Portuguese part-of-speech tagging with large margin structure learning

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Portuguese part-of-speech tagging with large margin structure learning. / Fernandes, Eraldo Rezende; Rodrigues, Irving Muller; Milidiú, Ruy Luiz.
BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings. Piscataway: Institute of Electrical and Electronics Engineers Inc., 2014. p. 25-30 6984802.

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Fernandes, ER, Rodrigues, IM & Milidiú, RL 2014, Portuguese part-of-speech tagging with large margin structure learning. in BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings., 6984802, Institute of Electrical and Electronics Engineers Inc., Piscataway, pp. 25-30, Brazilian Conference on Intelligent Systems - BRACIS 2014, Sao Carlos, Sao Paulo, Brazil, 18.10.14. https://doi.org/10.1109/BRACIS.2014.16

APA

Fernandes, E. R., Rodrigues, I. M., & Milidiú, R. L. (2014). Portuguese part-of-speech tagging with large margin structure learning. In BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings (pp. 25-30). Article 6984802 Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BRACIS.2014.16

Vancouver

Fernandes ER, Rodrigues IM, Milidiú RL. Portuguese part-of-speech tagging with large margin structure learning. In BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings. Piscataway: Institute of Electrical and Electronics Engineers Inc. 2014. p. 25-30. 6984802 doi: 10.1109/BRACIS.2014.16

Bibtex

@inbook{8908bae32b724634be8419587a8cc4be,
title = "Portuguese part-of-speech tagging with large margin structure learning",
abstract = "Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.",
keywords = "Machine Learning, Natural Language Processing, POS Tagging, Structure Learning, Informatics, Business informatics",
author = "Fernandes, {Eraldo Rezende} and Rodrigues, {Irving Muller} and Milidi{\'u}, {Ruy Luiz}",
year = "2014",
month = dec,
day = "12",
doi = "10.1109/BRACIS.2014.16",
language = "English",
isbn = "978-1-4799-7859-5",
pages = "25--30",
booktitle = "BRACIS 2014",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",
note = "Brazilian Conference on Intelligent Systems - BRACIS 2014 ; Conference date: 18-10-2014 Through 23-10-2014",
url = "https://ieeexplore.ieee.org/xpl/conhome/6979382/proceeding",

}

RIS

TY - CHAP

T1 - Portuguese part-of-speech tagging with large margin structure learning

AU - Fernandes, Eraldo Rezende

AU - Rodrigues, Irving Muller

AU - Milidiú, Ruy Luiz

N1 - Conference code: 3

PY - 2014/12/12

Y1 - 2014/12/12

N2 - Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.

AB - Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.

KW - Machine Learning

KW - Natural Language Processing

KW - POS Tagging

KW - Structure Learning

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84922535000&partnerID=8YFLogxK

U2 - 10.1109/BRACIS.2014.16

DO - 10.1109/BRACIS.2014.16

M3 - Article in conference proceedings

AN - SCOPUS:84922535000

SN - 978-1-4799-7859-5

SP - 25

EP - 30

BT - BRACIS 2014

PB - Institute of Electrical and Electronics Engineers Inc.

CY - Piscataway

T2 - Brazilian Conference on Intelligent Systems - BRACIS 2014

Y2 - 18 October 2014 through 23 October 2014

ER -

DOI

Recently viewed

Researchers

  1. Matthias Schmidt

Publications

  1. Structural ambidexterity, transition processes, and integration trade‐offs: a longitudinal study of failed exploration
  2. Video-, Text- oder Live-Coaching?
  3. 9th challenge on question answering over linked data (QALD-9)
  4. Export entry, export exit, and productivity in German manufacturing industries
  5. Time-varying persistence in real oil prices and its determinant
  6. Development and application of green and sustainable analytical methods for flavonoid extraction from Passiflora waste
  7. Training in Components of Problem-Solving Competence
  8. Foundational Aspects of Polycentric Governance
  9. Cross-Fertilizing Qualitative Perspectives on Effects of a Mindfulness-Based Intervention: An Empirical Comparison of Four Methodical Approaches
  10. The adoption and implementation of Activity-based Costing
  11. Keep calm and follow the news
  12. Neighbour species richness and local structural variability modulate aboveground allocation patterns and crown morphology of individual trees
  13. A Fictional Risk Narrative and Its Potential for Social Resonance: Reception of Barbara Kingsolver’s Flight Behavior in Reviews and Reading Groups
  14. Analyzing Pragmatic Variation in English
  15. History of Embryology: Visualizations Through Series and Animation
  16. Effect of Welding Speed on Friction Stir Welds of PM2000 Alloy
  17. Learning Analytics and Digital Badges
  18. The sources of entrepreneurial activity, ed. by Gary D. Libecap
  19. Score-Informed Analysis of Tuning, Intonation, Pitch Modulation, and Dynamics in Jazz Solos
  20. Insights into PBDE Uptake, Body Burden, and Elimination Gained from Australian Age-Concentration Trends Observed Shortly after Peak Exposure
  21. Handwerk hacken
  22. Vocational identity trajectories
  23. Music Scenes and Cultural Ecosystems - Alternative Practices between 'Official Policy Recognition' and 'Post-Pandemic Austerity'
  24. Guest editorial
  25. Symbolic Environmental Legislation and Societal Self-Deception
  26. Die Entwicklung der Rechtschreibkompetenz beim Textschreiben
  27. Common opossum population density in an agroforestry system in Bolivia
  28. Analyzing non-linear dynamics of organic growth: evidence from small german new ventures
  29. Mathematiklehrerausbildung zum Studienbeginn
  30. Can radicals get a seat on the negotiation table? A Dynamic Perspective on Legitimation Processes.
  31. Health promoting schools in germany. Mapping the implementation of holistic strategies to tackle ncds and promote health
  32. Leveraging Architectural Thinking for Large-Scale E-Government Projects
  33. The counter-revolution is really televised …
  34. Heuristics-in-use