Portuguese part-of-speech tagging with large margin structure learning

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Standard

Portuguese part-of-speech tagging with large margin structure learning. / Fernandes, Eraldo Rezende; Rodrigues, Irving Muller; Milidiú, Ruy Luiz.
BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings. Piscataway: Institute of Electrical and Electronics Engineers Inc., 2014. S. 25-30 6984802.

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Harvard

Fernandes, ER, Rodrigues, IM & Milidiú, RL 2014, Portuguese part-of-speech tagging with large margin structure learning. in BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings., 6984802, Institute of Electrical and Electronics Engineers Inc., Piscataway, S. 25-30, Brazilian Conference on Intelligent Systems - BRACIS 2014, Sao Carlos, Sao Paulo, Brasilien, 18.10.14. https://doi.org/10.1109/BRACIS.2014.16

APA

Fernandes, E. R., Rodrigues, I. M., & Milidiú, R. L. (2014). Portuguese part-of-speech tagging with large margin structure learning. In BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings (S. 25-30). Artikel 6984802 Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BRACIS.2014.16

Vancouver

Fernandes ER, Rodrigues IM, Milidiú RL. Portuguese part-of-speech tagging with large margin structure learning. in BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings. Piscataway: Institute of Electrical and Electronics Engineers Inc. 2014. S. 25-30. 6984802 doi: 10.1109/BRACIS.2014.16

Bibtex

@inbook{8908bae32b724634be8419587a8cc4be,
title = "Portuguese part-of-speech tagging with large margin structure learning",
abstract = "Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.",
keywords = "Machine Learning, Natural Language Processing, POS Tagging, Structure Learning, Informatics, Business informatics",
author = "Fernandes, {Eraldo Rezende} and Rodrigues, {Irving Muller} and Milidi{\'u}, {Ruy Luiz}",
year = "2014",
month = dec,
day = "12",
doi = "10.1109/BRACIS.2014.16",
language = "English",
isbn = "978-1-4799-7859-5",
pages = "25--30",
booktitle = "BRACIS 2014",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",
note = "Brazilian Conference on Intelligent Systems - BRACIS 2014 ; Conference date: 18-10-2014 Through 23-10-2014",
url = "https://ieeexplore.ieee.org/xpl/conhome/6979382/proceeding",

}

RIS

TY - CHAP

T1 - Portuguese part-of-speech tagging with large margin structure learning

AU - Fernandes, Eraldo Rezende

AU - Rodrigues, Irving Muller

AU - Milidiú, Ruy Luiz

N1 - Conference code: 3

PY - 2014/12/12

Y1 - 2014/12/12

N2 - Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.

AB - Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.

KW - Machine Learning

KW - Natural Language Processing

KW - POS Tagging

KW - Structure Learning

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84922535000&partnerID=8YFLogxK

U2 - 10.1109/BRACIS.2014.16

DO - 10.1109/BRACIS.2014.16

M3 - Article in conference proceedings

AN - SCOPUS:84922535000

SN - 978-1-4799-7859-5

SP - 25

EP - 30

BT - BRACIS 2014

PB - Institute of Electrical and Electronics Engineers Inc.

CY - Piscataway

T2 - Brazilian Conference on Intelligent Systems - BRACIS 2014

Y2 - 18 October 2014 through 23 October 2014

ER -

DOI

Zuletzt angesehen

Forschende

  1. Paul Drews
  2. Tim Dornis

Publikationen

  1. Multibody simulations of distributed flight arrays for Industry 4.0 applications
  2. Understanding of capacity in 3rd grade
  3. Kemp-Reader
  4. The Problem of Institutional Fit
  5. Commentary on Outer Space Treaty 1967
  6. Does Board Composition Influence CSR Reporting?
  7. Portraying myth more convincingly
  8. To err is Human, To Explain and Correct is Divine: A Study of Interactive Erroneous Examples with Middle School Math Students.
  9. Different ways lead to ambidexterity
  10. Combining mechanics and electrostatics
  11. Extending Internet of Things Enterprise Architectures by Digital Twins Exemplified in the Context of the Hamburg Port Authority
  12. Who likes to learn new things: measuring adult motivation to learn with PIAAC data from 21 countries
  13. AI for All?
  14. Gutes Leben vor Ort
  15. Socioeconomic status and word problem solving in PISA: The role of mathematical content areas
  16. Integrating indigenous and local knowledge in management and research on coastal ecosystems in the Global South
  17. Why Being Democratic is Just Not Enough
  18. Environmentalitäre Zeit
  19. Dock labour in Hamburg
  20. On-board pneumatic pressure generation methods for soft robotics applications
  21. Integrated Concept for the Selection of Process-improving and Competence-increasing Methods for the Shopfloor
  22. Extraction of information from invoices - challenges in the extraction pipeline
  23. For whom are internet-based occupational mental health interventions effective? Moderators of internet-based problem-solving training outcome
  24. Manual construction and mathematics- and computer-aided counting of stereoisomers. The example of oligoinositols
  25. Temporal patterns in ecosystem services research
  26. Multinomial choice models based on Archimedean copulas
  27. Round, just-below, or precise prices? Cultural differences in the prevalence of price endings in E-commerce
  28. Gender differences on general knowledge tests
  29. Determinants and Consequences of Executive Compensation-Related Shareholder Activism and Say-on-Pay Votes
  30. Automation in Clinical Laboratories
  31. Not Only a Workplace
  32. Who is a Migrant? Abandoning the Nation-State Point of View in the Study of Migration
  33. Towards a thick understanding of sustainability transitions - Linking transition management, capabilities and social practices
  34. Der Mensch in Zahlen
  35. Power and Policies in and by the Arts - Introduction
  36. Basin efficiency approach and its effect on streamflow quality, Zerafshan River Uzbekistan
  37. Bimodal Enterprise Architecture Management
  38. In the name of God and Christianity