Portuguese part-of-speech tagging with large margin structure learning

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Portuguese part-of-speech tagging with large margin structure learning. / Fernandes, Eraldo Rezende; Rodrigues, Irving Muller; Milidiú, Ruy Luiz.
BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings. Piscataway: Institute of Electrical and Electronics Engineers Inc., 2014. p. 25-30 6984802.

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Fernandes, ER, Rodrigues, IM & Milidiú, RL 2014, Portuguese part-of-speech tagging with large margin structure learning. in BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings., 6984802, Institute of Electrical and Electronics Engineers Inc., Piscataway, pp. 25-30, Brazilian Conference on Intelligent Systems - BRACIS 2014, Sao Carlos, Sao Paulo, Brazil, 18.10.14. https://doi.org/10.1109/BRACIS.2014.16

APA

Fernandes, E. R., Rodrigues, I. M., & Milidiú, R. L. (2014). Portuguese part-of-speech tagging with large margin structure learning. In BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings (pp. 25-30). Article 6984802 Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BRACIS.2014.16

Vancouver

Fernandes ER, Rodrigues IM, Milidiú RL. Portuguese part-of-speech tagging with large margin structure learning. In BRACIS 2014: 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings. Piscataway: Institute of Electrical and Electronics Engineers Inc. 2014. p. 25-30. 6984802 doi: 10.1109/BRACIS.2014.16

Bibtex

@inbook{8908bae32b724634be8419587a8cc4be,
title = "Portuguese part-of-speech tagging with large margin structure learning",
abstract = "Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.",
keywords = "Machine Learning, Natural Language Processing, POS Tagging, Structure Learning, Informatics, Business informatics",
author = "Fernandes, {Eraldo Rezende} and Rodrigues, {Irving Muller} and Milidi{\'u}, {Ruy Luiz}",
year = "2014",
month = dec,
day = "12",
doi = "10.1109/BRACIS.2014.16",
language = "English",
isbn = "978-1-4799-7859-5",
pages = "25--30",
booktitle = "BRACIS 2014",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",
note = "Brazilian Conference on Intelligent Systems - BRACIS 2014 ; Conference date: 18-10-2014 Through 23-10-2014",
url = "https://ieeexplore.ieee.org/xpl/conhome/6979382/proceeding",

}

RIS

TY - CHAP

T1 - Portuguese part-of-speech tagging with large margin structure learning

AU - Fernandes, Eraldo Rezende

AU - Rodrigues, Irving Muller

AU - Milidiú, Ruy Luiz

N1 - Conference code: 3

PY - 2014/12/12

Y1 - 2014/12/12

N2 - Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.

AB - Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.

KW - Machine Learning

KW - Natural Language Processing

KW - POS Tagging

KW - Structure Learning

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84922535000&partnerID=8YFLogxK

U2 - 10.1109/BRACIS.2014.16

DO - 10.1109/BRACIS.2014.16

M3 - Article in conference proceedings

AN - SCOPUS:84922535000

SN - 978-1-4799-7859-5

SP - 25

EP - 30

BT - BRACIS 2014

PB - Institute of Electrical and Electronics Engineers Inc.

CY - Piscataway

T2 - Brazilian Conference on Intelligent Systems - BRACIS 2014

Y2 - 18 October 2014 through 23 October 2014

ER -

DOI

Recently viewed

Publications

  1. Perfectly nested or significantly nested - an important difference for conservation management
  2. Developing spatial biophysical accounting for multiple ecosystem services
  3. A Two-Stage Sliding-Mode High-Gain Observer to Reduce Uncertainties and Disturbances Effects for Sensorless Control in Automotive Applications
  4. Discourse, practice, policy and organizing
  5. A Unified Contextual Bandit Framework for Long- and Short-Term Recommendations
  6. Emergence of Responsiveness Across Organizations, Networks, and Clusters from a Dynamic Capability Perspective
  7. BUSINESS MODELS IN BANKING: A CLUSTER ANALYSIS USING ARCHIVAL DATA
  8. Simon Denny
  9. The Influence of Robots’ Emotion Expressions on the Uncanny-Valley-Effect
  10. Perception of Space and Time in a Created Environment
  11. Reconceptualizing the role of socioeconomic material stocks in the leverage points framework to enable transformative change
  12. German Utilities and distributed PV
  13. New descriptions and typifications of syntaxa within the project 'Plant communities of Mecklenburg-Vorpommern and their vulnerability' - Part II
  14. Solution for the direct kinematics problem of the general stewart-gough platform by using only linear actuators’ orientations
  15. Study of the solidification of AS alloys combining in situ synchrotron diffraction and differential scanning calorimetry
  16. Discourse of ‘Self’ and ‘Other’ in Newspaper Editorials on Insecurity in Nigeria
  17. Joint Proceedings of Scholarly QALD 2023 and SemREC 2023 co-located with 22nd International Semantic Web Conference ISWC 2023
  18. DigiSchreib
  19. From Planning to Implementation: Top-Down and Bottom-Up Approaches for Collaborative Watershed Management
  20. Modeling Self-Organization
  21. Equivalence unbalanced-metaphor, case, and example-from Aristotle to Derrida
  22. High-precision frequency measurements: indispensable tools at the core of the molecular-level analysis of complex systems.
  23. Implementation of formative assessment
  24. Chip extrusion with integrated equal channel angular pressing
  25. Education and Communication as Prerequisites for and Components of Sustainable Development. Reflections for Policies, Conceptual Work, and Theory, Based on Previous Practises