Portuguese part-of-speech tagging with large margin structure learning

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.

Original languageEnglish
Title of host publicationBRACIS 2014 : 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings
Number of pages6
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers Inc.
Publication date12.12.2014
Pages25-30
Article number6984802
ISBN (print)978-1-4799-7859-5
ISBN (electronic)978-1-4799-5618-0
DOIs
Publication statusPublished - 12.12.2014
Externally publishedYes
EventBrazilian Conference on Intelligent Systems - BRACIS 2014 - Sao Carlos, Sao Paulo, Brazil
Duration: 18.10.201423.10.2014
Conference number: 3
https://ieeexplore.ieee.org/xpl/conhome/6979382/proceeding

DOI

Recently viewed

Publications

  1. Fusion of knowledge bases for better navigation of wheeled mobile robotic group with 3D TVS
  2. Analysis of the construction of an autonomous robot to improve its energy efficiency when traveling through irregular terrain
  3. Automatic Tuning of Extended Kalman Filter in Synchronous Reluctance Motor Drives with a Master-Slave Configuration
  4. Sensorless Control of AC Motor Drives with Adaptive Extended Kalman Filter
  5. Rapid grain refinement and compositional homogenization in a cast binary Cu50Ni alloy achieved by friction stir processing
  6. Improving Flood Forecasting in a Developing Country
  7. Influence of Mg content in Al alloys on processing characteristics and dynamically recrystallized microstructure of friction surfacing deposits
  8. Stimulating Computing
  9. Artificial intelligence in songwriting and composing - perspectives and challenges in creative practices
  10. Internal forces in robotic manipulation and in general mechanisms using a geometric approach
  11. How to support teachers to give feedback to modelling tasks effectively? Results from a teacher-training-study in the Co²CA project
  12. Introduction
  13. A dialectical perspective on innovation: Conflicting demands, multiple pathways, and ambidexterity
  14. On New Forms of Science Communication and Communication in Science
  15. Self-supervised Siamese Autoencoders
  16. Toward a methodical framework for comprehensively assessing forest multifunctionality
  17. Explaining Disagreement on Interest Rates in a Taylor-Rule Setting
  18. Optimal trajectory generation for camless internal combustion engine valve control
  19. Dispute and morality in the perception of societal risks: extending the psychometric model
  20. Where Tasks, Technology, and Textbooks Meet: An Exploratory Analysis of English Language Teachers’ Perceived Affordances of an Intelligent Language Tutoring System
  21. Towards a caring transdisciplinary research practice
  22. Deep drawing of high-strength tailored blanks by using tailored tools
  23. Practical critique: Bridging the gap between critical and practice oriented REDD+ research communities’
  24. The effect of psychotherapy for depression on improvements in social functioning
  25. Optimal grazing management rules in semi-arid rangelands with uncertain rainfall
  26. Fluorometer controlled apparatus designed for long-duration algal-feeding experiments and environmental effect studies with mussels
  27. The First 50 Contributions to the Data Observer Series - An Overview
  28. Comparing Instrument-induced effects in EFL requests
  29. Software and Web-Based Tools for Sustainability Management in Micro-, Small- and Medium-Sized Enterprises