Portuguese part-of-speech tagging with large margin structure learning

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.

Original languageEnglish
Title of host publicationBRACIS 2014 : 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings
Number of pages6
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers Inc.
Publication date12.12.2014
Pages25-30
Article number6984802
ISBN (print)978-1-4799-7859-5
ISBN (electronic)978-1-4799-5618-0
DOIs
Publication statusPublished - 12.12.2014
Externally publishedYes
EventBrazilian Conference on Intelligent Systems - BRACIS 2014 - Sao Carlos, Sao Paulo, Brazil
Duration: 18.10.201423.10.2014
Conference number: 3
https://ieeexplore.ieee.org/xpl/conhome/6979382/proceeding

DOI

Recently viewed

Activities

  1. A Simple Likelihood-based Panel Cointegration Test in the Presence of a Linear Time Trend and Cross-sectional Dependence
  2. Methodology of Scenario Technique in Regional Development Processes
  3. Towards an Emotional Geography of Urban Policing: Exploring the Materialization of Police Territoriality with Emotional Mapping Interviews
  4. CES Summer School 2016
  5. Macro Opinion in Comparative Perspective Class policy moods: a new approach to responsiveness inequality
  6. Interpretation and contestation of fracking in a changing context: The case of Germany and its proclaimed energy transition
  7. Time-Induced Political Inequality: Why Future Generations Need Proxy Representation
  8. Anarchism as a Framework for Rethinking Educational Authority
  9. ‘Thinking the Problematic‘
  10. Transition into unemployment and demand for redistribution
  11. Ecological Applications (Zeitschrift)
  12. The hidden power dynamics of organizing through enterprise social media
  13. Emerging Visions of Seamless Travel: (En)Countering Camouflaged Sovereignty at the Frictionless Border
  14. Robots versus Machines
  15. ECPR Winter School in Methods and Techniques
  16. Exploring organizational processes:A tension-based view
  17. Digital Teaching and Learning
  18. Are Self-Employed Time and Money Poor? Dynamics of Interpendent Multidimensional Poverty with German Time Use Diary Data
  19. HyperKult XI - Computer als Medium: Das Unsichtbare - 2002
  20. The Relationship between Innovation and Creativity
  21. Open-source Citizenship Research: Learning from Anti-corporate Campaigning Methodologies
  22. Questioning societal assumptions and paradigms through transdisciplinary research
  23. ECPR Joint Sessions of Workshops - ECPR 2019

Publications

  1. A Two-Stage Sliding-Mode High-Gain Observer to Reduce Uncertainties and Disturbances Effects for Sensorless Control in Automotive Applications
  2. Archives
  3. Implementing the Kyoto Protocol without Russia
  4. Crowdsourcing
  5. Ob lang oder kurz, berührbar oder nicht: Ist die Längenschätzkompetenz eindimensional?
  6. Energy model, boundary object and societal lens
  7. Soft Skills for Hard Constraints
  8. Advancing Qualitative Meta-Studies (QMS)
  9. Introduction to Philosophy of Management
  10. How people explain their own and others’ behavior:
  11. Development and criterion validity of differentiated and elevated vocational interests in adolescence
  12. Calibration of a simple method for determining ammonia loss in the field
  13. Conceptualizing community in energy systems
  14. Embedding Evidence on Conservation Interventions Within a Context of Multilevel Governance
  15. A highly transparent method of assessing the contribution of incentives to meet various technical challenges in distributed energy systems
  16. Advanced extrusion processes
  17. Geometric control tools for robotic manipulators
  18. Sustainability and management control. Exploring and theorizing control patterns in large European firms
  19. Application of Friction Riveting technique for the assembly of electronic components on printed circuit boards (PCB)
  20. “Making Sense”
  21. A Genetic Algorithm for the Dynamic Management of Cellular Reconfigurable Manufacturing Systems
  22. Evaluating the (cost-)effectiveness of guided and unguided Internet-based self-help for problematic alcohol use in employees
  23. Do Specific Text Features Influence Click Probabilities in Paid Search Advertising?
  24. Lernwerkstatt