Portuguese part-of-speech tagging with large margin structure learning

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.

OriginalspracheEnglisch
TitelBRACIS 2014 : 2014 Brazilian Conference on Intelligent Systems ; 19-23 October 2014, São Carlos, São Paulo, Brazil ; proceedings
Anzahl der Seiten6
ErscheinungsortPiscataway
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum12.12.2014
Seiten25-30
Aufsatznummer6984802
ISBN (Print)978-1-4799-7859-5
ISBN (elektronisch)978-1-4799-5618-0
DOIs
PublikationsstatusErschienen - 12.12.2014
Extern publiziertJa
VeranstaltungBrazilian Conference on Intelligent Systems - BRACIS 2014 - Sao Carlos, Sao Paulo, Brasilien
Dauer: 18.10.201423.10.2014
Konferenznummer: 3
https://ieeexplore.ieee.org/xpl/conhome/6979382/proceeding

DOI

Zuletzt angesehen

Publikationen

  1. Digital identity building:
  2. Feature Extraction and Aggregation for Predicting the Euro 2016
  3. Use of Recurrence Quantification Analysis to Examine Associations Between Changes in Text Structure Across an Expressive Writing Intervention and Reductions in Distress Symptoms in Women With Breast Cancer
  4. Integrating regional perceptions into climate change adaptation
  5. The “Fragment on Machines” as Science Fiction; Or, Reading the Grundrisse Politically
  6. Design It!
  7. Software and Web-Based Tools for Sustainability Management in Micro-, Small- and Medium-Sized Enterprises
  8. Determinants and consequences of corporate social responsibility decoupling—Status quo and limitations of recent empirical quantitative research
  9. Local expansion concepts for detecting transport barriers in dynamical systems
  10. Schreiben digital
  11. Ecosystem Services as a Contested Concept
  12. Mapping Urban Information as an Interdisciplinary Method for Geography, Art and Architecture Representations
  13. De-Anonymizing Anonymous
  14. Temporary organizing and acceleration
  15. Vom Sagbaren zum Machbaren?
  16. Predictive modeling in e-mental health
  17. Effect of Sn additions on the age hardening response, microstructures and corrosion resistance of Mg-0.8Ca (wt%) alloys
  18. Impacts of species richness on productivity in a large-scale subtropical forest experiment
  19. Flavonoids as biopesticides – Systematic assessment of sources, structures, activities and environmental fate
  20. A New Generation of CAPI
  21. Nucleation kinetics of the γ-phase in a binary Mg-Al alloy
  22. Der dunkle Transhumanismus
  23. Atmospheric mercury over sea ice during the OASIS-2009 campaign
  24. Coplanar micro-strips/electrospun sensor system to measure the electronics properties of the polyethylene oxide (PEO) electrospun
  25. 3. Methoden-Muster: Austausch, Koordination, Abstimmung
  26. Von Differenz zu Vielfalt zu Super-Diversity
  27. Nitrogen Addition Enhances Drought Sensitivity of Young Deciduous Tree Species
  28. Laypeople’s Affective Images of Energy Transition Pathways
  29. Das Bild in der Schrift
  30. Musikbegriff, erweiterter
  31. Imagining organization through metaphor and metonymy
  32. A Soft Piezo Mechanical Hydraulic Actuator with its Liquid Stiffness Identification and its Control
  33. Simulationen im Nawi-Unterricht
  34. Effective Strategies for Research Integrity Training—a Meta-analysis
  35. Agro-ecosystem services and dis-services in almond orchards are differentially influenced by the surrounding landscape
  36. Discomfort in Automated Driving –
  37. "i like reggae and Bob Marley is already dead"
  38. Identifying governance gaps among interlinked sustainability challenges
  39. Unobserved firm heterogeneity and the establishment size
  40. Article 75 CISG
  41. EWMN