RelHunter: A machine learning method for relation extraction from text

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

We propose RelHunter, a machine learning-based method for the extraction of structured information from text. RelHunter's key idea is to model the target structures as a relation over entities. Hence, the modeling effort is reduced to the identification of entities and the generation of a candidate relation, which are simpler problems than the original one. RelHunter fits a very broad spectrum of complex computational linguistic problems. We apply it to five tasks: phrase chunking, clause identification, hedge detection, quotation extraction, and dependency parsing. We compare RelHunter to token classification approaches through several computational experiments on seven multilingual corpora. RelHunter outperforms the token classification approaches by 2.14% on average. Moreover, we compare the derived systems against state-of-the-art systems for each corpus. Our systems achieve state-of-the-art performances for three corpora: Portuguese phrase chunking, Portuguese clause identification, and English quotation extraction. Additionally, the derived systems show good quality performance for the other four corpora.

OriginalspracheEnglisch
Aufsatznummer18
ZeitschriftJournal of the Brazilian Computer Society
Jahrgang16
Ausgabenummer3
Seiten (von - bis)191-199
Anzahl der Seiten9
ISSN0104-6500
DOIs
PublikationsstatusErschienen - 09.2010
Extern publiziertJa

DOI

Zuletzt angesehen

Forschende

  1. Felix Westermann

Publikationen

  1. Microstructure-Oriented Fatigue Crack Propagation in Two Cast Mg–Al–Ba–Ca Alloys
  2. Towards a Model for Building Trust and Acceptance of Artificial Intelligence Aided Medical Assessment Systems
  3. Metamodelizing the Territory
  4. Informatik
  5. Microtomography on biomaterials using the harwi-2 beamline at desy
  6. Modality in Nigerian Senate Debates: Patterned co-occurrence and stratgic-pragmatic functions
  7. Cascaded Backstepping Control for a Permanent Magnet Linear Motor using a Dual Kalman Filter
  8. QUANT - Question Answering Benchmark Curator
  9. Knowledge on global environmental change within social praxis: what do we know?
  10. The Invisualities of Capture in Amazon’s Logistical Operations
  11. „Ist das dein Handy oder vibrierst du?“
  12. Quality Assurance of Specification - The Users Point of View
  13. Modulation of T-effector function by imatinib at the level of cytokine secretion
  14. Internet-Based Guided Self-Help for Vaginal Penetration Difficulties
  15. Improving collaboration between ecosystem service communities and the IPBES science-policy platform
  16. How do distinct facets of tree diversity and community assembly respond to environmental variables in the subtropical Atlantic Forest?
  17. Optimal grazing management rules in semi-arid rangelands with uncertain rainfall
  18. The Impact of Mental Fatigue on Exploration in a Complex Computer Task
  19. Soil texture and altitude, respectively, largely determine the floristic gradient of the most diverse fog oasis in the Peruvian desert
  20. Biocultural approaches to pollinator conservation
  21. Sustainability in Business: Integrated Management of Value Creation and Disvalue Mitigation
  22. Practices and Policies from Spaces of Possibilities to Institutional Innovations
  23. Absolute and relative maximum strength measures show differences in their correlations with sprint and jump performances in trained youth soccer players
  24. Relative wage positions and quit behavior
  25. Daily breath-based mindfulness exercises in a randomized controlled trial improve primary school children’s performance in arithmetic
  26. Mechanisms of dialectical change
  27. Modelling lateness and schedule reliability
  28. Properties of some overlapping self-similar and some self-affine measures
  29. Briefe schreiben in der Sekundarstufe I
  30. Rating Player Actions in Soccer
  31. Toward a gecko-inspired, climbing soft robot