RelHunter: A machine learning method for relation extraction from text

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

We propose RelHunter, a machine learning-based method for the extraction of structured information from text. RelHunter's key idea is to model the target structures as a relation over entities. Hence, the modeling effort is reduced to the identification of entities and the generation of a candidate relation, which are simpler problems than the original one. RelHunter fits a very broad spectrum of complex computational linguistic problems. We apply it to five tasks: phrase chunking, clause identification, hedge detection, quotation extraction, and dependency parsing. We compare RelHunter to token classification approaches through several computational experiments on seven multilingual corpora. RelHunter outperforms the token classification approaches by 2.14% on average. Moreover, we compare the derived systems against state-of-the-art systems for each corpus. Our systems achieve state-of-the-art performances for three corpora: Portuguese phrase chunking, Portuguese clause identification, and English quotation extraction. Additionally, the derived systems show good quality performance for the other four corpora.

OriginalspracheEnglisch
Aufsatznummer18
ZeitschriftJournal of the Brazilian Computer Society
Jahrgang16
Ausgabenummer3
Seiten (von - bis)191-199
Anzahl der Seiten9
ISSN0104-6500
DOIs
PublikationsstatusErschienen - 09.2010
Extern publiziertJa

DOI

Zuletzt angesehen

Publikationen

  1. Swarm Robotics, or: The Smartness of 'a bunch of cheap dumb things'
  2. Perceptions of Organizational Downsizing
  3. Policy implementation through multi-level governance
  4. Pre-service mathematics teachers' modelling processes within model eliciting activity through digital technologies
  5. Advantages and difficulties of conducting thinking-aloud protocols in the school setting
  6. Development of a procedure for forming assisted thermal joining of tubes
  7. The complementarity of single-species and ecosystem-oriented research in conservation research
  8. Innovation in Continuing Engineering Education with focus on gender and non-traditional students' pathways
  9. Does transition to IFRS substantially affect key financial ratios in shareholder-oriented common law regimes?
  10. Do it again
  11. Classification of playing position in elite junior Australian football using technical skill indicators
  12. Global patterns of ecologically unequal exchange
  13. The use of force against terrorists
  14. Wir sind ihr
  15. Delivering community benefits through REDD plus : Lessons from Joint Forest Management in Zambia
  16. Internet-Based Prevention of Depression in Employees
  17. Toward a Production-Oriented Imagology
  18. The Computational Turn, or, a New Weltbild
  19. Archival research on carbon reporting quality. A review of determinants and consequences for firm value
  20. Community and Training in NFDI4DS
  21. Kriminalisierung und Versicherheitlichung von Migration. Editorial
  22. Assoggettamento/Soggettivazione
  23. On the micro-structure of the German export boom
  24. The Measurement of Grip-Strength in Automobiles
  25. Front in the mouth, front in the word
  26. Intra- and interspecific hybridization in invasive Siberian elm
  27. Design und Methode der Studie
  28. Benchmarking question answering systems
  29. Logistisches Montagecontrolling