RelHunter: A machine learning method for relation extraction from text

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

We propose RelHunter, a machine learning-based method for the extraction of structured information from text. RelHunter's key idea is to model the target structures as a relation over entities. Hence, the modeling effort is reduced to the identification of entities and the generation of a candidate relation, which are simpler problems than the original one. RelHunter fits a very broad spectrum of complex computational linguistic problems. We apply it to five tasks: phrase chunking, clause identification, hedge detection, quotation extraction, and dependency parsing. We compare RelHunter to token classification approaches through several computational experiments on seven multilingual corpora. RelHunter outperforms the token classification approaches by 2.14% on average. Moreover, we compare the derived systems against state-of-the-art systems for each corpus. Our systems achieve state-of-the-art performances for three corpora: Portuguese phrase chunking, Portuguese clause identification, and English quotation extraction. Additionally, the derived systems show good quality performance for the other four corpora.

OriginalspracheEnglisch
Aufsatznummer18
ZeitschriftJournal of the Brazilian Computer Society
Jahrgang16
Ausgabenummer3
Seiten (von - bis)191-199
Anzahl der Seiten9
ISSN0104-6500
DOIs
PublikationsstatusErschienen - 09.2010
Extern publiziertJa

DOI

Zuletzt angesehen

Publikationen

  1. Enforcement concepts and strategies in the EU
  2. Large trees are keystone structures in urban parks
  3. Introduction
  4. The multiplicity of emotions: A framework of emotional functions in decision making
  5. Continuous Casting with Mid-Process Alloying
  6. A Subspace to Describe Grasping Internal Forces in Robotic Manipulation Systems
  7. A revised theory of contestable markets
  8. Einführung in Grundlagen der theoretischen Informatik
  9. Studying embodied encounters
  10. A fragile kaleidoscope
  11. Numerical dynamic simulation and analysis of a lithium bromide/water long term solar heat storage system
  12. Sustainable Statehood: Reflections on Critical (Pre-)Conditions, Requirements and Design Options
  13. Classification of playing position in elite junior Australian football using technical skill indicators
  14. A Performance Motivator in one Country, A Non-Motivator in Another?
  15. Der "getarnte" Arbeitnehmer-Geschäftsführer
  16. The Lotka-Volterra Model for Competition Controlled by a Sliding Mode Approach
  17. CSR and tax avoidance: A review of empirical research
  18. SemREC-SMART 2022
  19. Basic analysis of the incremental profile forming process
  20. The temporal factor of change in stressor-strain relationships
  21. Promoting diversity of thought: bridging knowledge systems for a pluriverse approach to research
  22. Analysis of observability of a differential equation system describing a synchronous electromagnetic drive
  23. Kriminalisierung und Versicherheitlichung von Migration. Editorial
  24. Launching insectphylo.org; a new hub facilitating construction and use of synthesis molecular phylogenies of insects
  25. Workshop: 20 years health promotion research in and on settings
  26. Open Innovation Networks
  27. A victim of regulatory arbitrage? Automatic exchange of information and the use of golden visas and corporate shells
  28. Understanding european union law
  29. Intentionalisten vs. Strukturalisten
  30. Effects of anthropogenic disturbances on soil microbial communities in oak forests persist for more than 100 years
  31. Metastable–Stable
  32. The impact of auditor rotation, audit firm rotation and non-audit services on earnings quality, audit quality and investor perceptions: A literature review
  33. Design und Methode der Studie
  34. Why reinvent the wheel
  35. Process limits of extrusion of multimaterial components
  36. Crowdsourcing Swiss Dialect Transcriptions for Assessing Factors in Writing Variations
  37. Edward Lear, A book of nonsense
  38. Contractualisation of Civil Litigation
  39. Beschreibungsmethodik für AAL-Integrationsprofile
  40. A systematic literature review of machine learning canvases
  41. Aesthetics Column With Blindman on the Documenta 13
  42. Sudoko mathematics for and done by younger students
  43. Computer-Kriegs-Spiele oder: eine Kultur der Gewalt
  44. Workshop on impacts of the EU-UK Trade and Cooperation Agreement on fisheries and aquaculture in the EU

Presse / Medien

  1. Mimesis und Mimikry