Hedge Detection Using the RelHunter Approach

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

RelHunter is a Machine Learning based method for the extraction of structured information from text. Here, we apply RelHunter to the Hedge Detection task, proposed as the CoNLL-2010 Shared Task. RelHunter's key design idea is to model the target structures as a relation over entities. The method decomposes the original task into three subtasks: (i) Entity Identification; (ii) Candidate Relation Generation; and (iii) Relation Recognition. In the Hedge Detection task, we define three types of entities: cue chunk, start scope token and end scope token. Hence, the Entity Identification subtask is further decomposed into three token classification subtasks, one for each entity type. In the Candidate Relation Generation sub-task, we apply a simple procedure to generate a ternary candidate relation. Each instance in this relation represents a hedge candidate composed by a cue chunk, a start scope token and an end scope token. For the Relation Recognition subtask, we use a binary classifier to discriminate between true and false candidates. The four classifiers are trained with the Entropy Guided Transformation Learning algorithm. When compared to the other hedge detection systems of the CoNLL shared task, our scheme shows a competitive performance. The F-score of our system is 54.05 on the evaluation corpus.
Original languageEnglish
Title of host publicationProceedings of the Fourteenth Conference on Computational Natural Language Learning --- Shared Task
EditorsRichard Farkas, Veronika Vincze, György Szarvas, György Mora, Janos Csirik
Number of pages6
Place of PublicationUSA
PublisherAssociation for Computational Linguistics (ACL)
Publication date2010
Pages64–69
ISBN (print)978-1-932432-84-8
Publication statusPublished - 2010
Externally publishedYes
Event14th Conference on Computational Natural Language Learning - CoNLL 2010: Shared Task - Uppsala, Uppsala, Sweden
Duration: 15.07.201017.07.2010
Conference number: 14
http://toc.proceedings.com/08986webtoc.pdf

Recently viewed

Publications

  1. Short run comovement, persistent shocks and the business cycle
  2. Desynchronization of Public and Private
  3. Approximate tree kernels
  4. Studying properties of water data using manifold-aware anomaly detectors
  5. Leveraging Big Data and Analytics for Auditing
  6. Influence of measurement errors on networks
  7. How can problems be turned into something good? The role of entrepreneurial learning and error mastery orientation
  8. The effect of psychotherapy for depression on improvements in social functioning
  9. Numerical Investigation of the Effect of Rolling on the Localized Stress and Strain Induction for Wire + Arc Additive Manufactured Structures
  10. Making an impression with open strategy
  11. Glitch(ing)! A refusal and gateway to more caring techno-urban worlds?
  12. Set-Oriented and Finite-Element Study of Coherent Behavior in Rayleigh-Bénard Convection
  13. New developments in extrusion of profiles with variable curvatures and cross-sections
  14. The role of tree crown on the performance of trees at individual and community levels
  15. One planet
  16. Data quality assessment framework for critical raw materials. The case of cobalt
  17. The geometry of habitat fragmentation
  18. How to Measure the Speed of Enterprise IT?
  19. Alignment of the life cycle initiative’s “principles for the application of life cycle sustainability assessment” with the LCSA practice
  20. Determinants and Consequences of Executive Compensation-Related Shareholder Activism and Say-on-Pay Votes
  21. Irish English and Variational Pragmatics
  22. Alcohol intake can reduce gambling behavior
  23. Rapid upwards spread of non-native plants in mountains across continents
  24. In vivo degradation of binary magnesium alloys - A long-term study
  25. Jenseits des Elfenbeinturms