Hedge Detection Using the RelHunter Approach

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

RelHunter is a Machine Learning based method for the extraction of structured information from text. Here, we apply RelHunter to the Hedge Detection task, proposed as the CoNLL-2010 Shared Task. RelHunter's key design idea is to model the target structures as a relation over entities. The method decomposes the original task into three subtasks: (i) Entity Identification; (ii) Candidate Relation Generation; and (iii) Relation Recognition. In the Hedge Detection task, we define three types of entities: cue chunk, start scope token and end scope token. Hence, the Entity Identification subtask is further decomposed into three token classification subtasks, one for each entity type. In the Candidate Relation Generation sub-task, we apply a simple procedure to generate a ternary candidate relation. Each instance in this relation represents a hedge candidate composed by a cue chunk, a start scope token and an end scope token. For the Relation Recognition subtask, we use a binary classifier to discriminate between true and false candidates. The four classifiers are trained with the Entropy Guided Transformation Learning algorithm. When compared to the other hedge detection systems of the CoNLL shared task, our scheme shows a competitive performance. The F-score of our system is 54.05 on the evaluation corpus.
Original languageEnglish
Title of host publicationProceedings of the Fourteenth Conference on Computational Natural Language Learning --- Shared Task
EditorsRichard Farkas, Veronika Vincze, György Szarvas, György Mora, Janos Csirik
Number of pages6
Place of PublicationUSA
PublisherAssociation for Computational Linguistics (ACL)
Publication date2010
Pages64–69
ISBN (print)978-1-932432-84-8
Publication statusPublished - 2010
Externally publishedYes
Event14th Conference on Computational Natural Language Learning - CoNLL 2010: Shared Task - Uppsala, Uppsala, Sweden
Duration: 15.07.201017.07.2010
Conference number: 14
http://toc.proceedings.com/08986webtoc.pdf

Recently viewed

Publications

  1. Chronic effects of a static stretching intervention program on range of motion and tissue hardness in older adults
  2. How to support students-learning in mathematical bridging-courses using ITS? Remedial Scenarios in the EU-Project Math-Bridge
  3. Study of non-linear systems
  4. Using latent class analysis to produce a typology of environmental concern in the UK
  5. Ablation Study of a Multimodal Gat Network on Perfect Synthetic and Real-world Data to Investigate the Influence of Language Models in Invoice Recognition
  6. Implementation of Chemometric Tools to Improve Data Mining and Prioritization in LC-HRMS for Nontarget Screening of Organic Micropollutants in Complex Water Matrixes
  7. The language of situated joint activity: Social virtual reality and language learning in virtual exchange
  8. Anonymized firm data under test: evidence from a replication study
  9. How development leads to democracy
  10. Predicting recurrent chat contact in a psychological intervention for the youth using natural language processing
  11. Dimensions, dialectic, discourse
  12. Model-Based Optimization of Spiral Coils for Improving Wireless Power Transfer
  13. Synthesis and future research directions linking tree diversity to growth, survival, and damage in a global network of tree diversity experiments
  14. An assessment of the published results of animal relocations
  15. Intermediate `time-spaces' - The rediscovery of transition in spatial planning and environmental planning
  16. A path to clean water
  17. Optimising patterns of life conduct
  18. Accidental Representation–The Reconfiguration of Representation through Social Media
  19. Cyclooxygenase-2-expression in the outer root sheath of anagen but not telogen hair follicles of the mouse skin
  20. Enhancing the transformative potential of interventions for the sustainable use of natural resources
  21. Qualitative Daten computergestutzt auswerten
  22. Overyielding in experimental grassland communities - Irrespective of species pool or spatial scale