Latent trees for coreference resolution

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

We describe a structure learning system for unrestricted coreference resolution that explores two key modeling techniques: latent coreference trees and automatic entropy-guided feature induction. The latent tree modeling makes the learning problem computationally feasible because it incorporates a meaningful hidden structure. Additionally, using an automatic feature induction method, we can efficiently build enhanced nonlinear models using linear model learning algorithms. We present empirical results that highlight the contribution of each modeling technique used in the proposed system. Empirical evaluation is performed on the multilingual unrestricted coreference CoNLL-2012 Shared Task data sets, which comprise three languages: Arabic, Chinese, and English. We apply the same system to all languages, except for minor adaptations to some language-dependent features such as nested mentions and specific static pronoun lists. A previous version of this system was submitted to the CoNLL-2012 Shared Task closed track, achieving an official score of 58:69, the best among the competitors. The unique enhancement added to the current system version is the inclusion of candidate arcs linking nested mentions for the Chinese language. By including such arcs, the score increases by almost 4.5 points for that language. The current system shows a score of 60:15, which corresponds to a 3:5% error reduction, and is the best performing system for each of the three languages.

Original languageEnglish
JournalComputational Linguistics
Volume40
Issue number4
Pages (from-to)801-835
Number of pages35
ISSN0891-2017
DOIs
Publication statusPublished - 19.12.2014
Externally publishedYes

DOI

Recently viewed

Activities

  1. 23rd (EC)2 Conference - Hypothesis Testing - EC2 2012
  2. UV photodegradation of trimipramine under different environmental variables and chemical nature of aqueous solution - biodegradation and LC-MSn characterization of the formed transformation products
  3. Eigenzeiten of Creativity – Temporal Work as a Coordination Challenge in Artistic and Scientific Project Ecologies
  4. Transdisciplinary Evaluation of Different Coastal Adaptation Strategies: Integrating Regional Perceptions of Scientists, Practitioners and the Public
  5. Student Gender and Teachers' Grading and Written Feedback on Math or Language Assignments
  6. Prototyping in der transdisziplinären Teamarbeit
  7. Video or Text Cases in Problem-Oriented or Direct Instructional Settings for Preservice Teachers?
  8. Group Decision and Negotiation (Fachzeitschrift)
  9. The Rhetoric of Disillusionment. Discursive Shifts in the Rhetoric of "There is no alternative"
  10. Building Collective Institutional Infrastructures for Decent Platform Work: The Development of a Crowdwork Agreement in Germany
  11. It’s hard to part with gains, but what about losses. Contribution and Distribution of Benefits and Burdens in Integrative Negotiations
  12. 18th International Conference on Pragmatics and Language Learning - 2010 (Veranstaltung)
  13. Containing and Accomodating Salafism in the Sahel: Insights and Lessons from Niger
  14. Designmethoden in transdisziplinären Teams
  15. Social Entrepreneurship - an introduction: ERASMUS guest lecture
  16. The Influence of Media-Politics-Parallelism on Political Participation and Pluralism
  17. Founding moral theory: a Meillassouxian perspective on Kant’s postulate problem
  18. From sensors and trajectories to transport and mixing
  19. It’s hard to part with gains, but what about losses. Contribution and Distribution of Benefits and Burdens in Integrative Negotiations
  20. Veranstaltungsreihe "Brown Bag Lectures" am Institute of English Studies