Latent trees for coreference resolution

Research output: Journal contributionsJournal articlesResearchpeer-review

Standard

Latent trees for coreference resolution. / Fernandes, Eraldo Rezende; dos Santos, Cícero Nogueira; Milidiú, Ruy Luiz.
In: Computational Linguistics, Vol. 40, No. 4, 19.12.2014, p. 801-835.

Research output: Journal contributionsJournal articlesResearchpeer-review

Harvard

APA

Vancouver

Fernandes ER, dos Santos CN, Milidiú RL. Latent trees for coreference resolution. Computational Linguistics. 2014 Dec 19;40(4):801-835. doi: 10.1162/COLI_a_00200

Bibtex

@article{77fc8ee145124047b4766a9cd67a110b,
title = "Latent trees for coreference resolution",
abstract = "We describe a structure learning system for unrestricted coreference resolution that explores two key modeling techniques: latent coreference trees and automatic entropy-guided feature induction. The latent tree modeling makes the learning problem computationally feasible because it incorporates a meaningful hidden structure. Additionally, using an automatic feature induction method, we can efficiently build enhanced nonlinear models using linear model learning algorithms. We present empirical results that highlight the contribution of each modeling technique used in the proposed system. Empirical evaluation is performed on the multilingual unrestricted coreference CoNLL-2012 Shared Task data sets, which comprise three languages: Arabic, Chinese, and English. We apply the same system to all languages, except for minor adaptations to some language-dependent features such as nested mentions and specific static pronoun lists. A previous version of this system was submitted to the CoNLL-2012 Shared Task closed track, achieving an official score of 58:69, the best among the competitors. The unique enhancement added to the current system version is the inclusion of candidate arcs linking nested mentions for the Chinese language. By including such arcs, the score increases by almost 4.5 points for that language. The current system shows a score of 60:15, which corresponds to a 3:5% error reduction, and is the best performing system for each of the three languages.",
keywords = "Informatics, Business informatics",
author = "Fernandes, {Eraldo Rezende} and {dos Santos}, {C{\'i}cero Nogueira} and Milidi{\'u}, {Ruy Luiz}",
year = "2014",
month = dec,
day = "19",
doi = "10.1162/COLI_a_00200",
language = "English",
volume = "40",
pages = "801--835",
journal = "Computational Linguistics",
issn = "0891-2017",
publisher = "MIT Press",
number = "4",

}

RIS

TY - JOUR

T1 - Latent trees for coreference resolution

AU - Fernandes, Eraldo Rezende

AU - dos Santos, Cícero Nogueira

AU - Milidiú, Ruy Luiz

PY - 2014/12/19

Y1 - 2014/12/19

N2 - We describe a structure learning system for unrestricted coreference resolution that explores two key modeling techniques: latent coreference trees and automatic entropy-guided feature induction. The latent tree modeling makes the learning problem computationally feasible because it incorporates a meaningful hidden structure. Additionally, using an automatic feature induction method, we can efficiently build enhanced nonlinear models using linear model learning algorithms. We present empirical results that highlight the contribution of each modeling technique used in the proposed system. Empirical evaluation is performed on the multilingual unrestricted coreference CoNLL-2012 Shared Task data sets, which comprise three languages: Arabic, Chinese, and English. We apply the same system to all languages, except for minor adaptations to some language-dependent features such as nested mentions and specific static pronoun lists. A previous version of this system was submitted to the CoNLL-2012 Shared Task closed track, achieving an official score of 58:69, the best among the competitors. The unique enhancement added to the current system version is the inclusion of candidate arcs linking nested mentions for the Chinese language. By including such arcs, the score increases by almost 4.5 points for that language. The current system shows a score of 60:15, which corresponds to a 3:5% error reduction, and is the best performing system for each of the three languages.

AB - We describe a structure learning system for unrestricted coreference resolution that explores two key modeling techniques: latent coreference trees and automatic entropy-guided feature induction. The latent tree modeling makes the learning problem computationally feasible because it incorporates a meaningful hidden structure. Additionally, using an automatic feature induction method, we can efficiently build enhanced nonlinear models using linear model learning algorithms. We present empirical results that highlight the contribution of each modeling technique used in the proposed system. Empirical evaluation is performed on the multilingual unrestricted coreference CoNLL-2012 Shared Task data sets, which comprise three languages: Arabic, Chinese, and English. We apply the same system to all languages, except for minor adaptations to some language-dependent features such as nested mentions and specific static pronoun lists. A previous version of this system was submitted to the CoNLL-2012 Shared Task closed track, achieving an official score of 58:69, the best among the competitors. The unique enhancement added to the current system version is the inclusion of candidate arcs linking nested mentions for the Chinese language. By including such arcs, the score increases by almost 4.5 points for that language. The current system shows a score of 60:15, which corresponds to a 3:5% error reduction, and is the best performing system for each of the three languages.

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84918531827&partnerID=8YFLogxK

U2 - 10.1162/COLI_a_00200

DO - 10.1162/COLI_a_00200

M3 - Journal articles

AN - SCOPUS:84918531827

VL - 40

SP - 801

EP - 835

JO - Computational Linguistics

JF - Computational Linguistics

SN - 0891-2017

IS - 4

ER -

DOI

Recently viewed

Activities

  1. Biological Oxidation of Iron with Various Oxidants.
  2. How is Research Creation as Other Knowledge?
  3. Mobile Communication in Germany: The Meaning of ‘Handy’ in Everyday Life
  4. Linked Art, Provenance and the Use for Scholarship
  5. Künstliche Intelligenz in der Hochschullehre
  6. Publishing without perishing: How to publish in English- speaking journals with high impact rates
  7. Which effect does a text- vs. video-based learning environment have on pre-service teachers' peer feedback expertise?
  8. L’ intruse – Der Eindringling im Installationsraum
  9. MIZ allgemein (Organisation)
  10. Embedding sustainability in the curriculum: practicing what we teach 2013
  11. Space-focused stereotypes of ethnically diverse places
  12. International Journal of Microsimulation (Zeitschrift)
  13. Kolloquium: Creative Coding. Zum Stand der Kulturtechnik Programmieren
  14. Digital capitalism meets “Leberkaspepi”: Temporal orientations in business models as a source of platform power in mature industries
  15. International Conference on Advances in Social Networks Analysis and Mining - 2010
  16. 12th Interpretative Policy Analysis Conference - IPA 2017
  17. Digitalisierung der Hochschule: Studierendenzentrierung oder Exzellenzstrategie
  18. Towards a New Aesthetic Paradigm
  19. Journal of Molecular Catalysis A (Zeitschrift)
  20. Fabricating the Digital Citizen
  21. RHYTHMS OF ATTUNEMENT
  22. 6th International Conference on Soft Computing
  23. (University) support programmes and sustainable regional development: Why, how, and with what impact?
  24. Power and Potential of Artistic and Cultural Organizations in a Network of Sustainable Urban Development
  25. ICSS Group (Externe Organisation)
  26. Identifying Global Challenges for Future Tourism and Tourism Management

Publications

  1. Non-native populations of an invasive tree outperform their native conspecifics
  2. Efficacy of trapping techniques (pitfall, ramp and arboreal traps) for capturing spiders
  3. Microsatellites and allozymes as the genetic memory of habitat fragmentation and defragmentation in populations of the ground beetle Carabus auronitens (Col., Carabidae)
  4. Adapting videogame interfaces for the visually impaired
  5. Design of finger joint implants based on triply periodic minimal surfaces
  6. OPERATIONALIZING DIGITAL TRANSFORMATION FROM MULTIPLE PERSPECTIVES
  7. Introduction - Teaching Artistic Strategies. Playing with Materiality, Aesthetics and Ambiguity
  8. Article 1 Scope
  9. Sustainable Wireless Sensor Networks for Railway Systems Powered by Energy Harvesting from Vibration
  10. Investigation and prediction of grain texture evolution in AA6082
  11. Case study meta-analysis in the social sciences. Insights on data quality and reliability from a large-N case survey
  12. QSPR Using MOLGEN-QSPR
  13. Digitalization in engineering education research and practice
  14. Article 11 Formal Validity
  15. Lagging behind in CSR?
  16. Das neue AGG
  17. 3D Simulation of Electric Arcing and Pressure increase in an Automotive HVDC Relay During a Short Circuit Situation
  18. Resource selection by sympatric wild equids in the Mongolian Gobi
  19. Quality Assurance of Specification - The Users Point of View
  20. Diffusion of the Balanced Scorecard
  21. Orientations for co-constructing a positive climate for diversity in teaching and learning
  22. The value of local environmental knowledge to monitor and manage changing coral reef systems in Kiribati
  23. Geometria