Latent trees for coreference resolution

Eraldo Rezende Fernandes; Cícero Nogueira dos Santos; Ruy Luiz Milidiú

doi:10.1162/COLI_a_00200

Latent trees for coreference resolution

Research output: Journal contributions › Journal articles › Research › peer-review

Standard

Latent trees for coreference resolution. / Fernandes, Eraldo Rezende; dos Santos, Cícero Nogueira; Milidiú, Ruy Luiz.
In: Computational Linguistics, Vol. 40, No. 4, 19.12.2014, p. 801-835.

Research output: Journal contributions › Journal articles › Research › peer-review

Harvard

Fernandes, ER, dos Santos, CN & Milidiú, RL 2014, 'Latent trees for coreference resolution', Computational Linguistics, vol. 40, no. 4, pp. 801-835. https://doi.org/10.1162/COLI_a_00200

APA

Fernandes, E. R., dos Santos, C. N., & Milidiú, R. L. (2014). Latent trees for coreference resolution. Computational Linguistics, 40(4), 801-835. https://doi.org/10.1162/COLI_a_00200

Vancouver

Fernandes ER, dos Santos CN, Milidiú RL. Latent trees for coreference resolution. Computational Linguistics. 2014 Dec 19;40(4):801-835. doi: 10.1162/COLI_a_00200

Bibtex

@article{77fc8ee145124047b4766a9cd67a110b,

title = "Latent trees for coreference resolution",

abstract = "We describe a structure learning system for unrestricted coreference resolution that explores two key modeling techniques: latent coreference trees and automatic entropy-guided feature induction. The latent tree modeling makes the learning problem computationally feasible because it incorporates a meaningful hidden structure. Additionally, using an automatic feature induction method, we can efficiently build enhanced nonlinear models using linear model learning algorithms. We present empirical results that highlight the contribution of each modeling technique used in the proposed system. Empirical evaluation is performed on the multilingual unrestricted coreference CoNLL-2012 Shared Task data sets, which comprise three languages: Arabic, Chinese, and English. We apply the same system to all languages, except for minor adaptations to some language-dependent features such as nested mentions and specific static pronoun lists. A previous version of this system was submitted to the CoNLL-2012 Shared Task closed track, achieving an official score of 58:69, the best among the competitors. The unique enhancement added to the current system version is the inclusion of candidate arcs linking nested mentions for the Chinese language. By including such arcs, the score increases by almost 4.5 points for that language. The current system shows a score of 60:15, which corresponds to a 3:5% error reduction, and is the best performing system for each of the three languages.",

keywords = "Informatics, Business informatics",

author = "Fernandes, {Eraldo Rezende} and {dos Santos}, {C{\'i}cero Nogueira} and Milidi{\'u}, {Ruy Luiz}",

year = "2014",

month = dec,

day = "19",

doi = "10.1162/COLI_a_00200",

language = "English",

volume = "40",

pages = "801--835",

journal = "Computational Linguistics",

issn = "0891-2017",

publisher = "MIT Press",

number = "4",

}

RIS

TY - JOUR

T1 - Latent trees for coreference resolution

AU - Fernandes, Eraldo Rezende

AU - dos Santos, Cícero Nogueira

AU - Milidiú, Ruy Luiz

PY - 2014/12/19

Y1 - 2014/12/19

N2 - We describe a structure learning system for unrestricted coreference resolution that explores two key modeling techniques: latent coreference trees and automatic entropy-guided feature induction. The latent tree modeling makes the learning problem computationally feasible because it incorporates a meaningful hidden structure. Additionally, using an automatic feature induction method, we can efficiently build enhanced nonlinear models using linear model learning algorithms. We present empirical results that highlight the contribution of each modeling technique used in the proposed system. Empirical evaluation is performed on the multilingual unrestricted coreference CoNLL-2012 Shared Task data sets, which comprise three languages: Arabic, Chinese, and English. We apply the same system to all languages, except for minor adaptations to some language-dependent features such as nested mentions and specific static pronoun lists. A previous version of this system was submitted to the CoNLL-2012 Shared Task closed track, achieving an official score of 58:69, the best among the competitors. The unique enhancement added to the current system version is the inclusion of candidate arcs linking nested mentions for the Chinese language. By including such arcs, the score increases by almost 4.5 points for that language. The current system shows a score of 60:15, which corresponds to a 3:5% error reduction, and is the best performing system for each of the three languages.

AB - We describe a structure learning system for unrestricted coreference resolution that explores two key modeling techniques: latent coreference trees and automatic entropy-guided feature induction. The latent tree modeling makes the learning problem computationally feasible because it incorporates a meaningful hidden structure. Additionally, using an automatic feature induction method, we can efficiently build enhanced nonlinear models using linear model learning algorithms. We present empirical results that highlight the contribution of each modeling technique used in the proposed system. Empirical evaluation is performed on the multilingual unrestricted coreference CoNLL-2012 Shared Task data sets, which comprise three languages: Arabic, Chinese, and English. We apply the same system to all languages, except for minor adaptations to some language-dependent features such as nested mentions and specific static pronoun lists. A previous version of this system was submitted to the CoNLL-2012 Shared Task closed track, achieving an official score of 58:69, the best among the competitors. The unique enhancement added to the current system version is the inclusion of candidate arcs linking nested mentions for the Chinese language. By including such arcs, the score increases by almost 4.5 points for that language. The current system shows a score of 60:15, which corresponds to a 3:5% error reduction, and is the best performing system for each of the three languages.

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84918531827&partnerID=8YFLogxK

U2 - 10.1162/COLI_a_00200

DO - 10.1162/COLI_a_00200

M3 - Journal articles

AN - SCOPUS:84918531827

VL - 40

SP - 801

EP - 835

JO - Computational Linguistics

JF - Computational Linguistics

SN - 0891-2017

IS - 4

ER -

Other publications by the same author(s)

Data practices in apps from Brazil: What do privacy policies inform us about?

Quadros dos Reis, V., Rabello, M. E. R., Lima, A. C., Jardim, G. P. S., Fernandes, E. R. & Brefeld, U., 10.02.2023, In: Journal on Interactive Systems. 14, 1, p. 1-8 8 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Entity Extraction from Portuguese Legal Documents Using Distant Supervision

Navarezi, L. M., Sakiyama, K., Rodrigues, L. S., Robaldo, C. M. O., Lobato, G. R., Vilela, P. A., Matsubara, E. T. & Fernandes, E. R., 2022, Computational Processing of the Portuguese Language : 15th International Conference, PROPOR 2022, Fortaleza, Brazil, March 21-23, 2022, Proceedings. Pinheiro, V., Gamallo, P., Amaro, R., Scarton, C., Batista, F., Silva, D., Magro, C. & Pinto, H. (eds.). Cham: Springer Nature Switzerland AG, p. 166-176 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 13208 LNAI).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

FaST: A linear time stack trace alignment heuristic for crash report deduplication

Rodrigues, I. M., Aloise, D. & Fernandes, E. R., 17.10.2022, The 2022 Mining Software Repositories Conference: MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania. New York: Institute of Electrical and Electronics Engineers Inc., p. 549-560 12 p. (Proceedings - IEEE/ACM International Conference on Mining Software Repositories ).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Performance predictors for graphics processing units applied to dark-silicon-aware design space exploration

Sonohata, R., Arigoni, D. C. A., Fernandes, E. R., Ribeiro dos Santos, R. & Dessandre Duenha, L., 01.08.2023, In: Concurrency and Computation: Practice and Experience. 35, 17, 16 p., e6877.

Research output: Journal contributions › Journal articles › Research › peer-review

TraceSim: An Alignment Method for Computing Stack Trace Similarity

Rodrigues, I. M., Khvorov, A., Aloise, D., Vasiliev, R., Koznov, D., Fernandes, E. R., Chernishev, G., Luciv, D. & Povarov, N., 01.03.2022, In: Empirical Software Engineering. 27, 2, 41 p., 53.

Research output: Journal contributions › Journal articles › Research › peer-review

DOI

https://doi.org/10.1162/COLI_a_00200
Final published version

Latent trees for coreference resolution

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Other publications by the same author(s)

Data practices in apps from Brazil: What do privacy policies inform us about?

Entity Extraction from Portuguese Legal Documents Using Distant Supervision

FaST: A linear time stack trace alignment heuristic for crash report deduplication

Performance predictors for graphics processing units applied to dark-silicon-aware design space exploration

TraceSim: An Alignment Method for Computing Stack Trace Similarity

DOI

Recently viewed

Activities

Prizes

Publications