Learning from partially annotated sequences

Eraldo R. Fernandes; Ulf Brefeld

doi:10.1007/978-3-642-23780-5_36

Learning from partially annotated sequences

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Authors

We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.

Originalsprache	Englisch
Titel	Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings
Herausgeber	Dimitrios Gunopulos, Thomas Hofmann, Donato Malerba, Michalis Vazirgiannis
Anzahl der Seiten	16
Erscheinungsort	Heidelberg, Berlin
Verlag	Springer Verlag
Erscheinungsdatum	2011
Auflage	PART 1
Seiten	407-422
ISBN (Print)	978-3-642-23779-9
ISBN (elektronisch)	978-3-642-23780-5
DOIs	https://doi.org/10.1007/978-3-642-23780-5_36
Publikationsstatus	Erschienen - 2011
Extern publiziert	Ja
Veranstaltung	European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011 - Athen, Griechenland Dauer: 05.09.2011 → 09.09.2011 http://www.ecmlpkdd2011.org/ https://www.ecmlpkdd2011.org/

Fachgebiete

Informatik
Wirtschaftsinformatik

Weitere Publikationen dieser Person(en)

Interactive sequential generative models for team sports

Fassmeyer, D., Cordes, M. & Brefeld, U., 02.2025, in: Machine Learning. 114, 2, 15 S., 38.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Joint Item Response Models for Manual and Automatic Scores on Open-Ended Test Items

Bengs, D., Brefeld, U., Kroehne, U. & Zehner, F., 01.09.2025, in: Psychometrika. 90, 4, S. 1346-1367 22 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Machine Learning and Data Mining for Sports Analytics: 11th International Workshop, MLSA 2024, Vilnius, Lithuania, September 9, 2024, Revised Selected Papers

Brefeld, U. (Herausgeber*in), Davis, J. (Herausgeber*in), Van Haaren, J. (Herausgeber*in) & Zimmermann, A. (Herausgeber*in), 2025, Cham: Springer Verlag. 119 S. (Communications in Computer and Information Science; Band 2460)

Publikation: Bücher und Anthologien › Konferenzbände und -dokumentationen › Forschung

Masked autoencoder for multiagent trajectories

Rudolph, Y. & Brefeld, U., 02.2025, in: Machine Learning. 114, 2, 18 S., 44.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Self-improvement for Computerized Adaptive Testing

Rudolph, Y., Neubauer, K. & Brefeld, U., 2026, Machine Learning and Knowledge Discovery in Databases - Research Track: European Conference, ECML PKDD 2025, Porto, Portugal, September 15–19, 2025, Proceedings. Ribeiro, R. P., Jorge, A. M., Soares, C., Gama, J., Pfahringer, B., Japkowicz, N., Larrañaga, P. & Abreu, P. H. (Hrsg.). Cham: Springer International Publishing, Band 2. S. 70-86 17 S. (Lecture Notes in Computer Science; Band 16014 LNCS).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

DOI

https://doi.org/10.1007/978-3-642-23780-5_36
Endgültige, publizierte Fassung

Learning from partially annotated sequences

Authors

Fachgebiete

Weitere Publikationen dieser Person(en)

Interactive sequential generative models for team sports

Joint Item Response Models for Manual and Automatic Scores on Open-Ended Test Items

Machine Learning and Data Mining for Sports Analytics: 11th International Workshop, MLSA 2024, Vilnius, Lithuania, September 9, 2024, Revised Selected Papers

Masked autoencoder for multiagent trajectories

Self-improvement for Computerized Adaptive Testing

DOI

Zuletzt angesehen

Publikationen