Learning from partially annotated sequences

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Learning from partially annotated sequences. / Fernandes, Eraldo R.; Brefeld, Ulf.
Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings. ed. / Dimitrios Gunopulos; Thomas Hofmann; Donato Malerba; Michalis Vazirgiannis. PART 1. ed. Heidelberg, Berlin: Springer Verlag, 2011. p. 407-422 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6911 LNAI, No. PART 1).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Fernandes, ER & Brefeld, U 2011, Learning from partially annotated sequences. in D Gunopulos, T Hofmann, D Malerba & M Vazirgiannis (eds), Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings. PART 1 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 1, vol. 6911 LNAI, Springer Verlag, Heidelberg, Berlin, pp. 407-422, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011, Athen, Greece, 05.09.11. https://doi.org/10.1007/978-3-642-23780-5_36

APA

Fernandes, E. R., & Brefeld, U. (2011). Learning from partially annotated sequences. In D. Gunopulos, T. Hofmann, D. Malerba, & M. Vazirgiannis (Eds.), Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings (PART 1 ed., pp. 407-422). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6911 LNAI, No. PART 1). Springer Verlag. https://doi.org/10.1007/978-3-642-23780-5_36

Vancouver

Fernandes ER, Brefeld U. Learning from partially annotated sequences. In Gunopulos D, Hofmann T, Malerba D, Vazirgiannis M, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings. PART 1 ed. Heidelberg, Berlin: Springer Verlag. 2011. p. 407-422. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 1). doi: 10.1007/978-3-642-23780-5_36

Bibtex

@inbook{0b7856608ef7418f8057a3ab0347cc36,
title = "Learning from partially annotated sequences",
abstract = "We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.",
keywords = "Informatics, Automatically generated, Cross-lingual, Labeled data, Named entity recognition, NAtural language processing, Perceptron, Semi-supervised, Sequential prediction, Hide Markov Model, Unlabeled Data, Neural Information Processing System, Entity Recognition, Annotate Sequence, Business informatics",
author = "Fernandes, {Eraldo R.} and Ulf Brefeld",
year = "2011",
doi = "10.1007/978-3-642-23780-5_36",
language = "English",
isbn = "978-3-642-23779-9",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
number = "PART 1",
pages = "407--422",
editor = "Dimitrios Gunopulos and Thomas Hofmann and Donato Malerba and Michalis Vazirgiannis",
booktitle = "Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings",
address = "Germany",
edition = "PART 1",
note = "European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011, ECML PKDD 2011 ; Conference date: 05-09-2011 Through 09-09-2011",
url = "http://www.ecmlpkdd2011.org/, https://www.ecmlpkdd2011.org/",

}

RIS

TY - CHAP

T1 - Learning from partially annotated sequences

AU - Fernandes, Eraldo R.

AU - Brefeld, Ulf

PY - 2011

Y1 - 2011

N2 - We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.

AB - We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.

KW - Informatics

KW - Automatically generated

KW - Cross-lingual

KW - Labeled data

KW - Named entity recognition

KW - NAtural language processing

KW - Perceptron

KW - Semi-supervised

KW - Sequential prediction

KW - Hide Markov Model

KW - Unlabeled Data

KW - Neural Information Processing System

KW - Entity Recognition

KW - Annotate Sequence

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=80052421057&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/047857db-fd1a-3b48-8b27-a1acc478a333/

U2 - 10.1007/978-3-642-23780-5_36

DO - 10.1007/978-3-642-23780-5_36

M3 - Article in conference proceedings

AN - SCOPUS:80052421057

SN - 978-3-642-23779-9

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 407

EP - 422

BT - Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings

A2 - Gunopulos, Dimitrios

A2 - Hofmann, Thomas

A2 - Malerba, Donato

A2 - Vazirgiannis, Michalis

PB - Springer Verlag

CY - Heidelberg, Berlin

T2 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011

Y2 - 5 September 2011 through 9 September 2011

ER -

Recently viewed

Publications

  1. Exploring the processes of emergent leadership in a netball team
  2. Mining Implications From Data
  3. Agile Portfolio Management Patterns
  4. Efficacy of a Web-Based Intervention With Mobile Phone Support in Treating Depressive Symptoms in Adults With Type 1 and Type 2 Diabetes
  5. Visual Frames – Framing Visuals
  6. Neural Networks for Energy Optimization of Production Processes in Small and Medium Sized Enterprises
  7. Educational reconstruction as model for the theory-based design of student-centered learning environments in electrical engineering courses
  8. Biodegradability and genotoxicity of surface functionalized colloidal silica (SiO2) particles in the aquatic environment
  9. Self-perceived quality of life predicts mortality risk better than a multi-biomarker panel, but the combination of both does best
  10. Clustering design science research based on the nature of the designed artifact
  11. Reconciling conservation and development in protected areas of the Global South
  12. Contextualizing the relationship between self-commitment and performance
  13. Polynomial Augmented Extended Kalman Filter to Estimate the State of Charge of Lithium-Ion Batteries
  14. What´s in a net? or: The end of the average
  15. Measurement in Machine Vision Editorial Paper
  16. Statistical precipitation bias correction of gridded model data using point measurements
  17. Value Structure and Dimensions
  18. Soft Skills for Hard Constraints
  19. Effect of gap distortion on the field splitting of collective modes in superfluid He3-B
  20. Orchestrating distributed data governance in open social innovation
  21. Non-acceptances in context
  22. Image, Process, Performance, Machine
  23. Effects of plyometric training on postural control in static and dynamic testing situations
  24. Self-perception of the internal audit function within the corporate governance system - Empirical evidence for the European Union
  25. Simulation-based Investigation of Energy Flexibility in the Optimization of Hinterland Drainage
  26. Design, Modeling and Control of an Over-actuated Hexacopter Tilt-Rotor