Learning from partially annotated sequences

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.

OriginalspracheEnglisch
TitelMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings
HerausgeberDimitrios Gunopulos, Thomas Hofmann, Donato Malerba, Michalis Vazirgiannis
Anzahl der Seiten16
ErscheinungsortHeidelberg, Berlin
VerlagSpringer Verlag
Erscheinungsdatum2011
AuflagePART 1
Seiten407-422
ISBN (Print)978-3-642-23779-9
ISBN (elektronisch)978-3-642-23780-5
DOIs
PublikationsstatusErschienen - 2011
Extern publiziertJa
VeranstaltungEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011 - Athen, Griechenland
Dauer: 05.09.201109.09.2011
http://www.ecmlpkdd2011.org/
https://www.ecmlpkdd2011.org/

DOI

Zuletzt angesehen

Aktivitäten

  1. MIZ allgemein (Organisation)
  2. Dislimitation of Urban Tourism
  3. Implicit Stereotypes versus Explicit Notions – A Young Generation’s Ambiguity towards the Image of Entrepreneurs
  4. Transdisciplinary Evaluation of Different Coastal Adaptation Strategies: Integrating Regional Perceptions of Scientists, Practitioners and the Public
  5. BSc-Thesis: Thermal and temporal niches of dung beetles
  6. A Material Flow Cost Accounting Approach to Improvement Assessment in LCA
  7. The Social Organization of Arts – A Theoretical Compendium (Introduction)
  8. Scene & DIY vs. current social developments: updating concepts for future research?
  9. Weaving Fabrics
  10. Gutachtertätigkeit für European Association for Research in Learning and Instruction
  11. 29th International Workshop on Computational Mechanics of Materials - IWCMM29
  12. Environmental fate of S-metolachlor in its pure form and as a part of commercial product - Mercantor Gold®: biodegradation and sorption onto sediment
  13. Confident for the next job interview: virtual reality for effective training
  14. HyperKult 13
  15. Flexibility of Industrial Material Flow Networks
  16. Fakultät W allgemein (Organisation)
  17. Dissertation "Participation and Democracy: Dynamics, Causes and Consequences of Elite-Challenging Activities"
  18. J. Pedro Lorente
  19. Classification of the Rock Samples of the Apollo 14 Landing Site According to the Recommended Nomenclature of Lunar Highland Rocks
  20. Privacy by Design. Überwachung, Selbst, Kontrolle
  21. transcript Verlag (Herausgeber (Verlag))
  22. Transport policy change - An actor-centered analysis of Berlin’s Verkehrswende project, speed talk session
  23. EVA 2004