Learning from partially annotated sequences

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.

OriginalspracheEnglisch
TitelMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings
HerausgeberDimitrios Gunopulos, Thomas Hofmann, Donato Malerba, Michalis Vazirgiannis
Anzahl der Seiten16
ErscheinungsortHeidelberg, Berlin
VerlagSpringer Verlag
Erscheinungsdatum2011
AuflagePART 1
Seiten407-422
ISBN (Print)978-3-642-23779-9
ISBN (elektronisch)978-3-642-23780-5
DOIs
PublikationsstatusErschienen - 2011
Extern publiziertJa
VeranstaltungEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011 - Athen, Griechenland
Dauer: 05.09.201109.09.2011
http://www.ecmlpkdd2011.org/
https://www.ecmlpkdd2011.org/

DOI

Zuletzt angesehen

Publikationen

  1. Are all errors created equal?
  2. Understanding and managing post-acquisition integration as change process
  3. Introduction
  4. Efficient co-regularised least squares regression
  5. Exchanging Knowledge and Good Practices of Education for Sustainable Development within a Global Student Organization (oikos)
  6. Finite element modeling of laser beam welding for residual stress calculation
  7. Lecture2Go
  8. On the Direct Kinematics Problem of Parallel Mechanisms
  9. Learning in Real-World Laboratories: A Systematic Impulse for Discussion
  10. RAWSim-O: A Simulation Framework for Robotic Mobile Fulfillment Systems
  11. Bifactor Models for Predicting Criteria by General and Specific Factors
  12. Optimal control strategies for PMSM with a decoupling super twisting SMC and inductance estimation in the presence of saturation
  13. Making transparency transparent
  14. Ablation Study of a Multimodal Gat Network on Perfect Synthetic and Real-world Data to Investigate the Influence of Language Models in Invoice Recognition
  15. Non-technical success factors for bioenergy projects-Learning from a multiple case study in Japan
  16. Examining how AI capabilities can foster organizational performance in public organizations
  17. A Two-Stage Sliding-Mode High-Gain Observer to Reduce Uncertainties and Disturbances Effects for Sensorless Control in Automotive Applications
  18. "If you like something, you want it to develop."
  19. Discourse, practice, policy and organizing
  20. FaQuAD
  21. Encoding the law of State responsibility with courage and resolve
  22. Complex predicates in German resultative constructions
  23. Species loss due to nutrient addition increases with spatial scale in global grasslands
  24. Optimising patterns of life conduct
  25. Eroding Patriarchy
  26. The Effects of Nonindependent Rater Sets in Multilevel–Multitrait–Multimethod Models