Learning from partially annotated sequences

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.

OriginalspracheEnglisch
TitelMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings
HerausgeberDimitrios Gunopulos, Thomas Hofmann, Donato Malerba, Michalis Vazirgiannis
Anzahl der Seiten16
ErscheinungsortHeidelberg, Berlin
VerlagSpringer Verlag
Erscheinungsdatum2011
AuflagePART 1
Seiten407-422
ISBN (Print)978-3-642-23779-9
ISBN (elektronisch)978-3-642-23780-5
DOIs
PublikationsstatusErschienen - 2011
Extern publiziertJa
VeranstaltungEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011 - Athen, Griechenland
Dauer: 05.09.201109.09.2011
http://www.ecmlpkdd2011.org/
https://www.ecmlpkdd2011.org/

DOI

Zuletzt angesehen

Publikationen

  1. Modelling ammonia emissions after field application of biogas slurries
  2. Efficient co-regularised least squares regression
  3. Using measures of reading time regularity (RTR) to quantify eye movement dynamics, and how they are shaped by linguistic information
  4. Optimal control strategies for PMSM with a decoupling super twisting SMC and inductance estimation in the presence of saturation
  5. Key Element No. 2: Applying Diagnostic Forms of Assessment
  6. Experimentally validated multi-step simulation strategy to predict the fatigue crack propagation rate in residual stress fields after laser shock peening
  7. Perfectly nested or significantly nested - an important difference for conservation management
  8. Deciding between the Covariance Analytical Approach and the Change-Score Approach in Two Wave Panel Data
  9. Facing Up to Third Party Liability for Space Activities
  10. Playing in the Spaces: Anarchism in the Classroom
  11. Solution for the direct kinematics problem of the general stewart-gough platform by using only linear actuators’ orientations
  12. Sustainable Consumption - Mapping the Terrain
  13. Phase Shift APOD and POD Control Technique in Multi-Level Inverters to Mitigate Total Harmonic Distortion
  14. Integrating inductive and deductive analysis to identify and characterize archetypical social-ecological systems and their changes
  15. Framework for empirical research on science teaching and learning
  16. Supporting non-hierarchical supply chain networks in the electronics industry
  17. Towards a Concept for Integrating IT Innovation Management into Business IT Management
  18. Recurrence-based diagnostics of rotary systems
  19. „More than a game“
  20. Collaborative business in supply chains - a system dynamics approach
  21. From Enterprise Architecture to Business Ecosystem Architecture
  22. Why Notational Iconicity is a Form of Operational Iconicity
  23. Pathways and mechanisms for catalyzing social impact through Orchestration: Insights from an open social innovation project
  24. From Claiming to Creating Value
  25. Friedenspraxis
  26. A scale-up procedure to dialkyl carbonates; evaluation of their properties, biodegradability, and toxicity