Learning from partially annotated sequences

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Learning from partially annotated sequences. / Fernandes, Eraldo R.; Brefeld, Ulf.
Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings. ed. / Dimitrios Gunopulos; Thomas Hofmann; Donato Malerba; Michalis Vazirgiannis. PART 1. ed. Heidelberg, Berlin: Springer Verlag, 2011. p. 407-422 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6911 LNAI, No. PART 1).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Fernandes, ER & Brefeld, U 2011, Learning from partially annotated sequences. in D Gunopulos, T Hofmann, D Malerba & M Vazirgiannis (eds), Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings. PART 1 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 1, vol. 6911 LNAI, Springer Verlag, Heidelberg, Berlin, pp. 407-422, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011, Athen, Greece, 05.09.11. https://doi.org/10.1007/978-3-642-23780-5_36

APA

Fernandes, E. R., & Brefeld, U. (2011). Learning from partially annotated sequences. In D. Gunopulos, T. Hofmann, D. Malerba, & M. Vazirgiannis (Eds.), Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings (PART 1 ed., pp. 407-422). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6911 LNAI, No. PART 1). Springer Verlag. https://doi.org/10.1007/978-3-642-23780-5_36

Vancouver

Fernandes ER, Brefeld U. Learning from partially annotated sequences. In Gunopulos D, Hofmann T, Malerba D, Vazirgiannis M, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings. PART 1 ed. Heidelberg, Berlin: Springer Verlag. 2011. p. 407-422. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 1). doi: 10.1007/978-3-642-23780-5_36

Bibtex

@inbook{0b7856608ef7418f8057a3ab0347cc36,
title = "Learning from partially annotated sequences",
abstract = "We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.",
keywords = "Informatics, Automatically generated, Cross-lingual, Labeled data, Named entity recognition, NAtural language processing, Perceptron, Semi-supervised, Sequential prediction, Hide Markov Model, Unlabeled Data, Neural Information Processing System, Entity Recognition, Annotate Sequence, Business informatics",
author = "Fernandes, {Eraldo R.} and Ulf Brefeld",
year = "2011",
doi = "10.1007/978-3-642-23780-5_36",
language = "English",
isbn = "978-3-642-23779-9",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
number = "PART 1",
pages = "407--422",
editor = "Dimitrios Gunopulos and Thomas Hofmann and Donato Malerba and Michalis Vazirgiannis",
booktitle = "Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings",
address = "Germany",
edition = "PART 1",
note = "European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011, ECML PKDD 2011 ; Conference date: 05-09-2011 Through 09-09-2011",
url = "http://www.ecmlpkdd2011.org/, https://www.ecmlpkdd2011.org/",

}

RIS

TY - CHAP

T1 - Learning from partially annotated sequences

AU - Fernandes, Eraldo R.

AU - Brefeld, Ulf

PY - 2011

Y1 - 2011

N2 - We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.

AB - We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and cross-lingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.

KW - Informatics

KW - Automatically generated

KW - Cross-lingual

KW - Labeled data

KW - Named entity recognition

KW - NAtural language processing

KW - Perceptron

KW - Semi-supervised

KW - Sequential prediction

KW - Hide Markov Model

KW - Unlabeled Data

KW - Neural Information Processing System

KW - Entity Recognition

KW - Annotate Sequence

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=80052421057&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/047857db-fd1a-3b48-8b27-a1acc478a333/

U2 - 10.1007/978-3-642-23780-5_36

DO - 10.1007/978-3-642-23780-5_36

M3 - Article in conference proceedings

AN - SCOPUS:80052421057

SN - 978-3-642-23779-9

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 407

EP - 422

BT - Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings

A2 - Gunopulos, Dimitrios

A2 - Hofmann, Thomas

A2 - Malerba, Donato

A2 - Vazirgiannis, Michalis

PB - Springer Verlag

CY - Heidelberg, Berlin

T2 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - ECML PKDD 2011

Y2 - 5 September 2011 through 9 September 2011

ER -

Recently viewed

Activities

  1. Organizing Collaborative Innovation Online and Offline: The Challenge of Copresence
  2. Micro and macro scale behavior of thermochemical materials in pure and composite forms for thermal storage applications
  3. Grünes Design allein reicht nicht mehr
  4. Where To Start? Exploring 1-Year-Students’ Preconceptions of Sustainable Development
  5. Extending Working Lives in Organizations: The Later Life Workplace Index for Successful Management of an Aging Workforce
  6. 15th International Conference on Sensors and Measurement Technology - SENSOR 2011
  7. Evaluating the efficacy and cost-effectiveness of web-based prevention of major depression
  8. Professional Development Workshop on “What Were You Thinking: Developing Cognitive Sensibilities for Inductive Coding” with Arne Carlsen, Martha Feldman, Claus Rerup, Heather Vogue, and Kristina Workman
  9. How Education Made Computers Personal
  10. Conference - 2022 the 5th World Conference on Computing and Communication Technologies
  11. Personal care products as source for micropollutants in Greywater-Identification, quantification and on-site treatment
  12. ‘Thinking the Problematic‘
  13. Presentation of new book on "Centrist Anti-Establishment Parties"
  14. Member of the selection committee for the COAL Prize Art & Environment 2011
  15. “Take Things Easy First” or “Get Straight to The Point”? The Order of Issue Packages in Negotiation and Its Effect on Dyadic Economic Outcomes.
  16. The relationship between intragenerational and intergenerational justice in the use of ecosystems and their services. An ecological-economic mode.

Publications

  1. Res Lunae: Characterizing Diverse Lunar Resource Systems Using the Social-Ecological System Framework
  2. WHICH ESTIMATION SITUATIONS ARE RELEVANT FOR A VALID ASSESSMENT OF MEASUREMENT ESTIMATION SKILLS
  3. Decisions And Characteristics During The Development Process Of A Software Demonstrator For Data Analysis In Production Logistics
  4. Green sample preparation of complex matrices
  5. Logistical Potentials of Load Balancing via the Build-up and Reduction of Stock
  6. Enhancing the transformative potential of sustainability innovations
  7. Late developers and the inequity of "equitable utilization" and the harm of "do no harm"
  8. Improving Human-Machine Interaction
  9. Developing a Process for the Analysis of User Journeys and the Prediction of Dropout in Digital Health Interventions:
  10. Second-Order Sliding Mode Control with State and Disturbance Estimation for a Permanent Magnet Linear Motor
  11. Feedforward and repetitive control of a servo piezo-mechanical hydraulic actuator
  12. Learning Online: A Comparison of Different Media Types
  13. A transdisciplinary evaluation framework for the assessment of integration in boundary-crossing collaborations in teacher education
  14. A trainable object finder, selector and identifier for pollen, spores and other things
  15. Discourse pragmatics
  16. Learning to rank user intent
  17. Model-based estimation of pesticides and transformation products and their export pathways in a headwater catchment
  18. Managing sustainable development with management control systems
  19. Credit constraints and exports
  20. Empathy as a motivator of dyadic helping across group boundaries
  21. Urban Problem Discourses
  22. Using density surface models to assess the ecological effectiveness of a protected area network in Tanzania
  23. Do consumers prefer pasture-raised dual-purpose cattle when considering meat products? A hypothetical discrete choice experiment for the case of minced beef
  24. Vom „rights-based approach" zum "solution-based approach" in der WTO-Streitbeilegung?
  25. Calibrated Passive Sampling - Multi-plot Field Measurements of NH3 Emissions with a Combination of Dynamic Tube Method and Passive Samplers
  26. The Balanced Scorecard and different Business Models in the textile industry
  27. Learning-related emotions in multimedia learning
  28. Cascade MIMO P-PID Controllers Applied in an Over-actuated Quadrotor Tilt-Rotor
  29. Collaborative modelling for active involvement of stakeholders in urban flood risk management