How difficult is the adaptation of POS taggers?

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Domain adaptation is a difficult problem, but also a relevant one. Unsupervised domain adaptation focuses on adapting a model from a source domain, which includes plenty of labeled data, to a target domain that provides no labeled data. This is the most compelling setting of domain adaptation, but it is also the most difficult one. We perform an experimental analysis to highlight how difficult this problem is. We show that the best available unsupervised domain adaptation system for POS tagging can be outperformed by a simple POS tagger that has access to only 250 labeled sentences from the target domain. This is not a fair comparison between these two systems, of course; but it highlights that unsupervised domain adaptation is not well solved yet. Moreover, the best available systems are not yet practical, since they are complex, difficult to implement, and do not achieve significant improvements.

OriginalspracheEnglisch
TitelBRACIS 2017 : 2017 Brazilian Conference on Intelligent Systems : Uberlândia, MG, Brazil, 2-5 October 2017 : proceedings
Anzahl der Seiten6
ErscheinungsortPiscataway
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum28.06.2017
Seiten360-365
ISBN (Print)978-1-5386-2408-1
ISBN (elektronisch)978-1-5386-2407-4
DOIs
PublikationsstatusErschienen - 28.06.2017
Extern publiziertJa
VeranstaltungBrazilian Conference on Intelligent Systems - BRACIS 2017 - Uberlandia, Brasilien
Dauer: 02.10.201705.10.2017
Konferenznummer: 6

DOI