How difficult is the adaptation of POS taggers?

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

How difficult is the adaptation of POS taggers? / Rodrigues, Irving Muller; Fernandes, Eraldo Rezende.

BRACIS 2017: 2017 Brazilian Conference on Intelligent Systems : Uberlândia, MG, Brazil, 2-5 October 2017 : proceedings. Piscataway : Institute of Electrical and Electronics Engineers Inc., 2017. p. 360-365.

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Rodrigues, IM & Fernandes, ER 2017, How difficult is the adaptation of POS taggers? in BRACIS 2017: 2017 Brazilian Conference on Intelligent Systems : Uberlândia, MG, Brazil, 2-5 October 2017 : proceedings. Institute of Electrical and Electronics Engineers Inc., Piscataway, pp. 360-365, Brazilian Conference on Intelligent Systems - BRACIS 2017, Uberlandia, Brazil, 02.10.17. https://doi.org/10.1109/BRACIS.2017.77

APA

Rodrigues, I. M., & Fernandes, E. R. (2017). How difficult is the adaptation of POS taggers? In BRACIS 2017: 2017 Brazilian Conference on Intelligent Systems : Uberlândia, MG, Brazil, 2-5 October 2017 : proceedings (pp. 360-365). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BRACIS.2017.77

Vancouver

Rodrigues IM, Fernandes ER. How difficult is the adaptation of POS taggers? In BRACIS 2017: 2017 Brazilian Conference on Intelligent Systems : Uberlândia, MG, Brazil, 2-5 October 2017 : proceedings. Piscataway: Institute of Electrical and Electronics Engineers Inc. 2017. p. 360-365 doi: 10.1109/BRACIS.2017.77

Bibtex

@inbook{153fbb723e3f41eeb19d2fdbfbd2bd65,
title = "How difficult is the adaptation of POS taggers?",
abstract = "Domain adaptation is a difficult problem, but also a relevant one. Unsupervised domain adaptation focuses on adapting a model from a source domain, which includes plenty of labeled data, to a target domain that provides no labeled data. This is the most compelling setting of domain adaptation, but it is also the most difficult one. We perform an experimental analysis to highlight how difficult this problem is. We show that the best available unsupervised domain adaptation system for POS tagging can be outperformed by a simple POS tagger that has access to only 250 labeled sentences from the target domain. This is not a fair comparison between these two systems, of course; but it highlights that unsupervised domain adaptation is not well solved yet. Moreover, the best available systems are not yet practical, since they are complex, difficult to implement, and do not achieve significant improvements.",
keywords = "Informatics, tagging, training, Feature extraction, adaptation models, natural language processing, syntactics, Business informatics",
author = "Rodrigues, {Irving Muller} and Fernandes, {Eraldo Rezende}",
year = "2017",
month = jun,
day = "28",
doi = "10.1109/BRACIS.2017.77",
language = "English",
isbn = "978-1-5386-2408-1",
pages = "360--365",
booktitle = "BRACIS 2017",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",
note = "Brazilian Conference on Intelligent Systems - BRACIS 2017 ; Conference date: 02-10-2017 Through 05-10-2017",

}

RIS

TY - CHAP

T1 - How difficult is the adaptation of POS taggers?

AU - Rodrigues, Irving Muller

AU - Fernandes, Eraldo Rezende

N1 - Conference code: 6

PY - 2017/6/28

Y1 - 2017/6/28

N2 - Domain adaptation is a difficult problem, but also a relevant one. Unsupervised domain adaptation focuses on adapting a model from a source domain, which includes plenty of labeled data, to a target domain that provides no labeled data. This is the most compelling setting of domain adaptation, but it is also the most difficult one. We perform an experimental analysis to highlight how difficult this problem is. We show that the best available unsupervised domain adaptation system for POS tagging can be outperformed by a simple POS tagger that has access to only 250 labeled sentences from the target domain. This is not a fair comparison between these two systems, of course; but it highlights that unsupervised domain adaptation is not well solved yet. Moreover, the best available systems are not yet practical, since they are complex, difficult to implement, and do not achieve significant improvements.

AB - Domain adaptation is a difficult problem, but also a relevant one. Unsupervised domain adaptation focuses on adapting a model from a source domain, which includes plenty of labeled data, to a target domain that provides no labeled data. This is the most compelling setting of domain adaptation, but it is also the most difficult one. We perform an experimental analysis to highlight how difficult this problem is. We show that the best available unsupervised domain adaptation system for POS tagging can be outperformed by a simple POS tagger that has access to only 250 labeled sentences from the target domain. This is not a fair comparison between these two systems, of course; but it highlights that unsupervised domain adaptation is not well solved yet. Moreover, the best available systems are not yet practical, since they are complex, difficult to implement, and do not achieve significant improvements.

KW - Informatics

KW - tagging

KW - training

KW - Feature extraction

KW - adaptation models

KW - natural language processing

KW - syntactics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=85049536443&partnerID=8YFLogxK

U2 - 10.1109/BRACIS.2017.77

DO - 10.1109/BRACIS.2017.77

M3 - Article in conference proceedings

AN - SCOPUS:85049536443

SN - 978-1-5386-2408-1

SP - 360

EP - 365

BT - BRACIS 2017

PB - Institute of Electrical and Electronics Engineers Inc.

CY - Piscataway

T2 - Brazilian Conference on Intelligent Systems - BRACIS 2017

Y2 - 2 October 2017 through 5 October 2017

ER -

DOI