Domain adaptation of POS taggers without handcrafted features

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Domain adaptation of POS taggers without handcrafted features. / Rodrigues, Irving M.; Fernandes, Eraldo R.; dos Santos, Cicero N.
IJCNN 2017: the International Joint Conference on Neural Networks. Piscataway: Institute of Electrical and Electronics Engineers Inc., 2017. p. 3331-3338 7966274 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2017).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Rodrigues, IM, Fernandes, ER & dos Santos, CN 2017, Domain adaptation of POS taggers without handcrafted features. in IJCNN 2017: the International Joint Conference on Neural Networks., 7966274, Proceedings of the International Joint Conference on Neural Networks, vol. 2017, Institute of Electrical and Electronics Engineers Inc., Piscataway, pp. 3331-3338, International Joint Conference on Neural Networks, Anchorage, United States, 14.05.17. https://doi.org/10.1109/IJCNN.2017.7966274

APA

Rodrigues, I. M., Fernandes, E. R., & dos Santos, C. N. (2017). Domain adaptation of POS taggers without handcrafted features. In IJCNN 2017: the International Joint Conference on Neural Networks (pp. 3331-3338). Article 7966274 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2017). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCNN.2017.7966274

Vancouver

Rodrigues IM, Fernandes ER, dos Santos CN. Domain adaptation of POS taggers without handcrafted features. In IJCNN 2017: the International Joint Conference on Neural Networks. Piscataway: Institute of Electrical and Electronics Engineers Inc. 2017. p. 3331-3338. 7966274. (Proceedings of the International Joint Conference on Neural Networks). doi: 10.1109/IJCNN.2017.7966274

Bibtex

@inbook{00f48f8535564d51896ec4b3ba5d0cd0,
title = "Domain adaptation of POS taggers without handcrafted features",
abstract = "Unsupervised domain adaptation is an attractive option when labeled data is lacking for some domain of interest but is available for other domain. Part-of-speech (POS) tagging is often considered a solved task when enough labeled data is available in the domain of interest. However, when considering a domain adaptation scenario, this is far from true. Several approaches have been proposed for domain adaptation of POS taggers, however as far as we know, all of them are based on handcrafted features. In this work, we employ a machine learning method whose input is exclusively composed of the raw text. This method learns word- and character-level representations (embeddings), and has been successfully applied to intra-domain tasks. We show that this method achieves strong performances on the domain adaptation of English and Portuguese POS taggers.",
keywords = "Informatics, tagging, syntactics, training, Vocabulary, training data, Feature extraction, Business informatics",
author = "Rodrigues, {Irving M.} and Fernandes, {Eraldo R.} and {dos Santos}, {Cicero N.}",
year = "2017",
month = jun,
day = "30",
doi = "10.1109/IJCNN.2017.7966274",
language = "English",
isbn = "978-1-5090-6183-9",
series = "Proceedings of the International Joint Conference on Neural Networks",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "3331--3338",
booktitle = "IJCNN 2017",
address = "United States",
note = "International Joint Conference on Neural Networks, IJCNN 2017 ; Conference date: 14-05-2017 Through 19-05-2017",
url = "https://ieeexplore.ieee.org/xpl/conhome/7958416/proceeding",

}

RIS

TY - CHAP

T1 - Domain adaptation of POS taggers without handcrafted features

AU - Rodrigues, Irving M.

AU - Fernandes, Eraldo R.

AU - dos Santos, Cicero N.

PY - 2017/6/30

Y1 - 2017/6/30

N2 - Unsupervised domain adaptation is an attractive option when labeled data is lacking for some domain of interest but is available for other domain. Part-of-speech (POS) tagging is often considered a solved task when enough labeled data is available in the domain of interest. However, when considering a domain adaptation scenario, this is far from true. Several approaches have been proposed for domain adaptation of POS taggers, however as far as we know, all of them are based on handcrafted features. In this work, we employ a machine learning method whose input is exclusively composed of the raw text. This method learns word- and character-level representations (embeddings), and has been successfully applied to intra-domain tasks. We show that this method achieves strong performances on the domain adaptation of English and Portuguese POS taggers.

AB - Unsupervised domain adaptation is an attractive option when labeled data is lacking for some domain of interest but is available for other domain. Part-of-speech (POS) tagging is often considered a solved task when enough labeled data is available in the domain of interest. However, when considering a domain adaptation scenario, this is far from true. Several approaches have been proposed for domain adaptation of POS taggers, however as far as we know, all of them are based on handcrafted features. In this work, we employ a machine learning method whose input is exclusively composed of the raw text. This method learns word- and character-level representations (embeddings), and has been successfully applied to intra-domain tasks. We show that this method achieves strong performances on the domain adaptation of English and Portuguese POS taggers.

KW - Informatics

KW - tagging

KW - syntactics

KW - training

KW - Vocabulary

KW - training data

KW - Feature extraction

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=85030974151&partnerID=8YFLogxK

U2 - 10.1109/IJCNN.2017.7966274

DO - 10.1109/IJCNN.2017.7966274

M3 - Article in conference proceedings

AN - SCOPUS:85030974151

SN - 978-1-5090-6183-9

T3 - Proceedings of the International Joint Conference on Neural Networks

SP - 3331

EP - 3338

BT - IJCNN 2017

PB - Institute of Electrical and Electronics Engineers Inc.

CY - Piscataway

T2 - International Joint Conference on Neural Networks

Y2 - 14 May 2017 through 19 May 2017

ER -

Recently viewed

Publications

  1. Critical evaluation of commonly used methods to determine the concordance between sonography and magnetic resonance imaging: A comparative study
  2. Embedded Self-Managing Modes of Organizing
  3. Love in Paramyth
  4. Firm size and the use of export intermediaries.
  5. A sensitive microsystem as biosensor for cell growth monitoring and antibiotic testing
  6. Frank Fischer/Herbert Gottweis (Hg.) The Argumentative Turn Revisited.
  7. Schreiben Englisch
  8. Tourismuswissenschaft
  9. Groundwater abstraction for irrigation and its impacts on low flows in a watershed in Northwest Germany
  10. How can we bring together empiricists and modellers in functional biodiversity research?
  11. Mimikbasierte Emotionserfassung anhand von dynamischen Flächen und Streckenveränderung
  12. The new US horizontal merger guidelines
  13. The global perspective of education for sustainable development
  14. Mildes Luthertum?
  15. Translating European labor relations practices to the United States through global framework agreements?
  16. The impact of growth markets in the downstream sector - the parameters for connectivity and services: Beyond outer space law
  17. Autonomie der Migration
  18. Artist Placement Group
  19. Vergütung, variable
  20. When status differences are illegitimate, groups' needs diverge
  21. Mehrsprachigkeit in der Grundschule
  22. Is innovative firm behavior correlated with age and gender composition of the workforce ?
  23. Unterrichtsqualität an Hamburger Grundschulen
  24. Pharmaceuticals in the Environment
  25. Interferences and Events
  26. Anton Schnack: Werke in zwei Bänden
  27. "Die Lüneburger Heide goes digital"
  28. Synapses in the Network
  29. Vergleichende Regionalismusforschung
  30. Standardized Tests Fail to Assess the Effects of Antibiotics on Environmental Bacteria
  31. Productivity premia for many modes of internationalization.
  32. Resilience and coastal governance
  33. From Blue Collar to Open Commons Region
  34. Jean Piaget zur Einführung
  35. Vegetation der Erde
  36. Autonomie der Migration
  37. Political Careers in Multi-Level Systems
  38. University-Industry Collaboration to Stimulate Learning in the Context of Sustainability-Oriented Innovations

Press / Media

  1. Der bessere Urlaub.