Domain adaptation of POS taggers without handcrafted features

Irving M. Rodrigues; Eraldo R. Fernandes; Cicero N. dos Santos

doi:10.1109/IJCNN.2017.7966274

Domain adaptation of POS taggers without handcrafted features

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Standard

Domain adaptation of POS taggers without handcrafted features. / Rodrigues, Irving M.; Fernandes, Eraldo R.; dos Santos, Cicero N.
IJCNN 2017: the International Joint Conference on Neural Networks. Piscataway: Institute of Electrical and Electronics Engineers Inc., 2017. p. 3331-3338 7966274 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2017).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Harvard

Rodrigues, IM, Fernandes, ER & dos Santos, CN 2017, Domain adaptation of POS taggers without handcrafted features. in IJCNN 2017: the International Joint Conference on Neural Networks., 7966274, Proceedings of the International Joint Conference on Neural Networks, vol. 2017, Institute of Electrical and Electronics Engineers Inc., Piscataway, pp. 3331-3338, International Joint Conference on Neural Networks, Anchorage, United States, 14.05.17. https://doi.org/10.1109/IJCNN.2017.7966274

APA

Rodrigues, I. M., Fernandes, E. R., & dos Santos, C. N. (2017). Domain adaptation of POS taggers without handcrafted features. In IJCNN 2017: the International Joint Conference on Neural Networks (pp. 3331-3338). Article 7966274 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2017). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCNN.2017.7966274

Vancouver

Rodrigues IM, Fernandes ER, dos Santos CN. Domain adaptation of POS taggers without handcrafted features. In IJCNN 2017: the International Joint Conference on Neural Networks. Piscataway: Institute of Electrical and Electronics Engineers Inc. 2017. p. 3331-3338. 7966274. (Proceedings of the International Joint Conference on Neural Networks). doi: 10.1109/IJCNN.2017.7966274

Bibtex

@inbook{00f48f8535564d51896ec4b3ba5d0cd0,

title = "Domain adaptation of POS taggers without handcrafted features",

abstract = "Unsupervised domain adaptation is an attractive option when labeled data is lacking for some domain of interest but is available for other domain. Part-of-speech (POS) tagging is often considered a solved task when enough labeled data is available in the domain of interest. However, when considering a domain adaptation scenario, this is far from true. Several approaches have been proposed for domain adaptation of POS taggers, however as far as we know, all of them are based on handcrafted features. In this work, we employ a machine learning method whose input is exclusively composed of the raw text. This method learns word- and character-level representations (embeddings), and has been successfully applied to intra-domain tasks. We show that this method achieves strong performances on the domain adaptation of English and Portuguese POS taggers.",

keywords = "Informatics, tagging, syntactics, training, Vocabulary, training data, Feature extraction, Business informatics",

author = "Rodrigues, {Irving M.} and Fernandes, {Eraldo R.} and {dos Santos}, {Cicero N.}",

year = "2017",

month = jun,

day = "30",

doi = "10.1109/IJCNN.2017.7966274",

language = "English",

isbn = "978-1-5090-6183-9",

series = "Proceedings of the International Joint Conference on Neural Networks",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "3331--3338",

booktitle = "IJCNN 2017",

address = "United States",

note = "International Joint Conference on Neural Networks, IJCNN 2017 ; Conference date: 14-05-2017 Through 19-05-2017",

url = "https://ieeexplore.ieee.org/xpl/conhome/7958416/proceeding",

}

RIS

TY - CHAP

T1 - Domain adaptation of POS taggers without handcrafted features

AU - Rodrigues, Irving M.

AU - Fernandes, Eraldo R.

AU - dos Santos, Cicero N.

PY - 2017/6/30

Y1 - 2017/6/30

N2 - Unsupervised domain adaptation is an attractive option when labeled data is lacking for some domain of interest but is available for other domain. Part-of-speech (POS) tagging is often considered a solved task when enough labeled data is available in the domain of interest. However, when considering a domain adaptation scenario, this is far from true. Several approaches have been proposed for domain adaptation of POS taggers, however as far as we know, all of them are based on handcrafted features. In this work, we employ a machine learning method whose input is exclusively composed of the raw text. This method learns word- and character-level representations (embeddings), and has been successfully applied to intra-domain tasks. We show that this method achieves strong performances on the domain adaptation of English and Portuguese POS taggers.

AB - Unsupervised domain adaptation is an attractive option when labeled data is lacking for some domain of interest but is available for other domain. Part-of-speech (POS) tagging is often considered a solved task when enough labeled data is available in the domain of interest. However, when considering a domain adaptation scenario, this is far from true. Several approaches have been proposed for domain adaptation of POS taggers, however as far as we know, all of them are based on handcrafted features. In this work, we employ a machine learning method whose input is exclusively composed of the raw text. This method learns word- and character-level representations (embeddings), and has been successfully applied to intra-domain tasks. We show that this method achieves strong performances on the domain adaptation of English and Portuguese POS taggers.

KW - Informatics

KW - tagging

KW - syntactics

KW - training

KW - Vocabulary

KW - training data

KW - Feature extraction

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=85030974151&partnerID=8YFLogxK

U2 - 10.1109/IJCNN.2017.7966274

DO - 10.1109/IJCNN.2017.7966274

M3 - Article in conference proceedings

AN - SCOPUS:85030974151

SN - 978-1-5090-6183-9

T3 - Proceedings of the International Joint Conference on Neural Networks

SP - 3331

EP - 3338

BT - IJCNN 2017

PB - Institute of Electrical and Electronics Engineers Inc.

CY - Piscataway

T2 - International Joint Conference on Neural Networks

Y2 - 14 May 2017 through 19 May 2017

ER -

Other publications by the same author(s)

Data practices in apps from Brazil: What do privacy policies inform us about?

Quadros dos Reis, V., Rabello, M. E. R., Lima, A. C., Jardim, G. P. S., Fernandes, E. R. & Brefeld, U., 10.02.2023, In: Journal on Interactive Systems. 14, 1, p. 1-8 8 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Entity Extraction from Portuguese Legal Documents Using Distant Supervision

Navarezi, L. M., Sakiyama, K., Rodrigues, L. S., Robaldo, C. M. O., Lobato, G. R., Vilela, P. A., Matsubara, E. T. & Fernandes, E. R., 2022, Computational Processing of the Portuguese Language : 15th International Conference, PROPOR 2022, Fortaleza, Brazil, March 21-23, 2022, Proceedings. Pinheiro, V., Gamallo, P., Amaro, R., Scarton, C., Batista, F., Silva, D., Magro, C. & Pinto, H. (eds.). Cham: Springer Nature Switzerland AG, p. 166-176 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 13208 LNAI).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

FaST: A linear time stack trace alignment heuristic for crash report deduplication

Rodrigues, I. M., Aloise, D. & Fernandes, E. R., 17.10.2022, The 2022 Mining Software Repositories Conference: MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania. New York: Institute of Electrical and Electronics Engineers Inc., p. 549-560 12 p. (Proceedings - IEEE/ACM International Conference on Mining Software Repositories ).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Performance predictors for graphics processing units applied to dark-silicon-aware design space exploration

Sonohata, R., Arigoni, D. C. A., Fernandes, E. R., Ribeiro dos Santos, R. & Dessandre Duenha, L., 01.08.2023, In: Concurrency and Computation: Practice and Experience. 35, 17, 16 p., e6877.

Research output: Journal contributions › Journal articles › Research › peer-review

TraceSim: An Alignment Method for Computing Stack Trace Similarity

Rodrigues, I. M., Khvorov, A., Aloise, D., Vasiliev, R., Koznov, D., Fernandes, E. R., Chernishev, G., Luciv, D. & Povarov, N., 01.03.2022, In: Empirical Software Engineering. 27, 2, 41 p., 53.

Research output: Journal contributions › Journal articles › Research › peer-review

DOI

https://doi.org/10.1109/IJCNN.2017.7966274
Final published version

Domain adaptation of POS taggers without handcrafted features

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Other publications by the same author(s)

Data practices in apps from Brazil: What do privacy policies inform us about?

Entity Extraction from Portuguese Legal Documents Using Distant Supervision

FaST: A linear time stack trace alignment heuristic for crash report deduplication

Performance predictors for graphics processing units applied to dark-silicon-aware design space exploration

TraceSim: An Alignment Method for Computing Stack Trace Similarity

DOI

Recently viewed

Researchers

Projects

Activities

Publications

Press / Media