ETL ensembles for chunking, NER and SRL

Cícero N. dos Santos; Ruy L. Milidiú; Carlos E.M. Crestana; Eraldo R. Fernandes

doi:10.1007/978-3-642-12116-6_9

ETL ensembles for chunking, NER and SRL

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Standard

ETL ensembles for chunking, NER and SRL. / dos Santos, Cícero N.; Milidiú, Ruy L.; Crestana, Carlos E.M. et al.
Computational Linguistics and Intelligent Text Processing: 11th International Conference, CICLing 2010, Iaşi, Romania, March 21-27, 2010. Proceedings. ed. / Alexander Gelbukh. Berlin: Springer Verlag, 2010. p. 100-112 (Lecture Notes in Computer Science; Vol. 6008 ).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Harvard

dos Santos, CN, Milidiú, RL, Crestana, CEM & Fernandes, ER 2010, ETL ensembles for chunking, NER and SRL. in A Gelbukh (ed.), Computational Linguistics and Intelligent Text Processing: 11th International Conference, CICLing 2010, Iaşi, Romania, March 21-27, 2010. Proceedings. Lecture Notes in Computer Science, vol. 6008 , Springer Verlag, Berlin, pp. 100-112, 11th International Conference on Computational Linguistics and Intelligent Text Processing - CICLing 2010, Iasi, Romania, 21.03.10. https://doi.org/10.1007/978-3-642-12116-6_9

APA

dos Santos, C. N., Milidiú, R. L., Crestana, C. E. M., & Fernandes, E. R. (2010). ETL ensembles for chunking, NER and SRL. In A. Gelbukh (Ed.), Computational Linguistics and Intelligent Text Processing: 11th International Conference, CICLing 2010, Iaşi, Romania, March 21-27, 2010. Proceedings (pp. 100-112). (Lecture Notes in Computer Science; Vol. 6008 ). Springer Verlag. https://doi.org/10.1007/978-3-642-12116-6_9

Vancouver

dos Santos CN, Milidiú RL, Crestana CEM, Fernandes ER. ETL ensembles for chunking, NER and SRL. In Gelbukh A, editor, Computational Linguistics and Intelligent Text Processing: 11th International Conference, CICLing 2010, Iaşi, Romania, March 21-27, 2010. Proceedings. Berlin: Springer Verlag. 2010. p. 100-112. (Lecture Notes in Computer Science). doi: 10.1007/978-3-642-12116-6_9

Bibtex

@inbook{9c23bf014c2046c7929d9928cbd2921c,

title = "ETL ensembles for chunking, NER and SRL",

abstract = "We present a new ensemble method that uses Entropy Guided Transformation Learning (ETL) as the base learner. The proposed approach, ETL Committee, combines the main ideas of Bagging and Random Subspaces. We also propose a strategy to include redundancy in transformation-based models. To evaluate the effectiveness of the ensemble method, we apply it to three Natural Language Processing tasks: Text Chunking, Named Entity Recognition and Semantic Role Labeling. Our experimental findings indicate that ETL Committee significantly outperforms single ETL models, achieving state-of-the-art competitive results. Some positive characteristics of the proposed ensemble strategy areworth to mention. First, it improves the ETL effectiveness without any additional human effort. Second, it is particularly useful when dealing with very complex tasks that use large feature sets. And finally, the resulting training and classification processes are very easy to parallelize.",

keywords = "Ensemble methods, Entropy guided transformation learning, Named entity recognition, Semantic role labeling, Text chunking, Informatics, Business informatics",

author = "{dos Santos}, {C{\'i}cero N.} and Milidi{\'u}, {Ruy L.} and Crestana, {Carlos E.M.} and Fernandes, {Eraldo R.}",

year = "2010",

doi = "10.1007/978-3-642-12116-6_9",

language = "English",

isbn = "3-642-12115-2",

series = "Lecture Notes in Computer Science",

publisher = "Springer Verlag",

pages = "100--112",

editor = "Alexander Gelbukh",

booktitle = "Computational Linguistics and Intelligent Text Processing",

address = "Germany",

note = "11th International Conference on Computational Linguistics and Intelligent Text Processing - CICLing 2010 ; Conference date: 21-03-2010 Through 27-03-2010",

url = "http://www.cicling.org/2010/",

}

RIS

TY - CHAP

T1 - ETL ensembles for chunking, NER and SRL

AU - dos Santos, Cícero N.

AU - Milidiú, Ruy L.

AU - Crestana, Carlos E.M.

AU - Fernandes, Eraldo R.

N1 - Conference code: 11

PY - 2010

Y1 - 2010

N2 - We present a new ensemble method that uses Entropy Guided Transformation Learning (ETL) as the base learner. The proposed approach, ETL Committee, combines the main ideas of Bagging and Random Subspaces. We also propose a strategy to include redundancy in transformation-based models. To evaluate the effectiveness of the ensemble method, we apply it to three Natural Language Processing tasks: Text Chunking, Named Entity Recognition and Semantic Role Labeling. Our experimental findings indicate that ETL Committee significantly outperforms single ETL models, achieving state-of-the-art competitive results. Some positive characteristics of the proposed ensemble strategy areworth to mention. First, it improves the ETL effectiveness without any additional human effort. Second, it is particularly useful when dealing with very complex tasks that use large feature sets. And finally, the resulting training and classification processes are very easy to parallelize.

AB - We present a new ensemble method that uses Entropy Guided Transformation Learning (ETL) as the base learner. The proposed approach, ETL Committee, combines the main ideas of Bagging and Random Subspaces. We also propose a strategy to include redundancy in transformation-based models. To evaluate the effectiveness of the ensemble method, we apply it to three Natural Language Processing tasks: Text Chunking, Named Entity Recognition and Semantic Role Labeling. Our experimental findings indicate that ETL Committee significantly outperforms single ETL models, achieving state-of-the-art competitive results. Some positive characteristics of the proposed ensemble strategy areworth to mention. First, it improves the ETL effectiveness without any additional human effort. Second, it is particularly useful when dealing with very complex tasks that use large feature sets. And finally, the resulting training and classification processes are very easy to parallelize.

KW - Ensemble methods

KW - Entropy guided transformation learning

KW - Named entity recognition

KW - Semantic role labeling

KW - Text chunking

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=78650569670&partnerID=8YFLogxK

UR - https://d-nb.info/1000252183

U2 - 10.1007/978-3-642-12116-6_9

DO - 10.1007/978-3-642-12116-6_9

M3 - Article in conference proceedings

AN - SCOPUS:78650569670

SN - 3-642-12115-2

SN - 978-3-642-12115-9

T3 - Lecture Notes in Computer Science

SP - 100

EP - 112

BT - Computational Linguistics and Intelligent Text Processing

A2 - Gelbukh, Alexander

PB - Springer Verlag

CY - Berlin

T2 - 11th International Conference on Computational Linguistics and Intelligent Text Processing - CICLing 2010

Y2 - 21 March 2010 through 27 March 2010

ER -

Other publications by the same author(s)

Data practices in apps from Brazil: What do privacy policies inform us about?

Quadros dos Reis, V., Rabello, M. E. R., Lima, A. C., Jardim, G. P. S., Fernandes, E. R. & Brefeld, U., 10.02.2023, In: Journal on Interactive Systems. 14, 1, p. 1-8 8 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Entity Extraction from Portuguese Legal Documents Using Distant Supervision

Navarezi, L. M., Sakiyama, K., Rodrigues, L. S., Robaldo, C. M. O., Lobato, G. R., Vilela, P. A., Matsubara, E. T. & Fernandes, E. R., 2022, Computational Processing of the Portuguese Language : 15th International Conference, PROPOR 2022, Fortaleza, Brazil, March 21-23, 2022, Proceedings. Pinheiro, V., Gamallo, P., Amaro, R., Scarton, C., Batista, F., Silva, D., Magro, C. & Pinto, H. (eds.). Cham: Springer Nature Switzerland AG, p. 166-176 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 13208 LNAI).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

FaST: A linear time stack trace alignment heuristic for crash report deduplication

Rodrigues, I. M., Aloise, D. & Fernandes, E. R., 23.05.2022, The 2022 Mining Software Repositories Conference: MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania. New York: Institute of Electrical and Electronics Engineers Inc., p. 549-560 12 p. (Proceedings - IEEE/ACM International Conference on Mining Software Repositories ).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Performance predictors for graphics processing units applied to dark-silicon-aware design space exploration

Sonohata, R., Arigoni, D. C. A., Fernandes, E. R., Ribeiro dos Santos, R. & Dessandre Duenha, L., 01.08.2023, In: Concurrency and Computation: Practice and Experience. 35, 17, 16 p., e6877.

Research output: Journal contributions › Journal articles › Research › peer-review

TraceSim: An Alignment Method for Computing Stack Trace Similarity

Rodrigues, I. M., Khvorov, A., Aloise, D., Vasiliev, R., Koznov, D., Fernandes, E. R., Chernishev, G., Luciv, D. & Povarov, N., 01.03.2022, In: Empirical Software Engineering. 27, 2, 41 p., 53.

Research output: Journal contributions › Journal articles › Research › peer-review

DOI

https://doi.org/10.1007/978-3-642-12116-6_9
Final published version

ETL ensembles for chunking, NER and SRL

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Other publications by the same author(s)

Data practices in apps from Brazil: What do privacy policies inform us about?

Entity Extraction from Portuguese Legal Documents Using Distant Supervision

FaST: A linear time stack trace alignment heuristic for crash report deduplication

Performance predictors for graphics processing units applied to dark-silicon-aware design space exploration

TraceSim: An Alignment Method for Computing Stack Trace Similarity

DOI

Recently viewed

Projects

Activities

Publications

Press / Media