Entropy-guided feature generation for structured learning of Portuguese dependency parsing

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Standard

Entropy-guided feature generation for structured learning of Portuguese dependency parsing. / Fernandes, Eraldo R.; Milidiú, Ruy L.
Computational Processing of the Portuguese Language: 10th International Conference, PROPOR 2012, Coimbra, Portugal, April 17-20, 2012. Proceedings. Hrsg. / Helena Caseli; Aline Villavicencio; Antonio Teixeira; Fernando Perdigao. Berlin, Heidelberg: Springer Verlag, 2012. S. 146-156 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 7243 LNAI).

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Harvard

Fernandes, ER & Milidiú, RL 2012, Entropy-guided feature generation for structured learning of Portuguese dependency parsing. in H Caseli, A Villavicencio, A Teixeira & F Perdigao (Hrsg.), Computational Processing of the Portuguese Language: 10th International Conference, PROPOR 2012, Coimbra, Portugal, April 17-20, 2012. Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Bd. 7243 LNAI, Springer Verlag, Berlin, Heidelberg, S. 146-156, International Conference on Computational Processing of Portuguese, Coimbra, Portugal, 17.04.12. https://doi.org/10.1007/978-3-642-28885-2_17

APA

Fernandes, E. R., & Milidiú, R. L. (2012). Entropy-guided feature generation for structured learning of Portuguese dependency parsing. In H. Caseli, A. Villavicencio, A. Teixeira, & F. Perdigao (Hrsg.), Computational Processing of the Portuguese Language: 10th International Conference, PROPOR 2012, Coimbra, Portugal, April 17-20, 2012. Proceedings (S. 146-156). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 7243 LNAI). Springer Verlag. https://doi.org/10.1007/978-3-642-28885-2_17

Vancouver

Fernandes ER, Milidiú RL. Entropy-guided feature generation for structured learning of Portuguese dependency parsing. in Caseli H, Villavicencio A, Teixeira A, Perdigao F, Hrsg., Computational Processing of the Portuguese Language: 10th International Conference, PROPOR 2012, Coimbra, Portugal, April 17-20, 2012. Proceedings. Berlin, Heidelberg: Springer Verlag. 2012. S. 146-156. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-642-28885-2_17

Bibtex

@inbook{f536d12ded6d493bb6a28d5057fd8d2a,
title = "Entropy-guided feature generation for structured learning of Portuguese dependency parsing",
abstract = "Feature generation is a difficult, yet highly necessary, subtask of machine learning modeling. Usually, it is partially solved by a domain expert that generates complex and discriminative feature templates by conjoining the available basic features. This is a limited and expensive way to obtain feature templates and is recognized as a modeling bottleneck. In this work, we propose an automatic method to generate feature templates for structured learning algorithms. The method receives as input the training dataset with basic features and produces a set of feature templates by conjoining basic features that are highly discriminative together. We denote this method entropy guided since it is based on the conditional entropy of local decision variables given the feature values. We illustrate our approach on the Portuguese dependency parsing task and report on experiments with the Bosque corpus. We show that the entropy-guided templates outperform the manually built templates used by MSTParser, which was the best performing system on the Bosque corpus up to now. Furthermore, our approach allows an effortless inclusion of two new basic features that automatically generate additional templates. As a result, our system achieves a per-token accuracy of 92.66%, what represents a reduction by more than 15% on the previous smallest error rate for Portuguese dependency parsing.",
keywords = "dependency parsing, entropy-guided feature generation, machine learning, structured learning, Informatics, Business informatics",
author = "Fernandes, {Eraldo R.} and Milidi{\'u}, {Ruy L.}",
year = "2012",
doi = "10.1007/978-3-642-28885-2_17",
language = "English",
isbn = "978-3-642-28884-5",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "146--156",
editor = "Helena Caseli and Aline Villavicencio and Antonio Teixeira and Fernando Perdigao",
booktitle = "Computational Processing of the Portuguese Language",
address = "Germany",
note = "International Conference on Computational Processing of Portuguese, PROPOR 2012 ; Conference date: 17-04-2012 Through 20-04-2012",
url = "https://aclweb.org/portal/content/10th-international-conference-computational-processing-portuguese-propor-2012",

}

RIS

TY - CHAP

T1 - Entropy-guided feature generation for structured learning of Portuguese dependency parsing

AU - Fernandes, Eraldo R.

AU - Milidiú, Ruy L.

N1 - Conference code: 10

PY - 2012

Y1 - 2012

N2 - Feature generation is a difficult, yet highly necessary, subtask of machine learning modeling. Usually, it is partially solved by a domain expert that generates complex and discriminative feature templates by conjoining the available basic features. This is a limited and expensive way to obtain feature templates and is recognized as a modeling bottleneck. In this work, we propose an automatic method to generate feature templates for structured learning algorithms. The method receives as input the training dataset with basic features and produces a set of feature templates by conjoining basic features that are highly discriminative together. We denote this method entropy guided since it is based on the conditional entropy of local decision variables given the feature values. We illustrate our approach on the Portuguese dependency parsing task and report on experiments with the Bosque corpus. We show that the entropy-guided templates outperform the manually built templates used by MSTParser, which was the best performing system on the Bosque corpus up to now. Furthermore, our approach allows an effortless inclusion of two new basic features that automatically generate additional templates. As a result, our system achieves a per-token accuracy of 92.66%, what represents a reduction by more than 15% on the previous smallest error rate for Portuguese dependency parsing.

AB - Feature generation is a difficult, yet highly necessary, subtask of machine learning modeling. Usually, it is partially solved by a domain expert that generates complex and discriminative feature templates by conjoining the available basic features. This is a limited and expensive way to obtain feature templates and is recognized as a modeling bottleneck. In this work, we propose an automatic method to generate feature templates for structured learning algorithms. The method receives as input the training dataset with basic features and produces a set of feature templates by conjoining basic features that are highly discriminative together. We denote this method entropy guided since it is based on the conditional entropy of local decision variables given the feature values. We illustrate our approach on the Portuguese dependency parsing task and report on experiments with the Bosque corpus. We show that the entropy-guided templates outperform the manually built templates used by MSTParser, which was the best performing system on the Bosque corpus up to now. Furthermore, our approach allows an effortless inclusion of two new basic features that automatically generate additional templates. As a result, our system achieves a per-token accuracy of 92.66%, what represents a reduction by more than 15% on the previous smallest error rate for Portuguese dependency parsing.

KW - dependency parsing

KW - entropy-guided feature generation

KW - machine learning

KW - structured learning

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84858599304&partnerID=8YFLogxK

UR - https://d-nb.info/1019948167

U2 - 10.1007/978-3-642-28885-2_17

DO - 10.1007/978-3-642-28885-2_17

M3 - Article in conference proceedings

AN - SCOPUS:84858599304

SN - 978-3-642-28884-5

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 146

EP - 156

BT - Computational Processing of the Portuguese Language

A2 - Caseli, Helena

A2 - Villavicencio, Aline

A2 - Teixeira, Antonio

A2 - Perdigao, Fernando

PB - Springer Verlag

CY - Berlin, Heidelberg

T2 - International Conference on Computational Processing of Portuguese

Y2 - 17 April 2012 through 20 April 2012

ER -

DOI

Zuletzt angesehen

Publikationen

  1. The signal location task as a method quantifying the distribution of attention
  2. A computational study of a model of single-crystal strain-gradient viscoplasticity with an interactive hardening relation
  3. Supporting discourse in a synchronous learning environment
  4. Predicting the Difficulty of Exercise Items for Dynamic Difficulty Adaptation in Adaptive Language Tutoring
  5. On the Decoupling and Output Functional Controllability of Robotic Manipulation
  6. Lyapunov Convergence Analysis for Asymptotic Tracking Using Forward and Backward Euler Approximation of Discrete Differential Equations
  7. A sufficient asymptotic stability condition in generalised model predictive control to avoid input saturation
  8. Modelling the Complexity of Measurement Estimation Situations - A Theoretical Framework for the Estimation of Lengths
  9. Distinguishing state variability from trait change in longitudinal data
  10. Return of Fibonacci random walks
  11. Optimization Analysis for an Uncovered Wagon Transportation with an Interactive Animated Simulation-Based Platform for Multidisciplinary Learning
  12. Knowledge Graph Question Answering Using Graph-Pattern Isomorphism
  13. Modeling items for text comprehension assessment using confirmatory factor analysis
  14. Foundations and applications of computer based material flow networks for einvironmental management
  15. Towards a Dynamic Interpretation of Subjective and Objective Values
  16. Performance analysis for loss systems with many subscribers and concurrent services
  17. Gaussian processes for dispatching rule selection in production scheduling
  18. Using mixture distribution models to test the construct validity of the Physical Self-Description Questionnaire
  19. Analysis of PI controllers with anti-windup techniques on level systems
  20. Artificial Intelligence Algorithms for Collaborative Book Recommender Systems
  21. A discrete approximate solution for the asymptotic tracking problem in affine nonlinear systems
  22. Improving students’ science text comprehension through metacognitive self-regulation when applying learning strategies
  23. Authenticity and authentication in language learning
  24. Supporting the Development and Implementation of a Digitalization Strategy in SMEs through a Lightweight Architecture-based Method
  25. Appendix A: Design, implementation, and analysis of the iGOES project
  26. A guided simulated annealing search for solving the pick-up and delivery problem with time windows and capacity constraints
  27. Analyzing math teacher students' sensitivity for aspects of the complexity of problem oriented mathematics instruction
  28. ActiveMath - a Learning Platform With Semantic Web Features
  29. A Service-oriented Search framework for full text, geospatial and semantic search
  30. Evaluation of Time/Phase Parameters in Frequency Measurements for Inertial Navigation Systems
  31. Modelling and implementation of an Order2Cash Process in distributed systems
  32. Investigation and modeling of the material behavior due to evolving dislocation microstructures in fcc and bcc metals
  33. The Scalable Question Answering Over Linked Data (SQA) Challenge 2018
  34. FFTSMC with Optimal Reference Trajectory Generated by MPC in Robust Robotino Motion Planning with Saturating Inputs
  35. On the Nonlinearity Compensation in Permanent Magnet Machine Using a Controller Based on a Controlled Invariant Subspace
  36. A Review of Latent Variable Modeling Using R - A Step-by-Step-Guide
  37. Analysis and Implementation of a Resistance Temperature Estimator Based on Bi-Polynomial Least Squares Method and Discrete Kalman Filter
  38. An expert-based reference list of variables for characterizing and monitoring social-ecological systems
  39. Homogenization modeling of thin-layer-type microstructures
  40. Using nonlinear methods to quantify changes in infant limb movements and vocalizations
  41. Guided discovery learning with computer-based simulation games
  42. Machine Learning and Knowledge Discovery in Databases