FaQuAD: Reading comprehension dataset in the domain of brazilian higher education

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Academic secretaries and faculty members of higher education institutions face a common problem: the abundance of questions sent by academics whose answers are found in available institutional documents. The official documents produced by Brazilian public universities are vast and disperse, which discourage students to further search for answers in such sources. In order to lessen this problem, we present FaQuAD: a novel machine reading comprehension dataset in the domain of Brazilian higher education institutions. FaQuAD follows the format of SQuAD (Stanford Question Answering Dataset) [Rajpurkar et al.2016]. It comprises 900 questions about 249 reading passages(paragraphs), which were taken from 18 official documents of a computer science college from a Brazilian federal university and 21 Wikipedia articles related to Brazilian higher education system. As far as we know, this is the first Portuguese reading comprehension dataset in this format. We trained a state-of-the-art model on this dataset, which is based on the Bi-Directional Attention Flow model [Seo et al. 2016]. We report on several ablation tests to assess different aspects of both the model and the dataset. For instance, we report learning curves to assess the amount of training data, the use of different levels of pre-trained models, and the use of more than one correct answer for each question.

OriginalspracheEnglisch
Titel2019 Brazilian Conference on Intelligent Systems : BRACIS 2019 : 15-18 October 2019, Salvador, Bahia, Brazil : proceedings
Anzahl der Seiten6
ErscheinungsortPiscataway
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum10.2019
Seiten443-448
Aufsatznummer8923668
ISBN (Print)978-1-7281-4254-8
ISBN (elektronisch)978-1-7281-4253-1
DOIs
PublikationsstatusErschienen - 10.2019
Extern publiziertJa
VeranstaltungBrazilian Conference on Intelligent Systems - BRACIS 2019 - Salvador, Bahia, Brasilien
Dauer: 15.10.201918.10.2019
Konferenznummer: 8
http://www.bracis2019.ufba.br/#:~:text=The%208th%20Brazilian%20Conference%20on,October%2015%20to%2018%2C%202019.

DOI

Zuletzt angesehen

Publikationen

  1. How do distinct facets of tree diversity and community assembly respond to environmental variables in the subtropical Atlantic Forest?
  2. Complex Trait-Treatment-Interaction analysis
  3. Playing with Information
  4. Clusteranalyse als Methode zur Strukturierung großer Datenmodelle
  5. A Multilab Replication of the Ego Depletion Effect
  6. Towards a Comprehensive Framework for Environmental Management Accounting
  7. Evolutionary clustering of Lagrangian trajectories in turbulent Rayleigh-Bénard convection flows
  8. Optimum parameters and rate-controlling mechanisms for hot working of extruded Mg-3Sn-1Ca alloy
  9. Modelling lateness and schedule reliability
  10. Properties of some overlapping self-similar and some self-affine measures
  11. Toward a gecko-inspired, climbing soft robot
  12. Supporting non-hierarchical supply chain networks in the electronics industry
  13. Plastic deformation induced microstructure evolution through gradient enhanced crystal plasticity based on a non-convex Helmholtz energy
  14. Similarity of molecular descriptors: The equivalence of Zagreb indices and walk counts
  15. Towards a heuristic for assessing adaptation knowledge: impacts, implications, decisions and actions
  16. Substance Flows Associated with Medical Care - Significance of Different Sources
  17. Contributing to sustainable development pathways in the South Pacific through transdisciplinary research
  18. The Multiple Self Objection to the Prudential Lifespan Account
  19. Expert*inneninterview
  20. Anonymized firm data under test: evidence from a replication study
  21. On walks in molecular graphs.
  22. Crop rotation modelling
  23. Teaching content and language in the multilingual classroom

Presse / Medien

  1. Asset Backed Securities