AmQA: Amharic Question Answering Dataset

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearch

Authors

Question Answering (QA) returns concise answers or answer lists from natural language text given a context document. Many resources go into curating QA datasets to advance robust models' development. There is a surge of QA datasets for languages like English, however, this is not true for Amharic. Amharic, the official language of Ethiopia, is the second most spoken Semitic language in the world. There is no published or publicly available Amharic QA dataset. Hence, to foster the research in Amharic QA, we present the first Amharic QA (AmQA) dataset. We crowdsourced 2628 question-answer pairs over 378 Wikipedia articles. Additionally, we run an XLMR Large-based baseline model to spark open-domain QA research interest. The best-performing baseline achieves an F-score of 69.58 and 71.74 in reader-retriever QA and reading comprehension settings respectively.
Original languageEnglish
Title of host publicationConference XXX
Number of pages7
DOIs
Publication statusIn preparation - 06.03.2023
Externally publishedYes

Recently viewed

Publications

  1. Reallabore im Kontext Transformativer Forschung
  2. Connecting feedback to self-efficacy
  3. Regulatory focus and thinking about the future versus reality.
  4. Was Polybios an einer modernen Universität zu suchen hat
  5. Archival research on carbon reporting quality. A review of determinants and consequences for firm value
  6. Environmental Shareholder Value
  7. Correlation of trends in cashmere production and declines of large wild mammals
  8. Think globally, learn locally!
  9. Collective emotions in institutional creation work
  10. Adapting Growth Models for Digital Startups
  11. Learning to spend time in unusual times
  12. Contrasting changes in the abundance and diversity of North American bird assemblages from 1971 to 2010
  13. John Stuart Mill: Über die Freiheit
  14. Nichts wie weg
  15. Tortenschlacht
  16. How Individuals React Emotionally to Others’ (Mis)Fortunes
  17. Buffer Institutions in Public Higher Education in the Context of Institutional Autonomy and Governmental Control: A Comparative View of the United States and Germany
  18. Das Anfertigen von Notizen als Lernstrategie beim mathematischen Modellieren
  19. Temperature-dependent mechanical behavior of aluminum AM structures generated via multi-layer friction surfacing
  20. Bird community responses to the edge between suburbs and reserves
  21. Vom Cassislikör zur E-Commerce-Richtlinie
  22. Online CSR communication by listed companies: a factor for enthusiasm or disappointment?
  23. Tree diversity effects on litter decomposition are mediated by litterfall and microbial processes
  24. Bats in a Farming Landscape Benefit from Linear Remnants and Unimproved Pastures
  25. Morphometric differentiation in a specialised snail predatior
  26. Toward a Framework for University-Based Entrepreneurial Ecosystems and Human Capital Development in Sub-Saharan Africa
  27. Effectiveness of psychological interventions in preventing recurrence of depressive disorder
  28. Leading digital innovation in schools
  29. Organizational Wrongdoing, Boundary Work, and Systems of Exclusion
  30. Attention and Information Acquisition
  31. Embracing conflicts for interpersonal competence development in project-based sustainability courses
  32. Moral sensitivity in business
  33. Provisions for nullification of conservation and management measures in RFMO objection procedures
  34. Absenteeism as a Reaction to Harmful Behavior in the Workplace from a Stress Theory Point of View
  35. Where Are the Organizations? Accounting for the Fluidity and Ambiguity of Organizing in the Arts