AmQA: Amharic Question Answering Dataset

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearch

Authors

Question Answering (QA) returns concise answers or answer lists from natural language text given a context document. Many resources go into curating QA datasets to advance robust models' development. There is a surge of QA datasets for languages like English, however, this is not true for Amharic. Amharic, the official language of Ethiopia, is the second most spoken Semitic language in the world. There is no published or publicly available Amharic QA dataset. Hence, to foster the research in Amharic QA, we present the first Amharic QA (AmQA) dataset. We crowdsourced 2628 question-answer pairs over 378 Wikipedia articles. Additionally, we run an XLMR Large-based baseline model to spark open-domain QA research interest. The best-performing baseline achieves an F-score of 69.58 and 71.74 in reader-retriever QA and reading comprehension settings respectively.
Original languageEnglish
Title of host publicationConference XXX
Number of pages7
DOIs
Publication statusIn preparation - 06.03.2023
Externally publishedYes

Recently viewed

Publications

  1. Resilience, Entrepreneurship and ICT
  2. Article 15 Scope of the Law Applicable
  3. Ready biodegradability of trifluoromethylated phenothiazine drugs, structural elucidation of their aquatic transformation products, and identification of environmental risks studied by LC-MS( n ) and QSAR
  4. Leading in times of crisis
  5. Wie suchst du so?
  6. Prior Entry and Temporal Attention
  7. Team Emotions and Team Learning
  8. Kurzprosa
  9. Effect of Deformation Speed on Stress Corrosion and Fracture Toughness of Extruded Mg10Dy and Mg10Dy1Nd Using C-Ring Tests
  10. Kooperation und Diversität von Netzwerken
  11. Pragmatism versus Artificial Art
  12. Enhancing firm performance and innovativeness through error management culture
  13. Datenbanken als Zitadellen des Web 2.0
  14. Proving the world more imaginary?
  15. Nachhaltiges Unternehmertum
  16. Between joint project, institutional bargaining and symbolic politics
  17. The Effect of Market Power on Electricity Storage Utilization
  18. Post-Cinematic Distribution Flows
  19. Purpurne Zeichen
  20. Acute effects of long-lasting stretching and strength training on maximal strength and flexibility in the calf muscle
  21. Vorwort
  22. A checklist for ecological management of landscapes for conservation
  23. Triggering root system plasticity in a changing environment with bacterial bioinoculants – Focus on plant P nutrition
  24. Coplanar micro-strips/electrospun sensor system to measure the electronics properties of the polyethylene oxide (PEO) electrospun
  25. Potential negative consequences of mindfulness in the moral domain
  26. Investigations on microstructures, mechanical and corrosion properties of Mg-Gd-Zn alloys
  27. A Daily Breathing Practice Bolsters Girls’ Prosocial Behavior and Third and Fourth Graders’ Supportive Peer Relationships
  28. Location, Location, Location
  29. Heinrich Mann - Dichterjugend
  30. A qualitative analysis of virtual patient descriptions in healthcare education based on a systematic literature review
  31. Bildung und Erziehung heute
  32. Grassroots relational approaches to agricultural transformation in Latin America
  33. Sunny Side Down
  34. The Lima Summit
  35. Der unversicherte Sprachschaden
  36. Das Lernfeldkonzept als Forschungsanlass und Diskursthema in der Berufs- und Wirtschaftspädagogik - Leuphana Notizen
  37. Dialogorientierte Nachhaltigkeitsberichterstattung im Internet
  38. Teachers' beliefs about and dispositions towards Inquiry-based Science Education
  39. The potential of crowdfunding for sustainable development
  40. Students’ own and perceived teacher reference norms