Low Resource Question Answering: An Amharic Benchmarking Dataset: An Amharic Benchmarking Dataset

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Question Answering (QA) systems return concise answers or answer lists based on natural language text, which uses a given context document. Many resources go into curating QA datasets to advance the development of robust QA models. There is a surge in QA datasets for languages such as English; this is different for low-resource languages like Amharic. Indeed, there is no published or publicly available Amharic QA dataset. Hence, to foster further research in low-resource QA, we present the first publicly available benchmarking Amharic Question Answering Dataset (Amh-QuAD). We crowdsource 2,628 question-answer pairs from over 378 Amharic Wikipedia articles. Using the training set, we fine-tune an XLM-R-based language model and introduce a new reader model. Leveraging our newly fine-tuned reader run a baseline model to spark open-domain Amharic QA research interest. The best-performing baseline QA achieves an F-score of 80.3 and 81.34 in retriever-reader and reading comprehension settings.

OriginalspracheEnglisch
TitelThe Fifth Workshop on Resources for African Indigenous Languages @LREC-COLING-2024 (RAIL) : Workshop Proceedings
HerausgeberRooweither Mabuya, Muzi Matfunjwa, Mmasibidi Setaka, Menno van Zaanen
Anzahl der Seiten9
ErscheinungsortParis
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum2024
Seiten124-132
ISBN (Print)9782493814401
ISBN (elektronisch)978-2-493814-40-1
PublikationsstatusErschienen - 2024
Veranstaltung5th Workshop on Resources for African Indigenous Languages - RAIL 2024 - Lingotto Conference Centre, Torino (Italy), Torino, Italien
Dauer: 25.05.2024 → …
Konferenznummer: 5
https://bit.ly/rail2024

Bibliographische Notiz

Publisher Copyright:
© 2024 ELRA Language Resource Association.

Links

Zuletzt angesehen

Publikationen

  1. Strategies and drivers of sustainable business model innovation
  2. Investitionen in Südostasien - das letzte große Steuerabenteuer?
  3. Environmental Management Accounting: Innovation or Managerial Fad?
  4. How can Environmental Management contribute to Shareholder Value?
  5. Building a Coalition with Depoliticized Sustainability Discourse
  6. Der Regierungsentwurf für ein Abschlussprüfungsreformgesetz (AReG)
  7. ZPO Buch 11. Justizielle Zusammenarbeit in der Europäischen Union
  8. Kriterien der interaktiven Unternehmenskommunikation im Internet
  9. Öko-Controlling als ökonomisch- ökologisches Führungsinstrument
  10. Manipulating Belief in Free Will and Its Downstream Consequences
  11. A unique nest-protection strategy in a new species of spider wasp
  12. Addressing Complexity in Environmental Management and Governance
  13. Does problem complexity matter for environmental policy delivery?
  14. Of sustainability and storytelling - An introduction to this book
  15. A review of transdisciplinary research in sustainability science
  16. Innovating Corporate Accounting and Reporting for Sustainability
  17. Beech forests as a joint natural heritage of Europe - a synthesis
  18. Individual-tree radial growth in a subtropical broad-leaved forest
  19. Pathogen induced disturbance and succession in temperate forests
  20. Sünde in Gesellschaft, Kirche und neutestamentlicher Wissenschaft
  21. Comparative Perspectives in Sustainable and Responsible Business
  22. Wozu Senden? Sendevisionen im Ersten und Dritten Fernsehzeitalter
  23. Schulbasierte Achtsamkeitsprogramme mit Kindern und Jugendlichen
  24. Widening global variability in grassland biomass since the 1980s
  25. Keramikspektren hellenistischer Städte und ihre Auswahlkriterien