QALD-9-plus: A Multilingual Dataset for Question Answering over DBpedia and Wikidata Translated by Native Speakers

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

The ability to have the same experience for different user groups (i.e., accessibility) is one of the most important characteristics of Web-based systems. The same is true for Knowledge Graph Question Answering (KGQA) systems that provide the access to Semantic Web data via natural language interface. While following our research agenda on the multilingual aspect of accessibility of KGQA systems, we identified several ongoing challenges. One of them is the lack of multilingual KGQA benchmarks. In this work, we extend one of the most popular KGQA benchmarks - QALD-9 by introducing high-quality questions' translations to 8 languages provided by native speakers, and transferring the SPARQL queries of QALD-9 from DBpedia to Wikidata, s.t., the usability and relevance of the dataset is strongly increased. Five of the languages - Armenian, Ukrainian, Lithuanian, Bashkir and Belarusian - to our best knowledge were never considered in KGQA research community before. The latter two of the languages are considered as 'endangered' by UNESCO. We call the extended dataset QALD-9-plus and made it available online11Figshare: https://doi.org/10.6084/m9.figshare.16864273. GitHub: https://github.com/Perevalov/qald-9-plus.

OriginalspracheEnglisch
TitelProceedings - 16th IEEE International Conference on Semantic Computing, ICSC 2022
Anzahl der Seiten6
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum2022
Seiten229-234
ISBN (Print)978-1-6654-3419-5
ISBN (elektronisch)978-1-6654-3418-8
DOIs
PublikationsstatusErschienen - 2022
Extern publiziertJa
Veranstaltung16th IEEE International Conference on Semantic Computing, ICSC 2022 - Virtual, Online, USA / Vereinigte Staaten
Dauer: 26.01.202228.01.2022
http://pa.icar.cnr.it/scsn22/

Bibliographische Notiz

Publisher Copyright:
© 2022 IEEE.

DOI

Zuletzt angesehen

Publikationen

  1. Common Commercial Policy and External Trade
  2. Der Kunde als Innovationsquelle
  3. The role of technological change for a sustainable development
  4. US agricultural sector analysis on pesticide externalities – the impact of climate change and a Pigovian tax
  5. Hello
  6. Panta Rhei
  7. Zur vollendeten Tatsache
  8. Über den sinn von Thematisierungstabus und die unmöglichkeit einer soziologischen analyse der soziologie
  9. Hold Back The River
  10. Die Kraft der digitalen Ästhetik
  11. Aquatic and terrestrial proxy evidence for Middle Pleistocene palaeolake and lake-shore development at two Lower Palaeolithic sites of Schöningen, Germany
  12. SPS steuern Assistenzsysteme in der Digitalen Fabrik
  13. Kurd Laßwitz’ Homchen
  14. Cutting Across Lines: Lil Picard and the Reorienting Effects of Collage
  15. Measurement and modelling of NH3 emissions from field-applied biogas residues in North German energy crop rotations
  16. Indikatorenmodell des schulischen Schreibens
  17. Musik in transkulturellen Kontexten
  18. Technikfolgenabschätzung - eine Einführung
  19. Conclusion
  20. Sustainability Management in Business Enterprises
  21. Polychlorinated biphenyls in glaciers
  22. Book Review of Amador-Moreno, C.P., McCafferty, K., and Vaughan, E. (Eds.): Pragmatic Markers in Irish English
  23. Housing in the Margins: Negotiating Urban Formalities in Berlin's Allotment Gardens
  24. Christopher H. Achen / Larry M. Bartels: Democracy for Realists. Why Elections Do Not Produce Responsive Government, Princeton: Princeton University Press 2017
  25. Fehlgeburt und Stillgeburt
  26. Der Beitrag zivilgesellschaftlicher Partizipation zur Effektivitatssteigerung von Governance
  27. Was vom Leben bleibt
  28. Corrosion properties of secondary AZ91 alloys
  29. Stufen der Privatheit und die diskursive Ordnung der Familie