QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

  • Ricardo Usbeck
  • Xi Yan
  • Aleksandr Perevalov
  • Longquan Jiang
  • Julius Schulz
  • Angelie Kraft
  • Cedric Möller
  • Junbo Huang
  • Jan Reineke
  • Axel-Cyrille Ngonga Ngomo
  • Muhammad Saleem
  • Andreas Both
Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.
OriginalspracheEnglisch
ZeitschriftSemantic Web
Jahrgang15
Ausgabenummer6
Seiten (von - bis)2193-2207
Anzahl der Seiten14
ISSN1570-0844
DOIs
PublikationsstatusErschienen - 2023
Extern publiziertJa

Zuletzt angesehen

Publikationen

  1. Impacts beyond experimentation - Conceptualising emergent impacts from long-term real-world laboratory processes
  2. Multitrophic diversity in a biodiverse forest is highly nonlinear across spatial scales
  3. Does location really matter? An inter-colony comparison of seabirds breeding at varying distances from productive oceanographic features in the Bering Sea
  4. Effect of salinity-changing rates on filtration activity of mussels from two sites within the Baltic Mytilus hybrid zone
  5. Reference wages and turnover intentions
  6. Study of the solidification of AS alloys combining in situ synchrotron diffraction and differential scanning calorimetry
  7. Reprocessing from the inside
  8. The Integration of Wheelchair Users in Team Handball
  9. Study harder? the relationship of achievement goals to attitudes and self-reported use of desirable difficulties in self-regulated learning
  10. Export Boosting Policies and Firm Performance
  11. To Own or to Use?
  12. The Crowd in Flux
  13. Consumers' Responses to CSR Activities
  14. Dialogic interactions in higher vocational learning environments in mainland China
  15. Can Geodesign Be Used to Facilitate Boundary Management for Planning and Implementation of Nature-based Solutions?
  16. Effekte unterschiedlicher Kollaborationsskripte in chatbasiertem Computer-Supported Collaborative Learning am Beispiel von Lernprotokollen
  17. Development and prospects of degradable magnesium alloys for structural and functional applications in the fields of environment and energy
  18. Das relationale Apriori Wiens / Das städtische Apriori des Relationalismus