QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

  • Ricardo Usbeck
  • Xi Yan
  • Aleksandr Perevalov
  • Longquan Jiang
  • Julius Schulz
  • Angelie Kraft
  • Cedric Möller
  • Junbo Huang
  • Jan Reineke
  • Axel-Cyrille Ngonga Ngomo
  • Muhammad Saleem
  • Andreas Both
Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.
OriginalspracheEnglisch
ZeitschriftSemantic Web
Jahrgang15
Ausgabenummer6
Seiten (von - bis)2193-2207
Anzahl der Seiten14
ISSN1570-0844
DOIs
PublikationsstatusErschienen - 2023
Extern publiziertJa

Zuletzt angesehen

Publikationen

  1. Impacts beyond experimentation - Conceptualising emergent impacts from long-term real-world laboratory processes
  2. Multitrophic diversity in a biodiverse forest is highly nonlinear across spatial scales
  3. A Conceptual Structure of Justice - Providing a Tool to Analyse Conceptions of Justice
  4. Effect of salinity-changing rates on filtration activity of mussels from two sites within the Baltic Mytilus hybrid zone
  5. Towards a dimensional approach to common mental disorders in the ICD-11?
  6. Study harder? the relationship of achievement goals to attitudes and self-reported use of desirable difficulties in self-regulated learning
  7. Export Boosting Policies and Firm Performance
  8. The Crowd in Flux
  9. Consumers' Responses to CSR Activities
  10. The dynamics of prioritizing
  11. Development and prospects of degradable magnesium alloys for structural and functional applications in the fields of environment and energy
  12. On the impact of network size and average degree on the robustness of centrality measures
  13. Othering Space
  14. Bifurcation loads of beams of glued-laminated timber with intermediate lateral supports
  15. Effectiveness of self-generation during learning is dependent on individual differences in need for cognition
  16. Standing Still
  17. Climate-smart socially innovative tools and approaches for marine pollution science in support of sustainable development
  18. Essays on Network Regulation
  19. Skills and knowledge management in higher education
  20. An InfoSpace Paradigm for Local and ad hoc Peer-to-Peer Communication
  21. Development and testing of the insulin treatment experience questionnaire (ITEQ)