QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

  • Ricardo Usbeck
  • Xi Yan
  • Aleksandr Perevalov
  • Longquan Jiang
  • Julius Schulz
  • Angelie Kraft
  • Cedric Möller
  • Junbo Huang
  • Jan Reineke
  • Axel-Cyrille Ngonga Ngomo
  • Muhammad Saleem
  • Andreas Both
Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.
Original languageEnglish
JournalSemantic Web
Volume15
Issue number6
Pages (from-to)2193-2207
Number of pages14
ISSN1570-0844
DOIs
Publication statusPublished - 2023
Externally publishedYes

Recently viewed

Publications

  1. Introduction
  2. Analyzing Emotional Styles in the Field of Christian Religion and The Relevance of New Types of Visualization
  3. Enhancement of workability in AZ31 alloy - Processing maps
  4. Analytics and Intuition in the Process of Selecting Talent
  5. L'agenda 21 locale
  6. Managing information in the case of opinion spamming
  7. The pace of range expansion
  8. Decision-making models for Robotic Warehouse
  9. Accidental Representation–The Reconfiguration of Representation through Social Media
  10. Conceptualizing community in energy systems
  11. Lifeworld and System
  12. The temporal factor of change in stressor-strain relationships
  13. Datenstrukturen & Algorithmen
  14. Current issues in competence modeling and assessment
  15. Effects of strategy instructions on learning from text and pictures
  16. RelHunter
  17. Enforcement concepts and strategies in the EU
  18. Large trees are keystone structures in urban parks
  19. Introduction
  20. The multiplicity of emotions: A framework of emotional functions in decision making
  21. Continuous Casting with Mid-Process Alloying
  22. A Subspace to Describe Grasping Internal Forces in Robotic Manipulation Systems
  23. A revised theory of contestable markets
  24. Einführung in Grundlagen der theoretischen Informatik
  25. Studying embodied encounters
  26. A fragile kaleidoscope
  27. Numerical dynamic simulation and analysis of a lithium bromide/water long term solar heat storage system
  28. Sustainable Statehood: Reflections on Critical (Pre-)Conditions, Requirements and Design Options
  29. Classification of playing position in elite junior Australian football using technical skill indicators
  30. A Performance Motivator in one Country, A Non-Motivator in Another?
  31. Der "getarnte" Arbeitnehmer-Geschäftsführer
  32. The Lotka-Volterra Model for Competition Controlled by a Sliding Mode Approach
  33. CSR and tax avoidance: A review of empirical research
  34. SemREC-SMART 2022
  35. Basic analysis of the incremental profile forming process
  36. Promoting diversity of thought: bridging knowledge systems for a pluriverse approach to research