QALD-10 — The 10th Challenge on Question Answering over Linked Data

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

  • Ricardo Usbeck
  • Xi Yan
  • Aleksandr Perevalov
  • Longquan Jiang
  • Julius Schulz
  • Angelie Kraft
  • Cedric Möller
  • Junbo Huang
  • Jan Reineke
  • Axel-Cyrille Ngonga Ngomo
  • Muhammad Saleem
  • Andreas Both
Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.
OriginalspracheEnglisch
ZeitschriftSemantic Web
Jahrgang2024
Anzahl der Seiten14
ISSN1570-0844
PublikationsstatusAngenommen/Im Druck - 08.02.2023
Extern publiziertJa

Zuletzt angesehen

Publikationen

  1. Analysing the gender wage gap (GWG) using personnel records
  2. Does board composition have an impact on CSR reporting?
  3. Schreiben
  4. The effectiveness of nudging
  5. DECODING SUSTAINABILITY IN THE HEALTHCARE SYSTEM. TEACHING STUDENTS HOW TO PROBLEMATIZE COMPLEX CONCEPTS
  6. Advances in Laser Positioning of Machine Vision System and Their Impact on 3D Coordinates Measurement
  7. Briefe schreiben in der Sekundarstufe I
  8. Question answering over linked data
  9. The challenges of gamifying CSR communication
  10. Problemlösen in der Sekundarstufe I
  11. New validated liquid chromatographic and chemometrics-assisted UV spectroscopic methods for the determination of two multicomponent cough mixtures in syrup.
  12. Organisationen hacken
  13. Determiner Ellipsis in Electronic Writing - Discourse or Syntax?
  14. Gehen in der Datenbank – Der BMLwalker
  15. Computer als Medium (Hyperkult VI)
  16. Evaluation of a temporal causal model for predicting the mood of clients in an online therapy
  17. A duty-block network approach for an integrated driver rostering problem in public bus transport
  18. Georeferencing System for Maneuvering of Autonomous Truck in Mining Environment
  19. Almost-invariant sets and invariant manifolds
  20. Bridging scenario planning and backcasting
  21. Irish English and Variational Pragmatics
  22. The role of the situation model in mathematical modelling
  23. Credit constraints and margins of import
  24. How does economic integration influence employment and wages in border regions?
  25. Resisting alignment
  26. Unobtrusive Measurement of Vital Signs Through Ultra-Wideband Sensing in the Domain of AAL
  27. Repräsentative Wahlstatistik
  28. Activating an Integrative Mindset Improves the Subjective Outcomes of Value-Driven Conflicts
  29. Der FFB-Server mit Microsoft Windows Server 2003
  30. Structure matters
  31. Going beyond certificates
  32. Converging perspectives in audience studies and digital literacies
  33. Developing pragmatic competence in a study abroad context
  34. Multifractal analysis reveals music-like dynamic structure in songbird rhythms