QALD-10 — The 10th Challenge on Question Answering over Linked Data

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

  • Ricardo Usbeck
  • Xi Yan
  • Aleksandr Perevalov
  • Longquan Jiang
  • Julius Schulz
  • Angelie Kraft
  • Cedric Möller
  • Junbo Huang
  • Jan Reineke
  • Axel-Cyrille Ngonga Ngomo
  • Muhammad Saleem
  • Andreas Both
Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.
Original languageEnglish
JournalSemantic Web
Volume2024
Number of pages14
ISSN1570-0844
Publication statusAccepted/In press - 08.02.2023
Externally publishedYes

Recently viewed

Publications

  1. Measurement invariance in a grid-based measure of academic self-concept
  2. Logistical Potentials of Load Balancing via the Build-up and Reduction of Stock
  3. Multifractality Versus (Mono-) Fractality as Evidence of Nonlinear Interactions Across Timescales
  4. Points of cooperation: integrating cooperative learning into web-based courses
  5. Fallstudie
  6. Explaining Investment Dynamics: Empirical Evidence from German New Ventures
  7. What drives the spatial distribution and dynamics of local species richness in tropical forest?
  8. Electrical and Mechanical Characterization of Polymer Nanofibers for Sensor Application
  9. Recruitment practices in small and medium size enterprises.
  10. Determining Lot Sizes in Production Areas
  11. Using Multi-Label Classification for Improved Question Answering
  12. Using a Seminorm for Wavelet Denoising of sEMG Signals for Monitoring during Rehabilitation with Embedded Orthosis System
  13. On the role of linguistic features for comprehension and learning from STEM texts. A meta-analysis
  14. Is There a Way Back or Can the Internet Remember its Own History?
  15. Deep drawing of high-strength tailored blanks by using tailored tools
  16. Introduction: A strategy for overcoming the definitional struggle
  17. Sprachen in Liechtenstein
  18. Language Model Transformers as Evaluators for Open-domain Dialogues
  19. Understanding Context Collapse for Social Media Users
  20. Transparency in an Age of Digitalization and Responsibility
  21. Low Resource Question Answering: An Amharic Benchmarking Dataset
  22. Heterogenität
  23. Are the terms “Socio-economic status” and “Class status” a warped form of reasoning for Max Weber?
  24. Microsatellites and allozymes as the genetic memory of habitat fragmentation and defragmentation in populations of the ground beetle Carabus auronitens (Col., Carabidae)
  25. A practical perspective on repatriate knowledge transfer
  26. Semi-polar root exudates in natural grassland communities
  27. Sol-gel technology for greener and more sustainable antimicrobial textiles that use silica matrices with C, and Ag and ZnO as biocides
  28. Fallstudie
  29. Actor analysis as a tool for exploring the decision-making processes in environmental governance
  30. Carbocyclic cis-[1.1.1]-tris-σ-homobenzenes - Syntheses by triple epoxide → cyclopropane conversions, structural data, [σ2s+σ2s+σ2s] cycloreversions
  31. Interventionen im Datenraum
  32. Time and Income Poverty – An Interdependent Multidimensional Poverty Approach with German Time Use Diary Data
  33. Plants, Androids and Operators
  34. The Use of Media in Intercultural Dialogue "dialogo_dialog"!
  35. SAP exchange infrastructure for developers
  36. Fallstudie
  37. Customer Orientation of Service Employees—Toward a Conceptual Framework of a Key Relationship Marketing Construct
  38. Study of digital morphing tools in the architectural design process
  39. Does the introduction of the Euro have an effect on subjective hypotheses about the price-quality relationship?
  40. Performance Saga: Interview 03