QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Standard

QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA. / Usbeck, Ricardo; Yan, Xi; Perevalov, Aleksandr et al.
in: Semantic Web, Jahrgang 15, Nr. 6, 2023, S. 2193-2207.

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Harvard

Usbeck, R, Yan, X, Perevalov, A, Jiang, L, Schulz, J, Kraft, A, Möller, C, Huang, J, Reineke, J, Ngomo, A-CN, Saleem, M & Both, A 2023, 'QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA', Semantic Web, Jg. 15, Nr. 6, S. 2193-2207. https://doi.org/10.3233/SW-233471

APA

Usbeck, R., Yan, X., Perevalov, A., Jiang, L., Schulz, J., Kraft, A., Möller, C., Huang, J., Reineke, J., Ngomo, A.-C. N., Saleem, M., & Both, A. (2023). QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA. Semantic Web, 15(6), 2193-2207. https://doi.org/10.3233/SW-233471

Vancouver

Bibtex

@article{717c33cb84e64001a853e5366c53693f,
title = "QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA",
abstract = "Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.",
keywords = "Informatics",
author = "Ricardo Usbeck and Xi Yan and Aleksandr Perevalov and Longquan Jiang and Julius Schulz and Angelie Kraft and Cedric M{\"o}ller and Junbo Huang and Jan Reineke and Ngomo, {Axel-Cyrille Ngonga} and Muhammad Saleem and Andreas Both",
year = "2023",
doi = "10.3233/SW-233471",
language = "English",
volume = "15",
pages = "2193--2207",
journal = "Semantic Web",
issn = "1570-0844",
publisher = "SAGE Publications Inc.",
number = "6",

}

RIS

TY - JOUR

T1 - QALD-10 — The 10th Challenge on Question Answering over Linked Data

T2 - Shifting from DBpedia to Wikidata as a KG for KGQA

AU - Usbeck, Ricardo

AU - Yan, Xi

AU - Perevalov, Aleksandr

AU - Jiang, Longquan

AU - Schulz, Julius

AU - Kraft, Angelie

AU - Möller, Cedric

AU - Huang, Junbo

AU - Reineke, Jan

AU - Ngomo, Axel-Cyrille Ngonga

AU - Saleem, Muhammad

AU - Both, Andreas

PY - 2023

Y1 - 2023

N2 - Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.

AB - Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.

KW - Informatics

U2 - 10.3233/SW-233471

DO - 10.3233/SW-233471

M3 - Journal articles

VL - 15

SP - 2193

EP - 2207

JO - Semantic Web

JF - Semantic Web

SN - 1570-0844

IS - 6

ER -

Zuletzt angesehen

Forschende

  1. Thorsten Aßmann

Publikationen

  1. A Synthesis is Emerging between Biodiversity-Ecosystem Function and Ecological Resilience Research
  2. Field-Configuring Events
  3. Praxishandbuch SAP NetWeaver PI - Entwicklung
  4. Identifying past social-ecological thresholds to understand long-term temporal dynamics in Spain
  5. Measuring mathematics competence in international and national large scale assessments
  6. Effect of salinity on growth of mussels, Mytilus edulis, with special reference to Great Belt (Denmark)
  7. Cascaded Kalman Filters for a Sliding Mode Control in a Peltier Structure for an Innovative Manufacturing System
  8. Scenarios for coal-exit in Germany-a model-based analysis and implications in the European context
  9. International Master’s Programme in Sustainable Development and Management
  10. Taming a Wicked Problem
  11. Sustainable entrepreneurship: creating environmental solutions in light of planetary boundaries
  12. Carbon Management Accounting
  13. Gestaltbarkeit aller Lebensbereiche
  14. The multipole resonance probe
  15. Development of environmentally biodegradable drugs
  16. Computer-based Adaptive Speed Tests
  17. Comparative Study of Transmitter and Resonator Coils for Wireless Power Transfer
  18. Skills and knowledge management in higher education
  19. Modulation of T-effector function by imatinib at the level of cytokine secretion
  20. Mechanical performance optimization of similar thin AA 7075‐T6 sheets produced by refill friction stir spot welding
  21. Multinational Enterprise Strategies for Addressing Sustainability
  22. Easier in than out
  23. Higher Wages in Exporting Firms
  24. Inflation: Drivers and Dynamics 2020 CEBRA Annual Meeting Session Summary