QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA

Ricardo Usbeck; Xi Yan; Aleksandr Perevalov; Longquan Jiang; Julius Schulz; Angelie Kraft; Cedric Möller; Junbo Huang; Jan Reineke; Axel-Cyrille Ngonga Ngomo; Muhammad Saleem; Andreas Both

doi:10.3233/SW-233471

QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Standard

QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA. / Usbeck, Ricardo; Yan, Xi; Perevalov, Aleksandr et al.
in: Semantic Web, Jahrgang 15, Nr. 6, 2023, S. 2193-2207.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Bibtex

@article{717c33cb84e64001a853e5366c53693f,

title = "QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA",

abstract = "Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.",

keywords = "Informatics",

author = "Ricardo Usbeck and Xi Yan and Aleksandr Perevalov and Longquan Jiang and Julius Schulz and Angelie Kraft and Cedric M{\"o}ller and Junbo Huang and Jan Reineke and Ngomo, {Axel-Cyrille Ngonga} and Muhammad Saleem and Andreas Both",

year = "2023",

doi = "10.3233/SW-233471",

language = "English",

volume = "15",

pages = "2193--2207",

journal = "Semantic Web",

issn = "1570-0844",

publisher = "SAGE Publications Inc.",

number = "6",

}

RIS

TY - JOUR

T1 - QALD-10 — The 10th Challenge on Question Answering over Linked Data

T2 - Shifting from DBpedia to Wikidata as a KG for KGQA

AU - Usbeck, Ricardo

AU - Yan, Xi

AU - Perevalov, Aleksandr

AU - Jiang, Longquan

AU - Schulz, Julius

AU - Kraft, Angelie

AU - Möller, Cedric

AU - Huang, Junbo

AU - Reineke, Jan

AU - Ngomo, Axel-Cyrille Ngonga

AU - Saleem, Muhammad

AU - Both, Andreas

PY - 2023

Y1 - 2023

N2 - Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.

AB - Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing the development in this field forward. Many of these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA benchmarks that depend on Freebase and DBpedia are gradually less studied and used, because Freebase is defunct and DBpedia lacks the structural validity of Wikidata. Therefore, research is gravitating toward Wikidata-based benchmarks. That is, new KGQA benchmarks are created on the basis of Wikidata and existing ones are migrated. We present a new, multilingual, complex KGQA benchmarking dataset as the 10th part of the Question Answering over Linked Data (QALD) benchmark series. This corpus formerly depended on DBpedia. Since QALD serves as a base for many machine-generated benchmarks, we increased the size and adjusted the benchmark to Wikidata and its ranking mechanism of properties. These measures foster novel KGQA developments by more demanding benchmarks. Creating a benchmark from scratch or migrating it from DBpedia to Wikidata is non-trivial due to the complexity of the Wikidata knowledge graph, mapping issues between different languages, and the ranking mechanism of properties using qualifiers. We present our creation strategy and the challenges we faced that will assist other researchers in their future work. Our case study, in the form of a conference challenge, is accompanied by an in-depth analysis of the created benchmark.

KW - Informatics

U2 - 10.3233/SW-233471

DO - 10.3233/SW-233471

M3 - Journal articles

VL - 15

SP - 2193

EP - 2207

JO - Semantic Web

JF - Semantic Web

SN - 1570-0844

IS - 6

ER -

In der gleichen Zeitschrift

Survey on English Entity Linking on Wikidata: Datasets and approaches

Möller, C., Lehmann, J. & Usbeck, R., 26.09.2022, in: Semantic Web. 13, 6, S. 925-966 42 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Benchmarking question answering systems

Usbeck, R., Röder, M., Hoffmann, M., Conrads, F., Huthmann, J., Ngonga-Ngomo, A. C., Demmler, C. & Unger, C., 2019, in: Semantic Web. 10, 2, S. 293-304 12 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Gerbil – Benchmarking named entity recognition and linking consistently

Röder, M., Usbeck, R. & Ngonga Ngomo, A. C., 2018, in: Semantic Web. 9, 5

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Survey on challenges of Question Answering in the Semantic Web

Höffner, K., Walter, S., Marx, E., Usbeck, R., Lehmann, J. & Ngonga Ngomo, A. C., 07.08.2017, in: Semantic Web. 8, 6, S. 895-920 26 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Weitere Publikationen dieser Person(en)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Salnikov, M., Sakhovskiy, A., Nikishina, I., Usmanova, A., Kraft, A., Möller, C., Banerjee, D., Huang, J., Jiang, L., Abdullah, R., Yan, X., Tutubalina, E., Usbeck, R. & Panchenko, A., 2026, Natural Language Processing and Information Systems: 30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, Proceedings. Ichise, R. (Hrsg.). Springer Science and Business Media Deutschland, S. 95-110 16 S. (Lecture Notes in Computer Science; Band 15836 LNCS).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Analyzing the Influence of Knowledge Graph Information on Relation Extraction.

Möller, C. & Usbeck, R., 2025

Publikation: Andere wissenschaftliche Beiträge › Andere › Forschung

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Möller, C. & Usbeck, R., 2025, The Semantic Web: 22nd European Semantic Web Conference, ESWC 2025 Portoroz, Slovenia, June 1–5, 2025 Proceedings, Part I. Curry, E., Acosta, M., Poveda-Villalón, M., van Erp, M., Ojo, A., Hose, K., Shimizu, C. & Lisena, P. (Hrsg.). Cham: Springer Nature Switzerland AG, Band 1. S. 460-480 21 S. (Lecture Notes in Computer Science ; Band 15718).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Automating SPARQL Query Translations between DBpedia and Wikidata

Bartels, M. C., Banerjee, D. & Usbeck, R., 14.07.2025, Linking Meaning: Semantic Technologies Shaping the Future of AI: Cover 74617 Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Spahiu, B., Vahdati, S., Salatino, A., Pellegrini, T. & Havur, G. (Hrsg.). IOS Press BV, S. 176-193 18 S. (Studies on the Semantic Web; Band 62).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung

Best Practices in AI and Data Science Models Evaluation

Banerjee, D., Taffa, T. A. & Usbeck, R., 2025, 55. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2025: The Wide Open - Offenheit von Source bis Science, Potsdam, Germany, September 16-19, 2025. Lucke, U., Stieglitz, S., Uebernickel, F., Lamprecht, A.-L. & Klein, M. (Hrsg.). Gesellschaft für Informatik, Bonn, Band P-366. S. 1211-1219 9 S. (LNI).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

DOI

https://doi.org/10.3233/SW-233471
Endgültige, publizierte Fassung

QALD-10 — The 10th Challenge on Question Answering over Linked Data: Shifting from DBpedia to Wikidata as a KG for KGQA

Standard

Harvard

APA

Vancouver

Bibtex

RIS

In der gleichen Zeitschrift

Survey on English Entity Linking on Wikidata: Datasets and approaches

Benchmarking question answering systems

Gerbil – Benchmarking named entity recognition and linking consistently

Survey on challenges of Question Answering in the Semantic Web

Weitere Publikationen dieser Person(en)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Analyzing the Influence of Knowledge Graph Information on Relation Extraction.

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Automating SPARQL Query Translations between DBpedia and Wikidata

Best Practices in AI and Data Science Models Evaluation

Links

DOI

Zuletzt angesehen

Publikationen