Automating SPARQL Query Translations between DBpedia and Wikidata

Malte Christian Bartels; Debayan Banerjee; Ricardo Usbeck

doi:10.3233/SSW250019

Automating SPARQL Query Translations between DBpedia and Wikidata

Research output: Contributions to collected editions/works › Article in conference proceedings › Research

Authors

Malte Christian Bartels
Debayan Banerjee
Ricardo Usbeck

Professorship for Information Systems, in particular Artificial Intelligence and Explainability

Purpose:

This paper investigates whether state-of-the-art Large Language Models (LLMs) can automatically translate SPARQL between popular Knowledge Graph (KG) schemas. We focus on translations between the DBpedia and Wikidata KG, and later on DBLP and OpenAlex KG. This study addresses a notable gap in KG interoperability research by evaluating LLM performance on SPARQL-to-SPARQL translation.
Methodology:

Two benchmarks are assembled, where the first aligns 100 DBpedia–Wikidata queries from QALD-9-Plus dataset; the second contains 100 DBLP queries aligned to OpenAlex, testing generalizability beyond encyclopaedic KGs. Three open LLMs: Llama-3-8B, DeepSeek-R1-Distill-Llama-70B, and Mistral-Large-Instruct-2407 are selected based on their sizes and architectures and tested with zero-shot, few-shot, and two chain-of-thought variants. Outputs were compared with gold-standard answers, and resulting errors were systematically categorized.
Findings:

We find that the performance varies markedly across models and prompting strategies, and that translations for Wikidata to DBpedia work far better than translations for DBpedia to Wikidata. The largest model, Mistral-Large-Instruct-2407, achieved the highest accuracy, reaching 86% on the Wikidata → DBpedia task using a Chain-of-Thought approach. This performance was replicated in the DBLP → OpenAlex generalization task, which achieved similar results with a few- shot setup, underscoring the critical role of in-context examples.
Value:

This study demonstrates a viable and scalable pathway toward KG interoperability by using LLMs with structured prompting and explicit schema-mapping tables to translate queries across heterogeneous KGs. The method’s strong performance when applied to general purpose KGs and specialized scholarly domain suggests its potential as a promising approach to reduce the manual effort required for cross-KG data integration and analysis.

Original language	English
Title of host publication	Linking Meaning: Semantic Technologies Shaping the Future of AI : Cover 74617 Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria
Editors	Blerina Spahiu, Sahar Vahdati, Angelo Salatino, Tassilo Pellegrini, Giray Havur
Number of pages	18
Publisher	IOS Press BV
Publication date	14.07.2025
Pages	176-193
ISBN (electronic)	978-1-64368-616-5
DOIs	https://doi.org/10.3233/SSW250019
Publication status	Published - 14.07.2025

Research areas

cs.AI, cs.CL
Informatics

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Salnikov, M., Sakhovskiy, A., Nikishina, I., Usmanova, A., Kraft, A., Möller, C., Banerjee, D., Huang, J., Jiang, L., Abdullah, R., Yan, X., Tutubalina, E., Usbeck, R. & Panchenko, A., 2026, Natural Language Processing and Information Systems: 30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, Proceedings. Ichise, R. (ed.). Springer Science and Business Media Deutschland, p. 95-110 16 p. (Lecture Notes in Computer Science; vol. 15836 LNCS).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Möller, C. & Usbeck, R., 2025, The Semantic Web: 22nd European Semantic Web Conference, ESWC 2025 Portoroz, Slovenia, June 1–5, 2025 Proceedings, Part I. Curry, E., Acosta, M., Poveda-Villalón, M., van Erp, M., Ojo, A., Hose, K., Shimizu, C. & Lisena, P. (eds.). Cham: Springer Nature Switzerland AG, Vol. 1. p. 460-480 21 p. (Lecture Notes in Computer Science ; vol. 15718).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Bridge-Generate: Scholarly Hybrid Question Answering

Taffa, T. A. & Usbeck, R., 23.05.2025, WWW Companion 2025 - Companion Proceedings of the ACM Web Conference 2025: Companion Proceedings of the ACM Web Conference 2025, April 28-May 2, 2025 Sydney, NSW, Australia. Long, G., Blumestein, M., Chang, Y., Lewin-Eytan, L., Huang, H. & Yom-Tov, E. (eds.). New York: Association for Computing Machinery, Inc, p. 1321-1325 5 p.

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

DBLPLink 2.0 -- An Entity Linker for the DBLP Scholarly Knowledge Graph

Banerjee, D., Taffa, T. A. & Usbeck, R., 30.07.2025

Research output: other publications › Other › Research

HySQA: Hybrid Scholarly Question Answering

Taffa, T., Banerjee, D., Assabie, Y. & Usbeck, R., 26.08.2025, Linking Meaning: Semantic Technologies Shaping the Future of AI: Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Spahiu, B., Vahdati, S., Salatino, A., Pellegrini, T. & Havur, G. (eds.). Amsterdam: IOS Press BV, p. 247-263 17 p. (Studies on the Semantic Web; vol. 62).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Documents

Download (Post-Print)
802 KB, PDF document

DOI

https://doi.org/10.3233/SSW250019
Final published version

Automating SPARQL Query Translations between DBpedia and Wikidata

Authors

Research areas

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Bridge-Generate: Scholarly Hybrid Question Answering

DBLPLink 2.0 -- An Entity Linker for the DBLP Scholarly Knowledge Graph

HySQA: Hybrid Scholarly Question Answering

Documents

DOI

Recently viewed

Projects

Activities

Publications

Press / Media