Automating SPARQL Query Translations between DBpedia and Wikidata

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearch

Authors

Purpose:

This paper investigates whether state-of-the-art Large Language Models (LLMs) can automatically translate SPARQL between popular Knowledge Graph (KG) schemas. We focus on translations between the DBpedia and Wikidata KG, and later on DBLP and OpenAlex KG. This study addresses a notable gap in KG interoperability research by evaluating LLM performance on SPARQL-to-SPARQL translation.
Methodology:

Two benchmarks are assembled, where the first aligns 100 DBpedia–Wikidata queries from QALD-9-Plus dataset; the second contains 100 DBLP queries aligned to OpenAlex, testing generalizability beyond encyclopaedic KGs. Three open LLMs: Llama-3-8B, DeepSeek-R1-Distill-Llama-70B, and Mistral-Large-Instruct-2407 are selected based on their sizes and architectures and tested with zero-shot, few-shot, and two chain-of-thought variants. Outputs were compared with gold-standard answers, and resulting errors were systematically categorized.
Findings:

We find that the performance varies markedly across models and prompting strategies, and that translations for Wikidata to DBpedia work far better than translations for DBpedia to Wikidata. The largest model, Mistral-Large-Instruct-2407, achieved the highest accuracy, reaching 86% on the Wikidata → DBpedia task using a Chain-of-Thought approach. This performance was replicated in the DBLP → OpenAlex generalization task, which achieved similar results with a few- shot setup, underscoring the critical role of in-context examples.
Value:

This study demonstrates a viable and scalable pathway toward KG interoperability by using LLMs with structured prompting and explicit schema-mapping tables to translate queries across heterogeneous KGs. The method’s strong performance when applied to general purpose KGs and specialized scholarly domain suggests its potential as a promising approach to reduce the manual effort required for cross-KG data integration and analysis.
Original languageEnglish
Title of host publicationLinking Meaning: Semantic Technologies Shaping the Future of AI : Cover 74617 Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria
EditorsBlerina Spahiu, Sahar Vahdati, Angelo Salatino, Tassilo Pellegrini, Giray Havur
Number of pages18
PublisherIOS Press BV
Publication date14.07.2025
Pages176-193
ISBN (electronic)978-1-64368-616-5
DOIs
Publication statusPublished - 14.07.2025

Documents

DOI

Recently viewed

Activities

  1. Liquidity, Flows, Circulation: The Cultural Logic of. Environmentalization - 2020
  2. Towards a New Aesthetic Paradigm
  3. Curator-artist Interactions: Forms of Exchange under the Pressure of the Logic of the Market
  4. PRIORITIZATION OF VETERINARY ANTIBIOTICS FOR ENVIRONMENTAL ANALYSIS USING A SIMPLE SCREENING APPROACH
  5. Of mice, polemics and toxins (dis)placed on stage of public consultation. Situational analysis of the GMO-discourse in Poland
  6. Are Hybrid Work Models Here to Stay?
  7. The Effects of Outcome Uncertainty on Negotiators Facing Externalities
  8. Exploring Urban Music Studies (Roundtable)
  9. Von Foerster‘s chap does brainy job. Die Prototypen des Biological Computer Laboratory (Hyperkult 18)
  10. MULTISCALE APPROACH TO LASER SHOCK PEENING INCLUDING PLASMA SHOCK WAVE SIMULATION
  11. Academy of Management Conference
  12. Cutting Across Lines: Lil Picard and the Reorienting Effects of Collage
  13. Problem Pressure, DRGs, and the Role of and Ideas in Healthcare System Change
  14. Workshop Grey Zones of Simulation - 2015
  15. 6th Austrian Early Scholars Workshop in Management - 2018
  16. Not terminated: In a Plessnerian perspective ‘cyborgized’ men still remain ‘human’ beings
  17. Methoden transformativer Forschung
  18. Professional School (Organisation)
  19. Lessons for conservation biology under global change conditions: a case study on two burnet moth species in the high altitudes of the Pyrenees
  20. Ecopharmacology.
  21. Summer School in the context of Biodiversity-Ecosystem Functioning 2012
  22. International Class Actions Conference - 2019
  23. Bsc-Thesis: Vertical stratification of ant community composition along a forest succession gradient in subtropical China.
  24. „Pegida in Germany. Causes and Consequences“
  25. DFG-Gutachtertätigkeit
  26. Methodentagung 2012
  27. 3rd International Conference on Implications of GM Crop Cultivation at Large Spatial Scales - GMLS-III 2012