Modern Baselines for SPARQL Semantic Parsing

Debayan Banerjee; Pranav Ajit Nair; Jivat Neet Kaur; Ricardo Usbeck; Chris Biemann

doi:10.1145/3477495.3531841

Modern Baselines for SPARQL Semantic Parsing

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Standard

Modern Baselines for SPARQL Semantic Parsing. / Banerjee, Debayan; Nair, Pranav Ajit; Kaur, Jivat Neet et al.
SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. Hrsg. / Enrique Amigo; Pablo Castells; Julio Gonzalo. Association for Computing Machinery, Inc, 2022. S. 2260-2265 (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Harvard

Banerjee, D, Nair, PA, Kaur, JN, Usbeck, R & Biemann, C 2022, Modern Baselines for SPARQL Semantic Parsing. in E Amigo, P Castells & J Gonzalo (Hrsg.), SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery, Inc, S. 2260-2265, 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2022, Madrid, Spanien, 11.07.22. https://doi.org/10.1145/3477495.3531841

APA

Banerjee, D., Nair, P. A., Kaur, J. N., Usbeck, R., & Biemann, C. (2022). Modern Baselines for SPARQL Semantic Parsing. In E. Amigo, P. Castells, & J. Gonzalo (Hrsg.), SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (S. 2260-2265). (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval). Association for Computing Machinery, Inc. https://doi.org/10.1145/3477495.3531841

Vancouver

Banerjee D, Nair PA, Kaur JN, Usbeck R, Biemann C. Modern Baselines for SPARQL Semantic Parsing. in Amigo E, Castells P, Gonzalo J, Hrsg., SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, Inc. 2022. S. 2260-2265. (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval). doi: 10.1145/3477495.3531841

Bibtex

@inbook{611debf5b1ff4a079c5e865344a099c6,

title = "Modern Baselines for SPARQL Semantic Parsing",

abstract = "In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not been explored in depth on this task so far, so we experiment with BART, T5 and PGNs (Pointer Generator Networks) with BERT embeddings, looking for new baselines in the PLM era for this task, on DBpedia and Wikidata KGs. We show that T5 requires special input tokenisation, but produces state of the art performance on LC-QuAD 1.0 and LC-QuAD 2.0 datasets, and outperforms task-specific models from previous works. Moreover, the methods enable semantic parsing for questions where a part of the input needs to be copied to the output query, thus enabling a new paradigm in KG semantic parsing.",

keywords = "knowledge graph, question answering, semantic parsing, sparql, Business informatics, Informatics",

author = "Debayan Banerjee and Nair, {Pranav Ajit} and Kaur, {Jivat Neet} and Ricardo Usbeck and Chris Biemann",

note = "Publisher Copyright: {\textcopyright} 2022 ACM.; 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2022, ACM SIGIR 2022 ; Conference date: 11-07-2022 Through 15-07-2022",

year = "2022",

month = jul,

day = "7",

doi = "10.1145/3477495.3531841",

language = "English",

series = "SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval",

publisher = "Association for Computing Machinery, Inc",

pages = "2260--2265",

editor = "Enrique Amigo and Pablo Castells and Julio Gonzalo",

booktitle = "SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval",

address = "United States",

url = "https://sigir.org/sigir2022/",

}

RIS

TY - CHAP

T1 - Modern Baselines for SPARQL Semantic Parsing

AU - Banerjee, Debayan

AU - Nair, Pranav Ajit

AU - Kaur, Jivat Neet

AU - Usbeck, Ricardo

AU - Biemann, Chris

N1 - Conference code: 45

PY - 2022/7/7

Y1 - 2022/7/7

N2 - In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not been explored in depth on this task so far, so we experiment with BART, T5 and PGNs (Pointer Generator Networks) with BERT embeddings, looking for new baselines in the PLM era for this task, on DBpedia and Wikidata KGs. We show that T5 requires special input tokenisation, but produces state of the art performance on LC-QuAD 1.0 and LC-QuAD 2.0 datasets, and outperforms task-specific models from previous works. Moreover, the methods enable semantic parsing for questions where a part of the input needs to be copied to the output query, thus enabling a new paradigm in KG semantic parsing.

AB - In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not been explored in depth on this task so far, so we experiment with BART, T5 and PGNs (Pointer Generator Networks) with BERT embeddings, looking for new baselines in the PLM era for this task, on DBpedia and Wikidata KGs. We show that T5 requires special input tokenisation, but produces state of the art performance on LC-QuAD 1.0 and LC-QuAD 2.0 datasets, and outperforms task-specific models from previous works. Moreover, the methods enable semantic parsing for questions where a part of the input needs to be copied to the output query, thus enabling a new paradigm in KG semantic parsing.

KW - knowledge graph

KW - question answering

KW - semantic parsing

KW - sparql

KW - Business informatics

KW - Informatics

UR - http://www.scopus.com/inward/record.url?scp=85135063193&partnerID=8YFLogxK

U2 - 10.1145/3477495.3531841

DO - 10.1145/3477495.3531841

M3 - Article in conference proceedings

AN - SCOPUS:85135063193

T3 - SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

SP - 2260

EP - 2265

BT - SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

A2 - Amigo, Enrique

A2 - Castells, Pablo

A2 - Gonzalo, Julio

PB - Association for Computing Machinery, Inc

T2 - 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2022

Y2 - 11 July 2022 through 15 July 2022

ER -

Weitere Publikationen dieser Person(en)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Salnikov, M., Sakhovskiy, A., Nikishina, I., Usmanova, A., Kraft, A., Möller, C., Banerjee, D., Huang, J., Jiang, L., Abdullah, R., Yan, X., Tutubalina, E., Usbeck, R. & Panchenko, A., 2026, Natural Language Processing and Information Systems: 30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, Proceedings. Ichise, R. (Hrsg.). Springer Science and Business Media Deutschland, S. 95-110 16 S. (Lecture Notes in Computer Science; Band 15836 LNCS).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Möller, C. & Usbeck, R., 2025, The Semantic Web: 22nd European Semantic Web Conference, ESWC 2025 Portoroz, Slovenia, June 1–5, 2025 Proceedings, Part I. Curry, E., Acosta, M., Poveda-Villalón, M., van Erp, M., Ojo, A., Hose, K., Shimizu, C. & Lisena, P. (Hrsg.). Cham: Springer Nature Switzerland AG, Band 1. S. 460-480 21 S. (Lecture Notes in Computer Science ; Band 15718).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

ASK-DBLP: Answering Questions over DBLP

Taffa, T., Neises, P., Ollinger, S., Westphal, P., Ackermann, M. R., Banerjee, D. & Usbeck, R., 02.11.2025, ISWC-C 2025, Industry, Doctoral Consortium, Posters and Demos at ISWC 2025: Joint Proceedings of Industry, Doctoral Consortium, Posters and Demos of the 24th International Semantic Web Conference (ISWC-C 2025), ISWC 2025 Companion Volume. Celino, I., Hassanzadeh, O., Bernstein, A., Noy, N., Cheng, G., Wang, S., Ferrada, S., Soulard, T., Kozaki, K., Takeda, H. & Gentile, A. L. (Hrsg.). Aachen: Sun Site Central Europe (RWTH Aachen University), S. 435-440 6 S. D13. (CEUR Workshop Proceedings; Band 4085).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Automating SPARQL Query Translations between DBpedia and Wikidata

Bartels, M. C., Banerjee, D. & Usbeck, R., 14.07.2025, Linking Meaning: Semantic Technologies Shaping the Future of AI: Cover 74617 Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Spahiu, B., Vahdati, S., Salatino, A., Pellegrini, T. & Havur, G. (Hrsg.). IOS Press BV, S. 176-193 18 S. (Studies on the Semantic Web; Band 62).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung

Best Practices in AI and Data Science Models Evaluation

Banerjee, D., Taffa, T. A. & Usbeck, R., 2025, INFORMATIK 2025 : The Wide Open - Offenheit von Source bis Science, 16.-19.September 2025 Potsdam. Lucke, U., Stieglitz, S., Uebernickel, F., Lamprecht, A.-L. & Klein, M. (Hrsg.). Bonn: Gesellschaft für Informatik, Bonn, S. 1211-1219 9 S. (Lecture Notes in Informatics; Band P366).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

DOI

https://doi.org/10.1145/3477495.3531841
Endgültige, publizierte Fassung

Modern Baselines for SPARQL Semantic Parsing

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Weitere Publikationen dieser Person(en)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

ASK-DBLP: Answering Questions over DBLP

Automating SPARQL Query Translations between DBpedia and Wikidata

Best Practices in AI and Data Science Models Evaluation

DOI

Zuletzt angesehen

Projekte

Aktivitäten

Publikationen