Modern Baselines for SPARQL Semantic Parsing

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Modern Baselines for SPARQL Semantic Parsing. / Banerjee, Debayan; Nair, Pranav Ajit; Kaur, Jivat Neet et al.
SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ed. / Enrique Amigo; Pablo Castells; Julio Gonzalo. Association for Computing Machinery, Inc, 2022. p. 2260-2265 (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Banerjee, D, Nair, PA, Kaur, JN, Usbeck, R & Biemann, C 2022, Modern Baselines for SPARQL Semantic Parsing. in E Amigo, P Castells & J Gonzalo (eds), SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery, Inc, pp. 2260-2265, 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2022, Madrid, Spain, 11.07.22. https://doi.org/10.1145/3477495.3531841

APA

Banerjee, D., Nair, P. A., Kaur, J. N., Usbeck, R., & Biemann, C. (2022). Modern Baselines for SPARQL Semantic Parsing. In E. Amigo, P. Castells, & J. Gonzalo (Eds.), SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 2260-2265). (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval). Association for Computing Machinery, Inc. https://doi.org/10.1145/3477495.3531841

Vancouver

Banerjee D, Nair PA, Kaur JN, Usbeck R, Biemann C. Modern Baselines for SPARQL Semantic Parsing. In Amigo E, Castells P, Gonzalo J, editors, SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, Inc. 2022. p. 2260-2265. (SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval). doi: 10.1145/3477495.3531841

Bibtex

@inbook{611debf5b1ff4a079c5e865344a099c6,
title = "Modern Baselines for SPARQL Semantic Parsing",
abstract = "In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not been explored in depth on this task so far, so we experiment with BART, T5 and PGNs (Pointer Generator Networks) with BERT embeddings, looking for new baselines in the PLM era for this task, on DBpedia and Wikidata KGs. We show that T5 requires special input tokenisation, but produces state of the art performance on LC-QuAD 1.0 and LC-QuAD 2.0 datasets, and outperforms task-specific models from previous works. Moreover, the methods enable semantic parsing for questions where a part of the input needs to be copied to the output query, thus enabling a new paradigm in KG semantic parsing.",
keywords = "knowledge graph, question answering, semantic parsing, sparql, Business informatics, Informatics",
author = "Debayan Banerjee and Nair, {Pranav Ajit} and Kaur, {Jivat Neet} and Ricardo Usbeck and Chris Biemann",
note = "Publisher Copyright: {\textcopyright} 2022 ACM.; 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2022, ACM SIGIR 2022 ; Conference date: 11-07-2022 Through 15-07-2022",
year = "2022",
month = jul,
day = "7",
doi = "10.1145/3477495.3531841",
language = "English",
series = "SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval",
publisher = "Association for Computing Machinery, Inc",
pages = "2260--2265",
editor = "Enrique Amigo and Pablo Castells and Julio Gonzalo",
booktitle = "SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval",
address = "United States",
url = "https://sigir.org/sigir2022/",

}

RIS

TY - CHAP

T1 - Modern Baselines for SPARQL Semantic Parsing

AU - Banerjee, Debayan

AU - Nair, Pranav Ajit

AU - Kaur, Jivat Neet

AU - Usbeck, Ricardo

AU - Biemann, Chris

N1 - Conference code: 45

PY - 2022/7/7

Y1 - 2022/7/7

N2 - In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not been explored in depth on this task so far, so we experiment with BART, T5 and PGNs (Pointer Generator Networks) with BERT embeddings, looking for new baselines in the PLM era for this task, on DBpedia and Wikidata KGs. We show that T5 requires special input tokenisation, but produces state of the art performance on LC-QuAD 1.0 and LC-QuAD 2.0 datasets, and outperforms task-specific models from previous works. Moreover, the methods enable semantic parsing for questions where a part of the input needs to be copied to the output query, thus enabling a new paradigm in KG semantic parsing.

AB - In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not been explored in depth on this task so far, so we experiment with BART, T5 and PGNs (Pointer Generator Networks) with BERT embeddings, looking for new baselines in the PLM era for this task, on DBpedia and Wikidata KGs. We show that T5 requires special input tokenisation, but produces state of the art performance on LC-QuAD 1.0 and LC-QuAD 2.0 datasets, and outperforms task-specific models from previous works. Moreover, the methods enable semantic parsing for questions where a part of the input needs to be copied to the output query, thus enabling a new paradigm in KG semantic parsing.

KW - knowledge graph

KW - question answering

KW - semantic parsing

KW - sparql

KW - Business informatics

KW - Informatics

UR - http://www.scopus.com/inward/record.url?scp=85135063193&partnerID=8YFLogxK

U2 - 10.1145/3477495.3531841

DO - 10.1145/3477495.3531841

M3 - Article in conference proceedings

AN - SCOPUS:85135063193

T3 - SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

SP - 2260

EP - 2265

BT - SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

A2 - Amigo, Enrique

A2 - Castells, Pablo

A2 - Gonzalo, Julio

PB - Association for Computing Machinery, Inc

T2 - 45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2022

Y2 - 11 July 2022 through 15 July 2022

ER -

DOI

Recently viewed

Prizes

  1. Transferpreis

Publications

  1. Root-root interactions: extending our perspective to be more inclusive of the range of theories in ecology and agriculture using in-vivo analyses
  2. Modelling, explaining, enacting and getting feedback: How can the acquisition of core practices in teacher education be optimally fostered?
  3. An Adaptive Resonance Regulator for an Actuator using Periodic Signals in Camless Engine Systems
  4. Measurement invariance in a grid-based measure of academic self-concept
  5. Datenstrukturen & Algorithmen
  6. Points of cooperation: integrating cooperative learning into web-based courses
  7. The effect of yield surface curvature change by cross hardening on forming limit diagrams of sheets
  8. Using latent class analysis to produce a typology of environmental concern in the UK
  9. An Experimental Approach to the Optimization of Customer Information at the Point of Sale
  10. Home range size and resource use of breeding and non-breeding white storks along a land use gradient
  11. Class size, student performance and Tiebout bias
  12. Feature Extraction and Aggregation for Predicting the Euro 2016
  13. Ablation Study of a Multimodal Gat Network on Perfect Synthetic and Real-world Data to Investigate the Influence of Language Models in Invoice Recognition
  14. Perfectly nested or significantly nested - an important difference for conservation management
  15. The role of plant biodiversity in modifying the structure and functioning of higher tropic Levels in species-rich forests
  16. Computational modelling of submicron-sized metallic glasses
  17. Determinants and Outcomes of Dual Distribution:
  18. Facing Up to Third Party Liability for Space Activities
  19. Erroneous examples as desirable difficulty
  20. Implementation of Chemometric Tools to Improve Data Mining and Prioritization in LC-HRMS for Nontarget Screening of Organic Micropollutants in Complex Water Matrixes
  21. Deactivation and transformation products in biodegradability testing of ß-lactams amoxicillin and piperacillin
  22. Explaining Disagreement on Interest Rates in a Taylor-Rule Setting
  23. The implications of knowledge hiding at work for recovery after work: A diary study
  24. Fusion of knowledge bases for better navigation of wheeled mobile robotic group with 3D TVS
  25. 2. Advent
  26. Consular Assistance: Rights, Remedies, and Responsibility Comments on the ICJ's Judgment in the LaGrand Case
  27. An Approach for Ex-Post-Facto Analysis of Knowledge Graph-Driven Chatbots – The DBpedia Chatbot