Modern Baselines for SPARQL Semantic Parsing

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not been explored in depth on this task so far, so we experiment with BART, T5 and PGNs (Pointer Generator Networks) with BERT embeddings, looking for new baselines in the PLM era for this task, on DBpedia and Wikidata KGs. We show that T5 requires special input tokenisation, but produces state of the art performance on LC-QuAD 1.0 and LC-QuAD 2.0 datasets, and outperforms task-specific models from previous works. Moreover, the methods enable semantic parsing for questions where a part of the input needs to be copied to the output query, thus enabling a new paradigm in KG semantic parsing.

Original languageEnglish
Title of host publicationSIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
EditorsEnrique Amigo, Pablo Castells, Julio Gonzalo
Number of pages6
PublisherAssociation for Computing Machinery, Inc
Publication date07.07.2022
Pages2260-2265
ISBN (electronic)978-1-4503-8732-3
DOIs
Publication statusPublished - 07.07.2022
Externally publishedYes
Event45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2022 - Online + Círculo de Bellas Artes (Circle of Beaux Arts), Madrid, Spain
Duration: 11.07.202215.07.2022
Conference number: 45
https://sigir.org/sigir2022/

Bibliographical note

Publisher Copyright:
© 2022 ACM.

DOI

Recently viewed

Publications

  1. Model based logistic monitoring for supply and assembly processes
  2. Using Digitalization As An Enabler For Changeability In Production Systems In A Learning Factory Environment
  3. Differentiating forest types using TerraSAR–X spotlight images based on inferential statistics and multivariate analysis
  4. A Hermeneutic Interpretation of Concepts in a Cooperative Multicultural Working Project
  5. Intraspecific trait variation patterns along a precipitation gradient in Mongolian rangelands
  6. Multilayer neural networks
  7. THE PARALLAX OF INDIVIDUATION
  8. Development and comparison of processing maps of Mg-3Sn-1Ca alloy from data obtained in tension versus compression
  9. Nonlinear anisotropic boundary value problems – regularity results and multiscale discretizations
  10. Determining Lot Sizes in Production Areas
  11. An Optimization Approach for Crew Rostering in Public Bus Transit
  12. Cost effectiveness of guided Internet-based interventions for depression in comparison with control conditions
  13. Robust Control of Excavation Mobile Robot with Dynamic Triangulation Vision
  14. Walk counts, labyrinthicity, and complexity of acyclic and cyclic graphs and molecules.
  15. Using an adaptive memory strategy to improve a multistart heuristic for sequencing by hybridization
  16. Agile Portfolio Management Patterns
  17. Educational reconstruction as model for the theory-based design of student-centered learning environments in electrical engineering courses
  18. Image, Process, Performance, Machine
  19. Effects of plyometric training on postural control in static and dynamic testing situations
  20. Development and application of a laboratory flux measurement system (LFMS) for the investigation of the kinetics of mercury emissions from soils
  21. Hacking the Classroom
  22. Development and evaluation of a training program for dialysis nurses - An intervention study
  23. Tree diversity and mycorrhizal type co-determine multitrophic ecosystem functions
  24. Res Lunae: Characterizing Diverse Lunar Resource Systems Using the Social-Ecological System Framework
  25. The Creation of the Concept through the Interaction of Philosophy with Science and Art
  26. Continental mapping of forest ecosystem functions reveals a high but unrealised potential for forest multifunctionality.
  27. Microstructure and mechanical properties of as-cast Mg-Sn-Ca alloys and effect of alloying elements
  28. Effect of thermo-mechanical conditions during constrained friction processing on the particle refinement of AM50 Mg-alloy phases
  29. Embarrassment as a public vs. private emotion and symbolic coping behaviour