Modern Baselines for SPARQL Semantic Parsing

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not been explored in depth on this task so far, so we experiment with BART, T5 and PGNs (Pointer Generator Networks) with BERT embeddings, looking for new baselines in the PLM era for this task, on DBpedia and Wikidata KGs. We show that T5 requires special input tokenisation, but produces state of the art performance on LC-QuAD 1.0 and LC-QuAD 2.0 datasets, and outperforms task-specific models from previous works. Moreover, the methods enable semantic parsing for questions where a part of the input needs to be copied to the output query, thus enabling a new paradigm in KG semantic parsing.

Original languageEnglish
Title of host publicationSIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
EditorsEnrique Amigo, Pablo Castells, Julio Gonzalo
Number of pages6
PublisherAssociation for Computing Machinery, Inc
Publication date06.07.2022
Pages2260-2265
ISBN (electronic)978-1-4503-8732-3
DOIs
Publication statusPublished - 06.07.2022
Externally publishedYes
Event45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2022 - Online + Círculo de Bellas Artes (Circle of Beaux Arts), Madrid, Spain
Duration: 11.07.202215.07.2022
Conference number: 45
https://sigir.org/sigir2022/

Bibliographical note

Publisher Copyright:
© 2022 ACM.

DOI

Recently viewed

Activities

  1. (De)composing Public Value: New Evidence for Basic Structures
  2. Speaking about vision, talking in the name of so much more: A methodological framework for ventriloquial analyses in organization studies
  3. Explaining Healthcare System Change
  4. Knowledge Space(s) of Globalization – Musealizing Things, People and Spaces of Global Trade
  5. Digital Games Lab Lecture Series - 2018
  6. How does tree sapling diversity influence browsing intensity by deer across spatial scales?
  7. Vortrag: Teaching football skills with digitally supported learning processes in primary school
  8. Going Green: Digital project work as a transdisciplinary and transcultural task in the foreign language and STEM classrooms
  9. Trajectory-based Lagrangian approaches for the extraction and characterization of coherent structures in turbulent convection
  10. International Conference of Computational Methods in Engineering Science - Chair of Session III
  11. Liquidity, Flows, Circulation: The Cultural Logic of Environmentalization (2nd part) 2021
  12. Using Ethnographic Methods in Organizational Communication Research: Considering Materiality, Aesthetics and Embodiment
  13. Organizational Practices for the Aging Workforce: Validation of an English Version of the Later Life Workplace Index
  14. Was ist besser? Video-, Text- oder Live-Reflexion?
  15. Challenges for the Positioning of Destinations: Destination Formation Processes and Territorial Boundaries
  16. Transdisciplinary Evaluation of Different Coastal Adaptation Strategies: Integrating Regional Perceptions of Scientists, Practitioners and the Public
  17. Agile Portfolio Management Patterns - A Research Design
  18. Current Developments in Environmental Management Accounting: Towards a Comprehensive Framework for Environmental Management Accounting
  19. International Conference of EAS and ISME - 2007

Publications

  1. Performance of methods to select landscape metrics for modelling species richness
  2. Earnings Less Risk-Free Interest Charge (ERIC) and Stock Returns—A Value-Based Management Perspective on ERIC’s Relative and Incremental Information Content
  3. Automatic three-dimensional geometry and mesh generation of periodic representative volume elements for matrix-inclusion composites
  4. An empirically grounded ontology for analyzing IT-based interventions in business ecosystems
  5. Reducing mean tardiness in a flexible job shop containing AGVs with optimized combinations of sequencing and routing rules
  6. The relationship between audit committees, external auditors, and internal control systems
  7. Germination changes can restructure communities through priority effects
  8. German Utilities and distributed PV
  9. The Structure of Student Interest in Computers and Information Technology
  10. Exploring Leverages and Pitfalls of Context Collapse in Modern Communication
  11. General Patterns and Conclusions
  12. Mechanistic Realization of the Turtle Shell
  13. Reporting and Analysing the Environmental Impact of Language Models on the Example of Commonsense Question Answering with External Knowledge
  14. A Developmental Trend in the Structure of Time-Estimation Performance
  15. Explaining the (Non-) Adoption of Advanced Data Analytics in Auditing
  16. A cognitive mapping approach to understanding public objection to energy infrastructure
  17. How difficult is the adaptation of POS taggers?
  18. Online-scheduling using past and real-time data
  19. Processing of CSR communication
  20. Robust and Optimal Control Designed for Autonomous Surface Vessel Prototypes
  21. Res Lunae: Characterizing Diverse Lunar Resource Systems Using the Social-Ecological System Framework
  22. Enterprise Architecture Management Support for Digital Transformation Projects in Very Large Enterprises
  23. Excellence in Teaching and Learning
  24. Putting inquiry-based learning into practice
  25. A Study on the Impact of Intradomain Finetuning of Deep Language Models for Legal Named Entity Recognition in Portuguese
  26. Anwendungsprogrammierung mit Embedded-SQL