The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset.
Original languageEnglish
Title of host publicationFindings of the Association for Computational Linguistics: ACL 2023 : July 9-14, 2023
EditorsAnna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki
Number of pages10
Place of PublicationStroudsburg
PublisherAssociation for Computational Linguistics (ACL)
Publication date01.07.2023
Pages12219-12228
ISBN (electronic)978-1-959429-62-3
DOIs
Publication statusPublished - 01.07.2023
Externally publishedYes
Event61st Annual Meeting of the Association for Computational Linguistics - Toronto, Canada
Duration: 09.07.202314.07.2023
Conference number: 61
https://2023.aclweb.org

Bibliographical note

Publisher Copyright:
© 2023 Association for Computational Linguistics.

Recently viewed

Publications

  1. Earnings Less Risk-Free Interest Charge (ERIC) and Stock Returns—A Value-Based Management Perspective on ERIC’s Relative and Incremental Information Content
  2. Short run comovement, persistent shocks and the business cycle
  3. Peter Hay, Advanced Introduction to Private International Law and Procedure
  4. A comparison between private and public access rules to bottlenecks - experiences and expectations from telecommunication and energy
  5. Hill–Chao numbers allow decomposing gamma multifunctionality into alpha and beta components
  6. Low working memory reduces the use of mental contrasting
  7. Clusteranalyse als Methode zur Strukturierung großer Datenmodelle
  8. Analysis of the forming behaviour of in-situ drawn sandwich sheets
  9. Putting Architecture in its Social Space: the Fields and Skills of Planning Maastricht
  10. Resolving potential conflicts between different heathland ecosystem services through adaptive management
  11. Belief in free will affects causal attributions when judging others’ behavior
  12. Towards Computer Simulations of Virtue Ethics
  13. Habitual Actions as a Challenge to the Standard Theory of Action
  14. Ionic liquids vs. ethanol as extraction media of algicidal compounds from mango processing waste
  15. Edward Lear, A book of nonsense
  16. A trait-based framework linking the soil metabolome to plant–soil feedbacks
  17. Managing Knowledge in Organization Studies Through Instrumentation
  18. Conjectural variations equilibrium in a mixed duopoly
  19. Benno Reifenberg (1892-1970)
  20. Augmented space
  21. Determinants and Development of Schools in Organization Theory
  22. Extraction of information from invoices - challenges in the extraction pipeline
  23. Structuring Sustainability Reports for Environmental Standards with LLMs guided by Ontology
  24. Contested future-making in containment: temporalities, infrastructures and agency
  25. Who is a Migrant? Abandoning the Nation-State Point of View in the Study of Migration
  26. Synthesis of Room-Temperature Ionic Liquids with the Weakly Coordinating [Al(ORF)(4)](-) Anion (R-F = C(H)(CF3)(2)) and the Determination of Their Principal Physical Properties
  27. Vom Sagbaren zum Machbaren?
  28. The Eschatical Perfection of the World in God
  29. Temporal changes in taxonomic and functional alpha and beta diversity across tree communities in subtropical Atlantic forests
  30. Bridge-Generate
  31. Case Study Analysis
  32. Differential mortality rates in major and subthreshold depression
  33. Inflation Narratives from a Machine Learning Perspective
  34. Free work