The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset.
OriginalspracheEnglisch
TitelFindings of the Association for Computational Linguistics: ACL 2023 : July 9-14, 2023
HerausgeberAnna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki
Anzahl der Seiten10
ErscheinungsortStroudsburg
VerlagAssociation for Computational Linguistics (ACL)
Erscheinungsdatum01.07.2023
Seiten12219-12228
ISBN (elektronisch)978-1-959429-62-3
DOIs
PublikationsstatusErschienen - 01.07.2023
Extern publiziertJa
Veranstaltung61st Annual Meeting of the Association for Computational Linguistics - Toronto, Kanada
Dauer: 09.07.202314.07.2023
Konferenznummer: 61
https://2023.aclweb.org

Bibliographische Notiz

Publisher Copyright:
© 2023 Association for Computational Linguistics.

Zuletzt angesehen

Forschende

  1. Oliver Mock

Publikationen

  1. Differentiating forest types using TerraSAR–X spotlight images based on inferential statistics and multivariate analysis
  2. Estimation of minimal data sets sizes for machine learning predictions in digital mental health interventions
  3. Towards Computer Simulations of Virtue Ethics
  4. On the role of linguistic features for comprehension and learning from STEM texts. A meta-analysis
  5. Careless responding detection revisited
  6. Do consumers prefer pasture-raised dual-purpose cattle when considering meat products? A hypothetical discrete choice experiment for the case of minced beef
  7. The Pricing of Default-free Interest Rate Cap, Floor, and Collar Agreements
  8. Frames of systems change in sustainability transformations: Lessons from sociotechnical systems and circular economy case studies
  9. Explorations in Social Spaces
  10. Urgent need for updating the slogan of global climate actions from 'tree planting' to 'restore native vegetation'
  11. From railroad imperialism to neoliberal reprimarization: Lessons from regime-shifts in the Global Soybean Complex
  12. Planning for Sea Spaces I: Processes, Practices and Future Perspectives
  13. “If It Bleeds It Leads”
  14. Matching between oral inward–outward movements of object names and oral movements associated with denoted objects
  15. Hindering and Facilitating Factors for Developing and Implementing HR Measures for Older Workers
  16. Industrial Clusters as a Factor for Innovative Drive- in Regions of Transformation and Structural Change
  17. Wer wird subventioniert?
  18. When to sample in an inaccessible landscape
  19. Industry Transformations for High Service Provisioning with Lower Energy and Material Demand
  20. The Utilization of Artificial Intelligence in Higher Education Institutions in Germany
  21. Storytelling in instant messenger communication