The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset.
OriginalspracheEnglisch
TitelFindings of the Association for Computational Linguistics: ACL 2023 : July 9-14, 2023
HerausgeberAnna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki
Anzahl der Seiten10
ErscheinungsortStroudsburg
VerlagAssociation for Computational Linguistics (ACL)
Erscheinungsdatum01.07.2023
Seiten12219-12228
ISBN (elektronisch)978-1-959429-62-3
DOIs
PublikationsstatusErschienen - 01.07.2023
Extern publiziertJa
Veranstaltung61st Annual Meeting of the Association for Computational Linguistics - Toronto, Kanada
Dauer: 09.07.202314.07.2023
Konferenznummer: 61
https://2023.aclweb.org

Bibliographische Notiz

Publisher Copyright:
© 2023 Association for Computational Linguistics.

Zuletzt angesehen

Forschende

  1. Kerstin Fedder

Publikationen

  1. Internet-based public debate of CCS
  2. Integrating Common Ground and Informativeness in Pragmatic Word Learning
  3. Phosphorus uptake from struvite is modulated by the nitrogen form applied
  4. Learning with summaries
  5. Effectiveness of self-generation during learning is dependent on individual differences in need for cognition
  6. Joint Proceedings of Scholarly QALD 2023 and SemREC 2023 co-located with 22nd International Semantic Web Conference ISWC 2023
  7. Comparison between UKF and EKF in Sensorless Synchronous Reluctance Motor Drives
  8. Comparative study on corrosion behavior of we33 in immersion and polarization influenced by heat treatment
  9. Toward a gecko-inspired, climbing soft robot
  10. Microstructure, mechanical properties and fracture behaviors of large-scale sand-cast Mg-3Y-2Gd-1Nd-0.4Zr alloy
  11. IT Governance in Scaling Agile Frameworks
  12. Accuracy Improvement by Artificial Neural Networks in Technical Vision System
  13. Recurrence-based diagnostics of rotary systems
  14. Microstructure, mechanical and functional properties of refill friction stir spot welds on multilayered aluminum foils for battery application
  15. Developmentalities and donor-NGO relations
  16. Discriminative clustering for market segmentation
  17. Managing Biodiversity Correctly
  18. Chronic effects of a static stretching intervention program on range of motion and tissue hardness in older adults
  19. A single PD plus gravity compensation control for global asymptotic regulation of robot manipulators with actuator constraints
  20. Gas-Kampf oder Gas-Krampf
  21. Rebound Effects in Methods of Artificial Intelligence
  22. Systematic risk behavior in cyclical industries
  23. Interdiffusion and atomic mobility in hcp Mg–Al–Sn alloys
  24. Lessons learned — The case of CROCUS
  25. Dialogic interactions in higher vocational learning environments in mainland China
  26. Science-Related Outcomes
  27. Does cognitive load moderate the seductive details effect? A multimedia study
  28. Mobilität
  29. The Measurement of Grip-Strength in Automobiles
  30. Determinants and consequences of clawback provisions in management compensation contracts

Presse / Medien

  1. Duration