The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset.
OriginalspracheEnglisch
TitelFindings of the Association for Computational Linguistics: ACL 2023 : July 9-14, 2023
HerausgeberAnna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki
Anzahl der Seiten10
ErscheinungsortStroudsburg
VerlagAssociation for Computational Linguistics (ACL)
Erscheinungsdatum01.07.2023
Seiten12219-12228
ISBN (elektronisch)978-1-959429-62-3
DOIs
PublikationsstatusErschienen - 01.07.2023
Extern publiziertJa
Veranstaltung61st Annual Meeting of the Association for Computational Linguistics - Toronto, Kanada
Dauer: 09.07.202314.07.2023
Konferenznummer: 61
https://2023.aclweb.org

Bibliographische Notiz

Publisher Copyright:
© 2023 Association for Computational Linguistics.

Zuletzt angesehen

Aktivitäten

  1. Provenienza e Intelligenza Artificiale (Provenance and Artificial Intelligence)
  2. Intra-firm Wage Dispersion and Cost Coverage of Training: Evidence from German Linked Employer-Employee Data
  3. 45th IEEE Conference on Decision and Control - CDC 2006
  4. Optimal scheduling for Automated Guided Vehicles (AGV) in blocking job-shops
  5. Leveraging digital affordances to make language learning stick
  6. The hidden power dynamics of organizing through enterprise social media
  7. Artistic Utopian Spaces and the Promise of Urban Development
  8. Crumpled Times. Temporal and Epistemological Depths of Agent-Based Traffic Simulations
  9. 1st ECPR Winter School in Methods and Techniques 2012
  10. Life Cycle Assessment and Material Flow Analysis
  11. C- and Si-analogous compounds – comparison of their behaviour in a test for ready biodegradability
  12. Workshop: "Mit Leben Rechnen"
  13. Campusemerge 2011
  14. “The Bigger Picture of Corruption: A Comparative Analysis of Europe and the Rest of the World”, 03.03.2014.
  15. The Predictive Power of Social Media Sentiment for Short-Term Stock Movements
  16. Internet-based guided self-help to reduce depressive symptoms in teachers: Results from a randomized controlled trial
  17. Intercultural Relations in Practice 2017
  18. Das Unding. Colonial Gothic in Contemporary Art
  19. Ocean eddies and the polar vortex: coherence in complex systems
  20. Mercator School of Management
  21. Science Slam a New Popularized and Artistic Way of Informal Science Communication. Challenges of Contemporary Science Communication
  22. Elsevier B.V. (Externe Organisation)

Publikationen

  1. Do abundance distributions and species aggregation correctly predict macroecological biodiversity patterns in tropical forests?
  2. Performance of methods to select landscape metrics for modelling species richness
  3. Can we use isotopes to capture the speed of link between photosynthesis and soil respiration?
  4. Failing and the perception of failure in student-driven transdisciplinary projects
  5. Internet-based public debate of CCS
  6. Embedded, not plugged-in
  7. Modelling ammonia emissions after field application of biogas slurries
  8. Understanding and managing post-acquisition integration as change process
  9. Safer Spaces
  10. Searching for New Languages, Searching for Minor Voices in the Archive
  11. Governing Objects from a Distance
  12. Non-technical success factors for bioenergy projects-Learning from a multiple case study in Japan
  13. Examining how AI capabilities can foster organizational performance in public organizations
  14. Migration-Based Multilingualism in the English as a Foreign Language Classroom
  15. Teaching Sustainable Development in a Sensory and Artful Way — Concepts, Methods, and Examples
  16. Do children with deficits in basic cognitive functions profit from mixed age primary schools?
  17. Multi-view hidden markov perceptrons
  18. Implementation of formative assessment
  19. Assessing empirical research on value-based management
  20. A highly transparent method of assessing the contribution of incentives to meet various technical challenges in distributed energy systems
  21. Negotiation complexity
  22. Enhancing the structural diversity between forest patches — A concept and real-world experiment to study biodiversity, multifunctionality and forest resilience across spatial scales
  23. Microstructure, mechanical properties and fracture behaviors of large-scale sand-cast Mg-3Y-2Gd-1Nd-0.4Zr alloy