The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset.
OriginalspracheEnglisch
TitelFindings of the Association for Computational Linguistics: ACL 2023 : July 9-14, 2023
HerausgeberAnna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki
Anzahl der Seiten10
ErscheinungsortStroudsburg
VerlagAssociation for Computational Linguistics (ACL)
Erscheinungsdatum01.07.2023
Seiten12219-12228
ISBN (elektronisch)978-1-959429-62-3
DOIs
PublikationsstatusErschienen - 01.07.2023
Extern publiziertJa
Veranstaltung61st Annual Meeting of the Association for Computational Linguistics - Toronto, Kanada
Dauer: 09.07.202314.07.2023
Konferenznummer: 61
https://2023.aclweb.org

Bibliographische Notiz

Publisher Copyright:
© 2023 Association for Computational Linguistics.

Zuletzt angesehen

Publikationen

  1. A blackboard architecture for workflows
  2. Modeling Grounding Processes in Chat-based CSCL
  3. Mapping Amazon's logistical footprint on the Ruhr
  4. Developing spatial biophysical accounting for multiple ecosystem services
  5. A cascade regulator using Lyapunov's PID-PID controllers for an aggregate actuator in automotive applications
  6. Other spaces
  7. The link between in- and external rotation of the auditor and the quality of financial accounting and external audit
  8. Communicating Uncertainties About the Effects of Medical Interventions Using Different Display Formats
  9. Effekte unterschiedlicher Kollaborationsskripte in chatbasiertem Computer-Supported Collaborative Learning am Beispiel von Lernprotokollen
  10. Root-root interactions: extending our perspective to be more inclusive of the range of theories in ecology and agriculture using in-vivo analyses
  11. What Role for Public Participation in Implementing the EU Floods Directive? A comparison with the Water Framework Directive, early evidence from Germany, and a research agenda
  12. § 37a
  13. Confidence levels and likelihood terms in IPCC reports
  14. Entangled – But How?
  15. Explaining Investment Dynamics: Empirical Evidence from German New Ventures
  16. Assessing Exposure of Pesticides to Bees
  17. Process Analyses of Grounding in Chat-based CSCL
  18. Wie lang sollte eine Rollstuhlrampe sein?
  19. Learner pragmatics at the discourse level: Staying “on topic” in a telecollaborative eTandem task
  20. Atomare Hinterlassenschaften
  21. Performancemanagement in Projekten durch Earned Value Management (EVM)
  22. A Unified Contextual Bandit Framework for Long- and Short-Term Recommendations
  23. Redemption Restored: The Star in the Context of Modernity
  24. Pathways for Transformation
  25. Domain adaptation of POS taggers without handcrafted features
  26. The Measurement of Grip-Strength in Automobiles
  27. Disentangling trade-offs and synergies around ecosystem services with the influence network framework
  28. HPLC and chemometrics-assisted UV-spectroscopy methods for the simultaneous determination of ambroxol and doxycycline in capsule.
  29. Work availability types and well-being in Germany–a latent class analysis among a nationally representative sample
  30. Tree ring isotopic composition, radial increment and height growth reveal provenance-specific reactions of Douglas-fir towards environmental parameters
  31. Part I: Too much change is not enough
  32. Mapping social values of ecosystem services: What is behind the map?
  33. Exercise of members' rights
  34. Utilizing Synchrotron Radiation for the Characterization of Biodegradable Magnesium Alloys — From Alloy Development to the Application as Implant Material
  35. Recycling-oriented fabrication of soft robots
  36. Design of a Master of Science Sustainable Chemistry
  37. Atlas
  38. Alltag