The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Standard

The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing. / Banerjee, Debayan; Nair, Pranav; Usbeck, Ricardo et al.
Findings of the Association for Computational Linguistics: ACL 2023 : July 9-14, 2023. Hrsg. / Anna Rogers; Jordan L. Boyd-Graber; Naoaki Okazaki. Stroudsburg: Association for Computational Linguistics (ACL), 2023. S. 12219-12228 (Proceedings of the Annual Meeting of the Association for Computational Linguistics).

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Harvard

Banerjee, D, Nair, P, Usbeck, R & Biemann, C 2023, The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing. in A Rogers, JL Boyd-Graber & N Okazaki (Hrsg.), Findings of the Association for Computational Linguistics: ACL 2023 : July 9-14, 2023. Proceedings of the Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics (ACL), Stroudsburg, S. 12219-12228, 61st Annual Meeting of the Association for Computational Linguistics, Toronto, Ontario, Kanada, 09.07.23. https://doi.org/10.18653/v1/2023.findings-acl.774, https://doi.org/10.48550/arXiv.2305.15108

APA

Banerjee, D., Nair, P., Usbeck, R., & Biemann, C. (2023). The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing. In A. Rogers, J. L. Boyd-Graber, & N. Okazaki (Hrsg.), Findings of the Association for Computational Linguistics: ACL 2023 : July 9-14, 2023 (S. 12219-12228). (Proceedings of the Annual Meeting of the Association for Computational Linguistics). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.774, https://doi.org/10.48550/arXiv.2305.15108

Vancouver

Banerjee D, Nair P, Usbeck R, Biemann C. The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing. in Rogers A, Boyd-Graber JL, Okazaki N, Hrsg., Findings of the Association for Computational Linguistics: ACL 2023 : July 9-14, 2023. Stroudsburg: Association for Computational Linguistics (ACL). 2023. S. 12219-12228. (Proceedings of the Annual Meeting of the Association for Computational Linguistics). doi: 10.18653/v1/2023.findings-acl.774, 10.48550/arXiv.2305.15108

Bibtex

@inbook{5d6369c3cb2e4ab19990cad2ed7aa3f7,
title = "The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing",
abstract = "In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset.",
keywords = "Business informatics, Informatics",
author = "Debayan Banerjee and Pranav Nair and Ricardo Usbeck and Chris Biemann",
note = "Publisher Copyright: {\textcopyright} 2023 Association for Computational Linguistics.; 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 ; Conference date: 09-07-2023 Through 14-07-2023",
year = "2023",
month = jul,
day = "1",
doi = "10.18653/v1/2023.findings-acl.774",
language = "English",
series = "Proceedings of the Annual Meeting of the Association for Computational Linguistics",
publisher = "Association for Computational Linguistics (ACL)",
pages = "12219--12228",
editor = "Anna Rogers and Boyd-Graber, {Jordan L.} and Naoaki Okazaki",
booktitle = "Findings of the Association for Computational Linguistics: ACL 2023",
address = "United States",
url = "https://2023.aclweb.org",

}

RIS

TY - CHAP

T1 - The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

AU - Banerjee, Debayan

AU - Nair, Pranav

AU - Usbeck, Ricardo

AU - Biemann, Chris

N1 - Conference code: 61

PY - 2023/7/1

Y1 - 2023/7/1

N2 - In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset.

AB - In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset.

KW - Business informatics

KW - Informatics

UR - http://www.scopus.com/inward/record.url?scp=85175442200&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/ef5423a8-c326-3827-a9f1-bbd3d0e21ba1/

U2 - 10.18653/v1/2023.findings-acl.774

DO - 10.18653/v1/2023.findings-acl.774

M3 - Article in conference proceedings

T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics

SP - 12219

EP - 12228

BT - Findings of the Association for Computational Linguistics: ACL 2023

A2 - Rogers, Anna

A2 - Boyd-Graber, Jordan L.

A2 - Okazaki, Naoaki

PB - Association for Computational Linguistics (ACL)

CY - Stroudsburg

T2 - 61st Annual Meeting of the Association for Computational Linguistics

Y2 - 9 July 2023 through 14 July 2023

ER -

Zuletzt angesehen

Publikationen

  1. Was fehlt in der EVS?
  2. The Transition to Renewable Energy Systems - On the Way to a Comprehensive Transition Concept
  3. Understanding Innovation
  4. Expert*inneninterview
  5. Anonymized firm data under test: evidence from a replication study
  6. Crop rotation modelling
  7. Value of large-scale linear networks for bird conservation
  8. Small-scale soil patterns drive sharp boundaries between succulent "dwarf" biomes (or habitats) in the arid Succulent Karoo, South Africa
  9. New descriptions and typifications of syntaxa within the project 'Plant communities of Mecklenburg-Vorpommern and their vulnerability' - Part I
  10. Use of Chemotaxonomy To Study the Influence of Benzalkonium Chloride on Bacterial Populations in Biodegradation Testing
  11. Size, composition and provenance of fragmental particles in Apollo 14 breccias
  12. Analysis of the forming behaviour of in-situ drawn sandwich sheets
  13. Researching collaborative interdisciplinary teams
  14. On the geometric control of internal forces in power grasps
  15. ETL ensembles for chunking, NER and SRL
  16. Multitrait-multimethod-analysis
  17. Cross-level Information and Influence in Mandated Participatory Planning: Alternative Pathways to Sustainable Water Management in Germany’s Implementation of the EU Water Framework Directive
  18. Rate constants for the gas-phase reaction of OH with amines
  19. Vertical gradient in soil temperature stimulates development and increases biomass accumulation in barley
  20. Different sizes, similar challenges
  21. Identity without Membership?
  22. 2. Advent
  23. Two-way NxP fertilisation experiment on barley (Hordeum vulgare) reveals shift from additive to synergistic N-P interactions at critical phosphorus fertilisation level
  24. Evaluating social learning in participatory mapping of ecosystem services
  25. Zum Begriff der Repräsentation
  26. Payments for ecosystem services – for efficiency and for equity?
  27. From niche to mainstream
  28. The recent double paradigm shift in restoration ecology
  29. Mining for critical stock price movements using temporal power laws and integrated autoregressive models
  30. Explaining Convergence and Common Trends in the Role of the State in OECD Healthcare Systems
  31. Introduction
  32. Towards a global understanding of tree mortality

Presse / Medien

  1. Der neue Hass