Transformer with Tree-order Encoding for Neural Program Generation

Klaudia-Doris Thellmann; Bernhard Stadler; Ricardo Usbeck; Jens Lehmann

doi:10.48550/arXiv.2206.13354

Transformer with Tree-order Encoding for Neural Program Generation

Research output: Contributions to collected editions/works › Article in conference proceedings › Research

Standard

Transformer with Tree-order Encoding for Neural Program Generation. / Thellmann, Klaudia-Doris; Stadler, Bernhard; Usbeck, Ricardo et al.
Conference XXX. 2022.

Research output: Contributions to collected editions/works › Article in conference proceedings › Research

Bibtex

@inbook{56b3c87266394178bbbbf7097e04d0af,

title = "Transformer with Tree-order Encoding for Neural Program Generation",

abstract = " While a considerable amount of semantic parsing approaches have employed RNN architectures for code generation tasks, there have been only few attempts to investigate the applicability of Transformers for this task. Including hierarchical information of the underlying programming language syntax has proven to be effective for code generation. Since the positional encoding of the Transformer can only represent positions in a flat sequence, we have extended the encoding scheme to allow the attention mechanism to also attend over hierarchical positions in the input. Furthermore, we have realized a decoder based on a restrictive grammar graph model to improve the generation accuracy and ensure the well-formedness of the generated code. While we did not surpass the state of the art, our findings suggest that employing a tree-based positional encoding in combination with a shared natural-language subword vocabulary improves generation performance over sequential positional encodings. ",

keywords = "cs.CL, cs.AI, 68T07, 68T50, I.2.7, Informatics",

author = "Klaudia-Doris Thellmann and Bernhard Stadler and Ricardo Usbeck and Jens Lehmann",

note = "This paper was authored in late 2020 and early 2021 for the most part",

year = "2022",

month = may,

day = "30",

doi = "10.48550/arXiv.2206.13354",

language = "English",

booktitle = "Conference XXX",

}

RIS

TY - CHAP

T1 - Transformer with Tree-order Encoding for Neural Program Generation

AU - Thellmann, Klaudia-Doris

AU - Stadler, Bernhard

AU - Usbeck, Ricardo

AU - Lehmann, Jens

N1 - This paper was authored in late 2020 and early 2021 for the most part

PY - 2022/5/30

Y1 - 2022/5/30

N2 - While a considerable amount of semantic parsing approaches have employed RNN architectures for code generation tasks, there have been only few attempts to investigate the applicability of Transformers for this task. Including hierarchical information of the underlying programming language syntax has proven to be effective for code generation. Since the positional encoding of the Transformer can only represent positions in a flat sequence, we have extended the encoding scheme to allow the attention mechanism to also attend over hierarchical positions in the input. Furthermore, we have realized a decoder based on a restrictive grammar graph model to improve the generation accuracy and ensure the well-formedness of the generated code. While we did not surpass the state of the art, our findings suggest that employing a tree-based positional encoding in combination with a shared natural-language subword vocabulary improves generation performance over sequential positional encodings.

AB - While a considerable amount of semantic parsing approaches have employed RNN architectures for code generation tasks, there have been only few attempts to investigate the applicability of Transformers for this task. Including hierarchical information of the underlying programming language syntax has proven to be effective for code generation. Since the positional encoding of the Transformer can only represent positions in a flat sequence, we have extended the encoding scheme to allow the attention mechanism to also attend over hierarchical positions in the input. Furthermore, we have realized a decoder based on a restrictive grammar graph model to improve the generation accuracy and ensure the well-formedness of the generated code. While we did not surpass the state of the art, our findings suggest that employing a tree-based positional encoding in combination with a shared natural-language subword vocabulary improves generation performance over sequential positional encodings.

KW - cs.CL

KW - cs.AI

KW - 68T07, 68T50

KW - I.2.7

KW - Informatics

U2 - 10.48550/arXiv.2206.13354

DO - 10.48550/arXiv.2206.13354

M3 - Article in conference proceedings

BT - Conference XXX

ER -

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Salnikov, M., Sakhovskiy, A., Nikishina, I., Usmanova, A., Kraft, A., Möller, C., Banerjee, D., Huang, J., Jiang, L., Abdullah, R., Yan, X., Tutubalina, E., Usbeck, R. & Panchenko, A., 2026, Natural Language Processing and Information Systems: 30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, Proceedings. Ichise, R. (ed.). Springer Science and Business Media Deutschland, p. 95-110 16 p. (Lecture Notes in Computer Science; vol. 15836 LNCS).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Möller, C. & Usbeck, R., 2025, The Semantic Web: 22nd European Semantic Web Conference, ESWC 2025 Portoroz, Slovenia, June 1–5, 2025 Proceedings, Part I. Curry, E., Acosta, M., Poveda-Villalón, M., van Erp, M., Ojo, A., Hose, K., Shimizu, C. & Lisena, P. (eds.). Cham: Springer Nature Switzerland AG, Vol. 1. p. 460-480 21 p. (Lecture Notes in Computer Science ; vol. 15718).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Automating SPARQL Query Translations between DBpedia and Wikidata

Bartels, M. C., Banerjee, D. & Usbeck, R., 14.07.2025, SEMANTiCS Conference 2025.

Research output: Contributions to collected editions/works › Article in conference proceedings › Research

Bridge-Generate: Scholarly Hybrid Question Answering

Taffa, T. A. & Usbeck, R., 23.05.2025, WWW Companion 2025 - Companion Proceedings of the ACM Web Conference 2025: Companion Proceedings of the ACM Web Conference 2025, April 28-May 2, 2025 Sydney, NSW, Australia. Long, G., Blumestein, M., Chang, Y., Lewin-Eytan, L., Huang, H. & Yom-Tov, E. (eds.). New York: Association for Computing Machinery, Inc, p. 1321-1325 5 p.

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Junior fellows and distinguished dissertation of the GI and AI for crisis

Usbeck, R., Kraft, A. & Westphal, P., 01.02.2025, In: IT - Information Technology. 67, 1, p. 1-2 2 p.

Research output: Journal contributions › Other (editorial matter etc.) › Research

DOI

https://doi.org/10.48550/arXiv.2206.13354
Submitted manuscript

Transformer with Tree-order Encoding for Neural Program Generation

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Automating SPARQL Query Translations between DBpedia and Wikidata

Bridge-Generate: Scholarly Hybrid Question Answering

Junior fellows and distinguished dissertation of the GI and AI for crisis

Links

DOI

Recently viewed

Projects

Activities

Publications