Why reinvent the wheel: Let's build question answering systems together

Kuldeep Singh; Arun Sethupat Radhakrishna; Andreas Both; Saeedeh Shekarpour; Ioanna Lytra; Ricardo Usbeck; Akhilesh Vyas; Akmal Khikmatullaev; Dharmen Punjani; Christoph Lange; Maria Esther Vidal; Jens Lehmann; Sören Auer

doi:10.1145/3178876.3186023

Why reinvent the wheel: Let's build question answering systems together

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Standard

Why reinvent the wheel: Let's build question answering systems together. / Singh, Kuldeep; Radhakrishna, Arun Sethupat; Both, Andreas et al.
The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018. ed. / Pierre-Antoine Champin; Fabien Gandon; Lionel Medini. Association for Computing Machinery, Inc, 2018. p. 1247-1256 (The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Harvard

Singh, K, Radhakrishna, AS, Both, A, Shekarpour, S, Lytra, I, Usbeck, R, Vyas, A, Khikmatullaev, A, Punjani, D, Lange, C, Vidal, ME, Lehmann, J & Auer, S 2018, Why reinvent the wheel: Let's build question answering systems together. in P-A Champin, F Gandon & L Medini (eds), The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018. The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018, Association for Computing Machinery, Inc, pp. 1247-1256, 27th International World Wide Web, WWW 2018, Lyon, France, 23.04.18. https://doi.org/10.1145/3178876.3186023

APA

Singh, K., Radhakrishna, A. S., Both, A., Shekarpour, S., Lytra, I., Usbeck, R., Vyas, A., Khikmatullaev, A., Punjani, D., Lange, C., Vidal, M. E., Lehmann, J., & Auer, S. (2018). Why reinvent the wheel: Let's build question answering systems together. In P.-A. Champin, F. Gandon, & L. Medini (Eds.), The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018 (pp. 1247-1256). (The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018). Association for Computing Machinery, Inc. https://doi.org/10.1145/3178876.3186023

Vancouver

Singh K, Radhakrishna AS, Both A, Shekarpour S, Lytra I, Usbeck R et al. Why reinvent the wheel: Let's build question answering systems together. In Champin PA, Gandon F, Medini L, editors, The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018. Association for Computing Machinery, Inc. 2018. p. 1247-1256. (The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018). doi: 10.1145/3178876.3186023

Bibtex

@inbook{3df1a7d4f3b5490a9080b9378bce8e36,

title = "Why reinvent the wheel: Let's build question answering systems together",

abstract = "Modern question answering (QA) systems need to flexibly integrate a number of components specialised to fulfil specific tasks in a QA pipeline. Key QA tasks include Named Entity Recognition and Disambiguation, Relation Extraction, and Query Building. Since a number of different software components exist that implement different strategies for each of these tasks, it is a major challenge to select and combine the most suitable components into a QA system, given the characteristics of a question. We study this optimisation problem and train classifiers, which take features of a question as input and have the goal of optimising the selection of QA components based on those features. We then devise a greedy algorithm to identify the pipelines that include the suitable components and can effectively answer the given question. We implement this model within Frankenstein, a QA framework able to select QA components and compose QA pipelines. We evaluate the effectiveness of the pipelines generated by Frankenstein using the QALD and LC-QuAD benchmarks. These results not only suggest that Frankenstein precisely solves the QA optimisation problem but also enables the automatic composition of optimised QA pipelines, which outperform the static Baseline QA pipeline. Thanks to this flexible and fully automated pipeline generation process, new QA components can be easily included in Frankenstein, thus improving the performance of the generated pipelines.",

keywords = "QA framework, Question answering, Semantic search, Semantic web, Software reusability, Informatics, Business informatics",

author = "Kuldeep Singh and Radhakrishna, {Arun Sethupat} and Andreas Both and Saeedeh Shekarpour and Ioanna Lytra and Ricardo Usbeck and Akhilesh Vyas and Akmal Khikmatullaev and Dharmen Punjani and Christoph Lange and Vidal, {Maria Esther} and Jens Lehmann and S{\"o}ren Auer",

note = "Publisher Copyright: {\textcopyright} 2018 IW3C2 (International World Wide Web Conference Committee), published under Creative Commons CC BY 4.0 License.; 27th International World Wide Web, WWW 2018 : Bridging natural and artificial intelligence worldwide ; Conference date: 23-04-2018 Through 27-04-2018",

year = "2018",

month = apr,

day = "10",

doi = "10.1145/3178876.3186023",

language = "English",

isbn = "978-1-4503-5639-8",

series = "The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018",

publisher = "Association for Computing Machinery, Inc",

pages = "1247--1256",

editor = "Pierre-Antoine Champin and Fabien Gandon and Lionel Medini",

booktitle = "The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018",

address = "United States",

url = "https://archives.iw3c2.org/www2018/",

}

RIS

TY - CHAP

T1 - Why reinvent the wheel

T2 - 27th International World Wide Web, WWW 2018

AU - Singh, Kuldeep

AU - Radhakrishna, Arun Sethupat

AU - Both, Andreas

AU - Shekarpour, Saeedeh

AU - Lytra, Ioanna

AU - Usbeck, Ricardo

AU - Vyas, Akhilesh

AU - Khikmatullaev, Akmal

AU - Punjani, Dharmen

AU - Lange, Christoph

AU - Vidal, Maria Esther

AU - Lehmann, Jens

AU - Auer, Sören

PY - 2018/4/10

Y1 - 2018/4/10

N2 - Modern question answering (QA) systems need to flexibly integrate a number of components specialised to fulfil specific tasks in a QA pipeline. Key QA tasks include Named Entity Recognition and Disambiguation, Relation Extraction, and Query Building. Since a number of different software components exist that implement different strategies for each of these tasks, it is a major challenge to select and combine the most suitable components into a QA system, given the characteristics of a question. We study this optimisation problem and train classifiers, which take features of a question as input and have the goal of optimising the selection of QA components based on those features. We then devise a greedy algorithm to identify the pipelines that include the suitable components and can effectively answer the given question. We implement this model within Frankenstein, a QA framework able to select QA components and compose QA pipelines. We evaluate the effectiveness of the pipelines generated by Frankenstein using the QALD and LC-QuAD benchmarks. These results not only suggest that Frankenstein precisely solves the QA optimisation problem but also enables the automatic composition of optimised QA pipelines, which outperform the static Baseline QA pipeline. Thanks to this flexible and fully automated pipeline generation process, new QA components can be easily included in Frankenstein, thus improving the performance of the generated pipelines.

AB - Modern question answering (QA) systems need to flexibly integrate a number of components specialised to fulfil specific tasks in a QA pipeline. Key QA tasks include Named Entity Recognition and Disambiguation, Relation Extraction, and Query Building. Since a number of different software components exist that implement different strategies for each of these tasks, it is a major challenge to select and combine the most suitable components into a QA system, given the characteristics of a question. We study this optimisation problem and train classifiers, which take features of a question as input and have the goal of optimising the selection of QA components based on those features. We then devise a greedy algorithm to identify the pipelines that include the suitable components and can effectively answer the given question. We implement this model within Frankenstein, a QA framework able to select QA components and compose QA pipelines. We evaluate the effectiveness of the pipelines generated by Frankenstein using the QALD and LC-QuAD benchmarks. These results not only suggest that Frankenstein precisely solves the QA optimisation problem but also enables the automatic composition of optimised QA pipelines, which outperform the static Baseline QA pipeline. Thanks to this flexible and fully automated pipeline generation process, new QA components can be easily included in Frankenstein, thus improving the performance of the generated pipelines.

KW - QA framework

KW - Question answering

KW - Semantic search

KW - Semantic web

KW - Software reusability

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=85076220402&partnerID=8YFLogxK

U2 - 10.1145/3178876.3186023

DO - 10.1145/3178876.3186023

M3 - Article in conference proceedings

AN - SCOPUS:85076220402

SN - 978-1-4503-5639-8

T3 - The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018

SP - 1247

EP - 1256

BT - The Web Conference 2018 - Proceedings of the World Wide Web Conference, WWW 2018

A2 - Champin, Pierre-Antoine

A2 - Gandon, Fabien

A2 - Medini, Lionel

PB - Association for Computing Machinery, Inc

Y2 - 23 April 2018 through 27 April 2018

ER -

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Salnikov, M., Sakhovskiy, A., Nikishina, I., Usmanova, A., Kraft, A., Möller, C., Banerjee, D., Huang, J., Jiang, L., Abdullah, R., Yan, X., Tutubalina, E., Usbeck, R. & Panchenko, A., 2026, Natural Language Processing and Information Systems: 30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, Proceedings. Ichise, R. (ed.). Springer Science and Business Media Deutschland, p. 95-110 16 p. (Lecture Notes in Computer Science; vol. 15836 LNCS).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Möller, C. & Usbeck, R., 2025, The Semantic Web: 22nd European Semantic Web Conference, ESWC 2025 Portoroz, Slovenia, June 1–5, 2025 Proceedings, Part I. Curry, E., Acosta, M., Poveda-Villalón, M., van Erp, M., Ojo, A., Hose, K., Shimizu, C. & Lisena, P. (eds.). Cham: Springer Nature Switzerland AG, Vol. 1. p. 460-480 21 p. (Lecture Notes in Computer Science ; vol. 15718).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Automating SPARQL Query Translations between DBpedia and Wikidata

Bartels, M. C., Banerjee, D. & Usbeck, R., 14.07.2025, SEMANTiCS Conference 2025.

Research output: Contributions to collected editions/works › Article in conference proceedings › Research

Bridge-Generate: Scholarly Hybrid Question Answering

Taffa, T. A. & Usbeck, R., 23.05.2025, WWW Companion 2025 - Companion Proceedings of the ACM Web Conference 2025: Companion Proceedings of the ACM Web Conference 2025, April 28-May 2, 2025 Sydney, NSW, Australia. Long, G., Blumestein, M., Chang, Y., Lewin-Eytan, L., Huang, H. & Yom-Tov, E. (eds.). New York: Association for Computing Machinery, Inc, p. 1321-1325 5 p.

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Junior fellows and distinguished dissertation of the GI and AI for crisis

Usbeck, R., Kraft, A. & Westphal, P., 01.02.2025, In: IT - Information Technology. 67, 1, p. 1-2 2 p.

Research output: Journal contributions › Other (editorial matter etc.) › Research

DOI

https://doi.org/10.1145/3178876.3186023
Final published version

Why reinvent the wheel: Let's build question answering systems together

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Other publications by the same author(s)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Automating SPARQL Query Translations between DBpedia and Wikidata

Bridge-Generate: Scholarly Hybrid Question Answering

Junior fellows and distinguished dissertation of the GI and AI for crisis

DOI

Recently viewed

Researchers

Activities

Publications

Press / Media