DBLP-QuAD: A Question Answering Dataset over the DBLP Scholarly Knowledge Graph

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

In this work we create a question answering dataset over the DBLP scholarly knowledge graph (KG). DBLP is an on-line reference for bibliographic information on major computer science publications that indexes over 4.4 million publications published by more than 2.2 million authors. Our dataset consists of 10,000 question answer pairs with the corresponding SPARQL queries which can be executed over the DBLP KG to fetch the correct answer. DBLP-QuAD is the largest scholarly question answering dataset.
Original languageEnglish
Title of host publicationBIR 2023 - Bibliometric-enhanced Information Retrieval : Proceedings of the 13th International Workshop on Bibliometric-enhanced Information Retrieval co-located with 45th European Conference on Information Retrieval (ECIR 2023)
EditorsIngo Frommholz, Philipp Mayr, Guillaume Cabanac, Suzan Verberne, Jordan Brennan
Number of pages15
Place of PublicationAachen
PublisherSun Site Central Europe (RWTH Aachen University)
Publication date17.01.2024
Article number5
DOIs
Publication statusPublished - 17.01.2024
Externally publishedYes
Event13th International Workshop on Bibliometric-enhanced Information Retrieval - BIR 2023 - Dublin, Ireland
Duration: 02.04.202302.04.2023
Conference number: 13
https://ceur-ws.org/Vol-3617/
https://sites.google.com/view/bir-ws/bir-2023

Bibliographical note

12 pages ceur-ws 1 column accepted at International Bibliometric Information Retrieval Workshp @ ECIR 2023