LC-QuAD 2.0: A Large Dataset for Complex Question Answering over Wikidata and DBpedia
Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review
Standard
LC-QuAD 2.0 : A Large Dataset for Complex Question Answering over Wikidata and DBpedia. / Dubey, Mohnish; Banerjee, Debayan; Abdelkawi, Abdelrahman et al.
The Semantic Web – ISWC 2019 : 18th International Semantic Web Conference, Auckland, New Zealand, October 26-30, 2019 : proceedings. ed. / Chiara Ghidini; Olaf Hartig; Maria Maleshkova; Vojtech Svátek; Isabel Cruz; Aidan Hogan; Jie Song; Maxime Lefrançois; Fabien Gandon. Vol. 2 Cham : Springer, 2019. p. 69-78 (Lecture Notes in Computer Science; Vol. 11779 LNCS).Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review
Harvard
APA
Vancouver
Bibtex
}
RIS
TY - CHAP
T1 - LC-QuAD 2.0
T2 - 18th International Semantic Web Conference - ISWC 2019
AU - Dubey, Mohnish
AU - Banerjee, Debayan
AU - Abdelkawi, Abdelrahman
AU - Lehmann, Jens
N1 - Conference code: 18
PY - 2019
Y1 - 2019
N2 - Providing machines with the capability of exploring knowledge graphs and answering natural language questions has been an active area of research over the past decade. In this direction translating natural language questions to formal queries has been one of the key approaches. To advance the research area, several datasets like WebQuestions, QALD and LCQuAD have been published in the past. The biggest data set available for complex questions (LCQuAD) over knowledge graphs contains five thousand questions. We now provide LC-QuAD 2.0 (Large-Scale Complex Question Answering Dataset) with 30,000 questions, their paraphrases and their corresponding SPARQL queries. LC-QuAD 2.0 is compatible with both Wikidata and DBpedia 2018 knowledge graphs. In this article, we explain how the dataset was created and the variety of questions available with examples. We further provide a statistical analysis of the dataset. Resource Type: Dataset Website and documentation: http://lc-quad.sda.tech/ Permanent URL: https://figshare.com/projects/LCQuAD_2_0/62270.
AB - Providing machines with the capability of exploring knowledge graphs and answering natural language questions has been an active area of research over the past decade. In this direction translating natural language questions to formal queries has been one of the key approaches. To advance the research area, several datasets like WebQuestions, QALD and LCQuAD have been published in the past. The biggest data set available for complex questions (LCQuAD) over knowledge graphs contains five thousand questions. We now provide LC-QuAD 2.0 (Large-Scale Complex Question Answering Dataset) with 30,000 questions, their paraphrases and their corresponding SPARQL queries. LC-QuAD 2.0 is compatible with both Wikidata and DBpedia 2018 knowledge graphs. In this article, we explain how the dataset was created and the variety of questions available with examples. We further provide a statistical analysis of the dataset. Resource Type: Dataset Website and documentation: http://lc-quad.sda.tech/ Permanent URL: https://figshare.com/projects/LCQuAD_2_0/62270.
KW - Informatics
UR - http://www.scopus.com/inward/record.url?scp=85077909314&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-30796-7_5
DO - 10.1007/978-3-030-30796-7_5
M3 - Article in conference proceedings
AN - SCOPUS:85077909314
SN - 978-3-030-30795-0
VL - 2
T3 - Lecture Notes in Computer Science
SP - 69
EP - 78
BT - The Semantic Web – ISWC 2019
A2 - Ghidini, Chiara
A2 - Hartig, Olaf
A2 - Maleshkova, Maria
A2 - Svátek, Vojtech
A2 - Cruz, Isabel
A2 - Hogan, Aidan
A2 - Song, Jie
A2 - Lefrançois, Maxime
A2 - Gandon, Fabien
PB - Springer
CY - Cham
Y2 - 26 October 2019 through 30 October 2019
ER -