LC-QuAD 2.0: A Large Dataset for Complex Question Answering over Wikidata and DBpedia

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Providing machines with the capability of exploring knowledge graphs and answering natural language questions has been an active area of research over the past decade. In this direction translating natural language questions to formal queries has been one of the key approaches. To advance the research area, several datasets like WebQuestions, QALD and LCQuAD have been published in the past. The biggest data set available for complex questions (LCQuAD) over knowledge graphs contains five thousand questions. We now provide LC-QuAD 2.0 (Large-Scale Complex Question Answering Dataset) with 30,000 questions, their paraphrases and their corresponding SPARQL queries. LC-QuAD 2.0 is compatible with both Wikidata and DBpedia 2018 knowledge graphs. In this article, we explain how the dataset was created and the variety of questions available with examples. We further provide a statistical analysis of the dataset. Resource Type: Dataset Website and documentation: http://lc-quad.sda.tech/ Permanent URL: https://figshare.com/projects/LCQuAD_2_0/62270.

Original languageEnglish
Title of host publicationThe Semantic Web – ISWC 2019 : 18th International Semantic Web Conference, Auckland, New Zealand, October 26-30, 2019 : proceedings
EditorsChiara Ghidini, Olaf Hartig, Maria Maleshkova, Vojtech Svátek, Isabel Cruz, Aidan Hogan, Jie Song, Maxime Lefrançois, Fabien Gandon
Number of pages10
Volume2
Place of PublicationCham
PublisherSpringer Verlag
Publication date2019
Pages69-78
ISBN (print)978-3-030-30795-0
ISBN (electronic)978-3-030-30796-7
DOIs
Publication statusPublished - 2019
Externally publishedYes
Event18th International Semantic Web Conference - ISWC 2019 - Auckland, New Zealand
Duration: 26.10.201930.10.2019
Conference number: 18
https://iswc2019.semanticweb.org/
https://files.ifi.uzh.ch/ddis/iswc_archive/iswc/ab/2019/iswc2019.semanticweb.org/index.html

Bibliographical note

Funding Information:
Acknowledgements. This work has mainly been supported by the Fraunhofer-Cluster of Excellence “Cognitive Internet Technologies” (CCIT). It has also partly been supported by the German Federal Ministry of Education and Research (BMBF) in the context of the research project “InclusiveOCW” (grant no. 01PE17004D).

Publisher Copyright:
© 2019, Springer Nature Switzerland AG.

Recently viewed

Publications

  1. Exportorientierte Tabakwirtschaft in Zimbabwe
  2. Testing Cort-Fitness and Cort-Adaptation hypotheses in a habitat suitability gradient for roe deer
  3. Careless product use in access-based services
  4. Promoting pro-environmental behavior through citizen science?
  5. The influence of native versus exotic streetscape vegetation on the spatial distribution of birds in suburbs and reserves
  6. Dynamic norms drive sustainable consumption
  7. Factors affecting fruit set in Aizoaceae species of the Succulent Karoo
  8. Are you sure about what you mean by ‘uncertainty’?
  9. Empirie als Problem?
  10. Networked Disruption
  11. Designing instructional technology from an emotional perspective
  12. The effects of work engagement and self-efficacy on personal initiative and performance
  13. Simulation and training in work settings
  14. Managerhaftung, D&O und Mittelstand
  15. Controlling von Logistikprozessen
  16. Fly
  17. Communication management of start-ups: an empirical analysis of entrepreneurs’ communication and networking success on Facebook
  18. Human–nature connectedness and other relational values are negatively affected by landscape simplification
  19. Digital Media Facades for Lively Public Spaces
  20. Zugänge zur Bestimmung von Textqualität
  21. Modeling and assessing mathematical competence over the lifespan
  22. Predicting Therapy Success and Costs for Personalized Treatment Recommendations Using Baseline Characteristics
  23. More than Yield
  24. Future-Proofing Fuel Cells
  25. Vorstellungsänderung
  26. Cross-cultural differences in consumers' perception of the credibility of cause-related marketing (CRM) campaigns