Bridging the Gap: Generating a Comprehensive Biomedical Knowledge Graph Question Answering Dataset

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Despite the plethora of resources such as large-scale corpora and manually curated Knowledge Graphs (KGs), the ability to perform reasoning with natural language inputs over biomedical graphs remains challenging due to insufficient training data. We propose a novel method for automatically constructing a Biomedical Knowledge Graph Question Answering (BioKGQA) dataset sourced from PrimeKG, the largest precision medicine-oriented KG. In total, we create 85,368 question-answer pairs along with their respective SPARQL queries. Our approach generates a diverse array of contextually relevant questions covering a wide spectrum of biomedical concepts and levels of complexity. We evaluate our method based on automatic metrics alongside manual annotations. We establish novel standards tailored for KGQA systems to highlight the linguistic correctness and semantical faithfulness of the generated questions based on extracted KG facts. The compiled dataset – PrimeKGQA – serves as a valuable benchmarking resource for advancing knowledge-driven biomedical research and evaluating KGQA systems.
OriginalspracheEnglisch
TitelECAI 2024 : 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain; including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings
HerausgeberUlle Endriss, Francisco S. Melo, Kerstin Bach, Alberto José Bugarín Diz, Jose Maria Alonso-Moral, Senén Barro, Fredrik Heintz
Anzahl der Seiten8
ErscheinungsortAmsterdam
VerlagIOS Press BV
Erscheinungsdatum16.10.2024
Seiten1198-1205
ISBN (elektronisch)978-1-64368-548-9
DOIs
PublikationsstatusErschienen - 16.10.2024
Veranstaltung27th European Conference on Artificial Intelligence - ECAI 2024: "Celebrating the past. Inspiring the future" - University of Santiago de Compostela., Santiago de Compostela, Spanien
Dauer: 19.10.202424.10.2024
Konferenznummer: 27
https://www.ecai2024.eu/

Bibliographische Notiz

Publisher Copyright:
© 2024 The Authors.

DOI

Zuletzt angesehen

Publikationen

  1. Proxy Indicators for the Quality of Open-domain Dialogues
  2. Unveiling local knowledge
  3. Pathways of Data-driven Business Model Design and Realization
  4. Offline question answering over linked data using limited resources
  5. Geodesign as a boundary management process
  6. Life Cycle Assessment of Consumption Patterns – Understanding the links between changing social practices and environmental impacts
  7. Consequences of extreme weather events for developing countries based on the example of Mongolia
  8. Creating Value from in-Vehicle Data
  9. Challenges for biodiversity monitoring using citizen science in transitioning social-ecological systems
  10. Operationalization of the concept of sustainable development on different time scales
  11. Performance incentives in activity-based management
  12. The impact of explicit references in computer supported collaborative learning: Evidence from eye movement analyses
  13. Employing A-B tests for optimizing prices levels in e-commerce applications
  14. Integrating teacher and student workspaces in a technology-enhanced mathematics lecture
  15. Multi-view hidden markov perceptrons
  16. Exploring the dark and unexpected sides of digitalization
  17. Tschick
  18. Probabilistic movement models and zones of control
  19. Decision-making models for Robotic Warehouse
  20. One step forward, two steps back
  21. Performance Saga: Interview 06
  22. A PD Fuzzy Control of a Nonholonomic Car-Like Robot for Drive Assistant Systems
  23. Integrating multiple elements of environmental justice into urban blue space planning using public participation geographic information systems
  24. Sustainable use of ecosystem services under multiple risks
  25. Children's interpretation of ambiguous pronouns based on prior discourse
  26. Organizational practices for the aging workforce
  27. Conditionality of EU funds: an instrument to enforce EU fundamental values?
  28. The micro-processes during repatriate knowledge transfer
  29. Utilization of protein-rich residues in biotechnological processes
  30. Pathways to Implementation: Evidence on How Participation in Environmental Governance Impacts on Environmental Outcomes
  31. Quantifying ecosystem services of rewetted peatlands − the MoorFutures methodologies
  32. Learning Analytics
  33. The Role of Assessment and Quality Management in Transformations towards Sustainable Development
  34. To help or not to help an outgroup member
  35. Mathematics-specific motivations for choosing a mathematics teaching degree study programme
  36. Top-down biological motion perception does not differ between adults scoring high versus low on autism traits
  37. Soil carbon sequestration
  38. Utilizing Synchrotron Radiation for Phase Identification in Mg Alloys
  39. Learning Analytics an Hochschulen
  40. Standing up against Discrimination and Exclusion