Bridging the Gap: Generating a Comprehensive Biomedical Knowledge Graph Question Answering Dataset

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Despite the plethora of resources such as large-scale corpora and manually curated Knowledge Graphs (KGs), the ability to perform reasoning with natural language inputs over biomedical graphs remains challenging due to insufficient training data. We propose a novel method for automatically constructing a Biomedical Knowledge Graph Question Answering (BioKGQA) dataset sourced from PrimeKG, the largest precision medicine-oriented KG. In total, we create 85,368 question-answer pairs along with their respective SPARQL queries. Our approach generates a diverse array of contextually relevant questions covering a wide spectrum of biomedical concepts and levels of complexity. We evaluate our method based on automatic metrics alongside manual annotations. We establish novel standards tailored for KGQA systems to highlight the linguistic correctness and semantical faithfulness of the generated questions based on extracted KG facts. The compiled dataset – PrimeKGQA – serves as a valuable benchmarking resource for advancing knowledge-driven biomedical research and evaluating KGQA systems.
OriginalspracheEnglisch
TitelECAI 2024 : 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain; including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings
HerausgeberUlle Endriss, Francisco S. Melo, Kerstin Bach, Alberto José Bugarín Diz, Jose Maria Alonso-Moral, Senén Barro, Fredrik Heintz
Anzahl der Seiten8
ErscheinungsortAmsterdam
VerlagIOS Press BV
Erscheinungsdatum16.10.2024
Seiten1198-1205
ISBN (elektronisch)978-1-64368-548-9
DOIs
PublikationsstatusErschienen - 16.10.2024
Veranstaltung27th European Conference on Artificial Intelligence - ECAI 2024: "Celebrating the past. Inspiring the future" - University of Santiago de Compostela., Santiago de Compostela, Spanien
Dauer: 19.10.202424.10.2024
Konferenznummer: 27
https://www.ecai2024.eu/

Bibliographische Notiz

Publisher Copyright:
© 2024 The Authors.

DOI

Zuletzt angesehen

Publikationen

  1. A Structure and Content Prompt-based Method for Knowledge Graph Question Answering over Scholarly Data
  2. Assessing Effects Through Semi-Field and Field Toxicity Testing
  3. Effects of diversity versus segregation on automatic approach and avoidance behavior towards own and other ethnic groups
  4. Green software engineering with agile methods
  5. Multiphase-field modeling of temperature-driven intermetallic compound evolution in an Al-Mg system for application to solid-state joining processes
  6. Controlling a Bank Model Economy by Using an Adaptive Model Predictive Control with Help of an Extended Kalman Filter
  7. Passive Rotation of Rotational Joints and Its Computation Method
  8. A Theoretical Dynamical Noninteracting Model for General Manipulation Systems Using Axiomatic Geometric Structures
  9. Dynamic priority based dispatching of AGVs in flexible job shops
  10. HAWK - hybrid question answering using linked data
  11. How, when and why do negotiators use reference points?
  12. Modelling biodegradability based on OECD 301D data for the design of mineralising ionic liquids
  13. Reading Comprehension as Embodied Action: Exploratory Findings on Nonlinear Eye Movement Dynamics and Comprehension of Scientific Texts
  14. Multi-view discriminative sequential learning
  15. Cross-case knowledge transfer in transformative research: enabling learning in and across sustainability-oriented labs through case reporting
  16. WHICH ESTIMATION SITUATIONS ARE RELEVANT FOR A VALID ASSESSMENT OF MEASUREMENT ESTIMATION SKILLS
  17. Bifactor Models for Predicting Criteria by General and Specific Factors
  18. Repeat Receipts: A device for generating visible data in market research focus groups
  19. Rotational complexity in mental rotation tests
  20. On the Direct Kinematics Problem of Parallel Mechanisms
  21. Individual Scans Fusion in Virtual Knowledge Base for Navigation of Mobile Robotic Group with 3D TVS
  22. IWRM through WFD implementation? Drivers for integration in polycentric water governance systems
  23. Special Issue The Discourse of Redundancy Introduction
  24. Using data mining techniques to investigate the correlation between surface cracks and flange lengths in deep drawn sheet metals
  25. Dividing Apples and Pears: Towards a Taxonomy for Agile Transformation
  26. DISKNET – A Platform for the Systematic Accumulation of Knowledge in IS Research
  27. The Open Anchoring Quest Dataset: Anchored Estimates from 96 Studies on Anchoring Effects
  28. “Circuits of Commons”: Exploring the Connections Between Economic Lives and the Commons