Bridging the Gap: Generating a Comprehensive Biomedical Knowledge Graph Question Answering Dataset

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Despite the plethora of resources such as large-scale corpora and manually curated Knowledge Graphs (KGs), the ability to perform reasoning with natural language inputs over biomedical graphs remains challenging due to insufficient training data. We propose a novel method for automatically constructing a Biomedical Knowledge Graph Question Answering (BioKGQA) dataset sourced from PrimeKG, the largest precision medicine-oriented KG. In total, we create 85,368 question-answer pairs along with their respective SPARQL queries. Our approach generates a diverse array of contextually relevant questions covering a wide spectrum of biomedical concepts and levels of complexity. We evaluate our method based on automatic metrics alongside manual annotations. We establish novel standards tailored for KGQA systems to highlight the linguistic correctness and semantical faithfulness of the generated questions based on extracted KG facts. The compiled dataset – PrimeKGQA – serves as a valuable benchmarking resource for advancing knowledge-driven biomedical research and evaluating KGQA systems.
OriginalspracheEnglisch
TitelECAI 2024 : 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain; including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings
HerausgeberUlle Endriss, Francisco S. Melo, Kerstin Bach, Alberto José Bugarín Diz, Jose Maria Alonso-Moral, Senén Barro, Fredrik Heintz
Anzahl der Seiten8
ErscheinungsortAmsterdam
VerlagIOS Press BV
Erscheinungsdatum16.10.2024
Seiten1198-1205
ISBN (elektronisch)978-1-64368-548-9
DOIs
PublikationsstatusErschienen - 16.10.2024
Veranstaltung27th European Conference on Artificial Intelligence - ECAI 2024: "Celebrating the past. Inspiring the future" - University of Santiago de Compostela., Santiago de Compostela, Spanien
Dauer: 19.10.202424.10.2024
Konferenznummer: 27
https://www.ecai2024.eu/

Bibliographische Notiz

Publisher Copyright:
© 2024 The Authors.

DOI

Zuletzt angesehen

Publikationen

  1. Modelling biodegradability based on OECD 301D data for the design of mineralising ionic liquids
  2. Reading Comprehension as Embodied Action: Exploratory Findings on Nonlinear Eye Movement Dynamics and Comprehension of Scientific Texts
  3. Performance of methods to select landscape metrics for modelling species richness
  4. Determination of the construction and the material identity values of outside building components with the help of in-situ measuring procedures and FEM-simulation calculations
  5. DISKNET – A Platform for the Systematic Accumulation of Knowledge in IS Research
  6. A Sensitive Microsystem as Biosensor for Cell Growth Monitoring and Antibiotic Testing
  7. Experimental investigation of the fluid-structure interaction during deep drawing of fiber metal laminates in the in-situ hybridization process
  8. The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing
  9. Late developers and the inequity of "equitable utilization" and the harm of "do no harm"
  10. The impact of goal focus, task type and group size on synchronous net-based collaborative learning discourses
  11. Differences in adjustment flexibility between regular and temporary agency work
  12. Mechanical characterization of as-cast AA7075/6060 and CuSn6/Cu99.5 compounds using an experimental and numerical push-out test
  13. Integrating errors into the training process
  14. Dichotomy or continuum? A global review of the interaction between autonomous and planned adaptations
  15. Lost-customers approximation of semi-open queueing networks with backordering
  16. Quality System Development at the University of Graz
  17. Collaborative benchmarking of functional-structural root architecture models
  18. Unlocking knowledge-policy action gaps in disaster-recovery-risk governance cycle
  19. Variational pragmatics in the foreign language classroom