Bridging the Gap: Generating a Comprehensive Biomedical Knowledge Graph Question Answering Dataset

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Despite the plethora of resources such as large-scale corpora and manually curated Knowledge Graphs (KGs), the ability to perform reasoning with natural language inputs over biomedical graphs remains challenging due to insufficient training data. We propose a novel method for automatically constructing a Biomedical Knowledge Graph Question Answering (BioKGQA) dataset sourced from PrimeKG, the largest precision medicine-oriented KG. In total, we create 85,368 question-answer pairs along with their respective SPARQL queries. Our approach generates a diverse array of contextually relevant questions covering a wide spectrum of biomedical concepts and levels of complexity. We evaluate our method based on automatic metrics alongside manual annotations. We establish novel standards tailored for KGQA systems to highlight the linguistic correctness and semantical faithfulness of the generated questions based on extracted KG facts. The compiled dataset – PrimeKGQA – serves as a valuable benchmarking resource for advancing knowledge-driven biomedical research and evaluating KGQA systems.
OriginalspracheEnglisch
TitelECAI 2024 : 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain; including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024), Proceedings
HerausgeberUlle Endriss, Francisco S. Melo, Kerstin Bach, Alberto José Bugarín Diz, Jose Maria Alonso-Moral, Senén Barro, Fredrik Heintz
Anzahl der Seiten8
ErscheinungsortAmsterdam
VerlagIOS Press BV
Erscheinungsdatum16.10.2024
Seiten1198-1205
ISBN (elektronisch)978-1-64368-548-9
DOIs
PublikationsstatusErschienen - 16.10.2024
Veranstaltung27th European Conference on Artificial Intelligence - ECAI 2024: "Celebrating the past. Inspiring the future" - University of Santiago de Compostela., Santiago de Compostela, Spanien
Dauer: 19.10.202424.10.2024
Konferenznummer: 27
https://www.ecai2024.eu/

Bibliographische Notiz

Publisher Copyright:
© 2024 The Authors.

DOI

Zuletzt angesehen

Publikationen

  1. Modelling and Optimization of Commuter Flows as Queuing System Considering Customer and Environmental Costs
  2. Passive Rotation of Rotational Joints and Its Computation Method
  3. Repeat Receipts: A device for generating visible data in market research focus groups
  4. IWRM through WFD implementation? Drivers for integration in polycentric water governance systems
  5. “Circuits of Commons”: Exploring the Connections Between Economic Lives and the Commons
  6. Grazing effects on intraspecific trait variability vary with changing precipitation patterns in Mongolian rangelands
  7. Gain Adaptation in Sliding Mode Control Using Model Predictive Control and Disturbance Compensation with Application to Actuators
  8. On the computation of the warping function and the torsional properties of thin-walled crosssections of prismatic beams
  9. Proxy Indicators for the Quality of Open-domain Dialogues
  10. Influence of Long-Lasting Static Stretching Intervention on Functional and Morphological Parameters in the Plantar Flexors
  11. Action Errors, Error Management, and Learning in Organizations
  12. Soil conditions modify species diversity effects on tree functional trait expression
  13. A latent state-trait analysis of current achievement motivation across different tasks of cognitive ability
  14. Efficacy of a Web-Based Intervention With Mobile Phone Support in Treating Depressive Symptoms in Adults With Type 1 and Type 2 Diabetes
  15. Use of design methods, team leaders' goal orientation, and team effectiveness: A follow-up study in software development projects
  16. Industry 4.0 more than a challenge in modeling, identification, and control for cyber-physical systems
  17. Optimal trajectory generation for camless internal combustion engine valve control
  18. Using Conjoint Analysis to Elicit Preferences for Occupational Health Services in Small and Microenterprises
  19. The Relation of Children's Performances in Spatial Tasks at Two Different Scales of Space
  20. Pushing the Envelope: Creating Public Value in the Labor Market
  21. Language and Mathematics - Key Factors influencing the Comprehension Process in reality-based Tasks
  22. Making mutual learning tangible
  23. Earnings Less Risk-Free Interest Charge (ERIC) and Stock Returns—A Value-Based Management Perspective on ERIC’s Relative and Incremental Information Content
  24. Intraindividual variability in identity centrality
  25. Errors in Organizations
  26. Serendipity as a Mechanism of Change and its Potential for Explaining Change Processes
  27. Variational pragmatics in the foreign language classroom