HySQA: Hybrid Scholarly Question Answering

Research output: Contributions to collected editions/worksChapterpeer-review

Authors

Purpose:

The heterogeneity of scholarly information in knowledge graphs (KGs) and unstructured textual sources poses challenges in building robust Scholarly Question Answering (SQA) systems. Existing datasets and models typically address a narrow spectrum, focusing exclusively on KGs or unstructured sources and limiting evaluation to simple factoid questions. This gap leaves current systems unable to answer complex, hybrid scholarly questions that require integrating evidence from multiple heterogeneous data sources.

Methodology:

We introduce HySQA (Hybrid Scholarly Question Answering), a large-scale benchmarking dataset containing hybrid questions over scholarly KGs and Wikipedia text. HySQA contains complex questions that need to traverse facts across structured and unstructured sources. We also develop a baseline model that adaptively decomposes each question into sub-questions, identifies their answer sources, retrieves relevant information from SKGs and Wikipedia, and generates an answer using a hybrid augmented answer generation framework.

Findings:

The experimental results show that integrating static and adaptive decomposition methods is more effective than static decomposition alone.

Value:

Introducing HySQA provides the community with resources for evaluating the advancements in scholarly QA research.
Original languageEnglish
Title of host publicationProceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria
Number of pages17
Volume62
Publication date26.08.2025
Pages247
ISBN (electronic)978-1-64368-616-5
Publication statusPublished - 26.08.2025

Recently viewed

Researchers

  1. Carsten Hobohm

Publications

  1. Strategy execution in higher education
  2. Simultaneous Determination of 11 Sulfonamides by HPLC–UV and Application for Fast Screening of Their Aerobic Elimination and Biodegradation in a Simple Test
  3. Mapping ecosystem services in Colombia
  4. How young children integrate information sources to infer the meaning of words
  5. Collaborative Information Systems zur Selbstorganisation von ad-hoc-Helfern
  6. Temporal patterns in ecosystem services research
  7. Implementation intentions and the willful pursuit of prosocial goals in negotiations
  8. Modeling Bolt Load Retention of Ca modified AS41 using compliance-creep method
  9. Ecosystem services values in Spain
  10. The Plane of Obscurity — Simulation and Philosophy
  11. Development and Testing of Water-Filled Tube Systems for Flood Protection Measures
  12. Two Mediterranean annuals feature high within-population trait variability and respond differently to a precipitation gradient
  13. Modelling Interdependencies Within Production Planning and Control
  14. Morphometric differentiation in a specialised snail predatior
  15. The internal audience of external communications
  16. Who’s afraid of the senses? Organization, management and the return of the sensorium
  17. Group consent in population based research
  18. Directives in ELF peer feedback
  19. Tree species and functional traits but not species richness affect interrill erosion processes in young subtropical forests
  20. Repräsentative Wahlstatistik
  21. Rechtschreiben unterrichten
  22. Utilising learning analytics for study success
  23. What goes around, comes around? Access and allocation problems in Global North-South waste trade
  24. Case Study
  25. On the optimal design of insurance contracts with guarantees
  26. High temperature deformation and microstructural features of TXA321 magnesium alloy
  27. Introduction