HySQA: Hybrid Scholarly Question Answering

Tilahun Taffa; Debayan Banerjee; Yaregal Assabie; Ricardo Usbeck

doi:10.3233/SSW250024

HySQA: Hybrid Scholarly Question Answering

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Standard

HySQA: Hybrid Scholarly Question Answering. / Taffa, Tilahun ; Banerjee, Debayan; Assabie, Yaregal et al.
Linking Meaning: Semantic Technologies Shaping the Future of AI: Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Hrsg. / Blerina Spahiu; Sahar Vahdati; Angelo Salatino; Tassilo Pellegrini; Giray Havur. Amsterdam: IOS Press BV, 2025. S. 247-263 (Studies on the Semantic Web; Band 62).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Harvard

Taffa, T , Banerjee, D, Assabie, Y & Usbeck, R 2025, HySQA: Hybrid Scholarly Question Answering. in B Spahiu, S Vahdati, A Salatino, T Pellegrini & G Havur (Hrsg.), Linking Meaning: Semantic Technologies Shaping the Future of AI: Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Studies on the Semantic Web, Bd. 62, IOS Press BV, Amsterdam, S. 247-263, 21st International Conference on Semantic Systems, Wien, Österreich, 03.09.25. https://doi.org/10.3233/SSW250024

APA

Taffa, T., Banerjee, D., Assabie, Y., & Usbeck, R. (2025). HySQA: Hybrid Scholarly Question Answering. In B. Spahiu, S. Vahdati, A. Salatino, T. Pellegrini, & G. Havur (Hrsg.), Linking Meaning: Semantic Technologies Shaping the Future of AI: Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria (S. 247-263). (Studies on the Semantic Web; Band 62). IOS Press BV. https://doi.org/10.3233/SSW250024

Vancouver

Taffa T , Banerjee D, Assabie Y, Usbeck R. HySQA: Hybrid Scholarly Question Answering. in Spahiu B, Vahdati S, Salatino A, Pellegrini T, Havur G, Hrsg., Linking Meaning: Semantic Technologies Shaping the Future of AI: Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Amsterdam: IOS Press BV. 2025. S. 247-263. (Studies on the Semantic Web). doi: 10.3233/SSW250024

Bibtex

@inbook{d2ef6373814e43db826d9dea058cf22e,

title = "HySQA: Hybrid Scholarly Question Answering",

abstract = "Purpose:The heterogeneity of scholarly information in knowledge graphs (KGs) and unstructured textual sources poses challenges in building robust Scholarly Question Answering (SQA) systems. Existing datasets and models typically address a narrow spectrum, focusing exclusively on KGs or unstructured sources and limiting evaluation to simple factoid questions. This gap leaves current systems unable to answer complex, hybrid scholarly questions that require integrating evidence from multiple heterogeneous data sources.Methodology:We introduce HySQA (Hybrid Scholarly Question Answering), a large-scale benchmarking dataset containing hybrid questions over scholarly KGs and Wikipedia text. HySQA contains complex questions that need to traverse facts across structured and unstructured sources. We also develop a baseline model that adaptively decomposes each question into sub-questions, identifies their answer sources, retrieves relevant information from SKGs and Wikipedia, and generates an answer using a hybrid augmented answer generation framework.Findings:The experimental results show that integrating static and adaptive decomposition methods is more effective than static decomposition alone.Value:Introducing HySQA provides the community with resources for evaluating the advancements in scholarly QA research.",

keywords = "Business informatics, Scholarly hybrid questions, Scholarly Question Answering, Hybrid Question Answering, Complex Question Answering",

author = "Tilahun Taffa and Debayan Banerjee and Yaregal Assabie and Ricardo Usbeck",

year = "2025",

month = aug,

day = "26",

doi = "10.3233/SSW250024",

language = "English",

series = "Studies on the Semantic Web",

publisher = "IOS Press BV",

pages = "247--263",

editor = "Blerina Spahiu and Sahar Vahdati and Angelo Salatino and Tassilo Pellegrini and Giray Havur",

booktitle = "Linking Meaning: Semantic Technologies Shaping the Future of AI",

address = "Netherlands",

note = "21st International Conference on Semantic Systems : Linking Meaning: Semantic Technologies Shaping the Future of AI ; Conference date: 03-09-2025 Through 05-09-2025",

}

RIS

TY - CHAP

T1 - HySQA: Hybrid Scholarly Question Answering

AU - Taffa, Tilahun

AU - Banerjee, Debayan

AU - Assabie, Yaregal

AU - Usbeck, Ricardo

N1 - Conference code: 21

PY - 2025/8/26

Y1 - 2025/8/26

N2 - Purpose:The heterogeneity of scholarly information in knowledge graphs (KGs) and unstructured textual sources poses challenges in building robust Scholarly Question Answering (SQA) systems. Existing datasets and models typically address a narrow spectrum, focusing exclusively on KGs or unstructured sources and limiting evaluation to simple factoid questions. This gap leaves current systems unable to answer complex, hybrid scholarly questions that require integrating evidence from multiple heterogeneous data sources.Methodology:We introduce HySQA (Hybrid Scholarly Question Answering), a large-scale benchmarking dataset containing hybrid questions over scholarly KGs and Wikipedia text. HySQA contains complex questions that need to traverse facts across structured and unstructured sources. We also develop a baseline model that adaptively decomposes each question into sub-questions, identifies their answer sources, retrieves relevant information from SKGs and Wikipedia, and generates an answer using a hybrid augmented answer generation framework.Findings:The experimental results show that integrating static and adaptive decomposition methods is more effective than static decomposition alone.Value:Introducing HySQA provides the community with resources for evaluating the advancements in scholarly QA research.

AB - Purpose:The heterogeneity of scholarly information in knowledge graphs (KGs) and unstructured textual sources poses challenges in building robust Scholarly Question Answering (SQA) systems. Existing datasets and models typically address a narrow spectrum, focusing exclusively on KGs or unstructured sources and limiting evaluation to simple factoid questions. This gap leaves current systems unable to answer complex, hybrid scholarly questions that require integrating evidence from multiple heterogeneous data sources.Methodology:We introduce HySQA (Hybrid Scholarly Question Answering), a large-scale benchmarking dataset containing hybrid questions over scholarly KGs and Wikipedia text. HySQA contains complex questions that need to traverse facts across structured and unstructured sources. We also develop a baseline model that adaptively decomposes each question into sub-questions, identifies their answer sources, retrieves relevant information from SKGs and Wikipedia, and generates an answer using a hybrid augmented answer generation framework.Findings:The experimental results show that integrating static and adaptive decomposition methods is more effective than static decomposition alone.Value:Introducing HySQA provides the community with resources for evaluating the advancements in scholarly QA research.

KW - Business informatics

KW - Scholarly hybrid questions

KW - Scholarly Question Answering

KW - Hybrid Question Answering

KW - Complex Question Answering

UR - https://ebooks.iospress.nl/ISBN/978-1-64368-616-5

U2 - 10.3233/SSW250024

DO - 10.3233/SSW250024

M3 - Article in conference proceedings

T3 - Studies on the Semantic Web

SP - 247

EP - 263

BT - Linking Meaning: Semantic Technologies Shaping the Future of AI

A2 - Spahiu, Blerina

A2 - Vahdati, Sahar

A2 - Salatino, Angelo

A2 - Pellegrini, Tassilo

A2 - Havur, Giray

PB - IOS Press BV

CY - Amsterdam

T2 - 21st International Conference on Semantic Systems

Y2 - 3 September 2025 through 5 September 2025

ER -

Weitere Publikationen dieser Person(en)

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Salnikov, M., Sakhovskiy, A., Nikishina, I., Usmanova, A., Kraft, A., Möller, C., Banerjee, D., Huang, J., Jiang, L., Abdullah, R., Yan, X., Tutubalina, E., Usbeck, R. & Panchenko, A., 2026, Natural Language Processing and Information Systems: 30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, Proceedings. Ichise, R. (Hrsg.). Springer Science and Business Media Deutschland, S. 95-110 16 S. (Lecture Notes in Computer Science; Band 15836 LNCS).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Analyzing the Influence of Knowledge Graph Information on Relation Extraction

Möller, C. & Usbeck, R., 2025, The Semantic Web: 22nd European Semantic Web Conference, ESWC 2025 Portoroz, Slovenia, June 1–5, 2025 Proceedings, Part I. Curry, E., Acosta, M., Poveda-Villalón, M., van Erp, M., Ojo, A., Hose, K., Shimizu, C. & Lisena, P. (Hrsg.). Cham: Springer Nature Switzerland AG, Band 1. S. 460-480 21 S. (Lecture Notes in Computer Science ; Band 15718).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

ASK-DBLP: Answering Questions over DBLP

Taffa, T., Neises, P., Ollinger, S., Westphal, P., Ackermann, M. R., Banerjee, D. & Usbeck, R., 02.11.2025, ISWC-C 2025, Industry, Doctoral Consortium, Posters and Demos at ISWC 2025: Joint Proceedings of Industry, Doctoral Consortium, Posters and Demos of the 24th International Semantic Web Conference (ISWC-C 2025), ISWC 2025 Companion Volume. Celino, I., Hassanzadeh, O., Bernstein, A., Noy, N., Cheng, G., Wang, S., Ferrada, S., Soulard, T., Kozaki, K., Takeda, H. & Gentile, A. L. (Hrsg.). Aachen: Sun Site Central Europe (RWTH Aachen University), S. 435-440 6 S. D13. (CEUR Workshop Proceedings; Band 4085).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Automating SPARQL Query Translations between DBpedia and Wikidata

Bartels, M. C., Banerjee, D. & Usbeck, R., 14.07.2025, Linking Meaning: Semantic Technologies Shaping the Future of AI: Cover 74617 Proceedings of the 21st International Conference on Semantic Systems, 3-5 September 2025, Vienna, Austria. Spahiu, B., Vahdati, S., Salatino, A., Pellegrini, T. & Havur, G. (Hrsg.). IOS Press BV, S. 176-193 18 S. (Studies on the Semantic Web; Band 62).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung

Best Practices in AI and Data Science Models Evaluation

Banerjee, D., Taffa, T. A. & Usbeck, R., 2025, INFORMATIK 2025 : The Wide Open - Offenheit von Source bis Science, 16.-19.September 2025 Potsdam. Lucke, U., Stieglitz, S., Uebernickel, F., Lamprecht, A.-L. & Klein, M. (Hrsg.). Bonn: Gesellschaft für Informatik, Bonn, S. 1211-1219 9 S. (Lecture Notes in Informatics; Band P366).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

DOI

https://doi.org/10.3233/SSW250024
Endgültige, publizierte Fassung