Construct relation extraction from scientific papers: Is it automatable yet?
Research output: Contributions to collected editions/works › Published abstract in conference proceedings › Research › peer-review
Authors
The process of identifying relevant prior research articles is crucial for theoretical advancements, but often requires significant human effort. This study examines the feasibility of using large language models (LLMs) to support this task by extracting tested hypotheses, which consist of related constructs, moderators or mediators, path coefficients, and p-values, from empirical studies using structural equation modeling (SEM). We combine state-of-the-art LLMs with a variety of post-processing measures to improve the relation extraction quality. An extensive evaluation yields recall scores of up to 79.2% in construct entity extraction, 58.4% in construct-mediator/moderator-construct extraction, and 39.3% in extracting the full tested hypotheses. We provide a manually annotated dataset of 72 SEM articles and 749 construct relations to facilitate future research. Our findings offer critical insights and suggest promising directions for advancing the field of automated construct relation extraction from scholarly documents.
Original language | English |
---|---|
Title of host publication | Proceedings of the 58th Hawaii International Conference on System Sciences |
Editors | Tung Bui |
Number of pages | 10 |
Place of Publication | Honolulu |
Publisher | University of Hawaiʻi at Mānoa |
Publication date | 07.01.2025 |
Pages | 4675-4684 |
ISBN (electronic) | 978-0-9981331-8-8 |
Publication status | Published - 07.01.2025 |
Event | 58th Hawaii International Conference on System Sciences - HICSS 2025 - Hilton Waikoloa Village, Waikoloa, United States Duration: 07.01.2025 → 10.01.2025 Conference number: 58 |
Bibliographical note
Collections: AI Assistants and Generative AI for Knowledge Creation, Retention, and Use
- Business informatics - AI Assistants and Generative AI for Knowledge Creation, Retention, and Use, Large Language Models, natural language processing, Relation extraction, structural equation modeling