Construct relation extraction from scientific papers: Is it automatable yet?

Burkhardt Funk; Jonas Scharfenberger

doi:10.24251/hicss.2025.563

Construct relation extraction from scientific papers: Is it automatable yet?

Research output: Contributions to collected editions/works › Published abstract in conference proceedings › Research › peer-review

Authors

Professorship for Information Systems, in particular Data Science

The process of identifying relevant prior research articles is crucial for theoretical advancements, but often requires significant human effort. This study examines the feasibility of using large language models (LLMs) to support this task by extracting tested hypotheses, which consist of related constructs, moderators or mediators, path coefficients, and p-values, from empirical studies using structural equation modeling (SEM). We combine state-of-the-art LLMs with a variety of post-processing measures to improve the relation extraction quality. An extensive evaluation yields recall scores of up to 79.2% in construct entity extraction, 58.4% in construct-mediator/moderator-construct extraction, and 39.3% in extracting the full tested hypotheses. We provide a manually annotated dataset of 72 SEM articles and 749 construct relations to facilitate future research. Our findings offer critical insights and suggest promising directions for advancing the field of automated construct relation extraction from scholarly documents.

Original language	English
Title of host publication	Proceedings of the 58th Hawaii International Conference on System Sciences, HICSS 2025
Editors	Tung X. Bui
Number of pages	10
Place of Publication	Honolulu
Publisher	University of Hawaii at Manoa
Publication date	07.01.2025
Pages	4675-4684
ISBN (electronic)	978-0-9981331-8-8
DOIs	https://doi.org/10.24251/hicss.2025.563
Publication status	Published - 07.01.2025
Event	58th Hawaii International Conference on System Sciences - HICSS 2025 - Hilton Waikoloa Village, Waikoloa, United States Duration: 07.01.2025 → 10.01.2025 Conference number: 58 https://hicss.hawaii.edu/ https://doi.org/10.25798/rch5-7d05

Bibliographical note

Collections: AI Assistants and Generative AI for Knowledge Creation, Retention, and Use

Publisher Copyright:
© 2025 IEEE Computer Society. All rights reserved.

Research areas

Business informatics - AI Assistants and Generative AI for Knowledge Creation, Retention, and Use, Large Language Models, natural language processing, Relation extraction, structural equation modeling

Other publications by the same author(s)

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

Zainal, N. H., Eckhardt, R., Rackoff, G. N., Fitzsimmons-Craft, E. E., Rojas-Ashe, E., Barr Taylor, C., Funk, B., Eisenberg, D., Wilfley, D. E. & Newman, M. G., 02.04.2025, In: Psychological Medicine. 55, e106.

Research output: Journal contributions › Journal articles › Research › peer-review

Enhancing Invoice Recognition with LLM Embeddings in GAT Networks

Thiée, L. W. & Funk, B., 08.2025, Americas Conference on Information Systems, AMCIS 2025. The Association for Information Systems (AIS), p. 4483-4492 10 p. (Americas Conference on Information Systems, AMCIS 2025; vol. 7).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

From Feedback to Formative Guidance: Leveraging LLMs for Personalized Support in Programming Projects

Ghoochani, F., Scharfenberger, J., Funk, B., Doublan, R., Jakharabhai Odedra, M. & Etsiwah, B., 12.06.2025, UMAP 2025 - Adjunct Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization. Conati, C., Narducci, F., Rossiello, G., Musto, C. & Vassileva, J. (eds.). Association for Computing Machinery, Inc, p. 398-403 6 p. (UMAP 2025 - Adjunct Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

The promise and challenges of computer mouse trajectories in DMHIs – A feasibility study on pre-treatment dropout predictions

Zantvoort, K., Matthiesen, J., Bjurner, P., Bendix, M., Brefeld, U., Funk, B. & Kaldo, V., 06.2025, In: Internet Interventions. 40, 7 p., 100828.

Research output: Journal contributions › Journal articles › Research › peer-review

A Universal Digital Stress Management Intervention for Employees: Randomized Controlled Trial with Health-Economic Evaluation

Freund, J., Smit, F., Lehr, D., Zarski, A. C., Berking, M., Riper, H., Funk, B., Ebert, D. D. & Buntrock, C., 22.10.2024, In: Journal of Medical Internet Research. 26, 13 p., e48481.

Research output: Journal contributions › Journal articles › Research › peer-review

DOI

https://doi.org/10.24251/hicss.2025.563
Final published version

Construct relation extraction from scientific papers: Is it automatable yet?

Authors

Bibliographical note

Research areas

Other publications by the same author(s)

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

Enhancing Invoice Recognition with LLM Embeddings in GAT Networks

From Feedback to Formative Guidance: Leveraging LLMs for Personalized Support in Programming Projects

The promise and challenges of computer mouse trajectories in DMHIs – A feasibility study on pre-treatment dropout predictions

A Universal Digital Stress Management Intervention for Employees: Randomized Controlled Trial with Health-Economic Evaluation

Links

DOI

Recently viewed

Researchers

Publications