Construct relation extraction from scientific papers: Is it automatable yet?
Research output: Contributions to collected editions/works › Published abstract in conference proceedings › Research › peer-review
Standard
Proceedings of the 58th Hawaii International Conference on System Sciences 2025. 2025. p. 4675.
Research output: Contributions to collected editions/works › Published abstract in conference proceedings › Research › peer-review
Harvard
APA
Vancouver
Bibtex
}
RIS
TY - CHAP
T1 - Construct relation extraction from scientific papers: Is it automatable yet?
AU - Funk, Burkhardt
AU - Scharfenberger, Jonas
N1 - Conference code: 58
PY - 2025
Y1 - 2025
N2 - The process of identifying relevant prior researcharticles is crucial for theoretical advancements, butoften requires significant human effort. This studyexamines the feasibility of using large languagemodels (LLMs) to support this task by extractingtested hypotheses, which consist of related constructs,moderators or mediators, path coefficients, andp-values, from empirical studies using structuralequation modeling (SEM). We combine state-of-the-artLLMs with a variety of post-processing measuresto improve the relation extraction quality. Anextensive evaluation yields recall scores of up to79.2% in construct entity extraction, 58.4% inconstruct-mediator/moderator-construct extraction,and 39.3% in extracting the full tested hypotheses.We provide a manually annotated dataset of 72 SEMarticles and 749 construct relations to facilitate futureresearch. Our findings offer critical insights andsuggest promising directions for advancing the field ofautomated construct relation extraction from scholarlydocuments.
AB - The process of identifying relevant prior researcharticles is crucial for theoretical advancements, butoften requires significant human effort. This studyexamines the feasibility of using large languagemodels (LLMs) to support this task by extractingtested hypotheses, which consist of related constructs,moderators or mediators, path coefficients, andp-values, from empirical studies using structuralequation modeling (SEM). We combine state-of-the-artLLMs with a variety of post-processing measuresto improve the relation extraction quality. Anextensive evaluation yields recall scores of up to79.2% in construct entity extraction, 58.4% inconstruct-mediator/moderator-construct extraction,and 39.3% in extracting the full tested hypotheses.We provide a manually annotated dataset of 72 SEMarticles and 749 construct relations to facilitate futureresearch. Our findings offer critical insights andsuggest promising directions for advancing the field ofautomated construct relation extraction from scholarlydocuments.
M3 - Published abstract in conference proceedings
SP - 4675
BT - Proceedings of the 58th Hawaii International Conference on System Sciences 2025
T2 - 58th Hawaii International Conference on System Sciences - HICSS 2025
Y2 - 7 January 2025 through 10 January 2025
ER -