Applying Large Language Models to Enhance Data Extraction for Clinical Research
Project: Research
Project participants
- Usbeck, Ricardo (Project manager, academic)
Description
Acute heart failure including cardiogenic shock is a life-threatening condition with high 30-day mortality up to 60%.
In order to understand these critical conditions better, large registries are being established. These are most valuable primarily in generating hypotheses for further assessment. This assessment is usually performed in randomised controlled trials (RCTs) which permit insights into causal relationships between medical interventions (like the use of novel medical drugs or mechanical circulatory support devices like the veno-arterial extracorporeal membrane oxygenation, so-called VA-ECMO) and patient outcomes. Unfortunately, building the registries as well as performing RCTs are labour-intensive endeavours and, thus, both time-consuming and costly.
In the applicants’ opinion, leveraging existing data collected from clinical routine is of utmost importance to advance the research in understanding the critical conditions. Also, increasing the rate of patient recruitment by better embedding research-related tasks into clinical routine will reduce RCT durations necessary to obtain sufficient numbers of patients according to power calculations. On top, building larger registries from automatically collected data in a multi-centre setting will be simplified.
Applying artificial intelligence (AI) through large language models (LLMs) addresses this need: Much information of interest to the clinical researcher is contained in discharge letters from hospitals in a more or less structured way. Instead of spending labour force to collect these data, applying AI is a valuable and cost-saving alternative.
In order to understand these critical conditions better, large registries are being established. These are most valuable primarily in generating hypotheses for further assessment. This assessment is usually performed in randomised controlled trials (RCTs) which permit insights into causal relationships between medical interventions (like the use of novel medical drugs or mechanical circulatory support devices like the veno-arterial extracorporeal membrane oxygenation, so-called VA-ECMO) and patient outcomes. Unfortunately, building the registries as well as performing RCTs are labour-intensive endeavours and, thus, both time-consuming and costly.
In the applicants’ opinion, leveraging existing data collected from clinical routine is of utmost importance to advance the research in understanding the critical conditions. Also, increasing the rate of patient recruitment by better embedding research-related tasks into clinical routine will reduce RCT durations necessary to obtain sufficient numbers of patients according to power calculations. On top, building larger registries from automatically collected data in a multi-centre setting will be simplified.
Applying artificial intelligence (AI) through large language models (LLMs) addresses this need: Much information of interest to the clinical researcher is contained in discharge letters from hospitals in a more or less structured way. Instead of spending labour force to collect these data, applying AI is a valuable and cost-saving alternative.
Status | Active |
---|---|
Period | 01.09.24 → 28.02.26 |