A Framework for Applying Natural Language Processing in Digital Health Interventions

Burkhardt Funk; Shiri  Sadeh-Sharvit; Ellen E. Fitzsimmons-Craft; Mickey Todd  Trockel; Grace E Monterubio; Neha J Goel; Katherine N Balantekin; Dawn M Eichen; Rachael E Flatt; Marie-Laure Firebaugh; Corinna Jacobi; Andrea K.  Graham; Mark Hoogendoorn; Denise E Wilfley; C Barr Taylor

doi:10.2196/13855

A Framework for Applying Natural Language Processing in Digital Health Interventions

Research output: Journal contributions › Journal articles › Research › peer-review

Authors

Burkhardt Funk
Shiri Sadeh-Sharvit
Ellen E. Fitzsimmons-Craft
Mickey Todd Trockel
Grace E Monterubio
Neha J Goel
Katherine N Balantekin
Dawn M Eichen
Rachael E Flatt
Marie-Laure Firebaugh
Corinna Jacobi
Andrea K. Graham
Mark Hoogendoorn
Denise E Wilfley
C Barr Taylor

Professorship for Information Systems, in particular Data Science

Background: Digital health interventions (DHIs) are poised to reduce target symptoms in a scalable, affordable, and empirically supported way. DHIs that involve coaching or clinical support often collect text data from 2 sources: (1) open correspondence between users and the trained practitioners supporting them through a messaging system and (2) text data recorded during the intervention by users, such as diary entries. Natural language processing (NLP) offers methods for analyzing text, augmenting the understanding of intervention effects, and informing therapeutic decision making.

Objective: This study aimed to present a technical framework that supports the automated analysis of both types of text data often present in DHIs. This framework generates text features and helps to build statistical models to predict target variables, including user engagement, symptom change, and therapeutic outcomes.

Methods: We first discussed various NLP techniques and demonstrated how they are implemented in the presented framework. We then applied the framework in a case study of the Healthy Body Image Program, a Web-based intervention trial for eating disorders (EDs). A total of 372 participants who screened positive for an ED received a DHI aimed at reducing ED psychopathology (including binge eating and purging behaviors) and improving body image. These users generated 37,228 intervention text snippets and exchanged 4285 user-coach messages, which were analyzed using the proposed model.

Results: We applied the framework to predict binge eating behavior, resulting in an area under the curve between 0.57 (when applied to new users) and 0.72 (when applied to new symptom reports of known users). In addition, initial evidence indicated that specific text features predicted the therapeutic outcome of reducing ED symptoms.

Conclusions: The case study demonstrates the usefulness of a structured approach to text data analytics. NLP techniques improve the prediction of symptom changes in DHIs. We present a technical framework that can be easily applied in other clinical trials and clinical presentations and encourage other groups to apply the framework in similar contexts.

Original language	English
Article number	e13855
Journal	Journal of Medical Internet Research
Volume	22
Issue number	2
Number of pages	13
ISSN	1439-4456
DOIs	https://doi.org/10.2196/13855
Publication status	Published - 19.02.2020

Bibliographical note

Publisher Copyright:
© Burkhardt Funk, Shiri Sadeh-Sharvit, Ellen E Fitzsimmons-Craft, Mickey Todd Trockel, Grace E Monterubio, Neha J Goel, Katherine N Balantekin, Dawn M Eichen, Rachael E Flatt, Marie-Laure Firebaugh, Corinna Jacobi, Andrea K Graham, Mark Hoogendoorn, Denise E Wilfley, C Barr Taylor. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 19.02.2020. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.

ASJC Scopus Subject Areas

Health Informatics

Research areas

Business informatics - Digital Health Interventions Text Analytics (DHITA), Text mining, Digital health interventions, Eating disorders, Guided self-help, Natural language processing

Related by journal

Digital Health Literacy of Children and Adolescents and Its Association With Sociodemographic Factors: Representative Study Findings From Germany

Stauch, L., Renninger, D., Rangnow, P., Hartmann, A., Fischer, L., Dadaczynski, K. & Okan, O., 05.05.2025, In: Journal of Medical Internet Research. 27, 15 p., e69170.

Research output: Journal contributions › Journal articles › Research › peer-review

Efficacy of a Self-Guided Internet Intervention With Optional On-Demand Feedback Versus Digital Psychoeducation on Sleep Hygiene for University Students With Insomnia: Randomized Controlled Trial

Zarski, A. C., Bernstein, K., Baumeister, H., Lehr, D., Wernicke, S., Küchler, A. M., Kählke, F., Spiegelhalder, K. & Ebert, D. D., 08.05.2025, In: Journal of Medical Internet Research. 27, e58024.

Research output: Journal contributions › Journal articles › Research › peer-review

Efficacy of a Web-Based Stress Management Intervention for Beginning Teachers on Reducing Stress and Mechanisms of Change: Randomized Controlled Trial

Heckendorf, H. & Lehr, D., 16.06.2025, In: Journal of Medical Internet Research. 27, 28 p., e58475.

Research output: Journal contributions › Journal articles › Research › peer-review

A Universal Digital Stress Management Intervention for Employees: Randomized Controlled Trial with Health-Economic Evaluation

Freund, J., Smit, F., Lehr, D., Zarski, A. C., Berking, M., Riper, H., Funk, B., Ebert, D. D. & Buntrock, C., 22.10.2024, In: Journal of Medical Internet Research. 26, 13 p., e48481.

Research output: Journal contributions › Journal articles › Research › peer-review

eHealth Literacy and Web-Based Health Information–Seeking Behaviors on COVID-19 in Japan: Internet-Based Mixed Methods Study

Mitsutake, S., Oka, K., Okan, O., Dadaczynski, K., Ishizaki, T., Nakayama, T. & Takahashi, Y., 11.07.2024, In: Journal of Medical Internet Research. 26, 1, 27 p., e57842.

Research output: Journal contributions › Journal articles › Research › peer-review

Other publications by the same author(s)

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

Zainal, N. H., Eckhardt, R., Rackoff, G. N., Fitzsimmons-Craft, E. E., Rojas-Ashe, E., Barr Taylor, C., Funk, B., Eisenberg, D., Wilfley, D. E. & Newman, M. G., 02.04.2025, In: Psychological Medicine. 55, e106.

Research output: Journal contributions › Journal articles › Research › peer-review

Construct relation extraction from scientific papers: Is it automatable yet?

Funk, B. & Scharfenberger, J., 07.01.2025, Proceedings of the 58th Hawaii International Conference on System Sciences, HICSS 2025. Bui, T. X. (ed.). Honolulu: University of Hawaii at Manoa, p. 4675-4684 10 p. (Hawaii International Conference on System Sciences (HICSS); vol. 2025).

Research output: Contributions to collected editions/works › Published abstract in conference proceedings › Research › peer-review

From Feedback to Formative Guidance: Leveraging LLMs for Personalized Support in Programming Projects

Ghoochani, F., Scharfenberger, J., Funk, B., Doublan, R., Jakharabhai Odedra, M. & Etsiwah, B., 12.06.2025, UMAP 2025 - Adjunct Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization. Conati, C., Narducci, F., Rossiello, G., Musto, C. & Vassileva, J. (eds.). Association for Computing Machinery, Inc, p. 398-403 6 p. (UMAP 2025 - Adjunct Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

The promise and challenges of computer mouse trajectories in DMHIs – A feasibility study on pre-treatment dropout predictions

Zantvoort, K., Matthiesen, J., Bjurner, P., Bendix, M., Brefeld, U., Funk, B. & Kaldo, V., 06.2025, In: Internet Interventions. 40, 7 p., 100828.

Research output: Journal contributions › Journal articles › Research › peer-review

A Universal Digital Stress Management Intervention for Employees: Randomized Controlled Trial with Health-Economic Evaluation

Freund, J., Smit, F., Lehr, D., Zarski, A. C., Berking, M., Riper, H., Funk, B., Ebert, D. D. & Buntrock, C., 22.10.2024, In: Journal of Medical Internet Research. 26, 13 p., e48481.

Research output: Journal contributions › Journal articles › Research › peer-review

DOI

https://doi.org/10.2196/13855
Final published version

A Framework for Applying Natural Language Processing in Digital Health Interventions

Authors

Bibliographical note

ASJC Scopus Subject Areas

Research areas

Related by journal

Digital Health Literacy of Children and Adolescents and Its Association With Sociodemographic Factors: Representative Study Findings From Germany

Efficacy of a Self-Guided Internet Intervention With Optional On-Demand Feedback Versus Digital Psychoeducation on Sleep Hygiene for University Students With Insomnia: Randomized Controlled Trial

Efficacy of a Web-Based Stress Management Intervention for Beginning Teachers on Reducing Stress and Mechanisms of Change: Randomized Controlled Trial

A Universal Digital Stress Management Intervention for Employees: Randomized Controlled Trial with Health-Economic Evaluation

eHealth Literacy and Web-Based Health Information–Seeking Behaviors on COVID-19 in Japan: Internet-Based Mixed Methods Study

Other publications by the same author(s)

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

Construct relation extraction from scientific papers: Is it automatable yet?

From Feedback to Formative Guidance: Leveraging LLMs for Personalized Support in Programming Projects

The promise and challenges of computer mouse trajectories in DMHIs – A feasibility study on pre-treatment dropout predictions

A Universal Digital Stress Management Intervention for Employees: Randomized Controlled Trial with Health-Economic Evaluation

DOI

Recently viewed

Researchers

Activities

Press / Media

Publications