Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

Nur Hani Zainal; Regina Eckhardt; Gavin N. Rackoff; Ellen E. Fitzsimmons-Craft; Elsa Rojas-Ashe; Craig Barr Taylor; Burkhardt Funk; Daniel Eisenberg; Denise E. Wilfley; Michelle G. Newman

doi:10.1017/S0033291725000340

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

Research output: Journal contributions › Journal articles › Research › peer-review

Standard

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT). / Zainal, Nur Hani; Eckhardt, Regina; Rackoff, Gavin N. et al.
In: Psychological Medicine, Vol. 55, e106, 02.04.2025.

Research output: Journal contributions › Journal articles › Research › peer-review

Bibtex

@article{3cce202d951741bf8ca36f669098de6a,

title = "Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)",

abstract = "Background As the use of guided digitally-delivered cognitive-behavioral therapy (GdCBT) grows, pragmatic analytic tools are needed to evaluate coaches' implementation fidelity. Aims We evaluated how natural language processing (NLP) and machine learning (ML) methods might automate the monitoring of coaches' implementation fidelity to GdCBT delivered as part of a randomized controlled trial. Method Coaches served as guides to 6-month GdCBT with 3,381 assigned users with or at risk for anxiety, depression, or eating disorders. CBT-trained and supervised human coders used a rubric to rate the implementation fidelity of 13,529 coach-to-user messages. NLP methods abstracted data from text-based coach-to-user messages, and 11 ML models predicting coach implementation fidelity were evaluated. Results Inter-rater agreement by human coders was excellent (intra-class correlation coefficient 980-.992). Coaches achieved behavioral targets at the start of the GdCBT and maintained strong fidelity throughout most subsequent messages. Coaches also avoided prohibited actions (e.g. reinforcing users' avoidance). Sentiment analyses generally indicated a higher frequency of coach-delivered positive than negative sentiment words and predicted coach implementation fidelity with acceptable performance metrics (e.g. area under the receiver operating characteristic curve [AUC] = 74.48%). The final best-performing ML algorithms that included a more comprehensive set of NLP features performed well (e.g. AUC = 76.06%). Conclusions NLP and ML tools could help clinical supervisors automate monitoring of coaches' implementation fidelity to GdCBT. These tools could maximize allocation of scarce resources by reducing the personnel time needed to measure fidelity, potentially freeing up more time for high-quality clinical care.",

keywords = "anxiety, depression, digital mental health intervention, eating disorders, guided internet-delivered cognitive-behavioral therapy, implementation fidelity, machine learning, natural language processing, Business informatics",

author = "Zainal, {Nur Hani} and Regina Eckhardt and Rackoff, {Gavin N.} and Fitzsimmons-Craft, {Ellen E.} and Elsa Rojas-Ashe and {Barr Taylor}, Craig and Burkhardt Funk and Daniel Eisenberg and Wilfley, {Denise E.} and Newman, {Michelle G.}",

note = "Publisher Copyright: {\textcopyright} The Author(s), 2025. Published by Cambridge University Press.",

year = "2025",

month = apr,

day = "2",

doi = "10.1017/S0033291725000340",

language = "English",

volume = "55",

journal = "Psychological Medicine",

issn = "0033-2917",

publisher = "Cambridge University Press",

}

RIS

TY - JOUR

T1 - Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

AU - Zainal, Nur Hani

AU - Eckhardt, Regina

AU - Rackoff, Gavin N.

AU - Fitzsimmons-Craft, Ellen E.

AU - Rojas-Ashe, Elsa

AU - Barr Taylor, Craig

AU - Funk, Burkhardt

AU - Eisenberg, Daniel

AU - Wilfley, Denise E.

AU - Newman, Michelle G.

N1 - Publisher Copyright: © The Author(s), 2025. Published by Cambridge University Press.

PY - 2025/4/2

Y1 - 2025/4/2

N2 - Background As the use of guided digitally-delivered cognitive-behavioral therapy (GdCBT) grows, pragmatic analytic tools are needed to evaluate coaches' implementation fidelity. Aims We evaluated how natural language processing (NLP) and machine learning (ML) methods might automate the monitoring of coaches' implementation fidelity to GdCBT delivered as part of a randomized controlled trial. Method Coaches served as guides to 6-month GdCBT with 3,381 assigned users with or at risk for anxiety, depression, or eating disorders. CBT-trained and supervised human coders used a rubric to rate the implementation fidelity of 13,529 coach-to-user messages. NLP methods abstracted data from text-based coach-to-user messages, and 11 ML models predicting coach implementation fidelity were evaluated. Results Inter-rater agreement by human coders was excellent (intra-class correlation coefficient 980-.992). Coaches achieved behavioral targets at the start of the GdCBT and maintained strong fidelity throughout most subsequent messages. Coaches also avoided prohibited actions (e.g. reinforcing users' avoidance). Sentiment analyses generally indicated a higher frequency of coach-delivered positive than negative sentiment words and predicted coach implementation fidelity with acceptable performance metrics (e.g. area under the receiver operating characteristic curve [AUC] = 74.48%). The final best-performing ML algorithms that included a more comprehensive set of NLP features performed well (e.g. AUC = 76.06%). Conclusions NLP and ML tools could help clinical supervisors automate monitoring of coaches' implementation fidelity to GdCBT. These tools could maximize allocation of scarce resources by reducing the personnel time needed to measure fidelity, potentially freeing up more time for high-quality clinical care.

AB - Background As the use of guided digitally-delivered cognitive-behavioral therapy (GdCBT) grows, pragmatic analytic tools are needed to evaluate coaches' implementation fidelity. Aims We evaluated how natural language processing (NLP) and machine learning (ML) methods might automate the monitoring of coaches' implementation fidelity to GdCBT delivered as part of a randomized controlled trial. Method Coaches served as guides to 6-month GdCBT with 3,381 assigned users with or at risk for anxiety, depression, or eating disorders. CBT-trained and supervised human coders used a rubric to rate the implementation fidelity of 13,529 coach-to-user messages. NLP methods abstracted data from text-based coach-to-user messages, and 11 ML models predicting coach implementation fidelity were evaluated. Results Inter-rater agreement by human coders was excellent (intra-class correlation coefficient 980-.992). Coaches achieved behavioral targets at the start of the GdCBT and maintained strong fidelity throughout most subsequent messages. Coaches also avoided prohibited actions (e.g. reinforcing users' avoidance). Sentiment analyses generally indicated a higher frequency of coach-delivered positive than negative sentiment words and predicted coach implementation fidelity with acceptable performance metrics (e.g. area under the receiver operating characteristic curve [AUC] = 74.48%). The final best-performing ML algorithms that included a more comprehensive set of NLP features performed well (e.g. AUC = 76.06%). Conclusions NLP and ML tools could help clinical supervisors automate monitoring of coaches' implementation fidelity to GdCBT. These tools could maximize allocation of scarce resources by reducing the personnel time needed to measure fidelity, potentially freeing up more time for high-quality clinical care.

KW - anxiety

KW - depression

KW - digital mental health intervention

KW - eating disorders

KW - guided internet-delivered cognitive-behavioral therapy

KW - implementation fidelity

KW - machine learning

KW - natural language processing

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=105001856822&partnerID=8YFLogxK

U2 - 10.1017/S0033291725000340

DO - 10.1017/S0033291725000340

M3 - Journal articles

C2 - 40170669

AN - SCOPUS:105001856822

VL - 55

JO - Psychological Medicine

JF - Psychological Medicine

SN - 0033-2917

M1 - e106

ER -

Related by journal

Who benefits from indirect prevention and treatment of depression using an online intervention for insomnia? Results from an individual-participant data meta-analysis

Thielecke, J., Kuper, P., Lehr, D., Schuurmans, L., Harrer, M., Ebert, D. D., Cuijpers, P., Behrendt, D., Brückner, H. A., Horvath, H., Riper, H. & Buntrock, C., 07.2024, In: Psychological Medicine. 54, 10, p. 2389-2402 14 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Trust in government regarding COVID-19 and its associations with preventive health behaviour and prosocial behaviour during the pandemic: A cross-sectional and longitudinal study

PsyCorona Collaboration, 26.01.2023, In: Psychological Medicine. 53, 1, p. 149-159 11 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Does Internet-based guided self-help for depression cause harm? An individual participant data meta-analysis on deterioration rates and its moderators in randomized controlled trials

Ebert, D. D., Donkin, L., Andersson, G., Andrews, G., Berger, T. K., Carlbring, P., Rozental, A., Choi, I., Laferton, J. A. C., Johansson, R., Kleiboer, A., Lange, A., Lehr, D., Reins, J. A., Funk, B., Newby, J., Perini, S., Riper, H., Ruwaard, J., Sheeber, L., Snoek, F., Titov, N., Ünlü Ince, B., Van Bastelaar, K. M. P., Vernmark, K., Van Straten, A., Warmerdam, L., Salsman, N. & Cuijpers, P., 01.10.2016, In: Psychological Medicine. 46, 13, p. 2679-2693 15 p.

Research output: Journal contributions › Scientific review articles › Research

Guided Internet-delivered cognitive behavioural treatment for insomnia: a randomized trial

Van Straten, A., Emmelkamp, J., de Wit, J., Lancee, J., Andersson, G., van Someren, E. J. W. & Cuijpers, P., 05.2014, In: Psychological Medicine. 44, 7, p. 1521-1532 12 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Online cognitive-based intervention for depression: exploring possible circularity in mechanisms of change

van der Zanden, R., Galindo-Garre, F., Curie, K., Kramer, J. & Cuijpers, P., 04.2014, In: Psychological Medicine. 44, 6, p. 1159-1170 12 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Other publications by the same author(s)

Construct relation extraction from scientific papers: Is it automatable yet?

Funk, B. & Scharfenberger, J., 07.01.2025, Proceedings of the 58th Hawaii International Conference on System Sciences, HICSS 2025. Bui, T. X. (ed.). Honolulu: University of Hawaii at Manoa, p. 4675-4684 10 p. (Hawaii International Conference on System Sciences (HICSS); vol. 2025).

Research output: Contributions to collected editions/works › Published abstract in conference proceedings › Research › peer-review

From Feedback to Formative Guidance: Leveraging LLMs for Personalized Support in Programming Projects

Ghoochani, F., Scharfenberger, J., Funk, B., Doublan, R., Jakharabhai Odedra, M. & Etsiwah, B., 12.06.2025, UMAP 2025 - Adjunct Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization. Conati, C., Narducci, F., Rossiello, G., Musto, C. & Vassileva, J. (eds.). Association for Computing Machinery, Inc, p. 398-403 6 p. (UMAP 2025 - Adjunct Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

The promise and challenges of computer mouse trajectories in DMHIs – A feasibility study on pre-treatment dropout predictions

Zantvoort, K., Matthiesen, J., Bjurner, P., Bendix, M., Brefeld, U., Funk, B. & Kaldo, V., 06.2025, In: Internet Interventions. 40, 7 p., 100828.

Research output: Journal contributions › Journal articles › Research › peer-review

A Universal Digital Stress Management Intervention for Employees: Randomized Controlled Trial with Health-Economic Evaluation

Freund, J., Smit, F., Lehr, D., Zarski, A. C., Berking, M., Riper, H., Funk, B., Ebert, D. D. & Buntrock, C., 22.10.2024, In: Journal of Medical Internet Research. 26, 13 p., e48481.

Research output: Journal contributions › Journal articles › Research › peer-review

Dataset size versus homogeneity: A machine learning study on pooling intervention data in e-mental health dropout predictions

Zantvoort, K., Hentati Isacsson, N., Funk, B. & Kaldo, V., 15.05.2024, (E-pub ahead of print) In: Digital Health. 10, 10 p.

Research output: Journal contributions › Journal articles › Research › peer-review

DOI

https://doi.org/10.1017/S0033291725000340
Final published version

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Related by journal

Who benefits from indirect prevention and treatment of depression using an online intervention for insomnia? Results from an individual-participant data meta-analysis

Trust in government regarding COVID-19 and its associations with preventive health behaviour and prosocial behaviour during the pandemic: A cross-sectional and longitudinal study

Does Internet-based guided self-help for depression cause harm? An individual participant data meta-analysis on deterioration rates and its moderators in randomized controlled trials

Guided Internet-delivered cognitive behavioural treatment for insomnia: a randomized trial

Online cognitive-based intervention for depression: exploring possible circularity in mechanisms of change

Other publications by the same author(s)

Construct relation extraction from scientific papers: Is it automatable yet?

From Feedback to Formative Guidance: Leveraging LLMs for Personalized Support in Programming Projects

The promise and challenges of computer mouse trajectories in DMHIs – A feasibility study on pre-treatment dropout predictions

A Universal Digital Stress Management Intervention for Employees: Randomized Controlled Trial with Health-Economic Evaluation

Dataset size versus homogeneity: A machine learning study on pooling intervention data in e-mental health dropout predictions

DOI

Recently viewed

Activities

Publications