Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

Research output: Journal contributionsJournal articlesResearchpeer-review

Standard

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT). / Zainal, Nur Hani; Eckhardt, Regina; Rackoff, Gavin N. et al.
In: Psychological Medicine, Vol. 55, e106, 02.04.2025.

Research output: Journal contributionsJournal articlesResearchpeer-review

Harvard

Zainal, NH, Eckhardt, R, Rackoff, GN, Fitzsimmons-Craft, EE, Rojas-Ashe, E, Barr Taylor, C, Funk, B, Eisenberg, D, Wilfley, DE & Newman, MG 2025, 'Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)', Psychological Medicine, vol. 55, e106. https://doi.org/10.1017/S0033291725000340

APA

Zainal, N. H., Eckhardt, R., Rackoff, G. N., Fitzsimmons-Craft, E. E., Rojas-Ashe, E., Barr Taylor, C., Funk, B., Eisenberg, D., Wilfley, D. E., & Newman, M. G. (2025). Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT). Psychological Medicine, 55, Article e106. https://doi.org/10.1017/S0033291725000340

Vancouver

Zainal NH, Eckhardt R, Rackoff GN, Fitzsimmons-Craft EE, Rojas-Ashe E, Barr Taylor C et al. Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT). Psychological Medicine. 2025 Apr 2;55:e106. doi: 10.1017/S0033291725000340

Bibtex

@article{3cce202d951741bf8ca36f669098de6a,
title = "Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)",
abstract = "Background As the use of guided digitally-delivered cognitive-behavioral therapy (GdCBT) grows, pragmatic analytic tools are needed to evaluate coaches' implementation fidelity. Aims We evaluated how natural language processing (NLP) and machine learning (ML) methods might automate the monitoring of coaches' implementation fidelity to GdCBT delivered as part of a randomized controlled trial. Method Coaches served as guides to 6-month GdCBT with 3,381 assigned users with or at risk for anxiety, depression, or eating disorders. CBT-trained and supervised human coders used a rubric to rate the implementation fidelity of 13,529 coach-to-user messages. NLP methods abstracted data from text-based coach-to-user messages, and 11 ML models predicting coach implementation fidelity were evaluated. Results Inter-rater agreement by human coders was excellent (intra-class correlation coefficient 980-.992). Coaches achieved behavioral targets at the start of the GdCBT and maintained strong fidelity throughout most subsequent messages. Coaches also avoided prohibited actions (e.g. reinforcing users' avoidance). Sentiment analyses generally indicated a higher frequency of coach-delivered positive than negative sentiment words and predicted coach implementation fidelity with acceptable performance metrics (e.g. area under the receiver operating characteristic curve [AUC] = 74.48%). The final best-performing ML algorithms that included a more comprehensive set of NLP features performed well (e.g. AUC = 76.06%). Conclusions NLP and ML tools could help clinical supervisors automate monitoring of coaches' implementation fidelity to GdCBT. These tools could maximize allocation of scarce resources by reducing the personnel time needed to measure fidelity, potentially freeing up more time for high-quality clinical care.",
keywords = "anxiety, depression, digital mental health intervention, eating disorders, guided internet-delivered cognitive-behavioral therapy, implementation fidelity, machine learning, natural language processing, Business informatics",
author = "Zainal, {Nur Hani} and Regina Eckhardt and Rackoff, {Gavin N.} and Fitzsimmons-Craft, {Ellen E.} and Elsa Rojas-Ashe and {Barr Taylor}, Craig and Burkhardt Funk and Daniel Eisenberg and Wilfley, {Denise E.} and Newman, {Michelle G.}",
note = "Publisher Copyright: {\textcopyright} The Author(s), 2025. Published by Cambridge University Press.",
year = "2025",
month = apr,
day = "2",
doi = "10.1017/S0033291725000340",
language = "English",
volume = "55",
journal = "Psychological Medicine",
issn = "0033-2917",
publisher = "Cambridge University Press",

}

RIS

TY - JOUR

T1 - Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

AU - Zainal, Nur Hani

AU - Eckhardt, Regina

AU - Rackoff, Gavin N.

AU - Fitzsimmons-Craft, Ellen E.

AU - Rojas-Ashe, Elsa

AU - Barr Taylor, Craig

AU - Funk, Burkhardt

AU - Eisenberg, Daniel

AU - Wilfley, Denise E.

AU - Newman, Michelle G.

N1 - Publisher Copyright: © The Author(s), 2025. Published by Cambridge University Press.

PY - 2025/4/2

Y1 - 2025/4/2

N2 - Background As the use of guided digitally-delivered cognitive-behavioral therapy (GdCBT) grows, pragmatic analytic tools are needed to evaluate coaches' implementation fidelity. Aims We evaluated how natural language processing (NLP) and machine learning (ML) methods might automate the monitoring of coaches' implementation fidelity to GdCBT delivered as part of a randomized controlled trial. Method Coaches served as guides to 6-month GdCBT with 3,381 assigned users with or at risk for anxiety, depression, or eating disorders. CBT-trained and supervised human coders used a rubric to rate the implementation fidelity of 13,529 coach-to-user messages. NLP methods abstracted data from text-based coach-to-user messages, and 11 ML models predicting coach implementation fidelity were evaluated. Results Inter-rater agreement by human coders was excellent (intra-class correlation coefficient 980-.992). Coaches achieved behavioral targets at the start of the GdCBT and maintained strong fidelity throughout most subsequent messages. Coaches also avoided prohibited actions (e.g. reinforcing users' avoidance). Sentiment analyses generally indicated a higher frequency of coach-delivered positive than negative sentiment words and predicted coach implementation fidelity with acceptable performance metrics (e.g. area under the receiver operating characteristic curve [AUC] = 74.48%). The final best-performing ML algorithms that included a more comprehensive set of NLP features performed well (e.g. AUC = 76.06%). Conclusions NLP and ML tools could help clinical supervisors automate monitoring of coaches' implementation fidelity to GdCBT. These tools could maximize allocation of scarce resources by reducing the personnel time needed to measure fidelity, potentially freeing up more time for high-quality clinical care.

AB - Background As the use of guided digitally-delivered cognitive-behavioral therapy (GdCBT) grows, pragmatic analytic tools are needed to evaluate coaches' implementation fidelity. Aims We evaluated how natural language processing (NLP) and machine learning (ML) methods might automate the monitoring of coaches' implementation fidelity to GdCBT delivered as part of a randomized controlled trial. Method Coaches served as guides to 6-month GdCBT with 3,381 assigned users with or at risk for anxiety, depression, or eating disorders. CBT-trained and supervised human coders used a rubric to rate the implementation fidelity of 13,529 coach-to-user messages. NLP methods abstracted data from text-based coach-to-user messages, and 11 ML models predicting coach implementation fidelity were evaluated. Results Inter-rater agreement by human coders was excellent (intra-class correlation coefficient 980-.992). Coaches achieved behavioral targets at the start of the GdCBT and maintained strong fidelity throughout most subsequent messages. Coaches also avoided prohibited actions (e.g. reinforcing users' avoidance). Sentiment analyses generally indicated a higher frequency of coach-delivered positive than negative sentiment words and predicted coach implementation fidelity with acceptable performance metrics (e.g. area under the receiver operating characteristic curve [AUC] = 74.48%). The final best-performing ML algorithms that included a more comprehensive set of NLP features performed well (e.g. AUC = 76.06%). Conclusions NLP and ML tools could help clinical supervisors automate monitoring of coaches' implementation fidelity to GdCBT. These tools could maximize allocation of scarce resources by reducing the personnel time needed to measure fidelity, potentially freeing up more time for high-quality clinical care.

KW - anxiety

KW - depression

KW - digital mental health intervention

KW - eating disorders

KW - guided internet-delivered cognitive-behavioral therapy

KW - implementation fidelity

KW - machine learning

KW - natural language processing

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=105001856822&partnerID=8YFLogxK

U2 - 10.1017/S0033291725000340

DO - 10.1017/S0033291725000340

M3 - Journal articles

C2 - 40170669

AN - SCOPUS:105001856822

VL - 55

JO - Psychological Medicine

JF - Psychological Medicine

SN - 0033-2917

M1 - e106

ER -

Recently viewed

Researchers

  1. Horst Rode

Activities

  1. Statistische Woche - 2013
  2. Histories of Media Art (Networking) in Deep Europe in the 1990s
  3. What impact does a field experience have on on pre-service teachers' adaptive peer feedback expertise?
  4. Development Entrepreneurship and Personal Initiatives: Long term massive randomized experiments on personal inititiative training for entrepreneurs to reduce poverty in developing countries
  5. The effects of pragmatic intervention on directives in EIL feedback speech events
  6. Religious Activity, Risk Taking Preferences, and Financial Economic Behavior: Empirical Evidence from German Survey Data
  7. A diary study on the social dynamics of knowledge hiding and the role of entitlement
  8. What we mean when we talk about freedom – The KOMFOR study: an analysis of students' choices of courses in interdisciplinary parts of the curriculum.
  9. Lehrerfortbildung 2012
  10. Fostering inter-institutional Development Teams in ITE & School Practice: The Significance of epistemic, social and organisational integration.
  11. Situating Global Art - 2015
  12. Congress of Applied Psychology - IAAP 2006
  13. Field release modelling of pesticides and their transformation products during a first significant rainfall in a semi-arid region
  14. Developed materials for thermal energy storage: Design and Characterization
  15. Der neue EU-Nachhaltigkeitsbericht nach der CSRD. Fluch oder Segen?“
  16. Promoting Pre-Service Teachers' Professional Vision of Classroom Management During Practical School Training: An Online- and Video-Based Self-Reflection and Feedback Intervention
  17. Programm-Workshop zur Zukunft der Arbeitsforschung
  18. Lehrerfortbildung 2010
  19. Zoological Systematics (Fachzeitschrift)
  20. Graduate School (Organisation)