A Framework for Applying Natural Language Processing in Digital Health Interventions

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

  • Burkhardt Funk
  • Shiri Sadeh-Sharvit
  • Ellen E. Fitzsimmons-Craft
  • Mickey Todd Trockel
  • Grace E Monterubio
  • Neha J Goel
  • Katherine N Balantekin
  • Dawn M Eichen
  • Rachael E Flatt
  • Marie-Laure Firebaugh
  • Corinna Jacobi
  • Andrea K. Graham
  • Mark Hoogendoorn
  • Denise E Wilfley
  • C Barr Taylor
Background: Digital health interventions (DHIs) are poised to reduce target symptoms in a scalable, affordable, and empirically supported way. DHIs that involve coaching or clinical support often collect text data from 2 sources: (1) open correspondence between users and the trained practitioners supporting them through a messaging system and (2) text data recorded during the intervention by users, such as diary entries. Natural language processing (NLP) offers methods for analyzing text, augmenting the understanding of intervention effects, and informing therapeutic decision making.

Objective: This study aimed to present a technical framework that supports the automated analysis of both types of text data often present in DHIs. This framework generates text features and helps to build statistical models to predict target variables, including user engagement, symptom change, and therapeutic outcomes.

Methods: We first discussed various NLP techniques and demonstrated how they are implemented in the presented framework. We then applied the framework in a case study of the Healthy Body Image Program, a Web-based intervention trial for eating disorders (EDs). A total of 372 participants who screened positive for an ED received a DHI aimed at reducing ED psychopathology (including binge eating and purging behaviors) and improving body image. These users generated 37,228 intervention text snippets and exchanged 4285 user-coach messages, which were analyzed using the proposed model.

Results: We applied the framework to predict binge eating behavior, resulting in an area under the curve between 0.57 (when applied to new users) and 0.72 (when applied to new symptom reports of known users). In addition, initial evidence indicated that specific text features predicted the therapeutic outcome of reducing ED symptoms.

Conclusions: The case study demonstrates the usefulness of a structured approach to text data analytics. NLP techniques improve the prediction of symptom changes in DHIs. We present a technical framework that can be easily applied in other clinical trials and clinical presentations and encourage other groups to apply the framework in similar contexts.
Original languageEnglish
Article numbere13855
JournalJournal of Medical Internet Research
Volume22
Issue number2
Number of pages13
ISSN1439-4456
DOIs
Publication statusPublished - 19.02.2020

Bibliographical note

Publisher Copyright:
© Burkhardt Funk, Shiri Sadeh-Sharvit, Ellen E Fitzsimmons-Craft, Mickey Todd Trockel, Grace E Monterubio, Neha J Goel, Katherine N Balantekin, Dawn M Eichen, Rachael E Flatt, Marie-Laure Firebaugh, Corinna Jacobi, Andrea K Graham, Mark Hoogendoorn, Denise E Wilfley, C Barr Taylor. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 19.02.2020. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.

    Research areas

  • Business informatics - Digital Health Interventions Text Analytics (DHITA), Text mining, Digital health interventions, Eating disorders, Guided self-help, Natural language processing

DOI

Recently viewed

Publications

  1. On the origin of passive rotation in rotational joints, and how to calculate it
  2. Towards productive functions?
  3. Use of Machine-Learning Algorithms Based on Text, Audio and Video Data in the Prediction of Anxiety and Post-Traumatic Stress in General and Clinical Populations
  4. Methodological support for the selection of simplified equations of state for modeling technical fluids
  5. Spectral Early-Warning Signals for Sudden Changes in Time-Dependent Flow Patterns
  6. Enhancing EFL classroom instruction via the FeedBook: effects on language development and communicative language use.
  7. Interplays between relational and instrumental values
  8. Automated Invoice Processing: Machine Learning-Based Information Extraction for Long Tail Suppliers
  9. How alloying and processing effects can influence the microstructure and mechanical properties of directly extruded thin zinc wires
  10. Value Structure and Dimensions
  11. Conceptual understanding of complex components and Nyquist-Shannon sampling theorem
  12. Nonlinear PD fault-tolerant control for dynamic positioning of ships with actuator constraints
  13. Predicate‐based model of problem‐solving for robotic actions planning
  14. Homogenization methods for multi-phase elastic composites with non-elliptical reinforcements
  15. The role of task complexity, modality and aptitude in narrative task performance
  16. Factored MDPs for detecting topics of user sessions
  17. Inside-sediment partitioning of PAH, PCB and organochlorine compounds and inferences on sampling and normalization methods
  18. Privatizing the commons
  19. A tutorial introduction to adaptive fractal analysis
  20. Concepts, Formats, and Methods of Participation
  21. Mining Implications From Data
  22. Octanol-Water Partition Coefficient Measurement by a Simple 1H NMR Method
  23. New method for assessing the repeatability of the measuring system for roughness measurements
  24. Artificial intelligence
  25. Early Detection of Faillure in Conveyor Chain Systems by Wireless Sensor Node
  26. Changing Data Collection Methods Means Different Kind of Data
  27. Trait-based approaches to analyze links between the drivers of change and ecosystem services