A Framework for Applying Natural Language Processing in Digital Health Interventions

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

  • Burkhardt Funk
  • Shiri Sadeh-Sharvit
  • Ellen E. Fitzsimmons-Craft
  • Mickey Todd Trockel
  • Grace E Monterubio
  • Neha J Goel
  • Katherine N Balantekin
  • Dawn M Eichen
  • Rachael E Flatt
  • Marie-Laure Firebaugh
  • Corinna Jacobi
  • Andrea K. Graham
  • Mark Hoogendoorn
  • Denise E Wilfley
  • C Barr Taylor
Background: Digital health interventions (DHIs) are poised to reduce target symptoms in a scalable, affordable, and empirically supported way. DHIs that involve coaching or clinical support often collect text data from 2 sources: (1) open correspondence between users and the trained practitioners supporting them through a messaging system and (2) text data recorded during the intervention by users, such as diary entries. Natural language processing (NLP) offers methods for analyzing text, augmenting the understanding of intervention effects, and informing therapeutic decision making.

Objective: This study aimed to present a technical framework that supports the automated analysis of both types of text data often present in DHIs. This framework generates text features and helps to build statistical models to predict target variables, including user engagement, symptom change, and therapeutic outcomes.

Methods: We first discussed various NLP techniques and demonstrated how they are implemented in the presented framework. We then applied the framework in a case study of the Healthy Body Image Program, a Web-based intervention trial for eating disorders (EDs). A total of 372 participants who screened positive for an ED received a DHI aimed at reducing ED psychopathology (including binge eating and purging behaviors) and improving body image. These users generated 37,228 intervention text snippets and exchanged 4285 user-coach messages, which were analyzed using the proposed model.

Results: We applied the framework to predict binge eating behavior, resulting in an area under the curve between 0.57 (when applied to new users) and 0.72 (when applied to new symptom reports of known users). In addition, initial evidence indicated that specific text features predicted the therapeutic outcome of reducing ED symptoms.

Conclusions: The case study demonstrates the usefulness of a structured approach to text data analytics. NLP techniques improve the prediction of symptom changes in DHIs. We present a technical framework that can be easily applied in other clinical trials and clinical presentations and encourage other groups to apply the framework in similar contexts.
Original languageEnglish
Article numbere13855
JournalJournal of Medical Internet Research
Volume22
Issue number2
Number of pages13
ISSN1439-4456
DOIs
Publication statusPublished - 19.02.2020

Bibliographical note

Publisher Copyright:
© Burkhardt Funk, Shiri Sadeh-Sharvit, Ellen E Fitzsimmons-Craft, Mickey Todd Trockel, Grace E Monterubio, Neha J Goel, Katherine N Balantekin, Dawn M Eichen, Rachael E Flatt, Marie-Laure Firebaugh, Corinna Jacobi, Andrea K Graham, Mark Hoogendoorn, Denise E Wilfley, C Barr Taylor. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 19.02.2020. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.

    Research areas

  • Business informatics - Digital Health Interventions Text Analytics (DHITA), Text mining, Digital health interventions, Eating disorders, Guided self-help, Natural language processing

DOI

Recently viewed

Publications

  1. How does Enterprise Architecture support the Design and Realization of Data-Driven Business Models?
  2. Mining Implications From Data
  3. Scheme and Technical Issues in Water Quality Control
  4. There is no Software, there are just Services: Introduction
  5. Internet and computer based interventions for cannabis use
  6. Development of a scoring parameter to characterize data quality of centroids in high-resolution mass spectra
  7. Topic selection and development in learner-native speaker voice-based telecollaborative discourse
  8. Conjunctive cohesion in English language EU documents - A corpus-based analysis and its implications
  9. E-stability and stability of adaptive learning in models with private information
  10. Restoring Causal Analysis to Structural Equation ModelingReview of Causality: Models, Reasoning, and Inference (2nd Edition), by Judea Pearl
  11. A Study on the Performance of Adaptive Neural Networks for Haze Reduction with a Focus on Precision
  12. Student Behavior in Error-Correction-Tasks and its Relation to Perception of Competence
  13. Effects of diversity versus segregation on automatic approach and avoidance behavior towards own and other ethnic groups
  14. DISKNET – A Platform for the Systematic Accumulation of Knowledge in IS Research
  15. IT Governance in Scaling Agile Frameworks
  16. Different facets of tree sapling diversity influence browsing intensity by deer dependent on spatial scale
  17. Challenge-oriented policy making and innovation systems theory: reconsidering systemic instruments
  18. Differentiating forest types using TerraSAR–X spotlight images based on inferential statistics and multivariate analysis
  19. Understanding the error-structure of Time-driven Activity-based Costing
  20. Integration of laboratory experiments into introductory electrical engineering courses
  21. Hill–Chao numbers allow decomposing gamma multifunctionality into alpha and beta components
  22. The interplay between posture control and memory for spatial locations
  23. Do Specific Text Features Influence Click Probabilities in Paid Search Advertising?
  24. Interactive Media as Fields of Transduction
  25. Plant density modifies root system architecture in spring barley (Hordeum vulgare L.) through a change in nodal root number
  26. Development of an Active Aging Index for the Organizational Level
  27. Applying the Three Horizons approach in local and regional scenarios to support policy coherence in SDG implementation
  28. Digital twin support for laser-based assembly assistance
  29. "Doing" Sustainability Assessment in Different Consumption and Production Contexts-Lessons from Case Study Comparison
  30. Performance of process-based models for simulation of grain N in crop rotations across Europe
  31. Grazing effects on intraspecific trait variability vary with changing precipitation patterns in Mongolian rangelands
  32. A piezo servo hydraulic actuator for use in camless combustion engines and its control with MPC