A Framework for Applying Natural Language Processing in Digital Health Interventions

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

  • Burkhardt Funk
  • Shiri Sadeh-Sharvit
  • Ellen E. Fitzsimmons-Craft
  • Mickey Todd Trockel
  • Grace E Monterubio
  • Neha J Goel
  • Katherine N Balantekin
  • Dawn M Eichen
  • Rachael E Flatt
  • Marie-Laure Firebaugh
  • Corinna Jacobi
  • Andrea K. Graham
  • Mark Hoogendoorn
  • Denise E Wilfley
  • C Barr Taylor
Background: Digital health interventions (DHIs) are poised to reduce target symptoms in a scalable, affordable, and empirically supported way. DHIs that involve coaching or clinical support often collect text data from 2 sources: (1) open correspondence between users and the trained practitioners supporting them through a messaging system and (2) text data recorded during the intervention by users, such as diary entries. Natural language processing (NLP) offers methods for analyzing text, augmenting the understanding of intervention effects, and informing therapeutic decision making.

Objective: This study aimed to present a technical framework that supports the automated analysis of both types of text data often present in DHIs. This framework generates text features and helps to build statistical models to predict target variables, including user engagement, symptom change, and therapeutic outcomes.

Methods: We first discussed various NLP techniques and demonstrated how they are implemented in the presented framework. We then applied the framework in a case study of the Healthy Body Image Program, a Web-based intervention trial for eating disorders (EDs). A total of 372 participants who screened positive for an ED received a DHI aimed at reducing ED psychopathology (including binge eating and purging behaviors) and improving body image. These users generated 37,228 intervention text snippets and exchanged 4285 user-coach messages, which were analyzed using the proposed model.

Results: We applied the framework to predict binge eating behavior, resulting in an area under the curve between 0.57 (when applied to new users) and 0.72 (when applied to new symptom reports of known users). In addition, initial evidence indicated that specific text features predicted the therapeutic outcome of reducing ED symptoms.

Conclusions: The case study demonstrates the usefulness of a structured approach to text data analytics. NLP techniques improve the prediction of symptom changes in DHIs. We present a technical framework that can be easily applied in other clinical trials and clinical presentations and encourage other groups to apply the framework in similar contexts.
Original languageEnglish
Article numbere13855
JournalJournal of Medical Internet Research
Volume22
Issue number2
Number of pages13
ISSN1439-4456
DOIs
Publication statusPublished - 19.02.2020

Bibliographical note

Publisher Copyright:
© Burkhardt Funk, Shiri Sadeh-Sharvit, Ellen E Fitzsimmons-Craft, Mickey Todd Trockel, Grace E Monterubio, Neha J Goel, Katherine N Balantekin, Dawn M Eichen, Rachael E Flatt, Marie-Laure Firebaugh, Corinna Jacobi, Andrea K Graham, Mark Hoogendoorn, Denise E Wilfley, C Barr Taylor. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 19.02.2020. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.

    Research areas

  • Business informatics - Digital Health Interventions Text Analytics (DHITA), Text mining, Digital health interventions, Eating disorders, Guided self-help, Natural language processing

DOI

Recently viewed

Activities

  1. Contagious Agents: From Generative Social Science to the Computer Simulation of Epidemics
  2. A Framework for Text Analytics in Online Interventions
  3. Tilling the fields of knowledge in sustainability-oriented science
  4. Performativity and Authenticity in the Web 2.0-Enhanced Foreign Language Classroom
  5. Where is language use in the description of the Englishes? - ESSE 2006
  6. LC-MS identification of the photo-transformation products of desipramine with studying the effect of different environmental variables on the kinetics of their formation
  7. Modeling Grounding Processes in Chat-Based CSCL
  8. Artifacts and frames in socio-technical anticipation: The case of responsible AI
  9. IEEE Transactions on Control Systems Technology (Zeitschrift)
  10. Keep Calm and Solve the Problem: An Integrated Model to Reduce Threat and Defense in Conflicts
  11. Transformations 2017
  12. Peter G. Mahaffy
  13. Process Tracing Methodology - 2011
  14. Conference on Participatory Approaches in Science & Technology - PATH 2006
  15. Everything flows – identification and characterization of coherent patterns
  16. Simulation and Evaluation of Control Mechanisms for Mobile Robot Fulfillment Systems
  17. International Conference on Mathematical Models & Computational Techniques in Science & Engineering - MMCTSE2020 
  18. The many paths one picture can paint: Tracing a visual’s boundary work
  19. 2021 3rd International Conference on Soft Computing and its Engineering Applications
  20. Blogs in the Foreign Language Classroom
  21. Teams are changing! Going into the wild to expand theory on dynamics in modern teamwork settings
  22. Is a better understanding of assembly a way to help reassemble communities for restoration?

Publications

  1. Gamma GAMM applied on tree growth data
  2. Analysis And Comparison Of Dispatching RuleBased Scheduling In Dual-Resource Constrained Shop-Floor Scenarios
  3. Integrating adaptation and mitigation to climatic changes
  4. Enhancing EFL classroom instruction via the FeedBook: effects on language development and communicative language use.
  5. Contextualizing the relationship between self-commitment and performance
  6. The Dialectics of Open Access
  7. Factored MDPs for detecting topics of user sessions
  8. Sliding Mode Control Strategies for Maglev Systems Based on Kalman Filtering
  9. A tutorial introduction to adaptive fractal analysis
  10. Mining Implications From Data
  11. An approach for dynamic triangulation using servomotors
  12. Control system strategy of a modular omnidirectional AGV
  13. Robust Control of Excavation Mobile Robot with Dynamic Triangulation Vision
  14. Scholarly Question Answering Using Large Language Models in the NFDI4DataScience Gateway
  15. Application of design of experiments for laser shock peening process optimization
  16. Mimicking and anticipating others’ actions is linked to social information processing
  17. A high-resolution approach for the spatiotemporal analysis of forest canopy space using terrestrial laser scanning data
  18. Collaborative open science as a way to reproducibility and new insights in primate cognition research
  19. Direct parameter specification of an attention shift: Evidence from perceptual latency priming
  20. Chapter 9: Particular Remedies for Non-performance: Section 1: Right to Performance
  21. Processing of CSR communication: insights from the ELM
  22. Modeling the distribution of white spruce (Picea glauca) for Alaska with high accuracy: an open access role-model for predicting tree species in last remaining wilderness areas
  23. Web-scale extension of RDF knowledge bases from templated websites
  24. Formative Perspectives on the Relation Between CSR Communication and CSR Practices
  25. Study of fuzzy controllers performance
  26. Learning from partially annotated sequences
  27. Active learning for network intrusion detection