A Framework for Applying Natural Language Processing in Digital Health Interventions

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

  • Burkhardt Funk
  • Shiri Sadeh-Sharvit
  • Ellen E. Fitzsimmons-Craft
  • Mickey Todd Trockel
  • Grace E Monterubio
  • Neha J Goel
  • Katherine N Balantekin
  • Dawn M Eichen
  • Rachael E Flatt
  • Marie-Laure Firebaugh
  • Corinna Jacobi
  • Andrea K. Graham
  • Mark Hoogendoorn
  • Denise E Wilfley
  • C Barr Taylor
Background: Digital health interventions (DHIs) are poised to reduce target symptoms in a scalable, affordable, and empirically supported way. DHIs that involve coaching or clinical support often collect text data from 2 sources: (1) open correspondence between users and the trained practitioners supporting them through a messaging system and (2) text data recorded during the intervention by users, such as diary entries. Natural language processing (NLP) offers methods for analyzing text, augmenting the understanding of intervention effects, and informing therapeutic decision making.

Objective: This study aimed to present a technical framework that supports the automated analysis of both types of text data often present in DHIs. This framework generates text features and helps to build statistical models to predict target variables, including user engagement, symptom change, and therapeutic outcomes.

Methods: We first discussed various NLP techniques and demonstrated how they are implemented in the presented framework. We then applied the framework in a case study of the Healthy Body Image Program, a Web-based intervention trial for eating disorders (EDs). A total of 372 participants who screened positive for an ED received a DHI aimed at reducing ED psychopathology (including binge eating and purging behaviors) and improving body image. These users generated 37,228 intervention text snippets and exchanged 4285 user-coach messages, which were analyzed using the proposed model.

Results: We applied the framework to predict binge eating behavior, resulting in an area under the curve between 0.57 (when applied to new users) and 0.72 (when applied to new symptom reports of known users). In addition, initial evidence indicated that specific text features predicted the therapeutic outcome of reducing ED symptoms.

Conclusions: The case study demonstrates the usefulness of a structured approach to text data analytics. NLP techniques improve the prediction of symptom changes in DHIs. We present a technical framework that can be easily applied in other clinical trials and clinical presentations and encourage other groups to apply the framework in similar contexts.
Original languageEnglish
Article numbere13855
JournalJournal of Medical Internet Research
Volume22
Issue number2
Number of pages13
ISSN1439-4456
DOIs
Publication statusPublished - 19.02.2020

Bibliographical note

Publisher Copyright:
© Burkhardt Funk, Shiri Sadeh-Sharvit, Ellen E Fitzsimmons-Craft, Mickey Todd Trockel, Grace E Monterubio, Neha J Goel, Katherine N Balantekin, Dawn M Eichen, Rachael E Flatt, Marie-Laure Firebaugh, Corinna Jacobi, Andrea K Graham, Mark Hoogendoorn, Denise E Wilfley, C Barr Taylor. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 19.02.2020. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.

    Research areas

  • Business informatics - Digital Health Interventions Text Analytics (DHITA), Text mining, Digital health interventions, Eating disorders, Guided self-help, Natural language processing

DOI

Recently viewed

Publications

  1. Analysis And Comparison Of Dispatching RuleBased Scheduling In Dual-Resource Constrained Shop-Floor Scenarios
  2. Integrating adaptation and mitigation to climatic changes
  3. The Dialectics of Open Access
  4. Octanol-Water Partition Coefficient Measurement by a Simple 1H NMR Method
  5. Control system strategy of a modular omnidirectional AGV
  6. Scholarly Question Answering Using Large Language Models in the NFDI4DataScience Gateway
  7. Application of design of experiments for laser shock peening process optimization
  8. A survey of empirical studies using transaction level data on exports and imports
  9. A high-resolution approach for the spatiotemporal analysis of forest canopy space using terrestrial laser scanning data
  10. Das John-Stuart-Mill-Problem
  11. An integrative research framework for enabling transformative adaptation
  12. Errors in Working with Office Computers
  13. Complexity of traffic scenes and EEG-measures of processing workload in car driving
  14. Mirrored piezo servo hydraulic actuators for use in camless combustion engines and its Control with mirrored inputs and MPC
  15. Simple saturated PID control for fast transient of motion systems
  16. A Lyapunov based PI controller with an anti-windup scheme for a purification process of potable water
  17. Embarrassment as a public vs. private emotion and symbolic coping behaviour
  18. From "cracking the orthographic code" to "playing with language"
  19. Strategies of postural control in static and in dynamic testing situations
  20. Cost effectiveness of guided Internet-based interventions for depression in comparison with control conditions
  21. Adaptive control of the nonlinear dynamic behavior of the cantilever-sample system of an atomic force microscope
  22. Conjunctive cohesion in English language EU documents - A corpus-based analysis and its implications
  23. Effectiveness of a Web-Based Cognitive Behavioural Intervention for Subthreshold Depression
  24. Computational modeling of amorphous polymers
  25. Resolving the Complexity-Flexibility Dilemma in Multi-Issue Negotiations: Nested Bracketing as a Strategy to Enhance Negotiation Outcomes
  26. Discourse, practice, policy and organizing
  27. Developing a sustainable platform for entity annotation benchmarks
  28. Foreign bias in institutional portfolio allocation
  29. Preventive Diagnostics for cardiovascular diseases based on probabilistic methods and description logic
  30. Rethink Textile Production - Developing sustainable concepts for textile industry using production simulation
  31. Neural correlates of the enactment effect in the brain
  32. Recontextualizing Anthropomorphic Metaphors in Organization Studies
  33. Global fern and lycophyte richness explained: How regional and local factors shape plot richness
  34. A geometric approach to the decoupling control and to speed up the dynamics of a general rigid body manipulation system
  35. A Column Generation Approach for Bus Driver Rostering Problems