Developing a Process for the Analysis of User Journeys and the Prediction of Dropout in Digital Health Interventions: Machine Learning Approach

Research output: Journal contributionsConference article in journalResearchpeer-review

Authors

Background: User dropout is a widespread concern in the delivery and evaluation of digital (ie, web and mobile apps) health interventions. Researchers have yet to fully realize the potential of the large amount of data generated by these technology-based programs. Of particular interest is the ability to predict who will drop out of an intervention. This may be possible through the analysis of user journey data—self-reported as well as system-generated data—produced by the path (or journey) an individual takes to navigate through a digital health intervention.

Objective: The purpose of this study is to provide a step-by-step process for the analysis of user journey data and eventually to predict dropout in the context of digital health interventions. The process is applied to data from an internet-based intervention for insomnia as a way to illustrate its use. The completion of the program is contingent upon completing 7 sequential cores, which include an initial tutorial core. Dropout is defined as not completing the seventh core.

Methods: Steps of user journey analysis, including data transformation, feature engineering, and statistical model analysis and evaluation, are presented. Dropouts were predicted based on data from 151 participants from a fully automated web-based program (Sleep Healthy Using the Internet) that delivers cognitive behavioral therapy for insomnia. Logistic regression with L1 and L2 regularization, support vector machines, and boosted decision trees were used and evaluated based on their predictive performance. Relevant features from the data are reported that predict user dropout.

Results: Accuracy of predicting dropout (area under the curve [AUC] values) varied depending on the program core and the machine learning technique. After model evaluation, boosted decision trees achieved AUC values ranging between 0.6 and 0.9. Additional handcrafted features, including time to complete certain steps of the intervention, time to get out of bed, and days since the last interaction with the system, contributed to the prediction performance.

Conclusions: The results support the feasibility and potential of analyzing user journey data to predict dropout. Theory-driven handcrafted features increased the prediction performance. The ability to predict dropout at an individual level could be used to enhance decision making for researchers and clinicians as well as inform dynamic intervention regimens.
Original languageEnglish
Article numbere17738
JournalJournal of Medical Internet Research
Volume22
Issue number10
Number of pages20
ISSN1439-4456
DOIs
Publication statusPublished - 28.10.2020

Bibliographical note

Publisher Copyright:
©Vincent Bremer, Philip I Chow, Burkhardt Funk, Frances P Thorndike, Lee M Ritterband.

Documents

DOI

Recently viewed

Publications

  1. Hill–Chao numbers allow decomposing gamma multifunctionality into alpha and beta components
  2. How alloying and processing effects can influence the microstructure and mechanical properties of directly extruded thin zinc wires
  3. Teaching Sustainable Development in a Sensory and Artful Way — Concepts, Methods, and Examples
  4. Mapping Complexity in Environmental Governance
  5. Managing (in) times of uncertainty
  6. Dynamic capabilities and routinization
  7. Gerbil – Benchmarking named entity recognition and linking consistently
  8. A simple control strategy for increasing the soft bending actuator performance by using a pressure boost
  9. Constructing strangeness
  10. Duration of Organizational Decision Processes in Organizations in View of Simulation Calculations
  11. The impact of linguistic complexity on the solution of mathematical modelling tasks
  12. Metaheuristics approach for solving personalized crew rostering problem in public bus transit
  13. Simulation and optimization of material and energy flow systems
  14. Using Daily Stretching to Counteract Performance Decreases as a Result of Reduced Physical Activity—A Controlled Trial
  15. Unraveling Privacy Concerns in Complex Data Ecosystems with Architectural Thinking
  16. Using measures of reading time regularity (RTR) to quantify eye movement dynamics, and how they are shaped by linguistic information
  17. Life satisfaction in Germany after reunification: Additional insights on the pattern of convergence
  18. Public perceptions of CCS in context
  19. Scaling-based Least Squares Methods with Implemented Kalman filter Approach for Nano-Parameters Identification
  20. Design for Product Care—Development of Design Strategies and a Toolkit for Sustainable Consumer Behaviour
  21. Digging into the roots
  22. Teachers’ temporary support and worked-out examples as elements of scaffolding in mathematical modeling
  23. Outperformed by a Computer? - Comparing Human Decisions to Reinforcement Learning Agents, Assigning Lot Sizes in a Learning Factory
  24. Comparison of three methods of length compensation in a parallel kinematic and their equivalence conditions
  25. Towards Advanced Learning in Dispatching Rule-Based Scheuling