Developing a Process for the Analysis of User Journeys and the Prediction of Dropout in Digital Health Interventions: Machine Learning Approach

Publikation: Beiträge in ZeitschriftenKonferenzaufsätze in FachzeitschriftenForschungbegutachtet

Authors

Background: User dropout is a widespread concern in the delivery and evaluation of digital (ie, web and mobile apps) health interventions. Researchers have yet to fully realize the potential of the large amount of data generated by these technology-based programs. Of particular interest is the ability to predict who will drop out of an intervention. This may be possible through the analysis of user journey data—self-reported as well as system-generated data—produced by the path (or journey) an individual takes to navigate through a digital health intervention.

Objective: The purpose of this study is to provide a step-by-step process for the analysis of user journey data and eventually to predict dropout in the context of digital health interventions. The process is applied to data from an internet-based intervention for insomnia as a way to illustrate its use. The completion of the program is contingent upon completing 7 sequential cores, which include an initial tutorial core. Dropout is defined as not completing the seventh core.

Methods: Steps of user journey analysis, including data transformation, feature engineering, and statistical model analysis and evaluation, are presented. Dropouts were predicted based on data from 151 participants from a fully automated web-based program (Sleep Healthy Using the Internet) that delivers cognitive behavioral therapy for insomnia. Logistic regression with L1 and L2 regularization, support vector machines, and boosted decision trees were used and evaluated based on their predictive performance. Relevant features from the data are reported that predict user dropout.

Results: Accuracy of predicting dropout (area under the curve [AUC] values) varied depending on the program core and the machine learning technique. After model evaluation, boosted decision trees achieved AUC values ranging between 0.6 and 0.9. Additional handcrafted features, including time to complete certain steps of the intervention, time to get out of bed, and days since the last interaction with the system, contributed to the prediction performance.

Conclusions: The results support the feasibility and potential of analyzing user journey data to predict dropout. Theory-driven handcrafted features increased the prediction performance. The ability to predict dropout at an individual level could be used to enhance decision making for researchers and clinicians as well as inform dynamic intervention regimens.
OriginalspracheEnglisch
Aufsatznummere17738
ZeitschriftJournal of Medical Internet Research
Jahrgang22
Ausgabenummer10
Anzahl der Seiten20
ISSN1439-4456
DOIs
PublikationsstatusErschienen - 28.10.2020

Bibliographische Notiz

Funding Information:
This study was supported by grant R01 MH86758 from the National Institute of Mental Health. The funding source had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication. The authors thank Christina Frederick, BS, for her help with the study administration tasks. The authors specially thank Gabe D Heath, BA, and Steve P Johnson, BA, developers of the SHUTi intervention, for extracting and making all the data readily available for analysis.

Publisher Copyright:
©Vincent Bremer, Philip I Chow, Burkhardt Funk, Frances P Thorndike, Lee M Ritterband.

Dokumente

DOI

Zuletzt angesehen

Forschende

  1. Neele Puhlmann

Publikationen

  1. Learning shortest paths in word graphs
  2. A Two-Stage Sliding-Mode High-Gain Observer to Reduce Uncertainties and Disturbances Effects for Sensorless Control in Automotive Applications
  3. archiDART: an R package for the automated computation of plant root architectural traits
  4. Second-Order Sliding Mode Control with State and Disturbance Estimation for a Permanent Magnet Linear Motor
  5. Deciding between the Covariance Analytical Approach and the Change-Score Approach in Two Wave Panel Data
  6. Encoding the law of State responsibility with courage and resolve
  7. Investigating the Promotional Effect of Green Signals in Sponsored Search Advertising Using Bayesian Parameter Estimation
  8. The effect of structural complexity on large mammal occurrence in revegetation
  9. Telecoupling as a framework to support a more nuanced understanding of causality in land system science
  10. Single-Word Recognition Need Not Depend on Single-Word Features
  11. Playing in the Spaces: Anarchism in the Classroom
  12. Evaluating A Teaching-Learning Sequence (TLS) About Acid-Base Reactions In Upper Secondary School
  13. Anatomy of Chua's System - Nonlinear Dynamic Electronics for Chaos in the Lab
  14. Ontology-based automatic classification for Web pages
  15. Development of Early Spatial Perspective-Taking - Toward a Three-Level Model
  16. A highly transparent method of assessing the contribution of incentives to meet various technical challenges in distributed energy systems
  17. Understanding the error-structure of Time-driven Activity-based Costing
  18. Phase Shift APOD and POD Control Technique in Multi-Level Inverters to Mitigate Total Harmonic Distortion

Presse / Medien

  1. Computersimulation