Dataset size versus homogeneity: A machine learning study on pooling intervention data in e-mental health dropout predictions

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

Objective: This study proposes a way of increasing dataset sizes for machine learning tasks in Internet-based Cognitive Behavioral Therapy through pooling interventions. To this end, it (1) examines similarities in user behavior and symptom data among online interventions for patients with depression, social anxiety, and panic disorder and (2) explores whether these similarities suffice to allow for pooling the data together, resulting in more training data when prediction intervention dropout. Methods: A total of 6418 routine care patients from the Internet Psychiatry in Stockholm are analyzed using (1) clustering and (2) dropout prediction models. For the latter, prediction models trained on each individual intervention's data are compared to those trained on all three interventions pooled into one dataset. To investigate if results vary with dataset size, the prediction is repeated using small and medium dataset sizes. Results: The clustering analysis identified three distinct groups that are almost equally spread across interventions and are instead characterized by different activity levels. In eight out of nine settings investigated, pooling the data improves prediction results compared to models trained on a single intervention dataset. It is further confirmed that models trained on small datasets are more likely to overestimate prediction results. Conclusion: The study reveals similar patterns of patients with depression, social anxiety, and panic disorder regarding online activity and intervention dropout. As such, this work offers pooling different interventions’ data as a possible approach to counter the problem of small dataset sizes in psychological research.

Original languageEnglish
JournalDigital Health
Volume10
Number of pages10
DOIs
Publication statusE-pub ahead of print - 15.05.2024

Bibliographical note

Publisher Copyright:
© The Author(s) 2024.

Recently viewed

Publications

  1. Introduction
  2. Perspective actionnelle et cours à projet
  3. "Ich hoffe, du weißt das zu schätzen?!"
  4. Kooperation von Lehrkräftebildnern im Langzeitpraktikum
  5. Does it pay off? Integrated reporting and cost of debt
  6. Actors in transitions
  7. Introduction: Two Centuries of the Sublime in American Landscape, Art, and Literature
  8. Der Ausbau erneuerbarer Energien im Lichte der Aarhus-Konvention
  9. Whose Body?
  10. Entwicklungsfähige Planungssysteme
  11. Conflicting demands of chemistry and inclusive teaching—a video‐based case study
  12. Weiterentwicklung des Studienprogramms Wirtschaftsrecht in Ulan Bator nach dem „Lüneburger Modell“
  13. Arbeitsplatzdynamik in den Industriebetrieben in Mecklenburg-Vorpommern
  14. Distance-sensitivity of German exports
  15. Leindotter als Energiequelle nutzen
  16. Career Decision Making, Stability and Actualization of Career Intentions
  17. The Sound Of Silence
  18. Multiple anthropogenic pressures challenge the effectiveness of protected areas in western Tanzania
  19. Abstimmen wie Zuhause.
  20. Betriebsräte und andere Formen der betrieblichen Mitarbeitervertretung
  21. Solid solution strengthening in Mg-Gd alloys
  22. On Economic Anarchy
  23. Assessing the Sustainability Performance of Sustainability Management Software
  24. Philosophical Bases for Self-determination in Criminal Law
  25. Betriebliche Weiterbildung und Arbeitsmarktsituation
  26. Mullemänner: Dealing with Austria's Past and Weak Masculinity in Arno Geiger's 'Es geht uns gut' and Doron Rabinovici's 'Suche nach M'.
  27. CO2-neutrales Unternehmen - was ist das?
  28. Make EU trade with Brazil sustainable
  29. Collective Renewable Energy Prosumers and the Promises of the Energy Union
  30. Why courts are the life buoys of migrant rights
  31. Becoming a competent teacher in education for sustainable development
  32. Individual States as Guardians of Community Interests
  33. Experiential marketing as a tool to enhance Tourists’ pre-travel online destination experiences?
  34. The Ultimate Election Forecast
  35. "Mit Gott für König und Vaterland!"
  36. Verdrängung und Profit