Predicting recurrent chat contact in a psychological intervention for the youth using natural language processing

Research output: Journal contributionsJournal articlesResearchpeer-review

Standard

Predicting recurrent chat contact in a psychological intervention for the youth using natural language processing. / Hornstein, Silvan; Scharfenberger, Jonas; Lueken, Ulrike et al.
In: npj Digital Medicine, Vol. 7, No. 1, 132, 12.2024.

Research output: Journal contributionsJournal articlesResearchpeer-review

Harvard

APA

Vancouver

Bibtex

@article{cefe076ae3194fdab8e903499f3dd24d,
title = "Predicting recurrent chat contact in a psychological intervention for the youth using natural language processing",
abstract = "Chat-based counseling hotlines emerged as a promising low-threshold intervention for youth mental health. However, despite the resulting availability of large text corpora, little work has investigated Natural Language Processing (NLP) applications within this setting. Therefore, this preregistered approach (OSF: XA4PN) utilizes a sample of approximately 19,000 children and young adults that received a chat consultation from a 24/7 crisis service in Germany. Around 800,000 messages were used to predict whether chatters would contact the service again, as this would allow the provision of or redirection to additional treatment. We trained an XGBoost Classifier on the words of the anonymized conversations, using repeated cross-validation and bayesian optimization for hyperparameter search. The best model was able to achieve an AUROC score of 0.68 (p < 0.01) on the previously unseen 3942 newest consultations. A shapely-based explainability approach revealed that words indicating younger age or female gender and terms related to self-harm and suicidal thoughts were associated with a higher chance of recontacting. We conclude that NLP-based predictions of recurrent contact are a promising path toward personalized care at chat hotlines.",
keywords = "Informatics",
author = "Silvan Hornstein and Jonas Scharfenberger and Ulrike Lueken and Richard Wundrack and Kevin Hilbert",
note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",
year = "2024",
month = dec,
doi = "10.1038/s41746-024-01121-9",
language = "English",
volume = "7",
journal = "npj Digital Medicine",
issn = "2398-6352",
publisher = "Nature Publishing Group",
number = "1",

}

RIS

TY - JOUR

T1 - Predicting recurrent chat contact in a psychological intervention for the youth using natural language processing

AU - Hornstein, Silvan

AU - Scharfenberger, Jonas

AU - Lueken, Ulrike

AU - Wundrack, Richard

AU - Hilbert, Kevin

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024/12

Y1 - 2024/12

N2 - Chat-based counseling hotlines emerged as a promising low-threshold intervention for youth mental health. However, despite the resulting availability of large text corpora, little work has investigated Natural Language Processing (NLP) applications within this setting. Therefore, this preregistered approach (OSF: XA4PN) utilizes a sample of approximately 19,000 children and young adults that received a chat consultation from a 24/7 crisis service in Germany. Around 800,000 messages were used to predict whether chatters would contact the service again, as this would allow the provision of or redirection to additional treatment. We trained an XGBoost Classifier on the words of the anonymized conversations, using repeated cross-validation and bayesian optimization for hyperparameter search. The best model was able to achieve an AUROC score of 0.68 (p < 0.01) on the previously unseen 3942 newest consultations. A shapely-based explainability approach revealed that words indicating younger age or female gender and terms related to self-harm and suicidal thoughts were associated with a higher chance of recontacting. We conclude that NLP-based predictions of recurrent contact are a promising path toward personalized care at chat hotlines.

AB - Chat-based counseling hotlines emerged as a promising low-threshold intervention for youth mental health. However, despite the resulting availability of large text corpora, little work has investigated Natural Language Processing (NLP) applications within this setting. Therefore, this preregistered approach (OSF: XA4PN) utilizes a sample of approximately 19,000 children and young adults that received a chat consultation from a 24/7 crisis service in Germany. Around 800,000 messages were used to predict whether chatters would contact the service again, as this would allow the provision of or redirection to additional treatment. We trained an XGBoost Classifier on the words of the anonymized conversations, using repeated cross-validation and bayesian optimization for hyperparameter search. The best model was able to achieve an AUROC score of 0.68 (p < 0.01) on the previously unseen 3942 newest consultations. A shapely-based explainability approach revealed that words indicating younger age or female gender and terms related to self-harm and suicidal thoughts were associated with a higher chance of recontacting. We conclude that NLP-based predictions of recurrent contact are a promising path toward personalized care at chat hotlines.

KW - Informatics

UR - http://www.scopus.com/inward/record.url?scp=85193567276&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/827faade-aab3-33c2-af97-fd7b135056ba/

U2 - 10.1038/s41746-024-01121-9

DO - 10.1038/s41746-024-01121-9

M3 - Journal articles

C2 - 38762694

AN - SCOPUS:85193567276

VL - 7

JO - npj Digital Medicine

JF - npj Digital Medicine

SN - 2398-6352

IS - 1

M1 - 132

ER -

Recently viewed

Publications

  1. Structure as Infrastructure: The Interrelation of Fiber and Construction
  2. What drives the spatial distribution and dynamics of local species richness in tropical forest?
  3. Electrical and Mechanical Characterization of Polymer Nanofibers for Sensor Application
  4. Assessment of cognitive load in multimedia learning with dual-task methodology
  5. Microstructure, mechanical and functional properties of refill friction stir spot welds on multilayered aluminum foils for battery application
  6. Analyzing Talk and Text II: Thematic Analysis
  7. Determining Lot Sizes in Production Areas
  8. Do Specific Text Features Influence Click Probabilities in Paid Search Advertising?
  9. Using Multi-Label Classification for Improved Question Answering
  10. Ähnlichkeit mit unähnlichen Mitteln
  11. Precrop functional group identity affects yield of winter barley but less so high carbon amendments in a mesocosm experiment
  12. Performance of the DSM-5-based criteria for Internet addiction
  13. Analyzing Emotional Styles in the Field of Christian Religion and The Relevance of New Types of Visualization
  14. Risk Aversion and Sorting into Public Sector Employment
  15. Operationalizing Network Theory for Ecosystem Service Assessments
  16. Synthesis and future research directions linking tree diversity to growth, survival, and damage in a global network of tree diversity experiments
  17. Water quantity and quality in the Zerafshan river basin - only an upstream riparian problem?
  18. Calibration of a simple method for determining ammonia loss in the field
  19. Quasi-in-situ observation of microstructure at the friction interface
  20. An Experimental Approach to the Optimization of Customer Information at the Point of Sale
  21. Sprachen in Liechtenstein
  22. A path to clean water
  23. The Crowd in Flux
  24. Qualitative Daten computergestutzt auswerten
  25. Consumers' Responses to CSR Activities
  26. RAWSim-O: A Simulation Framework for Robotic Mobile Fulfillment Systems
  27. Early-Career Researchers’ Perceptions of the Prevalence of Questionable Research Practices, Potential Causes, and Open Science
  28. The complementary relationship of exploration and exploitation in professional service firms: An exploratory study of IT consulting firms
  29. Glancing into the Applied Tool Box
  30. Genetically based differentiation in growth of multiple non-native plant species along a steep environmental gradient
  31. Mechanics of sheet-bulk indentation
  32. Heterogenität
  33. Benchmarking question answering systems
  34. Are the terms “Socio-economic status” and “Class status” a warped form of reasoning for Max Weber?
  35. Recruitment practices in small and medium size enterprises.
  36. Ownership Patterns and Enterprise Groups in German Structural Business Statistics
  37. Tree diversity promotes predator but not omnivore ants in a subtropical Chinese forest
  38. Robust Control using Sliding Mode Approach for Ice-Clamping Device activated by Thermoelectric Coolers
  39. Facing Up to Third Party Liability for Space Activities
  40. A practical perspective on repatriate knowledge transfer
  41. Implementierung eines Fehlerpräventionsprogramms für gefahrenintensive Arbeitsprozesse
  42. Semi-polar root exudates in natural grassland communities
  43. »CO2 causes a hole in the atmosphere« Using laypeople’s conceptions as a starting point to communicate climate change
  44. ETL ensembles for chunking, NER and SRL
  45. Time and Income Poverty – An Interdependent Multidimensional Poverty Approach with German Time Use Diary Data
  46. Criticality and Values in Digital Transformation Research: Insights from a Workshop
  47. Commitment Strategies for Sustainability