# A proof-of-concept machine learning model for short-term suicide risk stratification in depressed youth

**Authors:** Bin Sun, Jie Zhang, Yarong Ma, Hongbo He

PMC · DOI: 10.1038/s41398-026-03944-4 · Translational Psychiatry · 2026-03-19

## TL;DR

This study shows that machine learning can help identify young depressed patients at higher short-term risk of suicide, but more research is needed for reliable clinical use.

## Contribution

A proof-of-concept ML model for short-term suicide risk prediction in depressed youth using clinical and psychosocial data.

## Key findings

- Support Vector Machine and Elastic Net models achieved AUCs of 0.831 and 0.811, respectively, in predicting 30-day suicide attempts.
- An ensemble model identified a high-risk group with a 20% suicide attempt rate compared to 3.6% in others (RR = 5.53).
- Model performance remained stable when using only 15 selected predictors, suggesting robustness against overfitting.

## Abstract

Machine learning (ML) offers promise for suicide risk stratification in depressed youth, yet its clinical application remains methodologically challenging. Using prospective data from 602 Chinese patients aged 15–24 years collected between January 2022 and June 2023, we developed ML models to predict suicide attempts within 30 days after treatment. From 102 clinical and psychosocial predictors, only 30 suicide attempts (5.0%) were observed, resulting in a limited predictor-to-event ratio. Seven algorithms were trained on 70% of the sample (n = 421; 21 events) using 10‑fold cross‑validation and tested on the remaining 30% (n = 181; 9 events), with model selection emphasizing regularization and parsimony to reduce overfitting risk. Among the algorithms, the Support Vector Machine (AUC = 0.831) and Elastic Net (AUC = 0.811) achieved the best test performance, while more complex models such as random forests and deep learning exhibited poor generalization. A combined SVM + EN ensemble reached an AUC of 0.84 in cross‑validation and identified a high‑risk decile with a 20% suicide attempt rate compared to 3.6% among remaining patients (RR = 5.53), although confidence intervals were wide due to the small number of events. These findings demonstrate the technical feasibility of ML‑based short‑term risk stratification but also underscore important methodological constraints. When retrained using only 15 LASSO-selected predictors, the model’s discrimination remained comparable (AUC = 0.82), supporting robustness against over-fitting. Low event counts limited model stability, cohort homogeneity and single‑country recruitment restricted generalizability, and the lack of temporal validation precluded assessment of model drift. Consequently, the models presented here should be viewed as proof‑of‑concept rather than evidence of clinical readiness, providing an empirical basis for future validation in larger and more diverse longitudinal cohorts.

## Linked entities

- **Diseases:** depression (MONDO:0002050)

## Full-text entities

- **Diseases:** ML (MESH:D007859), depressed (MESH:D003866), deaths (MESH:D003643), Obsessive Compulsive (MESH:D009771), Suicidal behavior (MESH:D001523), impulsivity (MESH:D007174), physical, cognitive, or intellectual disabilities (MESH:D008607), bipolar disorder (MESH:D001714), self -harm (MESH:D012652), Insomnia (MESH:D007319), suicidal ideation (MESH:D001072), Anxiety (MESH:D001007), substance misuse (MESH:D009293)
- **Chemicals:** alcohol (MESH:D000438)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13039823/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13039823/full.md

---
Source: https://tomesphere.com/paper/PMC13039823