Recurrence-Based Nonlinear Vocal Dynamics as Digital Biomarkers for Depression Detection from Conversational Speech
Himadri S Samanta

TL;DR
This study introduces recurrence-based biomarkers derived from nonlinear dynamical analysis of vocal trajectories, which improve depression detection accuracy over traditional static acoustic features.
Contribution
It demonstrates that recurrence structure analysis of vocal dynamics can serve as effective digital biomarkers for depression detection, surpassing conventional methods.
Findings
Recurrence-based biomarkers achieved a mean AUC of 0.689 in depression classification.
They significantly outperformed static acoustic, entropy, Hurst, determinism, and Lyapunov-like features.
Permutation testing confirmed the statistical significance of the results with p=0.004.
Abstract
Digital biomarkers for depression have largely relied on static acoustic descriptors, pooled summary statistics, or conventional machine learning representations. Such approaches may miss nonlinear temporal organization embedded in conversational vocal dynamics. We hypothesized that depression is associated with altered recurrence structure in vocal state trajectories, reflecting changes in how the vocal system revisits acoustic states over time. Using the depression subset of the DAIC-WOZ corpus with 142 labeled participants, we modeled frame-level COVAREP trajectories as nonlinear dynamical systems and derived recurrence-based biomarkers from 74 vocal channels. Logistic regression with feature selection and stratified cross-validation evaluated classification performance. Recurrence-based biomarkers achieved a mean cross-validated AUC of 0.689, exceeding static acoustic baselines,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
