Audio Frequency-Time Dual Domain Evaluation on Depression Diagnosis

Yu Luo; Nan Huang; Sophie Yu; Hendry Xu; Jerry Wang; Colin Wang; Zhichao Liu; Chen Zeng

arXiv:2510.22225·cs.CV·October 28, 2025

Audio Frequency-Time Dual Domain Evaluation on Depression Diagnosis

Yu Luo, Nan Huang, Sophie Yu, Hendry Xu, Jerry Wang, Colin Wang, Zhichao Liu, Chen Zeng

PDF

TL;DR

This paper introduces a novel deep learning-based method utilizing frequency-time dual domain analysis of voice signals for improved depression diagnosis, addressing current challenges in mental health assessment.

Contribution

It proposes a new multimodal voice analysis approach combining frequency and time domains with deep learning for depression detection.

Findings

01

High classification accuracy in depression diagnosis

02

Effective utilization of voice's dual domain features

03

Provides new tools for depression screening and diagnosis

Abstract

Depression, as a typical mental disorder, has become a prevalent issue significantly impacting public health. However, the prevention and treatment of depression still face multiple challenges, including complex diagnostic procedures, ambiguous criteria, and low consultation rates, which severely hinder timely assessment and intervention. To address these issues, this study adopts voice as a physiological signal and leverages its frequency-time dual domain multimodal characteristics along with deep learning models to develop an intelligent assessment and diagnostic algorithm for depression. Experimental results demonstrate that the proposed method achieves excellent performance in the classification task for depression diagnosis, offering new insights and approaches for the assessment, screening, and diagnosis of depression.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.