# ML-based detection of depressive profile through voice analysis in WhatsApp™ audio messages of Brazilian Portuguese Speakers

**Authors:** Victor H. O. Otani, Felipe O. Aguiar, Thiago P. Justino, Hudson S. Buck, Luiza B. Grilo, Matheus F. Figueiredo, Pedro M. Uchida, Daniel A. C. Vasques, Thaís Z. S. Otani, João Ricardo N. Vissoci, Lucas M. Marques, Ricardo R. Uchida, Ariel Teles, Ariel Teles, Ariel Teles, Ariel Teles

PMC · DOI: 10.1371/journal.pmen.0000357 · PLOS Mental Health · 2026-01-21

## TL;DR

This study uses machine learning to detect depression in WhatsApp™ audio messages from Brazilian Portuguese speakers, achieving high accuracy in identifying depressive profiles.

## Contribution

The novel contribution is applying machine learning to WhatsApp™ audio messages for depression detection in Brazilian Portuguese speakers.

## Key findings

- Machine learning models achieved 91.67% accuracy for women and 80% for men in detecting depression from WhatsApp™ audio messages.
- The accuracy of ML classification depends on the type of audio instruction provided in the recordings.

## Abstract

Depression is a prevalent mental health condition that significantly impacts individuals’ daily lives, work productivity, relationships, and overall well-being. The lack of reliable biomarkers complicates screening, contributing to underdiagnosis. Depression’s impact on voice and acoustic parameters enables differentiation between adaptive and non-adaptive mood profiles, offering potential classifiers for screening. This study evaluates the capability of seven distinct machine learning models to identify depression in speech samples. WhatsApp™ audio messages (WA), clinical, and sociodemographic data were collected from 160 individuals divided into two groups: one for algorithm development and the other for testing. Each group included patients with Major Depressive Disorder and healthy controls. In the test group, participants were interviewed using the Mini-International Neuropsychiatric Interview (MINI), and their WhatsApp™ audio recordings included both structured and semi-structured formats. After pre-processing the audio, 68 acoustic features were used to train the machine learning models. Results shows that: i) The algorithms evaluated WhatsApp™ audio recordings from the test group, achieving peak accuracies of 91.67% for women and 80% for men, with an AUC of 91.9% for women and 78.33% for men. ii) The accuracy of Machine Learning (ML) classification varies depending on the type of audio instruction provided. ML can classify, with reasonable accuracy, whether a WhatsApp™ audio message represents a depressive patient or a healthy individual. Future studies should further explore the relationship between voice characteristics, different mood profiles, and emotional states.

## Linked entities

- **Diseases:** Major Depressive Disorder (MONDO:0002009)

## Full-text entities

- **Diseases:** mental (MESH:D008607), Depression (MESH:D003866), Major Depressive Disorder (MESH:D003865)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12822941/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12822941/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/PMC12822941/full.md

---
Source: https://tomesphere.com/paper/PMC12822941