# Deep learning-based classification of speech disorder in stroke and hearing impairment

**Authors:** Joo Kyung Park, Sae Byeol Mun, Young Jae Kim, Kwang Gi Kim, Diego A. Forero, Diego A. Forero, Diego A. Forero

PMC · DOI: 10.1371/journal.pone.0315286 · PLOS One · 2025-05-28

## TL;DR

This paper explores using deep learning to classify the causes of speech disorders, such as stroke and hearing impairment, based on voice data.

## Contribution

The study introduces a deep learning approach to classify specific causes of speech disorders from abnormal voice data.

## Key findings

- ResNet-18, Inception V3, and SEResNeXt-18 models achieved AUC values of 0.839, 0.913, and 0.906, respectively.
- AI can efficiently classify the origins of speech disorders through voice data analysis.

## Abstract

Speech disorders can arise from various causes, including congenital conditions, neurological damage, diseases, and other disorders. Traditionally, medical professionals have used changes in voice to diagnose the underlying causes of these disorders. With the advancement of artificial intelligence (AI), new possibilities have emerged in this field. However, most existing studies primarily focus on comparing voice data between normal individuals and those with speech disorders. Research that classifies the causes of these disorders within the abnormal voice data, attributing them to specific etiologies, remains limited. Therefore, our objective was to classify the specific causes of speech disorders from voice data resulting from various conditions, such as stroke and hearing impairments (HI).

We experimentally developed a deep learning model to analyze Korean speech disorder voice data caused by stroke and HI. Our goal was to classify the disorders caused by these specific conditions. To achieve effective classification, we employed the ResNet-18, Inception V3, and SEResNeXt-18 models for feature extraction and training processes.

The models demonstrated promising results, with area under the curve (AUC) values of 0.839 for ResNet-18, 0.913 for Inception V3, and 0.906 for SEResNeXt-18, respectively.

These outcomes suggest the feasibility of using AI to efficiently classify the origins of speech disorders through the analysis of voice data.

## Linked entities

- **Diseases:** stroke (MONDO:0005098)

## Full-text entities

- **Diseases:** neurological damage (MESH:D020196), Speech disorders (MESH:D013064), stroke (MESH:D020521), HI (MESH:D034381)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12118888/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12118888/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC12118888/full.md

---
Source: https://tomesphere.com/paper/PMC12118888