Impact of Speech Mode in Automatic Pathological Speech Detection

Shakeel A. Sheikh; Ina Kodrasi

arXiv:2406.09968·cs.LG·June 17, 2024·1 cites

Impact of Speech Mode in Automatic Pathological Speech Detection

Shakeel A. Sheikh, Ina Kodrasi

PDF

Open Access

TL;DR

This paper investigates how speech mode affects automatic detection of pathological speech, revealing that deep learning methods outperform classical ones in spontaneous speech scenarios by capturing subtle cues.

Contribution

It provides a comparative analysis of classical and deep learning approaches for pathological speech detection across different speech modes, highlighting the advantages of deep learning in spontaneous speech.

Findings

01

Deep learning approaches outperform classical methods in spontaneous speech.

02

Classical approaches struggle with subtle cues in spontaneous speech.

03

Deep learning extracts additional pathological cues in spontaneous speech.

Abstract

Automatic pathological speech detection approaches yield promising results in identifying various pathologies. These approaches are typically designed and evaluated for phonetically-controlled speech scenarios, where speakers are prompted to articulate identical phonetic content. While gathering controlled speech recordings can be laborious, spontaneous speech can be conveniently acquired as potential patients navigate their daily routines. Further, spontaneous speech can be valuable in detecting subtle and abstract cues of pathological speech. Nonetheless, the efficacy of automatic pathological speech detection for spontaneous speech remains unexplored. This paper analyzes the influence of speech mode on pathological speech detection approaches, examining two distinct categories of approaches, i.e., classical machine learning and deep learning. Results indicate that classical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis