Impact of Speech Mode in Automatic Pathological Speech Detection
Shakeel A. Sheikh, Ina Kodrasi

TL;DR
This paper investigates how speech mode affects automatic detection of pathological speech, revealing that deep learning methods outperform classical ones in spontaneous speech scenarios by capturing subtle cues.
Contribution
It provides a comparative analysis of classical and deep learning approaches for pathological speech detection across different speech modes, highlighting the advantages of deep learning in spontaneous speech.
Findings
Deep learning approaches outperform classical methods in spontaneous speech.
Classical approaches struggle with subtle cues in spontaneous speech.
Deep learning extracts additional pathological cues in spontaneous speech.
Abstract
Automatic pathological speech detection approaches yield promising results in identifying various pathologies. These approaches are typically designed and evaluated for phonetically-controlled speech scenarios, where speakers are prompted to articulate identical phonetic content. While gathering controlled speech recordings can be laborious, spontaneous speech can be conveniently acquired as potential patients navigate their daily routines. Further, spontaneous speech can be valuable in detecting subtle and abstract cues of pathological speech. Nonetheless, the efficacy of automatic pathological speech detection for spontaneous speech remains unexplored. This paper analyzes the influence of speech mode on pathological speech detection approaches, examining two distinct categories of approaches, i.e., classical machine learning and deep learning. Results indicate that classical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
