Comparing Supervised Models And Learned Speech Representations For   Classifying Intelligibility Of Disordered Speech On Selected Phrases

Subhashini Venugopalan; Joel Shor; Manoj Plakal; Jimmy Tobin; Katrin; Tomanek; Jordan R. Green; Michael P. Brenner

arXiv:2107.03985·eess.AS·July 9, 2021

Comparing Supervised Models And Learned Speech Representations For Classifying Intelligibility Of Disordered Speech On Selected Phrases

Subhashini Venugopalan, Joel Shor, Manoj Plakal, Jimmy Tobin, Katrin, Tomanek, Jordan R. Green, Michael P. Brenner

PDF

TL;DR

This study compares deep learning methods for classifying disordered speech intelligibility, finding that embeddings from an automatic speech recognition system outperform other approaches in accuracy.

Contribution

It introduces a comparative analysis of CNN-based classifiers, unsupervised speech representations, and ASR encoder embeddings for disordered speech intelligibility classification.

Findings

01

ASR encoder embeddings outperform other classifiers

02

Longer phrases provide better intelligibility indicators

03

Embeddings cluster speech by phrase and speaker respectively

Abstract

Automatic classification of disordered speech can provide an objective tool for identifying the presence and severity of speech impairment. Classification approaches can also help identify hard-to-recognize speech samples to teach ASR systems about the variable manifestations of impaired speech. Here, we develop and compare different deep learning techniques to classify the intelligibility of disordered speech on selected phrases. We collected samples from a diverse set of 661 speakers with a variety of self-reported disorders speaking 29 words or phrases, which were rated by speech-language pathologists for their overall intelligibility using a five-point Likert scale. We then evaluated classifiers developed using 3 approaches: (1) a convolutional neural network (CNN) trained for the task, (2) classifiers trained on non-semantic speech representations from CNNs that used an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.