Nonverbal Sound Detection for Disordered Speech

Colin Lea; Zifang Huang; Dhruv Jain; Lauren Tooley; Zeinab Liaghat,; Shrinath Thelapurath; Leah Findlater; Jeffrey P. Bigham

arXiv:2202.07750·eess.AS·February 17, 2022·1 cites

Nonverbal Sound Detection for Disordered Speech

Colin Lea, Zifang Huang, Dhruv Jain, Lauren Tooley, Zeinab Liaghat,, Shrinath Thelapurath, Leah Findlater, Jeffrey P. Bigham

PDF

Open Access

TL;DR

This paper presents a nonverbal sound detection system using mouth sounds to improve voice assistant accessibility for individuals with speech disorders, demonstrating high accuracy and effective personalization.

Contribution

It introduces a novel sound event detection approach using nonverbal mouth sounds, with dataset design, model considerations, and personalization strategies for disordered speech.

Findings

01

Achieves 88.6% precision and 88.4% recall on internal dataset

02

0.31 false positives per hour on speech aggressors

03

84.5% success rate in personalized model performance

Abstract

Voice assistants have become an essential tool for people with various disabilities because they enable complex phone- or tablet-based interactions without the need for fine-grained motor control, such as with touchscreens. However, these systems are not tuned for the unique characteristics of individuals with speech disorders, including many of those who have a motor-speech disorder, are deaf or hard of hearing, have a severe stutter, or are minimally verbal. We introduce an alternative voice-based input system which relies on sound event detection using fifteen nonverbal mouth sounds like "pop," "click," or "eh." This system was designed to work regardless of ones' speech abilities and allows full access to existing technology. In this paper, we describe the design of a dataset, model considerations for real-world deployment, and efforts towards model personalization. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVoice and Speech Disorders · Speech Recognition and Synthesis · Speech and Audio Processing