AequeVox: Automated Fairness Testing of Speech Recognition Systems

Sai Sathiesh Rajan (1); Sakshi Udeshi (1); and Sudipta Chattopadhyay; (1) ((1) Singapore University of Technology; Design)

arXiv:2110.09843·cs.LG·January 14, 2022

AequeVox: Automated Fairness Testing of Speech Recognition Systems

Sai Sathiesh Rajan (1), Sakshi Udeshi (1), and Sudipta Chattopadhyay, (1) ((1) Singapore University of Technology, Design)

PDF

Open Access 1 Repo

TL;DR

AequeVox is an automated framework for testing and identifying fairness issues in speech recognition systems across different populations and environments, without requiring ground truth data.

Contribution

It introduces a novel automated fairness testing framework for ASR systems, including environment simulation, human comprehensibility assessment, and fault localization techniques.

Findings

01

Non-native English, female, and Nigerian English speakers produce significantly more errors.

02

Most simulations are rated highly comprehensible by users.

03

Fault localization predicts words with substantially more errors than robust words.

Abstract

Automatic Speech Recognition (ASR) systems have become ubiquitous. They can be found in a variety of form factors and are increasingly important in our daily lives. As such, ensuring that these systems are equitable to different subgroups of the population is crucial. In this paper, we introduce, AequeVox, an automated testing framework for evaluating the fairness of ASR systems. AequeVox simulates different environments to assess the effectiveness of ASR systems for different populations. In addition, we investigate whether the chosen simulations are comprehensible to humans. We further propose a fault localization technique capable of identifying words that are not robust to these varying environments. Both components of AequeVox are able to operate in the absence of ground truth data. We evaluated AequeVox on speech from four different datasets using three different commercial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sparkssss/aequevox
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Speech Recognition and Synthesis