AequeVox: Automated Fairness Testing of Speech Recognition Systems
Sai Sathiesh Rajan (1), Sakshi Udeshi (1), and Sudipta Chattopadhyay, (1) ((1) Singapore University of Technology, Design)

TL;DR
AequeVox is an automated framework for testing and identifying fairness issues in speech recognition systems across different populations and environments, without requiring ground truth data.
Contribution
It introduces a novel automated fairness testing framework for ASR systems, including environment simulation, human comprehensibility assessment, and fault localization techniques.
Findings
Non-native English, female, and Nigerian English speakers produce significantly more errors.
Most simulations are rated highly comprehensible by users.
Fault localization predicts words with substantially more errors than robust words.
Abstract
Automatic Speech Recognition (ASR) systems have become ubiquitous. They can be found in a variety of form factors and are increasingly important in our daily lives. As such, ensuring that these systems are equitable to different subgroups of the population is crucial. In this paper, we introduce, AequeVox, an automated testing framework for evaluating the fairness of ASR systems. AequeVox simulates different environments to assess the effectiveness of ASR systems for different populations. In addition, we investigate whether the chosen simulations are comprehensible to humans. We further propose a fault localization technique capable of identifying words that are not robust to these varying environments. Both components of AequeVox are able to operate in the absence of ground truth data. We evaluated AequeVox on speech from four different datasets using three different commercial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification · Speech Recognition and Synthesis
