Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants
Chlo\'e Sekkat, Fanny Leroy, Salima Mdhaffar, Blake Perry Smith,, Yannick Est\`eve, Joseph Dureau, Alice Coucke

TL;DR
This paper introduces a new dataset and methodology for assessing demographic bias in voice assistants, revealing significant performance disparities across different demographic groups.
Contribution
It provides the first large, demographically annotated dataset and a novel bias assessment method using spoken language understanding metrics.
Findings
Significant performance differences across age, dialect, and ethnicity.
Multivariate analysis uncovers complex interactions between demographics.
Demonstrates the dataset and method's effectiveness in bias detection.
Abstract
Recent works demonstrate that voice assistants do not perform equally well for everyone, but research on demographic robustness of speech technologies is still scarce. This is mainly due to the rarity of large datasets with controlled demographic tags. This paper introduces the Sonos Voice Control Bias Assessment Dataset, an open dataset composed of voice assistant requests for North American English in the music domain (1,038 speakers, 166 hours, 170k audio samples, with 9,040 unique labelled transcripts) with a controlled demographic diversity (gender, age, dialectal region and ethnicity). We also release a statistical demographic bias assessment methodology, at the univariate and multivariate levels, tailored to this specific use case and leveraging spoken language understanding metrics rather than transcription accuracy, which we believe is a better proxy for user experience. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Speech Recognition and Synthesis · Speech and Audio Processing
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide)
