Greater accessibility can amplify discrimination in generative AI
Carolin Holtermann, Minh Duc Bui, Kaitlyn Zhou, Valentin Hofmann, Katharina von der Wense, Anne Lauscher

TL;DR
This paper reveals that voice-enabled large language models can amplify gender biases based on speaker voice, raising concerns about fairness and accessibility, and proposes pitch manipulation as a mitigation strategy.
Contribution
It uncovers gender discrimination in audio-based LLMs linked to paralinguistic cues and demonstrates a mitigation method, highlighting the need to address fairness alongside accessibility.
Findings
Audio LLMs show gender bias based on speaker voice.
Pitch manipulation can reduce gender-discriminatory responses.
Users are more concerned about attribute inference when biases are disclosed.
Abstract
Hundreds of millions of people rely on large language models (LLMs) for education, work, and even healthcare. Yet these models are known to reproduce and amplify social biases present in their training data. Moreover, text-based interfaces remain a barrier for many, for example, users with limited literacy, motor impairments, or mobile-only devices. Voice interaction promises to expand accessibility, but unlike text, speech carries identity cues that users cannot easily mask, raising concerns about whether accessibility gains may come at the cost of equitable treatment. Here we show that audio-enabled LLMs exhibit systematic gender discrimination, shifting responses toward gender-stereotyped adjectives and occupations solely on the basis of speaker voice, and amplifying bias beyond that observed in text-based interaction. Thus, voice interfaces do not merely extend text models to a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Artificial Intelligence in Healthcare and Education · Text Readability and Simplification
