Who Gets the Mic? Investigating Gender Bias in the Speaker Assignment of a Speech-LLM

Dariia Puhach; Amir H. Payberah; \'Eva Sz\'ekely

arXiv:2508.13603·cs.CL·August 20, 2025

Who Gets the Mic? Investigating Gender Bias in the Speaker Assignment of a Speech-LLM

Dariia Puhach, Amir H. Payberah, \'Eva Sz\'ekely

PDF

TL;DR

This paper investigates gender bias in Speech-LLMs by analyzing speaker assignment patterns in Bark, revealing gender awareness and inclinations but no systematic bias, using specially constructed datasets.

Contribution

It introduces a novel methodology using speaker assignment as an explicit bias analysis tool for Speech-LLMs, expanding bias detection beyond text-based models.

Findings

01

Bark shows gender awareness and some inclinations.

02

No systematic gender bias found in speaker assignment.

03

Constructed datasets effectively reveal gender-related patterns.

Abstract

Similar to text-based Large Language Models (LLMs), Speech-LLMs exhibit emergent abilities and context awareness. However, whether these similarities extend to gender bias remains an open question. This study proposes a methodology leveraging speaker assignment as an analytic tool for bias investigation. Unlike text-based models, which encode gendered associations implicitly, Speech-LLMs must produce a gendered voice, making speaker selection an explicit bias cue. We evaluate Bark, a Text-to-Speech (TTS) model, analyzing its default speaker assignments for textual prompts. If Bark's speaker selection systematically aligns with gendered associations, it may reveal patterns in its training data or model design. To test this, we construct two datasets: (i) Professions, containing gender-stereotyped occupations, and (ii) Gender-Colored Words, featuring gendered connotations. While Bark does…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.