Gender Bias in Instruction-Guided Speech Synthesis Models

Chun-Yi Kuan; Hung-yi Lee

arXiv:2502.05649·cs.CL·February 11, 2025

Gender Bias in Instruction-Guided Speech Synthesis Models

Chun-Yi Kuan, Hung-yi Lee

PDF

Open Access 1 Video

TL;DR

This paper investigates gender bias in instruction-guided speech synthesis models, revealing that models tend to reinforce stereotypes in interpreting occupation-related prompts, with bias levels varying by model size.

Contribution

The study provides empirical evidence of gender bias in TTS models' interpretation of occupation prompts and compares bias across different model sizes.

Findings

01

Models exhibit gender bias in occupation prompts

02

Bias varies with model size

03

Bias tends to reinforce stereotypes

Abstract

Recent advancements in controllable expressive speech synthesis, especially in text-to-speech (TTS) models, have allowed for the generation of speech with specific styles guided by textual descriptions, known as style prompts. While this development enhances the flexibility and naturalness of synthesized speech, there remains a significant gap in understanding how these models handle vague or abstract style prompts. This study investigates the potential gender bias in how models interpret occupation-related prompts, specifically examining their responses to instructions like "Act like a nurse". We explore whether these models exhibit tendencies to amplify gender stereotypes when interpreting such prompts. Our experimental results reveal the model's tendency to exhibit gender bias for certain occupations. Moreover, models of different sizes show varying degrees of this bias across these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Gender Bias in Instruction-Guided Speech Synthesis Models· underline

Taxonomy

TopicsSpeech and dialogue systems · Natural Language Processing Techniques