Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models

Yi-Cheng Lin; Tzu-Quan Lin; Chih-Kai Yang; Ke-Han Lu; Wei-Chih Chen; Chun-Yi Kuan; Hung-yi Lee

arXiv:2407.06957·eess.AS·May 22, 2025

Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models

Yi-Cheng Lin, Tzu-Quan Lin, Chih-Kai Yang, Ke-Han Lu, Wei-Chih Chen, Chun-Yi Kuan, Hung-yi Lee

PDF

Open Access 1 Repo

TL;DR

This paper investigates semantic gender bias in Speech Integrated Large Language Models across multiple tasks, revealing language-dependent bias variations and emphasizing the need for diverse evaluation methods to ensure fairness.

Contribution

Introduces a curated bias evaluation toolkit and dataset for assessing gender bias in SILLMs across four semantic tasks, highlighting bias variability and evaluation challenges.

Findings

01

Bias levels vary with language and task

02

Multiple evaluation methods are necessary for comprehensive bias assessment

03

Bias can amplify existing societal stereotypes in speech models

Abstract

Speech Integrated Large Language Models (SILLMs) combine large language models with speech perception to perform diverse tasks, such as emotion recognition to speaker verification, demonstrating universal audio understanding capability. However, these models may amplify biases present in training data, potentially leading to biased access to information for marginalized groups. This work introduces a curated spoken bias evaluation toolkit and corresponding dataset. We evaluate gender bias in SILLMs across four semantic-related tasks: speech-to-text translation (STT), spoken coreference resolution (SCR), spoken sentence continuation (SSC), and spoken question answering (SQA). Our analysis reveals that bias levels are language-dependent and vary with different evaluation methods. Our findings emphasize the necessity of employing multiple approaches to comprehensively assess biases in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dlion168/Listen-and-Speak-Fairly
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection