Jailbreak-AudioBench: In-Depth Evaluation and Analysis of Jailbreak Threats for Large Audio Language Models

Hao Cheng; Erjia Xiao; Jing Shao; Yichi Wang; Le Yang; Chao Shen; Philip Torr; Jindong Gu; Renjing Xu

arXiv:2501.13772·cs.SD·January 13, 2026

Jailbreak-AudioBench: In-Depth Evaluation and Analysis of Jailbreak Threats for Large Audio Language Models

Hao Cheng, Erjia Xiao, Jing Shao, Yichi Wang, Le Yang, Chao Shen, Philip Torr, Jindong Gu, Renjing Xu

PDF

Open Access

TL;DR

This paper introduces Jailbreak-AudioBench, a comprehensive evaluation framework for assessing vulnerabilities of Large Audio-Language Models to jailbreak attacks, including a dataset, toolbox, and benchmark for safety analysis.

Contribution

It provides the first in-depth evaluation and benchmark for audio-specific jailbreak threats in Large Audio-Language Models, including tools and datasets for future research.

Findings

01

Evaluated multiple state-of-the-art LALMs for jailbreak vulnerabilities.

02

Established the most comprehensive audio jailbreak benchmark to date.

03

Facilitated future research on safety and defense mechanisms for LALMs.

Abstract

Large Language Models (LLMs) demonstrate impressive zero-shot performance across a wide range of natural language processing tasks. Integrating various modality encoders further expands their capabilities, giving rise to Multimodal Large Language Models (MLLMs) that process not only text but also visual and auditory modality inputs. However, these advanced capabilities may also pose significant safety problems, as models can be exploited to generate harmful or inappropriate content through jailbreak attacks. While prior work has extensively explored how manipulating textual or visual modality inputs can circumvent safeguards in LLMs and MLLMs, the vulnerability of audio-specific jailbreak on Large Audio-Language Models (LALMs) remains largely underexplored. To address this gap, we introduce Jailbreak-AudioBench, which consists of the Toolbox, curated Dataset, and comprehensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Speech Recognition and Synthesis · Music and Audio Processing