Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models
Yanyun Wang, Yu Huang, Zi Liang, Xixin Wu, Li Liu

TL;DR
This paper introduces a novel acoustic interference attack that exploits intrinsic paralinguistic features in audio to bypass safety measures in large audio language models, revealing a fundamental vulnerability.
Contribution
It proposes a new paradigm shifting from content injection to safety interference using acoustic latent semantics, and develops a universal attack method effective across multiple models.
Findings
AIA achieves state-of-the-art attack success rates on 10 LALMs.
Interference audio can bypass safety without content modification.
Analysis uncovers the fundamental vulnerability in cross-modal alignment.
Abstract
The integration of audio modality into Large Audio Language Models (LALMs) significantly expands their attack surface. Existing jailbreak paradigms predominantly treat audio as a carrier for malicious payloads, relying on semantic optimization, acoustic parameter control, or additive perturbation to embed harmful content into the audio signal. In this work, we challenge this necessity and propose a new paradigm in which the role of audio shifts from content injection to safety alignment interference. We reveal that LALM safety alignment can be compromised solely by specific Acoustic Latent Semantics (ALS), the underlying paralinguistic features intrinsic to the priors of audio generative models. Distinct from previous works that leverage explicit acoustic parameters to merely style malicious audio, we demonstrate that interference audio, benign in content but infused with specific ALS,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
