Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models

Yanyun Wang; Yu Huang; Zi Liang; Xixin Wu; Li Liu

arXiv:2605.18168·cs.CR·May 19, 2026

Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models

Yanyun Wang, Yu Huang, Zi Liang, Xixin Wu, Li Liu

PDF

TL;DR

This paper introduces a novel acoustic interference attack that exploits intrinsic paralinguistic features in audio to bypass safety measures in large audio language models, revealing a fundamental vulnerability.

Contribution

It proposes a new paradigm shifting from content injection to safety interference using acoustic latent semantics, and develops a universal attack method effective across multiple models.

Findings

01

AIA achieves state-of-the-art attack success rates on 10 LALMs.

02

Interference audio can bypass safety without content modification.

03

Analysis uncovers the fundamental vulnerability in cross-modal alignment.

Abstract

The integration of audio modality into Large Audio Language Models (LALMs) significantly expands their attack surface. Existing jailbreak paradigms predominantly treat audio as a carrier for malicious payloads, relying on semantic optimization, acoustic parameter control, or additive perturbation to embed harmful content into the audio signal. In this work, we challenge this necessity and propose a new paradigm in which the role of audio shifts from content injection to safety alignment interference. We reveal that LALM safety alignment can be compromised solely by specific Acoustic Latent Semantics (ALS), the underlying paralinguistic features intrinsic to the priors of audio generative models. Distinct from previous works that leverage explicit acoustic parameters to merely style malicious audio, we demonstrate that interference audio, benign in content but infused with specific ALS,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.