Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation

Yuan Gan; Jiaxu Miao; Yunze Wang; Yi Yang

arXiv:2506.01591·cs.GR·June 3, 2025

Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation

Yuan Gan, Jiaxu Miao, Yunze Wang, Yi Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Silencer, a two-stage method that uses adversarial perturbations to prevent LDM-based talking-head models from being controlled by audio signals, enhancing privacy and security.

Contribution

The paper proposes a novel two-stage approach with nullifying and anti-purification losses to effectively protect portraits from audio-based manipulation in LDM-driven talking-head generation.

Findings

01

Silencer significantly reduces audio control in generated videos.

02

The method maintains high visual quality of protected portraits.

03

Experiments show robustness against advanced manipulation techniques.

Abstract

Advances in talking-head animation based on Latent Diffusion Models (LDM) enable the creation of highly realistic, synchronized videos. These fabricated videos are indistinguishable from real ones, increasing the risk of potential misuse for scams, political manipulation, and misinformation. Hence, addressing these ethical concerns has become a pressing issue in AI security. Recent proactive defense studies focused on countering LDM-based models by adding perturbations to portraits. However, these methods are ineffective at protecting reference portraits from advanced image-to-video animation. The limitations are twofold: 1) they fail to prevent images from being manipulated by audio signals, and 2) diffusion-based purification techniques can effectively eliminate protective perturbations. To address these challenges, we propose Silencer, a two-stage method designed to proactively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuangan/silencer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Music and Audio Processing

MethodsDiffusion