How to Backdoor Consistency Models?
Chengen Wang, Murat Kantarcioglu

TL;DR
This paper investigates the vulnerability of consistency models, a new class of image generation models, to backdoor attacks, demonstrating that they can be compromised using inconspicuous triggers without degrading overall image quality.
Contribution
First study to reveal backdoor vulnerabilities in consistency models, introducing a novel, stealthy trigger-based attack that maintains high image quality and attack specificity.
Findings
Consistency models are vulnerable to backdoor attacks.
Inconspicuous Gaussian noise triggers can activate backdoor targets.
The attack maintains high image quality and stealthiness.
Abstract
Consistency models are a new class of models that generate images by directly mapping noise to data, allowing for one-step generation and significantly accelerating the sampling process. However, their robustness against adversarial attacks has not yet been thoroughly investigated. In this work, we conduct the first study on the vulnerability of consistency models to backdoor attacks. While previous research has explored backdoor attacks on diffusion models, those studies have primarily focused on conventional diffusion models, employing a customized backdoor training process and objective, whereas consistency models have distinct training processes and objectives. Our proposed framework demonstrates the vulnerability of consistency models to backdoor attacks. During image generation, poisoned consistency models produce images with a Fr\'echet Inception Distance (FID) comparable to that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Approaches in Technology and Social Development · Complex Systems and Decision Making
MethodsConsistency Models · Diffusion
