Breaking the Stealth-Potency Trade-off in Clean-Image Backdoors with Generative Trigger Optimization

Binyan Xu; Fan Yang; Di Tang; Xilin Dai; Kehuan Zhang

arXiv:2511.07210·cs.CV·November 12, 2025

Breaking the Stealth-Potency Trade-off in Clean-Image Backdoors with Generative Trigger Optimization

Binyan Xu, Fan Yang, Di Tang, Xilin Dai, Kehuan Zhang

PDF

Open Access 1 Video

TL;DR

This paper introduces GCB, a novel framework using generative models to create stealthy triggers for clean-image backdoors, significantly reducing accuracy loss and demonstrating versatility across multiple datasets and tasks.

Contribution

The paper proposes a new generative trigger optimization method that minimizes accuracy degradation and enhances stealthiness in clean-image backdoor attacks.

Findings

01

Achieves less than 1% drop in clean accuracy

02

Successfully applies to six datasets, five architectures, and four tasks

03

Demonstrates robustness against existing defenses

Abstract

Clean-image backdoor attacks, which use only label manipulation in training datasets to compromise deep neural networks, pose a significant threat to security-critical applications. A critical flaw in existing methods is that the poison rate required for a successful attack induces a proportional, and thus noticeable, drop in Clean Accuracy (CA), undermining their stealthiness. This paper presents a new paradigm for clean-image attacks that minimizes this accuracy degradation by optimizing the trigger itself. We introduce Generative Clean-Image Backdoors (GCB), a framework that uses a conditional InfoGAN to identify naturally occurring image features that can serve as potent and stealthy triggers. By ensuring these triggers are easily separable from benign task-related features, GCB enables a victim model to learn the backdoor from an extremely small set of poisoned examples, resulting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Breaking the Stealth-Potency Trade-off in Clean-Image Backdoors with Generative Trigger Optimization· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Advanced Malware Detection Techniques