Spy-Watermark: Robust Invisible Watermarking for Backdoor Attack

Ruofei Wang; Renjie Wan; Zongyu Guo; Qing Guo; Rui Huang

arXiv:2401.02031·cs.CV·January 5, 2024·1 cites

Spy-Watermark: Robust Invisible Watermarking for Backdoor Attack

Ruofei Wang, Renjie Wan, Zongyu Guo, Qing Guo, Rui Huang

PDF

Open Access 1 Repo

TL;DR

Spy-Watermark introduces a robust, invisible watermark-based backdoor attack that remains effective under data corruption and defenses, outperforming existing methods in robustness and stealthiness across multiple datasets.

Contribution

The paper presents a novel latent domain watermarking technique for backdoor attacks that is highly resistant to data collapse and defenses, improving robustness and stealthiness.

Findings

01

Outperforms ten state-of-the-art methods in robustness.

02

Effective across CIFAR10, GTSRB, and ImageNet datasets.

03

Demonstrates high resilience against data corruption and defenses.

Abstract

Backdoor attack aims to deceive a victim model when facing backdoor instances while maintaining its performance on benign data. Current methods use manual patterns or special perturbations as triggers, while they often overlook the robustness against data corruption, making backdoor attacks easy to defend in practice. To address this issue, we propose a novel backdoor attack method named Spy-Watermark, which remains effective when facing data collapse and backdoor defense. Therein, we introduce a learnable watermark embedded in the latent domain of images, serving as the trigger. Then, we search for a watermark that can withstand collapse during image decoding, cooperating with several anti-collapse operations to further enhance the resilience of our trigger against data corruption. Extensive experiments are conducted on CIFAR10, GTSRB, and ImageNet datasets, demonstrating that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rfww/spy-watermark
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection