Generating Image Adversarial Examples by Embedding Digital Watermarks
Yuexin Xiang, Tiantian Li, Wei Ren, Tianqing Zhu, Kim-Kwang Raymond, Choo

TL;DR
This paper introduces a novel digital watermark-based approach to generate adversarial examples that effectively fool deep neural networks, achieving high success rates and efficiency on standard datasets.
Contribution
It presents a new watermark embedding method for creating adversarial images, combining improved DWT and DCT watermarking algorithms, with demonstrated high attack success rates.
Findings
Attack success rate up to 98.71% on CIFAR-10
Average attack time of 1.17 seconds per image
Effective even with Gaussian noise watermarks
Abstract
With the increasing attention to deep neural network (DNN) models, attacks are also upcoming for such models. For example, an attacker may carefully construct images in specific ways (also referred to as adversarial examples) aiming to mislead the DNN models to output incorrect classification results. Similarly, many efforts are proposed to detect and mitigate adversarial examples, usually for certain dedicated attacks. In this paper, we propose a novel digital watermark-based method to generate image adversarial examples to fool DNN models. Specifically, partial main features of the watermark image are embedded into the host image almost invisibly, aiming to tamper with and damage the recognition capabilities of the DNN models. We devise an efficient mechanism to select host images and watermark images and utilize the improved discrete wavelet transform (DWT) based Patchwork…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection
MethodsDiscrete Cosine Transform
