Invisible Watermarks: Attacks and Robustness
Dongjun Hwang, Sungwon Woo, Tom Gao, Raymond Luo, Sunghwan Baek

TL;DR
This paper enhances invisible watermarking techniques for generated images by introducing a watermark remover network and localized blurring attacks, improving robustness while minimizing image quality degradation.
Contribution
It proposes a novel watermark remover network that preserves one watermark modality while removing the other and introduces localized blurring attacks based on GradCAM heatmaps.
Findings
Watermark remover slightly improves decoding performance.
Localized blurring attacks cause less image degradation than uniform blurring.
The methods enhance robustness of invisible watermarks against attacks.
Abstract
As Generative AI continues to become more accessible, the case for robust detection of generated images in order to combat misinformation is stronger than ever. Invisible watermarking methods act as identifiers of generated content, embedding image- and latent-space messages that are robust to many forms of perturbations. The majority of current research investigates full-image attacks against images with a single watermarking method applied. We introduce novel improvements to watermarking robustness as well as minimizing degradation on image quality during attack. Firstly, we examine the application of both image-space and latent-space watermarking methods on a single image, where we propose a custom watermark remover network which preserves one of the watermarking modalities while completely removing the other during decoding. Then, we investigate localized blurring attacks (LBA) on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital and Cyber Forensics · Advanced Malware Detection Techniques
MethodsHeatmap
