Compositional Adversarial Training for Robust Visual Watermarking
Anirudh Satheesh, Michael-Andrei Panaitescu-Liess, Andrew Xu, Georgios Milis, Heng Huang, Zikui Cai, Furong Huang

TL;DR
This paper introduces Compositional Adversarial Training (CAT), a new framework for improving the robustness of visual watermarking by training against adaptive, compositional adversaries, leading to significant performance gains.
Contribution
The paper proposes a novel differentiable adversarial training method that models attack composition, enhancing watermark robustness against complex, adaptive attacks.
Findings
CAT outperforms random-augmentation baselines on multiple benchmarks.
Watermark capacity improved by up to 63.5% in single-step attacks.
TPR@FPR=1% increased by 12% on difficult transformations.
Abstract
Robust watermarking is typically trained with random post-processing augmentation, but random sampling under-covers the combinatorial space of realistic attack pipelines and rarely encounters the rare compositions that actually break detection. This leads to unstable training and poor sample efficiency. We instead formulate watermark robustness as a min-max problem over a structured space of compositional transformations. We propose Compositional Adversarial Training (CAT), a plug-in framework that learns a sequential differentiable adversary that observes the current watermarked image and selects an attack family at each step to maximally disrupt message recovery. CAT combines a straight-through Gumbel-Softmax attack selection with entropy regularization, allowing the backward pass to be end-to-end differentiable and aggregate gradient information across attack families, yielding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
