TL;DR
This paper introduces END$^2$, a dual-decoder watermarking framework that effectively handles non-differentiable distortions by using two decoders and a cosine similarity alignment, outperforming existing methods.
Contribution
The paper proposes a novel dual-decoder architecture with feature alignment to improve robustness against non-differentiable distortions in watermarking.
Findings
Outperforms state-of-the-art algorithms under various distortions.
Effective even without differentiability constraints.
Easy to implement across END architectures.
Abstract
DNN-based watermarking methods have rapidly advanced, with the ``Encoder-Noise Layer-Decoder'' (END) framework being the most widely used. To ensure end-to-end training, the noise layer in the framework must be differentiable. However, real-world distortions are often non-differentiable, leading to challenges in end-to-end training. Existing solutions only treat the distortion perturbation as additive noise, which does not fully integrate the effect of distortion in training. To better incorporate non-differentiable distortions into training, we propose a novel dual-decoder architecture (END). Unlike conventional END architecture, our method employs two structurally identical decoders: the Teacher Decoder, processing pure watermarked images, and the Student Decoder, handling distortion-perturbed images. The gradient is backpropagated only through the Teacher Decoder branch to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
