RoWSFormer: A Robust Watermarking Framework with Swin Transformer for   Enhanced Geometric Attack Resilience

Weitong Chen; Yuheng Li

arXiv:2409.14829·cs.MM·September 24, 2024

RoWSFormer: A Robust Watermarking Framework with Swin Transformer for Enhanced Geometric Attack Resilience

Weitong Chen, Yuheng Li

PDF

Open Access

TL;DR

RoWSFormer introduces a Swin Transformer-based watermarking framework that significantly enhances robustness against geometric attacks while maintaining high imperceptibility and extraction accuracy.

Contribution

The paper proposes a novel Swin Transformer-based watermarking framework with specialized blocks to improve robustness against geometric distortions.

Findings

01

Outperforms existing methods in robustness against geometric attacks.

02

Achieves over 6 dB PSNR improvement under geometric distortions.

03

Maintains over 97% extraction accuracy under various attacks.

Abstract

In recent years, digital watermarking techniques based on deep learning have been widely studied. To achieve both imperceptibility and robustness of image watermarks, most current methods employ convolutional neural networks to build robust watermarking frameworks. However, despite the success of CNN-based watermarking models, they struggle to achieve robustness against geometric attacks due to the limitations of convolutional neural networks in capturing global and long-range relationships. To address this limitation, we propose a robust watermarking framework based on the Swin Transformer, named RoWSFormer. Specifically, we design the Locally-Channel Enhanced Swin Transformer Block as the core of both the encoder and decoder. This block utilizes the self-attention mechanism to capture global and long-range information, thereby significantly improving adaptation to geometric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting · Vehicle License Plate Recognition

MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Softmax · Layer Normalization · Dropout · Stochastic Depth