Domain Generalized Recaptured Screen Image Identification Using SWIN Transformer
Preeti Mehta, Aman Sagar, Suchi Kumari

TL;DR
This paper introduces DAST-DG, a novel SWIN transformer-based framework with data augmentation for domain-generalized recaptured screen image identification, effectively handling scale variations and domain shifts.
Contribution
It proposes a cascaded data augmentation and SWIN transformer approach that improves domain generalization in recaptured image identification, outperforming existing methods.
Findings
Achieves approximately 82% accuracy on high-variance datasets
Attains 95% precision in recaptured image detection
Outperforms state-of-the-art methods across multiple databases
Abstract
An increasing number of classification approaches have been developed to address the issue of image rebroadcast and recapturing, a standard attack strategy in insurance frauds, face spoofing, and video piracy. However, most of them neglected scale variations and domain generalization scenarios, performing poorly in instances involving domain shifts, typically made worse by inter-domain and cross-domain scale variances. To overcome these issues, we propose a cascaded data augmentation and SWIN transformer domain generalization framework (DAST-DG) in the current research work Initially, we examine the disparity in dataset representation. A feature generator is trained to make authentic images from various domains indistinguishable. This process is then applied to recaptured images, creating a dual adversarial learning setup. Extensive experiments demonstrate that our approach is practical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Image Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Softmax · Dense Connections · Stochastic Depth · Multi-Head Attention · Residual Connection · Swin Transformer
