Swin Transformer for Robust Differentiation of Real and Synthetic   Images: Intra- and Inter-Dataset Analysis

Preetu Mehta; Aman Sagar; Suchi Kumari

arXiv:2409.04734·cs.CV·September 10, 2024

Swin Transformer for Robust Differentiation of Real and Synthetic Images: Intra- and Inter-Dataset Analysis

Preetu Mehta, Aman Sagar, Suchi Kumari

PDF

Open Access

TL;DR

This paper introduces a Swin Transformer-based model that effectively distinguishes CGI from real images with high accuracy, demonstrating robustness across multiple datasets and generalization to different domains.

Contribution

The study presents a novel application of Swin Transformer architecture for CGI detection, achieving high accuracy and robustness in intra- and inter-dataset evaluations.

Findings

01

Achieved 97-99% accuracy across datasets

02

Demonstrated strong domain generalization

03

Validated robustness in intra- and inter-dataset tests

Abstract

\textbf{Purpose} This study aims to address the growing challenge of distinguishing computer-generated imagery (CGI) from authentic digital images in the RGB color space. Given the limitations of existing classification methods in handling the complexity and variability of CGI, this research proposes a Swin Transformer-based model for accurate differentiation between natural and synthetic images. \textbf{Methods} The proposed model leverages the Swin Transformer's hierarchical architecture to capture local and global features crucial for distinguishing CGI from natural images. The model's performance was evaluated through intra-dataset and inter-dataset testing across three distinct datasets: CiFAKE, JSSSTU, and Columbia. The datasets were tested individually (D1, D2, D3) and in combination (D1+D2+D3) to assess the model's robustness and domain generalization capabilities.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods

MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Dropout · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer · Adam