Enhancing Image Authenticity Detection: Swin Transformers and Color   Frame Analysis for CGI vs. Real Images

Preeti Mehta; Aman Sagar; Suchi Kumari

arXiv:2409.04742·cs.CV·September 10, 2024

Enhancing Image Authenticity Detection: Swin Transformers and Color Frame Analysis for CGI vs. Real Images

Preeti Mehta, Aman Sagar, Suchi Kumari

PDF

Open Access

TL;DR

This paper introduces a novel method combining Swin Transformers and color frame analysis to improve the accuracy, speed, and robustness of distinguishing CGI images from real photographs, addressing challenges posed by highly realistic CGI.

Contribution

It presents a new approach that leverages Swin Transformers and color frame preprocessing to outperform existing methods in image authenticity detection without handcrafted features.

Findings

01

Achieves state-of-the-art accuracy in CGI vs. real image classification.

02

Demonstrates improved processing speed and robustness against image manipulations.

03

Effectively detects highly realistic CGI images in various manipulated conditions.

Abstract

The rapid advancements in computer graphics have greatly enhanced the quality of computer-generated images (CGI), making them increasingly indistinguishable from authentic images captured by digital cameras (ADI). This indistinguishability poses significant challenges, especially in an era of widespread misinformation and digitally fabricated content. This research proposes a novel approach to classify CGI and ADI using Swin Transformers and preprocessing techniques involving RGB and CbCrY color frame analysis. By harnessing the capabilities of Swin Transformers, our method foregoes handcrafted features instead of relying on raw pixel data for model training. This approach achieves state-of-the-art accuracy while offering substantial improvements in processing speed and robustness against joint image manipulations such as noise addition, blurring, and JPEG compression. Our findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCurrency Recognition and Detection

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings