Enhancing Image Authenticity Detection: Swin Transformers and Color Frame Analysis for CGI vs. Real Images
Preeti Mehta, Aman Sagar, Suchi Kumari

TL;DR
This paper introduces a novel method combining Swin Transformers and color frame analysis to improve the accuracy, speed, and robustness of distinguishing CGI images from real photographs, addressing challenges posed by highly realistic CGI.
Contribution
It presents a new approach that leverages Swin Transformers and color frame preprocessing to outperform existing methods in image authenticity detection without handcrafted features.
Findings
Achieves state-of-the-art accuracy in CGI vs. real image classification.
Demonstrates improved processing speed and robustness against image manipulations.
Effectively detects highly realistic CGI images in various manipulated conditions.
Abstract
The rapid advancements in computer graphics have greatly enhanced the quality of computer-generated images (CGI), making them increasingly indistinguishable from authentic images captured by digital cameras (ADI). This indistinguishability poses significant challenges, especially in an era of widespread misinformation and digitally fabricated content. This research proposes a novel approach to classify CGI and ADI using Swin Transformers and preprocessing techniques involving RGB and CbCrY color frame analysis. By harnessing the capabilities of Swin Transformers, our method foregoes handcrafted features instead of relying on raw pixel data for model training. This approach achieves state-of-the-art accuracy while offering substantial improvements in processing speed and robustness against joint image manipulations such as noise addition, blurring, and JPEG compression. Our findings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCurrency Recognition and Detection
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
