Detect Fake with Fake: Leveraging Synthetic Data-driven Representation   for Synthetic Image Detection

Hina Otake; Yoshihiro Fukuhara; Yoshiki Kubotani; Shigeo Morishima

arXiv:2409.08884·cs.CV·September 16, 2024

Detect Fake with Fake: Leveraging Synthetic Data-driven Representation for Synthetic Image Detection

Hina Otake, Yoshihiro Fukuhara, Yoshiki Kubotani, Shigeo Morishima

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that vision transformers trained solely on synthetic data can effectively detect fake images, outperforming models trained on real data, with significant improvements on unseen GAN models.

Contribution

It shows synthetic data-driven representations, especially with vision transformers, are effective for fake image detection without using real images during training.

Findings

01

Vision transformers trained on synthetic data can distinguish fake images from real ones.

02

Using SynCLR backbone improves detection performance over CLIP.

03

The method achieves +10.32 mAP and +4.73% accuracy on unseen GAN models.

Abstract

Are general-purpose visual representations acquired solely from synthetic data useful for detecting fake images? In this work, we show the effectiveness of synthetic data-driven representations for synthetic image detection. Upon analysis, we find that vision transformers trained by the latest visual representation learners with synthetic data can effectively distinguish fake from real images without seeing any real images during pre-training. Notably, using SynCLR as the backbone in a state-of-the-art detection method demonstrates a performance improvement of +10.32 mAP and +4.73% accuracy over the widely used CLIP, when tested on previously unseen GAN models. Code is available at https://github.com/cvpaperchallenge/detect-fake-with-fake.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cvpaperchallenge/detect-fake-with-fake
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Advanced Steganography and Watermarking Techniques

MethodsContrastive Language-Image Pre-training