Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models
Chung-Ting Tsai, Ching-Yun Ko, I-Hsin Chung, Yu-Chiang Frank Wang,, Pin-Yu Chen

TL;DR
This paper investigates how vision foundation models can be leveraged for training-free detection of AI-generated fake images, analyzing factors affecting detection performance and proposing methods to improve robustness across different image types.
Contribution
It provides a comprehensive analysis of detection performance across model backbones and perturbation types, introduces Contrastive Blur and MINDER to enhance detection accuracy, and offers insights into model robustness for deepfake detection.
Findings
Detection performance correlates with model robustness.
Self-supervised models yield more reliable representations.
Gaussian blur improves facial image detection.
Abstract
The rapid advancement of generative models has introduced serious risks, including deepfake techniques for facial synthesis and editing. Traditional approaches rely on training classifiers and enhancing generalizability through various feature extraction techniques. Meanwhile, training-free detection methods address issues like limited data and overfitting by directly leveraging statistical properties from vision foundation models to distinguish between real and fake images. The current leading training-free approach, RIGID, utilizes DINOv2 sensitivity to perturbations in image space for detecting fake images, with fake image embeddings exhibiting greater sensitivity than those of real images. This observation prompts us to investigate how detection performance varies across model backbones, perturbation types, and datasets. Our experiments reveal that detection performance is closely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · COVID-19 diagnosis using AI · Medical Imaging and Analysis
