Training-free Detection of AI-generated images via Cropping Robustness
Sungik Choi, Hankook Lee, Moontae Lee

TL;DR
This paper introduces WaRPAD, a training-free method for detecting AI-generated images by leveraging self-supervised models' sensitivity to cropping and resizing, demonstrating robustness and broad applicability across models and datasets.
Contribution
We propose WaRPAD, a novel training-free detection algorithm that uses self-supervised models' responses to cropping and resizing to identify AI-generated images.
Findings
WaRPAD achieves competitive detection performance across diverse datasets.
The method is robust to test-time image corruptions.
WaRPAD is applicable across various self-supervised models.
Abstract
AI-generated image detection has become crucial with the rapid advancement of vision-generative models. Instead of training detectors tailored to specific datasets, we study a training-free approach leveraging self-supervised models without requiring prior data knowledge. These models, pre-trained with augmentations like RandomResizedCrop, learn to produce consistent representations across varying resolutions. Motivated by this, we propose WaRPAD, a training-free AI-generated image detection algorithm based on self-supervised models. Since neighborhood pixel differences in images are highly sensitive to resizing operations, WaRPAD first defines a base score function that quantifies the sensitivity of image embeddings to perturbations along high-frequency directions extracted via Haar wavelet decomposition. To simulate robustness against cropping augmentation, we rescale each image to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
