Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection
Anirudh Sundara Rajan, Yong Jae Lee

TL;DR
This paper introduces Stay Positive, an algorithm that enhances fake image detection by focusing solely on generative artifacts, thereby reducing false positives caused by spurious patterns and improving robustness.
Contribution
The paper proposes a novel method to train detectors that ignore real image artifacts, improving generalization and robustness in AI-generated image detection.
Findings
Detectors trained with Stay Positive are less affected by spurious correlations.
The method improves detection of inpainted real images.
Enhanced robustness to post-processing effects.
Abstract
Detecting AI generated images is a challenging yet essential task. A primary difficulty arises from the detectors tendency to rely on spurious patterns, such as compression artifacts, which can influence its decisions. These issues often stem from specific patterns that the detector associates with the real data distribution, making it difficult to isolate the actual generative traces. We argue that an image should be classified as fake if and only if it contains artifacts introduced by the generative model. Based on this premise, we propose Stay Positive, an algorithm designed to constrain the detectors focus to generative artifacts while disregarding those associated with real data. Experimental results demonstrate that detectors trained with Stay Positive exhibit reduced susceptibility to spurious correlations, leading to improved generalization and robustness to post processing.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Advanced Steganography and Watermarking Techniques
MethodsFocus
