Leveraging Arbitrary Data Sources for AI-Generated Image Detection Without Sacrificing Generalization
Qinghui He, Haifeng Zhang, Xiuli Bi, Bo Liu, Chi-Man Pun, Bin Xiao

TL;DR
The paper introduces a novel detection framework that leverages arbitrary data sources and intrinsic image features to improve generalization in identifying AI-generated images across unseen models.
Contribution
It proposes a single-class attribution model that enhances class separation and generalization without relying on generator-specific artifacts.
Findings
Outperforms existing detectors by up to 7.21% in accuracy.
Achieves better cross-model generalization, reducing reliance on specific generative artifacts.
Effectively utilizes any single-class training set to build a stable decision boundary.
Abstract
The accelerating advancement of generative models has introduced new challenges for detecting AI-generated images, especially in real-world scenarios where novel generation techniques emerge rapidly. Existing learning paradigms are likely to make classifiers data-dependent, resulting in narrow decision margins and, consequently, limited generalization ability to unseen generative models. We observe that both real and generated images intend to form clustered low-dimensional manifolds within high-level feature spaces extracted by pre-trained visual encoders. Building on this observation, we propose a single-class attribution modeling framework that first amplifies the intrinsic differences between real and generated images by constructing a compact attribution space from any single-class training set, either composed of real images or generated ones, and then establishes a more stable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
