Leveraging Arbitrary Data Sources for AI-Generated Image Detection Without Sacrificing Generalization

Qinghui He; Haifeng Zhang; Xiuli Bi; Bo Liu; Chi-Man Pun; Bin Xiao

arXiv:2603.00717·cs.CV·March 30, 2026

Leveraging Arbitrary Data Sources for AI-Generated Image Detection Without Sacrificing Generalization

Qinghui He, Haifeng Zhang, Xiuli Bi, Bo Liu, Chi-Man Pun, Bin Xiao

PDF

TL;DR

The paper introduces a novel detection framework that leverages arbitrary data sources and intrinsic image features to improve generalization in identifying AI-generated images across unseen models.

Contribution

It proposes a single-class attribution model that enhances class separation and generalization without relying on generator-specific artifacts.

Findings

01

Outperforms existing detectors by up to 7.21% in accuracy.

02

Achieves better cross-model generalization, reducing reliance on specific generative artifacts.

03

Effectively utilizes any single-class training set to build a stable decision boundary.

Abstract

The accelerating advancement of generative models has introduced new challenges for detecting AI-generated images, especially in real-world scenarios where novel generation techniques emerge rapidly. Existing learning paradigms are likely to make classifiers data-dependent, resulting in narrow decision margins and, consequently, limited generalization ability to unseen generative models. We observe that both real and generated images intend to form clustered low-dimensional manifolds within high-level feature spaces extracted by pre-trained visual encoders. Building on this observation, we propose a single-class attribution modeling framework that first amplifies the intrinsic differences between real and generated images by constructing a compact attribution space from any single-class training set, either composed of real images or generated ones, and then establishes a more stable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.