Combating Pattern and Content Bias: Adversarial Feature Learning for Generalized AI-Generated Image Detection
Haifeng Zhang, Qinghui He, Xiuli Bi, Bo Liu, Chi-Man Pun, Bin Xiao

TL;DR
This paper introduces MAFL, an adversarial feature learning framework that reduces bias in AI-generated image detection, improving cross-model generalization and performance with limited training data.
Contribution
The proposed MAFL framework effectively suppresses pattern and content biases, enhancing detection accuracy and generalization across different generative models.
Findings
Outperforms state-of-the-art methods by 10.89% in accuracy.
Achieves over 80% detection accuracy with only 320 training images.
Effectively captures shared generative features across models.
Abstract
In recent years, the rapid development of generative artificial intelligence technology has significantly lowered the barrier to creating high-quality fake images, posing a serious challenge to information authenticity and credibility. Existing generated image detection methods typically enhance generalization through model architecture or network design. However, their generalization performance remains susceptible to data bias, as the training data may drive models to fit specific generative patterns and content rather than the common features shared by images from different generative models (asymmetric bias learning). To address this issue, we propose a Multi-dimensional Adversarial Feature Learning (MAFL) framework. The framework adopts a pretrained multimodal image encoder as the feature extraction backbone, constructs a real-fake feature learning network, and designs an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
