TL;DR
This paper introduces a novel framework combining reconstruction and GAN-based upsampling artifacts, with a specialized fusion method, to improve generalizable AI-generated image detection across diverse methods.
Contribution
It proposes a Separate Expert Fusion framework that leverages domain-specific experts and decoupled fusion to enhance detection robustness against various generative models.
Findings
Outperforms existing methods on 13 benchmarks.
Effectively captures diverse artifact patterns.
Improves generalization across multiple generative techniques.
Abstract
As the misuse of AI-generated images grows, generalizable image detection techniques are urgently needed. Recent state-of-the-art (SOTA) methods adopt aligned training datasets to reduce content, size, and format biases, empowering models to capture robust forgery cues. A common strategy is to employ reconstruction techniques, e.g., VAE and DDIM, which show remarkable results in diffusion-based methods. However, such reconstruction-based approaches typically introduce limited and homogeneous artifacts, which cannot fully capture diverse generative patterns, such as GAN-based methods. To complement reconstruction-based fake images with aligned yet diverse artifact patterns, we propose a GAN-based upsampling approach that mimics GAN-generated fake patterns while preserving content, size, and format alignment. This naturally results in two aligned but distinct types of fake images.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
