Aggregating Diverse Cue Experts for AI-Generated Image Detection
Lei Tan, Shuwei Li, Mohan Kankanhalli, Robby T. Tan

TL;DR
This paper introduces MCAN, a multi-cue aggregation network that combines spatial, frequency, and chromaticity cues to improve the detection of AI-generated images, achieving superior cross-model generalization.
Contribution
The paper proposes a novel unified framework that integrates diverse cues using a mixture-of-encoders, enhancing robustness and generalization in AI-generated image detection.
Findings
Outperforms state-of-the-art methods by up to 7.4% in accuracy.
Demonstrates strong generalization across multiple datasets.
Effectively leverages chromatic inconsistency cues for detection.
Abstract
The rapid emergence of image synthesis models poses challenges to the generalization of AI-generated image detectors. However, existing methods often rely on model-specific features, leading to overfitting and poor generalization. In this paper, we introduce the Multi-Cue Aggregation Network (MCAN), a novel framework that integrates different yet complementary cues in a unified network. MCAN employs a mixture-of-encoders adapter to dynamically process these cues, enabling more adaptive and robust feature representation. Our cues include the input image itself, which represents the overall content, and high-frequency components that emphasize edge details. Additionally, we introduce a Chromatic Inconsistency (CI) cue, which normalizes intensity values and captures noise information introduced during the image acquisition process in real images, making these noise patterns more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
