When Detectors Forget Forensics: Blocking Semantic Shortcuts for Generalizable AI-Generated Image Detection

Chao Shuai; Zhenguang Liu; Shaojing Fan; Bin Gong; Weichen Lian; Xiuli Bi; Zhongjie Ba; Kui Ren

arXiv:2603.09242·cs.CV·March 11, 2026

When Detectors Forget Forensics: Blocking Semantic Shortcuts for Generalizable AI-Generated Image Detection

Chao Shuai, Zhenguang Liu, Shaojing Fan, Bin Gong, Weichen Lian, Xiuli Bi, Zhongjie Ba, Kui Ren

PDF

Open Access

TL;DR

This paper introduces a novel method called Geometric Semantic Decoupling (GSD) to improve the generalization of AI-generated image detectors by removing reliance on semantic priors, thereby enhancing robustness against unseen generation methods.

Contribution

The paper identifies semantic fallback as a key failure in VFM-based detectors and proposes GSD, a parameter-free module that explicitly removes semantic components to improve detection robustness.

Findings

01

GSD significantly improves cross-dataset detection performance.

02

The method enhances robustness to unseen manipulations.

03

It generalizes beyond faces to general scene images.

Abstract

AI-generated image detection has become increasingly important with the rapid advancement of generative AI. However, detectors built on Vision Foundation Models (VFMs, \emph{e.g.}, CLIP) often struggle to generalize to images created using unseen generation pipelines. We identify, for the first time, a key failure mechanism, termed \emph{semantic fallback}, where VFM-based detectors rely on dominant pre-trained semantic priors (such as identity) rather than forgery-specific traces under distribution shifts. To address this issue, we propose \textbf{Geometric Semantic Decoupling (GSD)}, a parameter-free module that explicitly removes semantic components from learned representations by leveraging a frozen VFM as a semantic guide with a trainable VFM as an artifact detector. GSD estimates semantic directions from batch-wise statistics and projects them out via a geometric constraint,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning