Decoupling Semantics and Fingerprints: A Universal Representation for AI-Generated Image Detection
Zhiyuan Wang (1), Yanxiang Chen (2), Pengcheng Zhao (3), Yunfeng Diao (2), Xin Liao (4)((1) Hefei University of Technology, (2) Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education / School of Computer Science

TL;DR
This paper introduces ODP-Net, a novel method that disentangles universal forgery traces from generator-specific fingerprints and semantics, improving AI-generated image detection across unseen architectures.
Contribution
It proposes a spectral analysis-based orthogonal decomposition approach to structurally disentangle factors, enabling better generalization in forgery detection.
Findings
ODP-Net outperforms existing methods on unseen architectures.
Spectral analysis reveals disjoint frequency subspaces for different features.
Explicit disentanglement improves detection robustness.
Abstract
Detecting AI-generated images across unseen architectures remains challenging, as existing models often overfit to generator-specific fingerprints and semantic content rather than learning universal forgery traces. We attribute this failure to feature entanglement: detectors learn these factors as a single entangled representation, where universal forgery traces are inextricably confounded with both generator-specific fingerprints and semantic content. Crucially, our spectral analysis reveals that this entanglement is avoidable: distinct generator-specific fingerprints (e.g., GAN stripes vs. Diffusion Model spots) occupy disjoint frequency subspaces and coexist as independent superpositions. Leveraging this physical orthogonality, we propose the Orthogonal Decomposition and Purification Network (ODP-Net) to structurally disentangle these factors. Specifically, ODP-Net employs (1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
