Decoupling Semantics and Fingerprints: A Universal Representation for AI-Generated Image Detection

Zhiyuan Wang (1); Yanxiang Chen (2); Pengcheng Zhao (3); Yunfeng Diao (2); Xin Liao (4)((1) Hefei University of Technology; (2) Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology); Ministry of Education / School of Computer Science; Information Engineering; Hefei University of Technology / Intelligent Interconnected Systems Laboratory of Anhui Province (Hefei University of Technology); (3) School of Computer Science; Nanjing Audit University; (4) College of Cyber Science; Technology; Hunan University)

arXiv:2605.07074·cs.CV·May 18, 2026

Decoupling Semantics and Fingerprints: A Universal Representation for AI-Generated Image Detection

Zhiyuan Wang (1), Yanxiang Chen (2), Pengcheng Zhao (3), Yunfeng Diao (2), Xin Liao (4)((1) Hefei University of Technology, (2) Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education / School of Computer Science

PDF

TL;DR

This paper introduces ODP-Net, a novel method that disentangles universal forgery traces from generator-specific fingerprints and semantics, improving AI-generated image detection across unseen architectures.

Contribution

It proposes a spectral analysis-based orthogonal decomposition approach to structurally disentangle factors, enabling better generalization in forgery detection.

Findings

01

ODP-Net outperforms existing methods on unseen architectures.

02

Spectral analysis reveals disjoint frequency subspaces for different features.

03

Explicit disentanglement improves detection robustness.

Abstract

Detecting AI-generated images across unseen architectures remains challenging, as existing models often overfit to generator-specific fingerprints and semantic content rather than learning universal forgery traces. We attribute this failure to feature entanglement: detectors learn these factors as a single entangled representation, where universal forgery traces are inextricably confounded with both generator-specific fingerprints and semantic content. Crucially, our spectral analysis reveals that this entanglement is avoidable: distinct generator-specific fingerprints (e.g., GAN stripes vs. Diffusion Model spots) occupy disjoint frequency subspaces and coexist as independent superpositions. Leveraging this physical orthogonality, we propose the Orthogonal Decomposition and Purification Network (ODP-Net) to structurally disentangle these factors. Specifically, ODP-Net employs (1)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.