Layer Consistency Matters: Elegant Latent Transition Discrepancy for Generalizable Synthetic Image Detection

Yawen Yang; Feng Li; Shuqi Kong; Yunfeng Diao; Xinjian Gao; Zenglin Shi; and Meng Wang

arXiv:2603.10598·cs.CV·March 23, 2026

Layer Consistency Matters: Elegant Latent Transition Discrepancy for Generalizable Synthetic Image Detection

Yawen Yang, Feng Li, Shuqi Kong, Yunfeng Diao, Xinjian Gao, Zenglin Shi, and Meng Wang

PDF

Open Access

TL;DR

This paper introduces a novel method called latent transition discrepancy (LTD) that leverages inter-layer consistency differences in latent representations to improve the detection of synthetic images, enhancing generalization and robustness.

Contribution

The paper presents a new approach that captures inter-layer transition discrepancies to distinguish real from synthetic images, outperforming existing methods in accuracy and generalizability.

Findings

01

LTD exceeds baseline models by 14.35% in mean accuracy across datasets.

02

LTD outperforms recent state-of-the-art detection methods.

03

The approach demonstrates superior robustness and generalization in synthetic image detection.

Abstract

Recent rapid advancement of generative models has significantly improved the fidelity and accessibility of AI-generated synthetic images. While enabling various innovative applications, the unprecedented realism of these synthetics makes them increasingly indistinguishable from authentic photographs, posing serious security risks, such as media credibility and content manipulation. Although extensive efforts have been dedicated to detecting synthetic images, most existing approaches suffer from poor generalization to unseen data due to their reliance on model-specific artifacts or low-level statistical cues. In this work, we identify a previously unexplored distinction that real images maintain consistent semantic attention and structural coherence in their latent representations, exhibiting more stable feature transitions across network layers, whereas synthetic ones present…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Face recognition and analysis