Adaptive Forensic Feature Refinement via Intrinsic Importance Perception

Jiazhen Yang; Junjun Zheng; Kejia Chen; Xiangheng Kong; Jie Lei; Zunlei Feng; Bingde Hu; Yang Gao

arXiv:2604.16879·cs.CV·April 23, 2026

Adaptive Forensic Feature Refinement via Intrinsic Importance Perception

Jiazhen Yang, Junjun Zheng, Kejia Chen, Xiangheng Kong, Jie Lei, Zunlei Feng, Bingde Hu, Yang Gao

PDF

TL;DR

This paper introduces I2P, a novel SID framework that adaptively identifies critical feature layers and constrains updates to preserve pretrained structure, enhancing cross-distribution generalization in synthetic image detection.

Contribution

The paper proposes a joint optimization approach for VFM adaptation in SID, explicitly modeling the optimal representational hierarchy and constraining parameter updates.

Findings

01

I2P effectively identifies the most discriminative feature layers for SID.

02

The method improves generalization to unknown generation sources.

03

Experimental results demonstrate enhanced detection accuracy across diverse datasets.

Abstract

With the rapid development of generative models and multimodal content editing technologies, the key challenge faced by synthetic image detection (SID) lies in cross-distribution generalization to unknown generation sources. In recent years, visual foundation models (VFM), which acquire rich visual priors through large scale image-text alignment pretraining, have become a promising technical route for improving the generalization ability of SID. However, existing VFM-based methods remain relatively coarse-grained in their adaptation strategies. They typically either directly use the final layer representations of VFM or simply fuse multi layer features, lacking explicit modeling of the optimal representational hierarchy for transferable forgery cues. Meanwhile, although directly fine-tuning VFM can enhance task adaptation, it may also damage the cross-modal pretrained structure that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.