DGS-Net: Distillation-Guided Gradient Surgery for CLIP Fine-Tuning in AI-Generated Image Detection

Jiazhen Yan; Ziqiang Li; Fan Wang; Boyu Wang; Ziwen He; Zhangjie Fu

arXiv:2511.13108·cs.CV·May 18, 2026

DGS-Net: Distillation-Guided Gradient Surgery for CLIP Fine-Tuning in AI-Generated Image Detection

Jiazhen Yan, Ziqiang Li, Fan Wang, Boyu Wang, Ziwen He, Zhangjie Fu

PDF

TL;DR

DGS-Net is a novel framework that enhances CLIP fine-tuning for AI-generated image detection by preserving pre-trained knowledge and suppressing irrelevant features through gradient-space decomposition.

Contribution

The paper introduces a gradient-space decomposition method and a distillation-guided approach to improve CLIP fine-tuning for synthetic image detection, reducing catastrophic forgetting.

Findings

01

Outperforms state-of-the-art methods by an average of 6.6 in detection accuracy.

02

Demonstrates superior generalization across 50 generative models.

03

Effectively preserves pre-trained priors while suppressing irrelevant features.

Abstract

The rapid progress of generative models such as GANs and diffusion models has led to the widespread proliferation of AI-generated images, raising concerns about misinformation, privacy violations, and trust erosion in digital media. Although large-scale multimodal models like CLIP offer strong transferable representations for detecting synthetic content, fine-tuning them often induces catastrophic forgetting, which degrades pre-trained priors and limits cross-domain generalization. To address this issue, we propose the Distillation-guided Gradient Surgery Network (DGS-Net), a novel framework that preserves transferable pre-trained priors while suppressing task-irrelevant components. Specifically, we introduce a gradient-space decomposition that separates harmful and beneficial descent directions during optimization. By projecting task gradients onto the orthogonal complement of harmful…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.