Rethinking Cross-Generator Image Forgery Detection through DINOv3
Zhenglin Huang, Jason Li, Haiquan Wen, Tianxiao Li, Xi Yang, Lu Qi, Bei Peng, Xiaowei Huang, Ming-Hsuan Yang, Guangliang Cheng

TL;DR
This paper demonstrates that frozen visual foundation models like DINOv3 can effectively detect cross-generator image forgeries by leveraging global low-frequency cues, without additional training, providing a universal and interpretable detection baseline.
Contribution
It reveals that DINOv3 inherently captures transferable authenticity cues and introduces a simple token-ranking method to enhance detection accuracy without fine-tuning.
Findings
DINOv3 shows strong cross-generator detection ability without fine-tuning.
Global, low-frequency structures serve as transferable authenticity cues.
Token selection improves detection accuracy across datasets.
Abstract
As generative models become increasingly diverse and powerful, cross-generator detection has emerged as a new challenge. Existing detection methods often memorize artifacts of specific generative models rather than learning transferable cues, leading to substantial failures on unseen generators. Surprisingly, this work finds that frozen visual foundation models, especially DINOv3, already exhibit strong cross-generator detection capability without any fine-tuning. Through systematic studies on frequency, spatial, and token perspectives, we observe that DINOv3 tends to rely on global, low-frequency structures as weak but transferable authenticity cues instead of high-frequency, generator-specific artifacts. Motivated by this insight, we introduce a simple, training-free token-ranking strategy followed by a lightweight linear probe to select a small subset of authenticity-relevant tokens.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
