DINOv3 Beats Specialized Detectors: A Simple Foundation Model Baseline for Image Forensics

Jieming Yu; Qiuxiao Feng; Zhuohan Wang; Xiaochen Ma

arXiv:2604.16083·cs.CV·April 20, 2026

DINOv3 Beats Specialized Detectors: A Simple Foundation Model Baseline for Image Forensics

Jieming Yu, Qiuxiao Feng, Zhuohan Wang, Xiaochen Ma

PDF

1 Repo

TL;DR

This paper introduces a simple yet effective baseline for image forensics using DINOv3 with LoRA adaptation, achieving state-of-the-art results across multiple benchmarks with fewer trainable parameters.

Contribution

The authors demonstrate that a DINOv3-based model with LoRA adaptation outperforms specialized detectors, providing a robust, lightweight baseline for image forgery localization.

Findings

01

Improves average pixel-level F1 by 17 points over previous state-of-the-art.

02

LoRA outperforms full fine-tuning across all backbone scales.

03

Achieves high robustness to noise, JPEG compression, and blurring.

Abstract

With the rapid advancement of deep generative models, realistic fake images have become increasingly accessible, yet existing localization methods rely on complex designs and still struggle to generalize across manipulation types and imaging conditions. We present a simple but strong baseline based on DINOv3 with LoRA adaptation and a lightweight convolutional decoder. Under the CAT-Net protocol, our best model improves average pixel-level F1 by 17.0 points over the previous state of the art on four standard benchmarks using only 9.1\,M trainable parameters on top of a frozen ViT-L backbone, and even our smallest variant surpasses all prior specialized methods. LoRA consistently outperforms full fine-tuning across all backbone scales. Under the data-scarce MVSS-Net protocol, LoRA reaches an average F1 of 0.774 versus 0.530 for the strongest prior method, while full fine-tuning becomes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Irennnne/DINOv3-IML
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.