Venus-DeFakerOne: Unified Fake Image Detection & Localization

GuangJian Team

arXiv:2605.14091·cs.CV·May 15, 2026

Venus-DeFakerOne: Unified Fake Image Detection & Localization

GuangJian Team

PDF

1 Models

TL;DR

DeFakerOne is a unified foundation model that advances fake image detection and localization across diverse forgery scenarios, outperforming existing methods and demonstrating robustness and scalability.

Contribution

It introduces a data-centric, unified model integrating InternVL2 and SAM2 for joint detection and localization, addressing cross-domain artifacts transfer and interference.

Findings

01

Achieves state-of-the-art performance on 39 detection benchmarks.

02

Outperforms baselines in forgery localization on 9 benchmarks.

03

Shows robustness against real-world perturbations and advanced generators.

Abstract

In recent years, the rapid evolution of generative AI has fundamentally reshaped the paradigm of image forgery, breaking the traditional boundaries between document editing, natural image manipulation, DeepFake generation, and full-image AIGC synthesis. Despite this shift toward unified forgery generation, existing research in Fake Image Detection and Localization (FIDL) remains fragmented. This creates a mismatch between increasingly unified forgery generation mechanisms and the domain-specific detection paradigm. Bridging this mismatch poses two key challenges for FIDL: understanding cross-domain artifacts transfer and interference, and building a high-capacity unified foundation model for joint detection and localization. To address these challenges, we propose DeFakerOne, a data-centric, unified FIDL foundation model integrating InternVL2 and SAM2. DeFakerOne enables simultaneous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
venus-guangjian/Venus-DeFakerOne
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.