TL;DR
DeFakerOne is a unified foundation model that advances fake image detection and localization across diverse forgery scenarios, outperforming existing methods and demonstrating robustness and scalability.
Contribution
It introduces a data-centric, unified model integrating InternVL2 and SAM2 for joint detection and localization, addressing cross-domain artifacts transfer and interference.
Findings
Achieves state-of-the-art performance on 39 detection benchmarks.
Outperforms baselines in forgery localization on 9 benchmarks.
Shows robustness against real-world perturbations and advanced generators.
Abstract
In recent years, the rapid evolution of generative AI has fundamentally reshaped the paradigm of image forgery, breaking the traditional boundaries between document editing, natural image manipulation, DeepFake generation, and full-image AIGC synthesis. Despite this shift toward unified forgery generation, existing research in Fake Image Detection and Localization (FIDL) remains fragmented. This creates a mismatch between increasingly unified forgery generation mechanisms and the domain-specific detection paradigm. Bridging this mismatch poses two key challenges for FIDL: understanding cross-domain artifacts transfer and interference, and building a high-capacity unified foundation model for joint detection and localization. To address these challenges, we propose DeFakerOne, a data-centric, unified FIDL foundation model integrating InternVL2 and SAM2. DeFakerOne enables simultaneous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
