Robust Deepfake Detection: Mitigating Spatial Attention Drift via Calibrated Complementary Ensembles

Minh-Khoa Le-Phan; Minh-Hoang Le; Trong-Le Do; Minh-Triet Tran

arXiv:2604.25889·cs.CV·April 29, 2026

Robust Deepfake Detection: Mitigating Spatial Attention Drift via Calibrated Complementary Ensembles

Minh-Khoa Le-Phan, Minh-Hoang Le, Trong-Le Do, Minh-Triet Tran

PDF

1 Repo

TL;DR

This paper introduces a multi-stream ensemble framework with a degradation engine to improve deepfake detection robustness against real-world degradations, achieving state-of-the-art zero-shot generalization.

Contribution

It proposes a novel foundation-driven forensic framework combining specialized streams and calibrated voting to mitigate spatial attention drift in deepfake detection.

Findings

01

Achieves highly stable zero-shot generalization in deepfake detection.

02

Suppresses background attention drift effectively.

03

Secures Fourth Place in the NTIRE 2026 Challenge.

Abstract

Current deepfake detection models achieve state-of-the-art performance on pristine academic datasets but suffer severe spatial attention drift under real-world compound degradations, such as blurring and severe lossy compression. To address this vulnerability, we propose a foundation-driven forensic framework that integrates an extreme compound degradation engine with a structurally constrained, multi-stream architecture. During training, our degradation pipeline systematically destroys high-frequency artifacts, optimizing the DINOv2-Giant backbone to extract invariant geometric and semantic priors. We then process images through three specialized pathways: a Global Texture stream, a Localized Facial stream, and a Hybrid Semantic Fusion stream incorporating CLIP. Through analyzing spatial attribution via Score-CAM and feature stability using Cosine Similarity, we quantitatively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

khoalephanminh/ntire26-deepfake-challenge
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.