LOGER: Local--Global Ensemble for Robust Deepfake Detection in the Wild
Fei Wu, Dagong Lu, Mufeng Yao, Xinlei Xu, Fengjun Guo

TL;DR
LOGER is a novel ensemble framework combining global and local analysis to improve robustness in deepfake detection across diverse real-world conditions.
Contribution
It introduces a local-global ensemble approach with heterogeneous backbones and patch-level modeling, enhancing detection robustness and generalization.
Findings
LOGER achieves 2nd place in NTIRE 2026 Deepfake Detection Challenge.
It demonstrates strong robustness across multiple benchmarks.
The method effectively handles diverse manipulation techniques and degradations.
Abstract
Robust deepfake detection in the wild remains challenging due to the ever-growing variety of manipulation techniques and uncontrolled real-world degradations. Forensic cues for deepfake detection reside at two complementary levels: global-level anomalies in semantics and statistics that require holistic image understanding, and local-level forgery traces concentrated in manipulated regions that are easily diluted by global averaging. Since no single backbone or input scale can effectively cover both levels, we propose LOGER, a LOcal--Global Ensemble framework for Robust deepfake detection. The global branch employs heterogeneous vision foundation model backbones at multiple resolutions to capture holistic anomalies with diverse visual priors. The local branch performs patch-level modeling with a Multiple Instance Learning top- aggregation strategy that selectively pools only the most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
