Attack-Aware Deepfake Detection under Counter-Forensic Manipulations
Noor Fatima, Hasan Faraz Khan, Muzammil Behzad

TL;DR
This paper introduces an attack-aware deepfake detection system that combines red-team training and randomized test-time defenses to achieve robustness, well-calibrated probabilities, and interpretable heatmaps in realistic scenarios.
Contribution
It presents a novel two-stream architecture with integrated attack simulation and test-time defenses, improving robustness and interpretability of deepfake detection under various manipulations.
Findings
Near-perfect ranking across attack types
Low calibration error and minimal abstention risk
Robust performance under low-light and heavy compression conditions
Abstract
This work presents an attack-aware deepfake and image-forensics detector designed for robustness, well-calibrated probabilities, and transparent evidence under realistic deployment conditions. The method combines red-team training with randomized test-time defense in a two-stream architecture, where one stream encodes semantic content using a pretrained backbone and the other extracts forensic residuals, fused via a lightweight residual adapter for classification, while a shallow Feature Pyramid Network style head produces tamper heatmaps under weak supervision. Red-team training applies worst-of-K counter-forensics per batch, including JPEG realign and recompress, resampling warps, denoise-to-regrain operations, seam smoothing, small color and gamma shifts, and social-app transcodes, while test-time defense injects low-cost jitters such as resize and crop phase changes, mild gamma…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
