Attack-Aware Deepfake Detection under Counter-Forensic Manipulations

Noor Fatima; Hasan Faraz Khan; Muzammil Behzad

arXiv:2512.22303·cs.CV·December 30, 2025

Attack-Aware Deepfake Detection under Counter-Forensic Manipulations

Noor Fatima, Hasan Faraz Khan, Muzammil Behzad

PDF

Open Access

TL;DR

This paper introduces an attack-aware deepfake detection system that combines red-team training and randomized test-time defenses to achieve robustness, well-calibrated probabilities, and interpretable heatmaps in realistic scenarios.

Contribution

It presents a novel two-stream architecture with integrated attack simulation and test-time defenses, improving robustness and interpretability of deepfake detection under various manipulations.

Findings

01

Near-perfect ranking across attack types

02

Low calibration error and minimal abstention risk

03

Robust performance under low-light and heavy compression conditions

Abstract

This work presents an attack-aware deepfake and image-forensics detector designed for robustness, well-calibrated probabilities, and transparent evidence under realistic deployment conditions. The method combines red-team training with randomized test-time defense in a two-stream architecture, where one stream encodes semantic content using a pretrained backbone and the other extracts forensic residuals, fused via a lightweight residual adapter for classification, while a shallow Feature Pyramid Network style head produces tamper heatmaps under weak supervision. Red-team training applies worst-of-K counter-forensics per batch, including JPEG realign and recompress, resampling warps, denoise-to-regrain operations, seam smoothing, small color and gamma shifts, and social-app transcodes, while test-time defense injects low-cost jitters such as resize and crop phase changes, mild gamma…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning