2D-Malafide: Adversarial Attacks Against Face Deepfake Detection Systems
Chiara Galdi, Michele Panariello, Massimiliano Todisco, Nicholas Evans

TL;DR
This paper presents 2D-Malafide, a lightweight adversarial attack using 2D convolutional filters that significantly reduces the effectiveness of face deepfake detection systems, revealing their vulnerability.
Contribution
The paper introduces 2D-Malafide, a novel attack method leveraging 2D convolutional filters to craft transferable adversarial perturbations against deepfake detectors.
Findings
2D-Malafide significantly degrades detection accuracy
Attack effectiveness increases with larger filter sizes
Transferability of attacks across different images is demonstrated
Abstract
We introduce 2D-Malafide, a novel and lightweight adversarial attack designed to deceive face deepfake detection systems. Building upon the concept of 1D convolutional perturbations explored in the speech domain, our method leverages 2D convolutional filters to craft perturbations which significantly degrade the performance of state-of-the-art face deepfake detectors. Unlike traditional additive noise approaches, 2D-Malafide optimises a small number of filter coefficients to generate robust adversarial perturbations which are transferable across different face images. Experiments, conducted using the FaceForensics++ dataset, demonstrate that 2D-Malafide substantially degrades detection performance in both white-box and black-box settings, with larger filter sizes having the greatest impact. Additionally, we report an explainability analysis using GradCAM which illustrates how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
