On the Vulnerability of DeepFake Detectors to Attacks Generated by   Denoising Diffusion Models

Marija Ivanovska; Vitomir \v{S}truc

arXiv:2307.05397·cs.CV·October 31, 2023·2 cites

On the Vulnerability of DeepFake Detectors to Attacks Generated by Denoising Diffusion Models

Marija Ivanovska, Vitomir \v{S}truc

PDF

Open Access

TL;DR

This paper reveals that state-of-the-art deepfake detectors are vulnerable to attacks generated by denoising diffusion models, which can evade detection without perceptible image changes, highlighting a need for more robust detection methods.

Contribution

The study introduces a novel diffusion-based attack method on deepfake detectors and evaluates its effectiveness, exposing weaknesses in current detection approaches.

Findings

01

Single diffusion step significantly reduces detection likelihood.

02

Detectors trained on diffusion-based deepfakes have limited generalizability.

03

Diffusion-based attacks can evade detection without perceptible image quality loss.

Abstract

The detection of malicious deepfakes is a constantly evolving problem that requires continuous monitoring of detectors to ensure they can detect image manipulations generated by the latest emerging models. In this paper, we investigate the vulnerability of single-image deepfake detectors to black-box attacks created by the newest generation of generative methods, namely Denoising Diffusion Models (DDMs). Our experiments are run on FaceForensics++, a widely used deepfake benchmark consisting of manipulated images generated with various techniques for face identity swapping and face reenactment. Attacks are crafted through guided reconstruction of existing deepfakes with a proposed DDM approach for face restoration. Our findings indicate that employing just a single denoising diffusion step in the reconstruction process of a deepfake can significantly reduce the likelihood of detection,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Face recognition and analysis

MethodsDiffusion