Exploring Adversarial Fake Images on Face Manifold

Dongze Li; Wei Wang; Hongxing Fan; Jing Dong

arXiv:2101.03272·cs.CV·May 20, 2021

Exploring Adversarial Fake Images on Face Manifold

Dongze Li, Wei Wang, Hongxing Fan, Jing Dong

PDF

Open Access

TL;DR

This paper introduces a method to generate adversarial fake face images by searching on the face manifold, effectively fooling deepfake detection models while preserving image quality.

Contribution

It proposes a novel latent space optimization approach to create adversarial face images that bypass forensic detectors without adding suspicious noise.

Findings

01

Adversarial images reduce detection accuracy from over 90% to nearly 0%.

02

Manipulating style and noise vectors affects attack success rate.

03

Generated images mainly alter facial textures and attributes.

Abstract

Images synthesized by powerful generative adversarial network (GAN) based methods have drawn moral and privacy concerns. Although image forensic models have reached great performance in detecting fake images from real ones, these models can be easily fooled with a simple adversarial attack. But, the noise adding adversarial samples are also arousing suspicion. In this paper, instead of adding adversarial noise, we optimally search adversarial points on face manifold to generate anti-forensic fake face images. We iteratively do a gradient-descent with each small step in the latent space of a generative model, e.g. Style-GAN, to find an adversarial latent vector, which is similar to norm-based adversarial attack but in latent space. Then, the generated fake images driven by the adversarial latent vectors with the help of GANs can defeat main-stream forensic models. For examples, they make…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning

MethodsDepthwise Convolution · Pointwise Convolution · Sigmoid Activation · (FiLe@Against@Claim)How do I file a claim against Expedia? · Batch Normalization · Dense Connections · Convolution · Depthwise Separable Convolution · Global Average Pooling · RMSProp