Evading Forensic Classifiers with Attribute-Conditioned Adversarial Faces
Fahad Shamshad, Koushik Srivatsan, Karthik Nandakumar

TL;DR
This paper introduces a method to generate adversarial fake face images with specific attributes using StyleGAN, effectively fooling forensic classifiers while remaining undetectable to humans.
Contribution
It presents a novel framework for attribute-conditioned adversarial face generation leveraging StyleGAN and meta-learning for transferability.
Findings
Successfully fools forensic classifiers with attribute-specific fake faces.
Generated images remain realistic and undetectable to human scrutiny.
Method demonstrates transferability across different forensic models.
Abstract
The ability of generative models to produce highly realistic synthetic face images has raised security and ethical concerns. As a first line of defense against such fake faces, deep learning based forensic classifiers have been developed. While these forensic models can detect whether a face image is synthetic or real with high accuracy, they are also vulnerable to adversarial attacks. Although such attacks can be highly successful in evading detection by forensic classifiers, they introduce visible noise patterns that are detectable through careful human scrutiny. Additionally, these attacks assume access to the target model(s) which may not always be true. Attempts have been made to directly perturb the latent space of GANs to produce adversarial fake faces that can circumvent forensic classifiers. In this work, we go one step further and show that it is possible to successfully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Face recognition and analysis
MethodsR1 Regularization · Dense Connections · HuMan(Expedia)||How do I get a human at Expedia? · Feedforward Network · Convolution · Adaptive Instance Normalization · StyleGAN
