Evading Forensic Classifiers with Attribute-Conditioned Adversarial   Faces

Fahad Shamshad; Koushik Srivatsan; Karthik Nandakumar

arXiv:2306.13091·cs.CV·June 23, 2023

Evading Forensic Classifiers with Attribute-Conditioned Adversarial Faces

Fahad Shamshad, Koushik Srivatsan, Karthik Nandakumar

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to generate adversarial fake face images with specific attributes using StyleGAN, effectively fooling forensic classifiers while remaining undetectable to humans.

Contribution

It presents a novel framework for attribute-conditioned adversarial face generation leveraging StyleGAN and meta-learning for transferability.

Findings

01

Successfully fools forensic classifiers with attribute-specific fake faces.

02

Generated images remain realistic and undetectable to human scrutiny.

03

Method demonstrates transferability across different forensic models.

Abstract

The ability of generative models to produce highly realistic synthetic face images has raised security and ethical concerns. As a first line of defense against such fake faces, deep learning based forensic classifiers have been developed. While these forensic models can detect whether a face image is synthetic or real with high accuracy, they are also vulnerable to adversarial attacks. Although such attacks can be highly successful in evading detection by forensic classifiers, they introduce visible noise patterns that are detectable through careful human scrutiny. Additionally, these attacks assume access to the target model(s) which may not always be true. Attempts have been made to directly perturb the latent space of GANs to produce adversarial fake faces that can circumvent forensic classifiers. In this work, we go one step further and show that it is possible to successfully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

koushiksrivats/face_attribute_attack
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Face recognition and analysis

MethodsR1 Regularization · Dense Connections · HuMan(Expedia)||How do I get a human at Expedia? · Feedforward Network · Convolution · Adaptive Instance Normalization · StyleGAN