Adversarial examples for generative models

Jernej Kos; Ian Fischer; Dawn Song

arXiv:1702.06832·stat.ML·February 23, 2017·32 cites

Adversarial examples for generative models

Jernej Kos, Ian Fischer, Dawn Song

PDF

Open Access 1 Repo

TL;DR

This paper investigates methods to generate adversarial examples targeting deep generative models like VAE and VAE-GAN, demonstrating three attack strategies on models trained on datasets like MNIST, SVHN, and CelebA.

Contribution

It introduces three novel attack techniques for deep generative models, expanding adversarial research beyond classification tasks.

Findings

01

Demonstrated attacks on multiple datasets

02

Showed manipulation of latent representations

03

Highlighted vulnerabilities of generative models

Abstract

We explore methods of producing adversarial examples on deep generative models such as the variational autoencoder (VAE) and the VAE-GAN. Deep learning architectures are known to be vulnerable to adversarial examples, but previous work has focused on the application of adversarial examples to classification tasks. Deep generative models have recently become popular due to their ability to model input data distributions and generate realistic examples from those distributions. We present three classes of attacks on the VAE and VAE-GAN architectures and demonstrate them against networks trained on MNIST, SVHN and CelebA. Our first attack leverages classification-based adversaries by attaching a classifier to the trained encoder of the target generative model, which can then be used to indirectly manipulate the latent representation. Our second attack directly uses the VAE loss function to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rohban-lab/Salehi_submitted_2020
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Forensic and Genetic Research

MethodsSolana Customer Service Number +1-833-534-1729 · USD Coin Customer Service Number +1-833-534-1729