Adversarial examples for generative models
Jernej Kos, Ian Fischer, Dawn Song

TL;DR
This paper investigates methods to generate adversarial examples targeting deep generative models like VAE and VAE-GAN, demonstrating three attack strategies on models trained on datasets like MNIST, SVHN, and CelebA.
Contribution
It introduces three novel attack techniques for deep generative models, expanding adversarial research beyond classification tasks.
Findings
Demonstrated attacks on multiple datasets
Showed manipulation of latent representations
Highlighted vulnerabilities of generative models
Abstract
We explore methods of producing adversarial examples on deep generative models such as the variational autoencoder (VAE) and the VAE-GAN. Deep learning architectures are known to be vulnerable to adversarial examples, but previous work has focused on the application of adversarial examples to classification tasks. Deep generative models have recently become popular due to their ability to model input data distributions and generate realistic examples from those distributions. We present three classes of attacks on the VAE and VAE-GAN architectures and demonstrate them against networks trained on MNIST, SVHN and CelebA. Our first attack leverages classification-based adversaries by attaching a classifier to the trained encoder of the target generative model, which can then be used to indirectly manipulate the latent representation. Our second attack directly uses the VAE loss function to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Forensic and Genetic Research
MethodsSolana Customer Service Number +1-833-534-1729 · USD Coin Customer Service Number +1-833-534-1729
