Semantic Perturbations with Normalizing Flows for Improved Generalization
Oguz Kaan Yuksel, Sebastian U. Stich, Martin Jaggi, Tatjana Chavdarova

TL;DR
This paper introduces a novel data augmentation method using normalizing flows to generate semantically meaningful perturbations in the latent space, improving neural network generalization especially in low-data regimes.
Contribution
It leverages the reversible structure of normalizing flows for unsupervised, on-manifold perturbations, achieving state-of-the-art results on CIFAR-10/100 datasets.
Findings
Achieves 96.6% accuracy on CIFAR-10 with ResNet-18.
Outperforms existing augmentation methods, especially with limited data.
First to demonstrate latent-space perturbations improving real-world dataset performance.
Abstract
Data augmentation is a widely adopted technique for avoiding overfitting when training deep neural networks. However, this approach requires domain-specific knowledge and is often limited to a fixed set of hard-coded transformations. Recently, several works proposed to use generative models for generating semantically meaningful perturbations to train a classifier. However, because accurate encoding and decoding are critical, these methods, which use architectures that approximate the latent-variable inference, remained limited to pilot studies on small datasets. Exploiting the exactly reversible encoder-decoder structure of normalizing flows, we perform on-manifold perturbations in the latent space to define fully unsupervised data augmentations. We demonstrate that such perturbations match the performance of advanced data augmentation techniques -- reaching 96.6% test accuracy for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Model Reduction and Neural Networks
