Viewmaker Networks: Learning Views for Unsupervised Representation Learning
Alex Tamkin, Mike Wu, Noah Goodman

TL;DR
Viewmaker networks are generative models that automatically learn to produce effective data views for unsupervised representation learning, reducing the need for manual design of data augmentations across various domains.
Contribution
We introduce viewmaker networks that learn to generate useful views for unsupervised learning, eliminating manual augmentation design and improving transfer performance across multiple data modalities.
Findings
Learned views enable comparable transfer accuracy to traditional augmentations on CIFAR-10.
Significantly outperform baseline augmentations on speech and sensor data.
Improve robustness to image corruptions when combined with handcrafted views.
Abstract
Many recent methods for unsupervised representation learning train models to be invariant to different "views," or distorted versions of an input. However, designing these views requires considerable trial and error by human experts, hindering widespread adoption of unsupervised representation learning methods across domains and modalities. To address this, we propose viewmaker networks: generative models that learn to produce useful views from a given input. Viewmakers are stochastic bounded adversaries: they produce views by generating and then adding an -bounded perturbation to the input, and are trained adversarially with respect to the main encoder network. Remarkably, when pretraining on CIFAR-10, our learned views enable comparable transfer accuracy to the well-tuned SimCLR augmentations -- despite not including transformations like cropping or color jitter. Furthermore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsViewmaker Network · 1x1 Convolution · Average Pooling · Batch Normalization · Residual Connection · Residual Block · Bottleneck Residual Block · Max Pooling · Convolution · Global Average Pooling
