Viewmaker Networks: Learning Views for Unsupervised Representation   Learning

Alex Tamkin; Mike Wu; Noah Goodman

arXiv:2010.07432·cs.LG·March 30, 2021·23 cites

Viewmaker Networks: Learning Views for Unsupervised Representation Learning

Alex Tamkin, Mike Wu, Noah Goodman

PDF

Open Access 1 Repo 1 Video

TL;DR

Viewmaker networks are generative models that automatically learn to produce effective data views for unsupervised representation learning, reducing the need for manual design of data augmentations across various domains.

Contribution

We introduce viewmaker networks that learn to generate useful views for unsupervised learning, eliminating manual augmentation design and improving transfer performance across multiple data modalities.

Findings

01

Learned views enable comparable transfer accuracy to traditional augmentations on CIFAR-10.

02

Significantly outperform baseline augmentations on speech and sensor data.

03

Improve robustness to image corruptions when combined with handcrafted views.

Abstract

Many recent methods for unsupervised representation learning train models to be invariant to different "views," or distorted versions of an input. However, designing these views requires considerable trial and error by human experts, hindering widespread adoption of unsupervised representation learning methods across domains and modalities. To address this, we propose viewmaker networks: generative models that learn to produce useful views from a given input. Viewmakers are stochastic bounded adversaries: they produce views by generating and then adding an $ℓ_{p}$ -bounded perturbation to the input, and are trained adversarially with respect to the main encoder network. Remarkably, when pretraining on CIFAR-10, our learned views enable comparable transfer accuracy to the well-tuned SimCLR augmentations -- despite not including transformations like cropping or color jitter. Furthermore,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alextamkin/viewmaker
pytorchOfficial

Videos

Viewmaker Networks: Learning Views for Unsupervised Representation Learning· slideslive

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsViewmaker Network · 1x1 Convolution · Average Pooling · Batch Normalization · Residual Connection · Residual Block · Bottleneck Residual Block · Max Pooling · Convolution · Global Average Pooling