Learning Self-Regularized Adversarial Views for Self-Supervised Vision   Transformers

Tao Tang; Changlin Li; Guangrun Wang; Kaicheng Yu; Xiaojun Chang,; Xiaodan Liang

arXiv:2210.08458·cs.CV·October 18, 2022·1 cites

Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers

Tao Tang, Changlin Li, Guangrun Wang, Kaicheng Yu, Xiaojun Chang,, Xiaodan Liang

PDF

Open Access 1 Repo

TL;DR

AutoView introduces a self-regularized adversarial approach for view generation in self-supervised vision transformers, significantly reducing search costs and improving downstream task performance.

Contribution

It proposes a novel AutoView method that learns views and network parameters simultaneously, with a self-regularized loss and a curated search space for self-supervised learning.

Findings

01

Achieves +10.2% k-NN accuracy improvement on ImageNet.

02

Outperforms manually tuned view policies by up to +1.3% k-NN accuracy.

03

Enhances downstream task performance and model robustness.

Abstract

Automatic data augmentation (AutoAugment) strategies are indispensable in supervised data-efficient training protocols of vision transformers, and have led to state-of-the-art results in supervised learning. Despite the success, its development and application on self-supervised vision transformers have been hindered by several barriers, including the high search cost, the lack of supervision, and the unsuitable search space. In this work, we propose AutoView, a self-regularized adversarial AutoAugment method, to learn views for self-supervised vision transformers, by addressing the above barriers. First, we reduce the search cost of AutoView to nearly zero by learning views and network parameters simultaneously in a single forward-backward step, minimizing and maximizing the mutual information among different augmented views, respectively. Then, to avoid information collapse caused by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trent-tangtao/autoview
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Image Enhancement Techniques · Domain Adaptation and Few-Shot Learning

MethodsTanh Activation · Sigmoid Activation · k-Nearest Neighbors · Long Short-Term Memory · AutoAugment