Train a One-Million-Way Instance Classifier for Unsupervised Visual   Representation Learning

Yu Liu; Lianghua Huang; Pan Pan; Bin Wang; Yinghui Xu; Rong Jin

arXiv:2102.04848·cs.CV·February 10, 2021·1 cites

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan, Bin Wang, Yinghui Xu, Rong Jin

PDF

Open Access 1 Video

TL;DR

This paper introduces a scalable unsupervised visual representation learning method using a one-million-way instance classifier, addressing large-scale training challenges and achieving competitive results on ImageNet and downstream tasks.

Contribution

The work proposes novel techniques for large-scale instance classification, including hybrid parallel training, raw-feature weight initialization, and label smoothing for hard classes.

Findings

01

Achieves competitive performance on ImageNet linear evaluation.

02

Outperforms or matches state-of-the-art unsupervised methods.

03

Demonstrates effectiveness on multiple downstream visual tasks.

Abstract

This paper presents a simple unsupervised visual representation learning method with a pretext task of discriminating all images in a dataset using a parametric, instance-level classifier. The overall framework is a replica of a supervised classification model, where semantic classes (e.g., dog, bird, and ship) are replaced by instance IDs. However, scaling up the classification task from thousands of semantic labels to millions of instance labels brings specific challenges including 1) the large-scale softmax computation; 2) the slow convergence due to the infrequent visiting of instance samples; and 3) the massive number of negative classes that can be noisy. This work presents several novel techniques to handle these difficulties. First, we introduce a hybrid parallel training framework to make large-scale training feasible. Second, we present a raw-feature initialization mechanism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning· underline

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsAverage Pooling · 1x1 Convolution · Batch Normalization · Kaiming Initialization · Dense Connections · Residual Connection · Global Average Pooling · Color Jitter · Bottleneck Residual Block · Convolution