Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning
Yu Liu, Lianghua Huang, Pan Pan, Bin Wang, Yinghui Xu, Rong Jin

TL;DR
This paper introduces a scalable unsupervised visual representation learning method using a one-million-way instance classifier, addressing large-scale training challenges and achieving competitive results on ImageNet and downstream tasks.
Contribution
The work proposes novel techniques for large-scale instance classification, including hybrid parallel training, raw-feature weight initialization, and label smoothing for hard classes.
Findings
Achieves competitive performance on ImageNet linear evaluation.
Outperforms or matches state-of-the-art unsupervised methods.
Demonstrates effectiveness on multiple downstream visual tasks.
Abstract
This paper presents a simple unsupervised visual representation learning method with a pretext task of discriminating all images in a dataset using a parametric, instance-level classifier. The overall framework is a replica of a supervised classification model, where semantic classes (e.g., dog, bird, and ship) are replaced by instance IDs. However, scaling up the classification task from thousands of semantic labels to millions of instance labels brings specific challenges including 1) the large-scale softmax computation; 2) the slow convergence due to the infrequent visiting of instance samples; and 3) the massive number of negative classes that can be noisy. This work presents several novel techniques to handle these difficulties. First, we introduce a hybrid parallel training framework to make large-scale training feasible. Second, we present a raw-feature initialization mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsAverage Pooling · 1x1 Convolution · Batch Normalization · Kaiming Initialization · Dense Connections · Residual Connection · Global Average Pooling · Color Jitter · Bottleneck Residual Block · Convolution
