Billion-scale semi-supervised learning for image classification
I. Zeki Yalniz, Herv\'e J\'egou, Kan Chen, Manohar Paluri, Dhruv, Mahajan

TL;DR
This paper demonstrates that semi-supervised learning with a billion unlabelled images significantly improves image classification accuracy, achieving state-of-the-art results on ImageNet using standard architectures.
Contribution
It introduces a large-scale semi-supervised learning pipeline leveraging a billion unlabelled images, with analysis and recommendations for high-accuracy models.
Findings
ResNet-50 achieves 81.2% top-1 accuracy on ImageNet.
Large unlabelled datasets substantially boost model performance.
The approach benefits image, video, and fine-grained classification tasks.
Abstract
This paper presents a study of semi-supervised learning with large convolutional networks. We propose a pipeline, based on a teacher/student paradigm, that leverages a large collection of unlabelled images (up to 1 billion). Our main goal is to improve the performance for a given target architecture, like ResNet-50 or ResNext. We provide an extensive analysis of the success factors of our approach, which leads us to formulate some recommendations to produce high-accuracy models for image classification with semi-supervised learning. As a result, our approach brings important gains to standard architectures for image, video and fine-grained classification. For instance, by leveraging one billion unlabelled images, our learned vanilla ResNet-50 achieves 81.2% top-1 accuracy on the ImageNet benchmark.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗kadirnar/timm_model_listmodel· ♡ 1♡ 1
- 🤗timm/resnet18.fb_ssl_yfcc100m_ft_in1kmodel· 314 dl· ♡ 1314 dl♡ 1
- 🤗timm/resnet18.fb_swsl_ig1b_ft_in1kmodel· 62k dl62k dl
- 🤗timm/resnet50.fb_ssl_yfcc100m_ft_in1kmodel· 268 dl268 dl
- 🤗timm/resnet50.fb_swsl_ig1b_ft_in1kmodel· 132k dl132k dl
- 🤗timm/resnext50_32x4d.fb_ssl_yfcc100m_ft_in1kmodel· 68 dl· ♡ 168 dl♡ 1
- 🤗timm/resnext50_32x4d.fb_swsl_ig1b_ft_in1kmodel· 1.2k dl1.2k dl
- 🤗timm/resnext101_32x4d.fb_ssl_yfcc100m_ft_in1kmodel· 130 dl· ♡ 1130 dl♡ 1
- 🤗timm/resnext101_32x4d.fb_swsl_ig1b_ft_in1kmodel· 472 dl472 dl
- 🤗timm/resnext101_32x8d.fb_ssl_yfcc100m_ft_in1kmodel· 58 dl58 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsAverage Pooling · ResNeXt Block · Grouped Convolution · Bottleneck Residual Block · Global Average Pooling · Residual Block · Max Pooling · 1x1 Convolution · ResNeXt · SGD with Momentum
