ScaleNet: An Unsupervised Representation Learning Method for Limited Information
Huili Huang, M. Mahdi Roozbahani

TL;DR
ScaleNet is an unsupervised learning method that uses multi-scale images and rotation prediction to improve CNN performance with limited data, outperforming previous methods like RotNet on CIFAR-10 and ImageNet.
Contribution
The paper introduces ScaleNet, a novel unsupervised learning approach leveraging multi-scale images and rotation tasks to enhance CNN feature learning with limited data.
Findings
ScaleNet outperforms RotNet by ~7% on CIFAR-10.
Transferred parameters from ScaleNet improve ImageNet classification by ~6%.
ScaleNet enhances other models like SimCLR by learning effective features.
Abstract
Although large-scale labeled data are essential for deep convolutional neural networks (ConvNets) to learn high-level semantic visual representations, it is time-consuming and impractical to collect and annotate large-scale datasets. A simple and efficient unsupervised representation learning method named ScaleNet based on multi-scale images is proposed in this study to enhance the performance of ConvNets when limited information is available. The input images are first resized to a smaller size and fed to the ConvNet to recognize the rotation degree. Next, the ConvNet learns the rotation-prediction task for the original size images based on the parameters transferred from the previous model. The CIFAR-10 and ImageNet datasets are examined on different architectures such as AlexNet and ResNet50 in this study. The current study demonstrates that specific image features, such as Harris…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Kaiming Initialization · 1x1 Convolution · Average Pooling · Residual Connection · Softmax · Bottleneck Residual Block · Residual Block · Max Pooling
