NASiam: Efficient Representation Learning using Neural Architecture Search for Siamese Networks
Alexandre Heuillet, Hedi Tabia, Hichem Arioui

TL;DR
NASiam introduces a differentiable neural architecture search method to optimize Siamese network components for self-supervised learning, achieving strong image representations efficiently on various datasets.
Contribution
It is the first to apply differentiable NAS to improve Siamese network architectures for contrastive learning, enhancing performance while maintaining simplicity.
Findings
Achieves competitive results on CIFAR-10, CIFAR-100, and ImageNet.
Uses only a few GPU hours for architecture search.
Discovers architectures that prevent collapsing behavior.
Abstract
Siamese networks are one of the most trending methods to achieve self-supervised visual representation learning (SSL). Since hand labeling is costly, SSL can play a crucial part by allowing deep learning to train on large unlabeled datasets. Meanwhile, Neural Architecture Search (NAS) is becoming increasingly important as a technique to discover novel deep learning architectures. However, early NAS methods based on reinforcement learning or evolutionary algorithms suffered from ludicrous computational and memory costs. In contrast, differentiable NAS, a gradient-based approach, has the advantage of being much more efficient and has thus retained most of the attention in the past few years. In this article, we present NASiam, a novel approach that uses for the first time differentiable NAS to improve the multilayer perceptron projector and predictor (encoder/predictor pair) architectures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
MethodsBitcoin Customer Service Number +1-833-534-1729 · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Normalized Temperature-scaled Cross Entropy Loss · Dense Connections · Residual Block · Random Gaussian Blur · Random Resized Crop · Kaiming Initialization · Convolution
