Siamese-NAS: Using Trained Samples Efficiently to Find Lightweight Neural Architecture by Prior Knowledge
Yu-Ming Zhang, Jun-Wei Hsieh, Chun-Chieh Lee, Kuo-Chin Fan

TL;DR
This paper introduces Siamese-NAS, a predictor-based neural architecture search method utilizing prior knowledge through Estimation Code, achieving state-of-the-art results in lightweight CNN design with limited samples and computational resources.
Contribution
It proposes Siamese-Predictor with Estimation Code for efficient NAS, surpassing current predictors and enabling lightweight CNN architecture search with minimal training samples.
Findings
Siamese-Predictor outperforms SOTA predictors on NASBench-201.
The Estimation Code enhances sampling efficiency and accuracy.
The Tiny-NanoBench search space facilitates finding lightweight architectures with few FLOPs.
Abstract
In the past decade, many architectures of convolution neural networks were designed by handcraft, such as Vgg16, ResNet, DenseNet, etc. They all achieve state-of-the-art level on different tasks in their time. However, it still relies on human intuition and experience, and it also takes so much time consumption for trial and error. Neural Architecture Search (NAS) focused on this issue. In recent works, the Neural Predictor has significantly improved with few training architectures as training samples. However, the sampling efficiency is already considerable. In this paper, our proposed Siamese-Predictor is inspired by past works of predictor-based NAS. It is constructed with the proposed Estimation Code, which is the prior knowledge about the training procedure. The proposed Siamese-Predictor gets significant benefits from this idea. This idea causes it to surpass the current SOTA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Neural Networks and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Average Pooling · Batch Normalization · 1x1 Convolution · Dense Connections · Dropout · Max Pooling · Softmax · Global Average Pooling
