Siamese-NAS: Using Trained Samples Efficiently to Find Lightweight   Neural Architecture by Prior Knowledge

Yu-Ming Zhang; Jun-Wei Hsieh; Chun-Chieh Lee; Kuo-Chin Fan

arXiv:2210.00546·cs.CV·October 4, 2022

Siamese-NAS: Using Trained Samples Efficiently to Find Lightweight Neural Architecture by Prior Knowledge

Yu-Ming Zhang, Jun-Wei Hsieh, Chun-Chieh Lee, Kuo-Chin Fan

PDF

Open Access 1 Repo

TL;DR

This paper introduces Siamese-NAS, a predictor-based neural architecture search method utilizing prior knowledge through Estimation Code, achieving state-of-the-art results in lightweight CNN design with limited samples and computational resources.

Contribution

It proposes Siamese-Predictor with Estimation Code for efficient NAS, surpassing current predictors and enabling lightweight CNN architecture search with minimal training samples.

Findings

01

Siamese-Predictor outperforms SOTA predictors on NASBench-201.

02

The Estimation Code enhances sampling efficiency and accuracy.

03

The Tiny-NanoBench search space facilitates finding lightweight architectures with few FLOPs.

Abstract

In the past decade, many architectures of convolution neural networks were designed by handcraft, such as Vgg16, ResNet, DenseNet, etc. They all achieve state-of-the-art level on different tasks in their time. However, it still relies on human intuition and experience, and it also takes so much time consumption for trial and error. Neural Architecture Search (NAS) focused on this issue. In recent works, the Neural Predictor has significantly improved with few training architectures as training samples. However, the sampling efficiency is already considerable. In this paper, our proposed Siamese-Predictor is inspired by past works of predictor-based NAS. It is constructed with the proposed Estimation Code, which is the prior knowledge about the training procedure. The proposed Siamese-Predictor gets significant benefits from this idea. This idea causes it to surpass the current SOTA…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

D0352276/Siamese-NAS
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Neural Networks and Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Average Pooling · Batch Normalization · 1x1 Convolution · Dense Connections · Dropout · Max Pooling · Softmax · Global Average Pooling