Accelerating Multi-Objective Neural Architecture Search by Random-Weight   Evaluation

Shengran Hu; Ran Cheng; Cheng He; Zhichao Lu; Jing Wang; Miao Zhang

arXiv:2110.05242·cs.LG·October 12, 2021

Accelerating Multi-Objective Neural Architecture Search by Random-Weight Evaluation

Shengran Hu, Ran Cheng, Cheng He, Zhichao Lu, Jing Wang, Miao Zhang

PDF

Open Access

TL;DR

This paper introduces Random-Weight Evaluation (RWE), a fast and cost-effective performance metric for neural architecture search that evaluates CNNs by training only the last layer, significantly reducing computational costs.

Contribution

The paper proposes RWE, a novel performance estimation method for NAS that trains only the last layer, enabling rapid evaluation of CNN architectures.

Findings

01

RWE achieves state-of-the-art performance in multi-objective NAS.

02

The method transfers well from CIFAR-10 to ImageNet.

03

Ablation studies confirm RWE's effectiveness over existing metrics.

Abstract

For the goal of automated design of high-performance deep convolutional neural networks (CNNs), Neural Architecture Search (NAS) methodology is becoming increasingly important for both academia and industries.Due to the costly stochastic gradient descent (SGD) training of CNNs for performance evaluation, most existing NAS methods are computationally expensive for real-world deployments. To address this issue, we first introduce a new performance estimation metric, named Random-Weight Evaluation (RWE) to quantify the quality of CNNs in a cost-efficient manner. Instead of fully training the entire CNN, the RWE only trains its last layer and leaves the remainders with randomly initialized weights, which results in a single network evaluation in seconds.Second, a complexity metric is adopted for multi-objective NAS to balance the model size and performance. Overall, our proposed method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning