Accelerating Multi-Objective Neural Architecture Search by Random-Weight Evaluation
Shengran Hu, Ran Cheng, Cheng He, Zhichao Lu, Jing Wang, Miao Zhang

TL;DR
This paper introduces Random-Weight Evaluation (RWE), a fast and cost-effective performance metric for neural architecture search that evaluates CNNs by training only the last layer, significantly reducing computational costs.
Contribution
The paper proposes RWE, a novel performance estimation method for NAS that trains only the last layer, enabling rapid evaluation of CNN architectures.
Findings
RWE achieves state-of-the-art performance in multi-objective NAS.
The method transfers well from CIFAR-10 to ImageNet.
Ablation studies confirm RWE's effectiveness over existing metrics.
Abstract
For the goal of automated design of high-performance deep convolutional neural networks (CNNs), Neural Architecture Search (NAS) methodology is becoming increasingly important for both academia and industries.Due to the costly stochastic gradient descent (SGD) training of CNNs for performance evaluation, most existing NAS methods are computationally expensive for real-world deployments. To address this issue, we first introduce a new performance estimation metric, named Random-Weight Evaluation (RWE) to quantify the quality of CNNs in a cost-efficient manner. Instead of fully training the entire CNN, the RWE only trains its last layer and leaves the remainders with randomly initialized weights, which results in a single network evaluation in seconds.Second, a complexity metric is adopted for multi-objective NAS to balance the model size and performance. Overall, our proposed method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
