WPNAS: Neural Architecture Search by jointly using Weight Sharing and Predictor
Ke Lin, Yong A, Zhuoxin Gan, Yingying Jiang

TL;DR
This paper introduces WPNAS, a neural architecture search method that combines weight sharing and a predictor to evaluate architectures more accurately, leading to state-of-the-art results across multiple datasets and search spaces.
Contribution
It proposes a unified framework that jointly uses weight sharing and a predictor, including a novel weakly weight sharing technique with HyperNet to improve evaluation accuracy.
Findings
Achieves state-of-the-art performance on CIFAR-10, CIFAR-100, and ImageNet.
Effectively combines direct evaluation, prediction, and cost for architecture assessment.
Introduces a weakly weight sharing method to mitigate weight sharing side effects.
Abstract
Weight sharing based and predictor based methods are two major types of fast neural architecture search methods. In this paper, we propose to jointly use weight sharing and predictor in a unified framework. First, we construct a SuperNet in a weight-sharing way and probabilisticly sample architectures from the SuperNet. To increase the correctness of the evaluation of architectures, besides direct evaluation using the inherited weights, we further apply a few-shot predictor to assess the architecture on the other hand. The final evaluation of the architecture is the combination of direct evaluation, the prediction from the predictor and the cost of the architecture. We regard the evaluation as a reward and apply a self-critical policy gradient approach to update the architecture probabilities. To further reduce the side effects of weight sharing, we propose a weakly weight sharing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Brain Tumor Detection and Classification
MethodsDifferentiable Architecture Search
