L$^{2}$NAS: Learning to Optimize Neural Architectures via Continuous-Action Reinforcement Learning
Keith G. Mills, Fred X. Han, Mohammad Salameh, Seyed Saeed Changiz, Rezaei, Linglong Kong, Wei Lu, Shuo Lian, Shangling Jui, Di Niu

TL;DR
L$^{2}$NAS introduces a reinforcement learning approach to neural architecture search that learns to optimize hyperparameters efficiently, achieving state-of-the-art results and demonstrating transferability across datasets.
Contribution
The paper presents L$^{2}$NAS, a novel actor-critic reinforcement learning method for neural architecture search that improves over gradient-based approaches.
Findings
Achieves state-of-the-art results on NAS-Bench-201 and other search spaces.
Demonstrates transferability of search policies across datasets.
Efficient training via a quantile-driven actor-critic framework.
Abstract
Neural architecture search (NAS) has achieved remarkable results in deep neural network design. Differentiable architecture search converts the search over discrete architectures into a hyperparameter optimization problem which can be solved by gradient descent. However, questions have been raised regarding the effectiveness and generalizability of gradient methods for solving non-convex architecture hyperparameter optimization problems. In this paper, we propose LNAS, which learns to intelligently optimize and update architecture hyperparameters via an actor neural network based on the distribution of high-performing architectures in the search history. We introduce a quantile-driven training procedure which efficiently trains LNAS in an actor-critic framework via continuous-action reinforcement learning. Experiments show that LNAS achieves state-of-the-art results on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDepthwise Convolution · ReLU6 · Pointwise Convolution · Batch Normalization · Depthwise Separable Convolution · Average Pooling · Hard Swish · Sigmoid Activation · Dropout · Global Average Pooling
