FNAS: Uncertainty-Aware Fast Neural Architecture Search
Jihao Liu, Ming Zhang, Yangting Sun, Boxiao Liu, Guanglu, Song, Yu Liu, Hongsheng Li

TL;DR
This paper introduces FNAS, a novel neural architecture search method that significantly accelerates RL-based NAS by leveraging architecture and parameter transfer, uncertainty-aware critics, and block-level weight sharing, achieving about 10x speedup.
Contribution
FNAS is the first to incorporate block-level weight sharing in RL-based NAS and introduces an uncertainty-aware critic and a knowledge pool to drastically reduce search time.
Findings
FNAS accelerates RL-based NAS by approximately 10 times.
The method achieves better performance on various vision tasks.
It reduces search time from 20,000 GPU hours to 2,000 GPU hours.
Abstract
Reinforcement learning (RL)-based neural architecture search (NAS) generally guarantees better convergence yet suffers from the requirement of huge computational resources compared with gradient-based approaches, due to the rollout bottleneck -- exhaustive training for each sampled generation on proxy tasks. In this paper, we propose a general pipeline to accelerate the convergence of the rollout process as well as the RL process in NAS. It is motivated by the interesting observation that both the architecture and the parameter knowledge can be transferred between different experiments and even different tasks. We first introduce an uncertainty-aware critic (value function) in Proximal Policy Optimization (PPO) to utilize the architecture knowledge in previous experiments, which stabilizes the training process and reduces the searching time by 4 times. Further, an architecture knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
