FNAS: Uncertainty-Aware Fast Neural Architecture Search

Jihao Liu; Ming Zhang; Yangting Sun; Boxiao Liu; Guanglu; Song; Yu Liu; Hongsheng Li

arXiv:2105.11694·cs.LG·May 28, 2021·5 cites

FNAS: Uncertainty-Aware Fast Neural Architecture Search

Jihao Liu, Ming Zhang, Yangting Sun, Boxiao Liu, Guanglu, Song, Yu Liu, Hongsheng Li

PDF

Open Access

TL;DR

This paper introduces FNAS, a novel neural architecture search method that significantly accelerates RL-based NAS by leveraging architecture and parameter transfer, uncertainty-aware critics, and block-level weight sharing, achieving about 10x speedup.

Contribution

FNAS is the first to incorporate block-level weight sharing in RL-based NAS and introduces an uncertainty-aware critic and a knowledge pool to drastically reduce search time.

Findings

01

FNAS accelerates RL-based NAS by approximately 10 times.

02

The method achieves better performance on various vision tasks.

03

It reduces search time from 20,000 GPU hours to 2,000 GPU hours.

Abstract

Reinforcement learning (RL)-based neural architecture search (NAS) generally guarantees better convergence yet suffers from the requirement of huge computational resources compared with gradient-based approaches, due to the rollout bottleneck -- exhaustive training for each sampled generation on proxy tasks. In this paper, we propose a general pipeline to accelerate the convergence of the rollout process as well as the RL process in NAS. It is motivated by the interesting observation that both the architecture and the parameter knowledge can be transferred between different experiments and even different tasks. We first introduce an uncertainty-aware critic (value function) in Proximal Policy Optimization (PPO) to utilize the architecture knowledge in previous experiments, which stabilizes the training process and reduces the searching time by 4 times. Further, an architecture knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning