BN-NAS: Neural Architecture Search with Batch Normalization

Boyu Chen; Peixia Li; Baopu Li; Chen Lin; Chuming Li; Ming Sun; Junjie; Yan; Wanli Ouyang

arXiv:2108.07375·cs.CV·August 18, 2021·1 cites

BN-NAS: Neural Architecture Search with Batch Normalization

Boyu Chen, Peixia Li, Baopu Li, Chen Lin, Chuming Li, Ming Sun, Junjie, Yan, Wanli Ouyang

PDF

Open Access 1 Repo

TL;DR

BN-NAS introduces a novel approach to neural architecture search that leverages Batch Normalization to predict subnet performance early and trains only BN parameters, greatly reducing search time without sacrificing accuracy.

Contribution

The paper proposes BN-NAS, a method that accelerates NAS by using BN-based indicators for early performance prediction and training only BN parameters during supernet training.

Findings

01

Supernet training time reduced by over 10 times.

02

Subnet evaluation time shortened by more than 600,000 times.

03

Maintains accuracy despite significant speedups.

Abstract

We present BN-NAS, neural architecture search with Batch Normalization (BN-NAS), to accelerate neural architecture search (NAS). BN-NAS can significantly reduce the time required by model training and evaluation in NAS. Specifically, for fast evaluation, we propose a BN-based indicator for predicting subnet performance at a very early training stage. The BN-based indicator further facilitates us to improve the training efficiency by only training the BN parameters during the supernet training. This is based on our observation that training the whole supernet is not necessary while training only BN parameters accelerates network convergence for network architecture search. Extensive experiments show that our method can significantly shorten the time of training supernet by more than 10 times and shorten the time of evaluating subnets by more than 600,000 times without losing accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bychen515/bnnas
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsBatch Normalization