BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage   Models

Jiahui Yu; Pengchong Jin; Hanxiao Liu; Gabriel Bender; Pieter-Jan; Kindermans; Mingxing Tan; Thomas Huang; Xiaodan Song; Ruoming Pang; Quoc Le

arXiv:2003.11142·cs.CV·July 20, 2020·38 cites

BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models

Jiahui Yu, Pengchong Jin, Hanxiao Liu, Gabriel Bender, Pieter-Jan, Kindermans, Mingxing Tan, Thomas Huang, Xiaodan Song, Ruoming Pang, Quoc Le

PDF

Open Access 1 Repo

TL;DR

BigNAS introduces a single training process for a family of neural network models that achieves high accuracy without post-processing, simplifying neural architecture search and surpassing state-of-the-art models.

Contribution

The paper presents BigNAS, a novel approach that trains a single shared model to produce multiple high-accuracy architectures without additional retraining or post-processing.

Findings

01

Achieves top-1 accuracy from 76.5% to 80.9% on ImageNet.

02

Surpasses state-of-the-art models like EfficientNets and Once-for-All.

03

Eliminates the need for retraining or post-processing of models.

Abstract

Neural architecture search (NAS) has shown promising results discovering models that are both accurate and fast. For NAS, training a one-shot model has become a popular strategy to rank the relative quality of different architectures (child models) using a single set of shared weights. However, while one-shot model weights can effectively rank different network architectures, the absolute accuracies from these shared weights are typically far below those obtained from stand-alone training. To compensate, existing methods assume that the weights must be retrained, finetuned, or otherwise post-processed after the search is completed. These steps significantly increase the compute requirements and complexity of the architecture search and model deployment. In this work, we propose BigNAS, an approach that challenges the conventional wisdom that post-processing of the weights is necessary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

GAIA-vision/GAIA-cv
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification