RelativeNAS: Relative Neural Architecture Search via Slow-Fast Learning

Hao Tan; Ran Cheng; Shihua Huang; Cheng He; Changxiao Qiu; Fan Yang,; Ping Luo

arXiv:2009.06193·cs.CV·July 14, 2021

RelativeNAS: Relative Neural Architecture Search via Slow-Fast Learning

Hao Tan, Ran Cheng, Shihua Huang, Cheng He, Changxiao Qiu, Fan Yang,, Ping Luo

PDF

2 Repos

TL;DR

RelativeNAS introduces a novel neural architecture search method that efficiently combines fast and slow learners, achieving state-of-the-art results with significantly reduced computation time and versatile transferability to various vision tasks.

Contribution

It proposes a pairwise joint learning approach for NAS that leverages low-fidelity estimates, enabling faster and more efficient architecture discovery.

Findings

01

Achieves 24.88% top-1 error on ImageNet, outperforming DARTS and AmoebaNet-B.

02

Completes search in only nine hours on a single GPU, much faster than previous methods.

03

Discovered architectures transfer effectively to object detection, segmentation, and keypoint detection with competitive results.

Abstract

Despite the remarkable successes of Convolutional Neural Networks (CNNs) in computer vision, it is time-consuming and error-prone to manually design a CNN. Among various Neural Architecture Search (NAS) methods that are motivated to automate designs of high-performance CNNs, the differentiable NAS and population-based NAS are attracting increasing interests due to their unique characters. To benefit from the merits while overcoming the deficiencies of both, this work proposes a novel NAS method, RelativeNAS. As the key to efficient search, RelativeNAS performs joint learning between fast-learners (i.e. networks with relatively higher accuracy) and slow-learners in a pairwise manner. Moreover, since RelativeNAS only requires low-fidelity performance estimation to distinguish each pair of fast-learner and slow-learner, it saves certain computation costs for training the candidate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDifferentiable Neural Architecture Search · Differentiable Architecture Search · Average Pooling · Softmax · Max Pooling · Convolution · Spatially Separable Convolution · AmoebaNet