TL;DR
RelativeNAS introduces a novel neural architecture search method that efficiently combines fast and slow learners, achieving state-of-the-art results with significantly reduced computation time and versatile transferability to various vision tasks.
Contribution
It proposes a pairwise joint learning approach for NAS that leverages low-fidelity estimates, enabling faster and more efficient architecture discovery.
Findings
Achieves 24.88% top-1 error on ImageNet, outperforming DARTS and AmoebaNet-B.
Completes search in only nine hours on a single GPU, much faster than previous methods.
Discovered architectures transfer effectively to object detection, segmentation, and keypoint detection with competitive results.
Abstract
Despite the remarkable successes of Convolutional Neural Networks (CNNs) in computer vision, it is time-consuming and error-prone to manually design a CNN. Among various Neural Architecture Search (NAS) methods that are motivated to automate designs of high-performance CNNs, the differentiable NAS and population-based NAS are attracting increasing interests due to their unique characters. To benefit from the merits while overcoming the deficiencies of both, this work proposes a novel NAS method, RelativeNAS. As the key to efficient search, RelativeNAS performs joint learning between fast-learners (i.e. networks with relatively higher accuracy) and slow-learners in a pairwise manner. Moreover, since RelativeNAS only requires low-fidelity performance estimation to distinguish each pair of fast-learner and slow-learner, it saves certain computation costs for training the candidate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDifferentiable Neural Architecture Search · Differentiable Architecture Search · Average Pooling · Softmax · Max Pooling · Convolution · Spatially Separable Convolution · AmoebaNet
