BNAS-v2: Memory-efficient and Performance-collapse-prevented Broad Neural Architecture Search
Zixiang Ding, Yaran Chen, Nannan Li, Dongbin Zhao

TL;DR
BNAS-v2 advances neural architecture search by employing continuous relaxation and confidence-based optimization to improve efficiency and prevent performance collapse, achieving state-of-the-art results on CIFAR-10 and ImageNet.
Contribution
It introduces a gradient-based NAS method with continuous relaxation and confidence learning rate to enhance efficiency and stability, addressing performance collapse issues.
Findings
BNAS-v2 achieves 0.05 GPU days on CIFAR-10, 4x faster than BNAS.
BNAS-v2 attains 0.19 GPU days on ImageNet, demonstrating high efficiency.
The proposed CLR effectively alleviates performance collapse in differentiable NAS.
Abstract
In this paper, we propose BNAS-v2 to further improve the efficiency of NAS, embodying both superiorities of BCNN simultaneously. To mitigate the unfair training issue of BNAS, we employ continuous relaxation strategy to make each edge of cell in BCNN relevant to all candidate operations for over-parameterized BCNN construction. Moreover, the continuous relaxation strategy relaxes the choice of a candidate operation as a softmax over all predefined operations. Consequently, BNAS-v2 employs the gradient-based optimization algorithm to simultaneously update every possible path of over-parameterized BCNN, rather than the single sampled one as BNAS. However, continuous relaxation leads to another issue named performance collapse, in which those weight-free operations are prone to be selected by the search strategy. For this consequent issue, two solutions are given: 1) we propose Confident…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Domain Adaptation and Few-Shot Learning
MethodsDifferentiable Neural Architecture Search · Softmax
