Rethinking Architecture Selection in Differentiable NAS
Ruochen Wang, Minhao Cheng, Xiangning Chen, Xiaocheng Tang, Cho-Jui, Hsieh

TL;DR
This paper critically examines the common practice of using architecture parameter magnitudes in differentiable NAS, revealing their unreliability, and proposes a perturbation-based method that improves architecture selection and mitigates known failure modes.
Contribution
It introduces a perturbation-based architecture selection method that directly measures operation influence, improving NAS results and addressing DARTS failure modes.
Findings
Magnitude of architecture parameters does not reflect operation contribution.
Perturbation-based selection yields better architectures.
Failure modes of DARTS are alleviated with the new method.
Abstract
Differentiable Neural Architecture Search is one of the most popular Neural Architecture Search (NAS) methods for its search efficiency and simplicity, accomplished by jointly optimizing the model weight and architecture parameters in a weight-sharing supernet via gradient-based algorithms. At the end of the search phase, the operations with the largest architecture parameters will be selected to form the final architecture, with the implicit assumption that the values of architecture parameters reflect the operation strength. While much has been discussed about the supernet's optimization, the architecture selection process has received little attention. We provide empirical and theoretical analysis to show that the magnitude of architecture parameters does not necessarily indicate how much the operation contributes to the supernet's performance. We propose an alternative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Neural Networks and Applications
MethodsDifferentiable Architecture Search · Differentiable Neural Architecture Search
