ZARTS: On Zero-order Optimization for Neural Architecture Search
Xiaoxing Wang, Wenxuan Guo, Junchi Yan, Jianlin Su, Xiaokang Yang

TL;DR
ZARTS introduces a zero-order optimization approach for neural architecture search, avoiding the biased gradients of differentiable methods like DARTS, leading to more robust and accurate architectures.
Contribution
The paper proposes ZARTS, a zero-order NAS method that outperforms DARTS in robustness and accuracy by eliminating the need for gradient-based approximations.
Findings
ZARTS achieves state-of-the-art accuracy on CIFAR-10 and ImageNet.
ZARTS demonstrates superior robustness over DARTS across multiple benchmarks.
MGS, a zero-order method, balances accuracy and computational efficiency.
Abstract
Differentiable architecture search (DARTS) has been a popular one-shot paradigm for NAS due to its high efficiency. It introduces trainable architecture parameters to represent the importance of candidate operations and proposes first/second-order approximation to estimate their gradients, making it possible to solve NAS by gradient descent algorithm. However, our in-depth empirical results show that the approximation will often distort the loss landscape, leading to the biased objective to optimize and in turn inaccurate gradient estimation for architecture parameters. This work turns to zero-order optimization and proposes a novel NAS scheme, called ZARTS, to search without enforcing the above approximation. Specifically, three representative zero-order optimization methods are introduced: RS, MGS, and GLD, among which MGS performs best by balancing the accuracy and speed. Moreover,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsDifferentiable Architecture Search
