iDARTS: Improving DARTS by Node Normalization and Decorrelation Discretization
Huiqun Wang, Ruijie Yang, Di Huang, Yunhong Wang

TL;DR
The paper introduces iDARTS, an improved neural architecture search method that stabilizes DARTS by node normalization and decorrelated discretization, leading to faster, more robust, and generalizable architecture search results.
Contribution
iDARTS proposes novel node normalization and decorrelated discretization techniques to address instability issues in DARTS, enhancing search stability and performance.
Findings
Achieves 2.25% error on CIFAR-10 within 0.2 GPU-days
Attains 24.7% error on ImageNet within 1.9 GPU-days
Demonstrates improved robustness and generalization over existing DARTS methods
Abstract
Differentiable ARchiTecture Search (DARTS) uses a continuous relaxation of network representation and dramatically accelerates Neural Architecture Search (NAS) by almost thousands of times in GPU-day. However, the searching process of DARTS is unstable, which suffers severe degradation when training epochs become large, thus limiting its application. In this paper, we claim that this degradation issue is caused by the imbalanced norms between different nodes and the highly correlated outputs from various operations. We then propose an improved version of DARTS, namely iDARTS, to deal with the two problems. In the training phase, it introduces node normalization to maintain the norm balance. In the discretization phase, the continuous architecture is approximated based on the similarity between the outputs of the node and the decorrelated operations rather than the values of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsDifferentiable Architecture Search
