DropNAS: Grouped Operation Dropout for Differentiable Architecture Search
Weijun Hong, Guilin Li, Weinan Zhang, Ruiming Tang, Yunhe Wang,, Zhenguo Li, Yong Yu

TL;DR
DropNAS introduces a grouped operation dropout method to address co-adaptation and Matthew Effect issues in differentiable neural architecture search, leading to improved performance and robustness.
Contribution
The paper proposes DropNAS, a novel grouped operation dropout technique that fixes co-adaptation and Matthew Effect problems in DARTS-based NAS.
Findings
DropNAS achieves 2.26% test error on CIFAR-10.
DropNAS attains 16.39% on CIFAR-100.
DropNAS reaches 23.4% top-1 accuracy on ImageNet.
Abstract
Neural architecture search (NAS) has shown encouraging results in automating the architecture design. Recently, DARTS relaxes the search process with a differentiable formulation that leverages weight-sharing and SGD where all candidate operations are trained simultaneously. Our empirical results show that such procedure results in the co-adaption problem and Matthew Effect: operations with fewer parameters would be trained maturely earlier. This causes two problems: firstly, the operations with more parameters may never have the chance to express the desired function since those with less have already done the job; secondly, the system will punish those underperforming operations by lowering their architecture parameter, and they will get smaller loss gradients, which causes the Matthew Effect. In this paper, we systematically study these problems and propose a novel grouped operation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and Data Classification
MethodsDifferentiable Architecture Search · Dropout · Stochastic Gradient Descent
