Progressive DARTS: Bridging the Optimization Gap for NAS in the Wild
Xin Chen, Lingxi Xie, Jun Wu, Qi Tian

TL;DR
This paper introduces Progressive DARTS, a method that reduces the optimization gap in neural architecture search, enabling better transferability of architectures across diverse vision tasks with reduced search time.
Contribution
It proposes a progressive search strategy that gradually increases network depth to improve architecture transferability and performance in NAS, addressing the optimization gap in DARTS.
Findings
P-DARTS reduces search cost to 7 hours on a single GPU.
Achieves state-of-the-art transfer performance on multiple vision benchmarks.
Improves architecture robustness across different tasks.
Abstract
With the rapid development of neural architecture search (NAS), researchers found powerful network architectures for a wide range of vision tasks. However, it remains unclear if the searched architecture can transfer across different types of tasks as manually designed ones did. This paper puts forward this problem, referred to as NAS in the wild, which explores the possibility of finding the optimal architecture in a proxy dataset and then deploying it to mostly unseen scenarios. We instantiate this setting using a currently popular algorithm named differentiable architecture search (DARTS), which often suffers unsatisfying performance while being transferred across different tasks. We argue that the accuracy drop originates from the formulation that uses a super-network for search but a sub-network for re-training. The different properties of these stages have resulted in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsDifferentiable Architecture Search · Sigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
