Making Differentiable Architecture Search less local
Erik Bodin, Federico Tomasi, Zhenwen Dai

TL;DR
This paper proposes a more global optimization scheme for differentiable neural architecture search (DARTS) to mitigate performance collapse caused by poor local optima, leading to better architectures.
Contribution
It introduces a global optimization approach for DARTS that improves search outcomes without altering the original problem formulation.
Findings
Discover architectures with better test performance
Achieve architectures with fewer parameters
Reduce the occurrence of performance collapse
Abstract
Neural architecture search (NAS) is a recent methodology for automating the design of neural network architectures. Differentiable neural architecture search (DARTS) is a promising NAS approach that dramatically increases search efficiency. However, it has been shown to suffer from performance collapse, where the search often leads to detrimental architectures. Many recent works try to address this issue of DARTS by identifying indicators for early stopping, regularising the search objective to reduce the dominance of some operations, or changing the parameterisation of the search problem. In this work, we hypothesise that performance collapses can arise from poor local optima around typical initial architectures and weights. We address this issue by developing a more global optimisation scheme that is able to better explore the space without changing the DARTS problem formulation. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Domain Adaptation and Few-Shot Learning
MethodsDifferentiable Architecture Search
