Making Differentiable Architecture Search less local

Erik Bodin; Federico Tomasi; Zhenwen Dai

arXiv:2104.10450·cs.LG·April 22, 2021

Making Differentiable Architecture Search less local

Erik Bodin, Federico Tomasi, Zhenwen Dai

PDF

Open Access

TL;DR

This paper proposes a more global optimization scheme for differentiable neural architecture search (DARTS) to mitigate performance collapse caused by poor local optima, leading to better architectures.

Contribution

It introduces a global optimization approach for DARTS that improves search outcomes without altering the original problem formulation.

Findings

01

Discover architectures with better test performance

02

Achieve architectures with fewer parameters

03

Reduce the occurrence of performance collapse

Abstract

Neural architecture search (NAS) is a recent methodology for automating the design of neural network architectures. Differentiable neural architecture search (DARTS) is a promising NAS approach that dramatically increases search efficiency. However, it has been shown to suffer from performance collapse, where the search often leads to detrimental architectures. Many recent works try to address this issue of DARTS by identifying indicators for early stopping, regularising the search objective to reduce the dominance of some operations, or changing the parameterisation of the search problem. In this work, we hypothesise that performance collapses can arise from poor local optima around typical initial architectures and weights. We address this issue by developing a more global optimisation scheme that is able to better explore the space without changing the DARTS problem formulation. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Domain Adaptation and Few-Shot Learning

MethodsDifferentiable Architecture Search