Neural Architecture Search as Sparse Supernet

Yan Wu; Aoming Liu; Zhiwu Huang; Siwei Zhang; Luc Van Gool

arXiv:2007.16112·cs.CV·April 1, 2021·1 cites

Neural Architecture Search as Sparse Supernet

Yan Wu, Aoming Liu, Zhiwu Huang, Siwei Zhang, Luc Van Gool

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel approach to Neural Architecture Search by modeling it as a sparse supernet with a continuous architecture representation, enabling automatic mixed-path architecture optimization.

Contribution

It proposes a new sparse supernet model with a continuous architecture representation and a hierarchical optimization algorithm for efficient NAS.

Findings

01

Successfully searches for compact neural architectures.

02

Demonstrates effectiveness on CNN and RNN search tasks.

03

Achieves general and powerful architectures.

Abstract

This paper aims at enlarging the problem of Neural Architecture Search (NAS) from Single-Path and Multi-Path Search to automated Mixed-Path Search. In particular, we model the NAS problem as a sparse supernet using a new continuous architecture representation with a mixture of sparsity constraints. The sparse supernet enables us to automatically achieve sparsely-mixed paths upon a compact set of nodes. To optimize the proposed sparse supernet, we exploit a hierarchical accelerated proximal gradient algorithm within a bi-level optimization framework. Extensive experiments on Convolutional Neural Network and Recurrent Neural Network search demonstrate that the proposed method is capable of searching for compact, general and powerful neural architectures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Neural Architecture Search as Sparse Supernet· underline

Taxonomy

TopicsAdvanced Neural Network Applications · Infrastructure Maintenance and Monitoring · Robotic Path Planning Algorithms