DSNAS: Direct Neural Architecture Search without Parameter Retraining

Shoukang Hu; Sirui Xie; Hehui Zheng; Chunxiao Liu; Jianping Shi,; Xunying Liu; Dahua Lin

arXiv:2002.09128·cs.LG·April 2, 2020·26 cites

DSNAS: Direct Neural Architecture Search without Parameter Retraining

Shoukang Hu, Sirui Xie, Hehui Zheng, Chunxiao Liu, Jianping Shi,, Xunying Liu, Dahua Lin

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces DSNAS, a novel differentiable neural architecture search method that directly finds deployable models without retraining, significantly reducing search time while maintaining competitive accuracy.

Contribution

The paper proposes a new end-to-end NAS framework that optimizes architecture and parameters simultaneously, enabling direct deployment without retraining.

Findings

01

Achieves 74.4% accuracy on ImageNet with 420 GPU hours

02

Reduces total NAS time by over 34% compared to two-stage methods

03

Produces models that can be deployed directly without parameter retraining

Abstract

If NAS methods are solutions, what is the problem? Most existing NAS methods require two-stage parameter optimization. However, performance of the same architecture in the two stages correlates poorly. In this work, we propose a new problem definition for NAS, task-specific end-to-end, based on this observation. We argue that given a computer vision task for which a NAS method is expected, this definition can reduce the vaguely-defined NAS evaluation to i) accuracy of this task and ii) the total computation consumed to finally obtain a model with satisfying accuracy. Seeing that most existing methods do not solve this problem directly, we propose DSNAS, an efficient differentiable NAS framework that simultaneously optimizes architecture and parameters with a low-biased Monte Carlo estimate. Child networks derived from DSNAS can be deployed directly without parameter retraining.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SNAS-Series/SNAS-Series
pytorchOfficial

Videos

DSNAS: Direct Neural Architecture Search Without Parameter Retraining· youtube

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM

MethodsDifferentiable Neural Architecture Search