DSNAS: Direct Neural Architecture Search without Parameter Retraining
Shoukang Hu, Sirui Xie, Hehui Zheng, Chunxiao Liu, Jianping Shi,, Xunying Liu, Dahua Lin

TL;DR
This paper introduces DSNAS, a novel differentiable neural architecture search method that directly finds deployable models without retraining, significantly reducing search time while maintaining competitive accuracy.
Contribution
The paper proposes a new end-to-end NAS framework that optimizes architecture and parameters simultaneously, enabling direct deployment without retraining.
Findings
Achieves 74.4% accuracy on ImageNet with 420 GPU hours
Reduces total NAS time by over 34% compared to two-stage methods
Produces models that can be deployed directly without parameter retraining
Abstract
If NAS methods are solutions, what is the problem? Most existing NAS methods require two-stage parameter optimization. However, performance of the same architecture in the two stages correlates poorly. In this work, we propose a new problem definition for NAS, task-specific end-to-end, based on this observation. We argue that given a computer vision task for which a NAS method is expected, this definition can reduce the vaguely-defined NAS evaluation to i) accuracy of this task and ii) the total computation consumed to finally obtain a model with satisfying accuracy. Seeing that most existing methods do not solve this problem directly, we propose DSNAS, an efficient differentiable NAS framework that simultaneously optimizes architecture and parameters with a low-biased Monte Carlo estimate. Child networks derived from DSNAS can be deployed directly without parameter retraining.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
DSNAS: Direct Neural Architecture Search Without Parameter Retraining· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsDifferentiable Neural Architecture Search
