You Only Search Once: Single Shot Neural Architecture Search via Direct Sparse Optimization
Xinbang Zhang, Zehao Huang, Naiyan Wang

TL;DR
This paper introduces DSO-NAS, a novel neural architecture search method that uses sparse optimization and pruning, enabling efficient and differentiable search directly applicable to large datasets like ImageNet.
Contribution
The paper proposes a new NAS approach based on sparse regularization and pruning, avoiding evolutionary or reinforcement learning methods, and achieves competitive results efficiently.
Findings
Achieved 2.84% test error on CIFAR-10.
Achieved 25.4% test error on ImageNet with 600M FLOPs.
Completed search on ImageNet in 18 hours using 8 GPUs.
Abstract
Recently Neural Architecture Search (NAS) has aroused great interest in both academia and industry, however it remains challenging because of its huge and non-continuous search space. Instead of applying evolutionary algorithm or reinforcement learning as previous works, this paper proposes a Direct Sparse Optimization NAS (DSO-NAS) method. In DSO-NAS, we provide a novel model pruning view to NAS problem. In specific, we start from a completely connected block, and then introduce scaling factors to scale the information flow between operations. Next, we impose sparse regularizations to prune useless connections in the architecture. Lastly, we derive an efficient and theoretically sound optimization method to solve it. Our method enjoys both advantages of differentiability and efficiency, therefore can be directly applied to large datasets like ImageNet. Particularly, On CIFAR-10…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
