OStr-DARTS: Differentiable Neural Architecture Search based on Operation Strength
Le Yang, Ziwei Zheng, Yizeng Han, Shiji Song, Gao Huang, Fan Li

TL;DR
This paper introduces OStr-DARTS, a novel operation strength-based criterion for neural architecture search that effectively addresses the degeneration issue in differentiable NAS without altering supernet optimization.
Contribution
It proposes a new operation importance criterion based on operation strength, improving DARTS stability and performance.
Findings
Addresses degeneration issue in DARTS
Operation strength criterion improves architecture quality
Effective on NAS-Bench-201 and DARTS spaces
Abstract
Differentiable architecture search (DARTS) has emerged as a promising technique for effective neural architecture search, and it mainly contains two steps to find the high-performance architecture: First, the DARTS supernet that consists of mixed operations will be optimized via gradient descent. Second, the final architecture will be built by the selected operations that contribute the most to the supernet. Although DARTS improves the efficiency of NAS, it suffers from the well-known degeneration issue which can lead to deteriorating architectures. Existing works mainly attribute the degeneration issue to the failure of its supernet optimization, while little attention has been paid to the selection method. In this paper, we cease to apply the widely-used magnitude-based selection method and propose a novel criterion based on operation strength that estimates the importance of an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
MethodsSoftmax · Attention Is All You Need · Differentiable Architecture Search
