MSR-DARTS: Minimum Stable Rank of Differentiable Architecture Search
Kengo Machida, Kuniaki Uto, Koichi Shinoda, Taiji Suzuki

TL;DR
MSR-DARTS introduces a novel architecture search method that selects convolution operators based on minimum stable rank to improve generalization, demonstrating superior performance on CIFAR-10 and ImageNet datasets.
Contribution
It replaces the traditional architecture optimization in DARTS with a selection process based on stable rank, enhancing model generalization.
Findings
Achieves 2.54% error on CIFAR-10 with 4.0M parameters
Attains 23.9% top-1 error on ImageNet
Operates within 0.3 GPU-days
Abstract
In neural architecture search (NAS), differentiable architecture search (DARTS) has recently attracted much attention due to its high efficiency. It defines an over-parameterized network with mixed edges, each of which represents all operator candidates, and jointly optimizes the weights of the network and its architecture in an alternating manner. However, this method finds a model with the weights converging faster than the others, and such a model with fastest convergence often leads to overfitting. Accordingly, the resulting model cannot always be well-generalized. To overcome this problem, we propose a method called minimum stable rank DARTS (MSR-DARTS), for finding a model with the best generalization error by replacing architecture optimization with the selection process using the minimum stable rank criterion. Specifically, a convolution operator is represented by a matrix, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsDifferentiable Architecture Search · Convolution
