$\beta$-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search
Peng Ye, Tong He, Baopu Li, Tao Chen, Lei Bai, Wanli Ouyang

TL;DR
This paper introduces Beta-Decay regularization and flooding regularization to improve the stability, generalization, and robustness of differentiable neural architecture search methods like DARTS, making the search process more reliable and the architectures more transferable.
Contribution
The paper proposes Beta-Decay regularization for stability and generalization in DARTS, and flooding regularization to enhance proxy robustness, with comprehensive theoretical analysis and extensive experiments.
Findings
Beta-Decay stabilizes the NAS search process.
Beta-Decay improves transferability of searched architectures.
Flooding regularization enhances robustness across proxies.
Abstract
Neural Architecture Search has attracted increasing attention in recent years. Among them, differential NAS approaches such as DARTS, have gained popularity for the search efficiency. However, they still suffer from three main issues, that are, the weak stability due to the performance collapse, the poor generalization ability of the searched architectures, and the inferior robustness to different kinds of proxies. To solve the stability and generalization problems, a simple-but-effective regularization method, termed as Beta-Decay, is proposed to regularize the DARTS-based NAS searching process (i.e., -DARTS). Specifically, Beta-Decay regularization can impose constraints to keep the value and variance of activated architecture parameters from being too large, thereby ensuring fair competition among architecture parameters and making the supernet less sensitive to the impact of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsDifferentiable Architecture Search
