$\Lambda$-DARTS: Mitigating Performance Collapse by Harmonizing Operation Selection among Cells
Sajad Movahedi, Melika Adabinejad, Ayyoob Imani, Arezou Keshavarz,, Mostafa Dehghani, Azadeh Shakery, Babak N. Araabi

TL;DR
This paper analyzes the causes of performance collapse in DARTS, a neural architecture search method, and proposes $\\Lambda$-DARTS with regularization to harmonize operation selection and prevent collapse.
Contribution
The paper provides a novel theoretical analysis of DARTS' convergence issues and introduces two regularization terms to mitigate performance collapse.
Findings
$\\Lambda$-DARTS prevents performance collapse across multiple datasets.
Theoretical analysis links collapse to convergence at softmax saturation points.
Experimental results validate the effectiveness of the proposed regularization.
Abstract
Differentiable neural architecture search (DARTS) is a popular method for neural architecture search (NAS), which performs cell-search and utilizes continuous relaxation to improve the search efficiency via gradient-based optimization. The main shortcoming of DARTS is performance collapse, where the discovered architecture suffers from a pattern of declining quality during search. Performance collapse has become an important topic of research, with many methods trying to solve the issue through either regularization or fundamental changes to DARTS. However, the weight-sharing framework used for cell-search in DARTS and the convergence of architecture parameters has not been analyzed yet. In this paper, we provide a thorough and novel theoretical and empirical analysis on DARTS and its point of convergence. We show that DARTS suffers from a specific structural flaw due to its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsDifferentiable Architecture Search · Softmax
