Heterogeneous Learning Rate Scheduling for Neural Architecture Search on Long-Tailed Datasets
Chenxia Tang

TL;DR
This paper introduces an adaptive learning rate scheduling method for neural architecture search on long-tailed datasets, improving stability and performance when applying DARTS to imbalanced data.
Contribution
We propose a novel adaptive learning rate strategy for DARTS that enhances its applicability to long-tailed datasets, addressing issues caused by traditional re-sampling techniques.
Findings
Our method achieves comparable accuracy to standard DARTS on long-tailed CIFAR-10.
Re-sampling methods can harm DARTS performance in imbalanced scenarios.
Adaptive learning rate scheduling prevents disruption of learned representations.
Abstract
In this paper, we attempt to address the challenge of applying Neural Architecture Search (NAS) algorithms, specifically the Differentiable Architecture Search (DARTS), to long-tailed datasets where class distribution is highly imbalanced. We observe that traditional re-sampling and re-weighting techniques, which are effective in standard classification tasks, lead to performance degradation when combined with DARTS. To mitigate this, we propose a novel adaptive learning rate scheduling strategy tailored for the architecture parameters of DARTS when integrated with the Bilateral Branch Network (BBN) for handling imbalanced datasets. Our approach dynamically adjusts the learning rate of the architecture parameters based on the training epoch, preventing the disruption of well-trained representations in the later stages of training. Additionally, we explore the impact of branch mixing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Stochastic Gradient Optimization Techniques · Machine Learning and Data Classification
MethodsGumbel Softmax · Differentiable Neural Architecture Search · Differentiable Architecture Search
