Partial Connection Based on Channel Attention for Differentiable Neural Architecture Search
Yu Xue, Jiafeng Qin

TL;DR
This paper introduces ADARTS, a neural architecture search method that uses channel attention to select important features, improving search efficiency, memory usage, and stability, leading to better-performing models on CIFAR datasets.
Contribution
The paper proposes a novel partial channel connection method based on channel attention for DARTS, addressing issues of unfair operation competition and memory inefficiency.
Findings
Achieved 2.46% error on CIFAR-10
Achieved 17.06% error on CIFAR-100
Effectively reduces skip connections and improves network performance
Abstract
Differentiable neural architecture search (DARTS), as a gradient-guided search method, greatly reduces the cost of computation and speeds up the search. In DARTS, the architecture parameters are introduced to the candidate operations, but the parameters of some weight-equipped operations may not be trained well in the initial stage, which causes unfair competition between candidate operations. The weight-free operations appear in large numbers which results in the phenomenon of performance crash. Besides, a lot of memory will be occupied during training supernet which causes the memory utilization to be low. In this paper, a partial channel connection based on channel attention for differentiable neural architecture search (ADARTS) is proposed. Some channels with higher weights are selected through the attention mechanism and sent into the operation space while the other channels are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDifferentiable Architecture Search
