Partial Connection Based on Channel Attention for Differentiable Neural   Architecture Search

Yu Xue; Jiafeng Qin

arXiv:2208.00791·cs.LG·August 2, 2022

Partial Connection Based on Channel Attention for Differentiable Neural Architecture Search

Yu Xue, Jiafeng Qin

PDF

TL;DR

This paper introduces ADARTS, a neural architecture search method that uses channel attention to select important features, improving search efficiency, memory usage, and stability, leading to better-performing models on CIFAR datasets.

Contribution

The paper proposes a novel partial channel connection method based on channel attention for DARTS, addressing issues of unfair operation competition and memory inefficiency.

Findings

01

Achieved 2.46% error on CIFAR-10

02

Achieved 17.06% error on CIFAR-100

03

Effectively reduces skip connections and improves network performance

Abstract

Differentiable neural architecture search (DARTS), as a gradient-guided search method, greatly reduces the cost of computation and speeds up the search. In DARTS, the architecture parameters are introduced to the candidate operations, but the parameters of some weight-equipped operations may not be trained well in the initial stage, which causes unfair competition between candidate operations. The weight-free operations appear in large numbers which results in the phenomenon of performance crash. Besides, a lot of memory will be occupied during training supernet which causes the memory utilization to be low. In this paper, a partial channel connection based on channel attention for differentiable neural architecture search (ADARTS) is proposed. Some channels with higher weights are selected through the attention mechanism and sent into the operation space while the other channels are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDifferentiable Architecture Search