PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture   Search

Yuhui Xu; Lingxi Xie; Xiaopeng Zhang; Xin Chen; Guo-Jun Qi; Qi Tian,; Hongkai Xiong

arXiv:1907.05737·cs.CV·April 8, 2020·389 cites

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian,, Hongkai Xiong

PDF

Open Access 5 Repos

TL;DR

PC-DARTS introduces a memory-efficient neural architecture search method that samples a subset of channels during search, reducing computational costs while maintaining high performance, demonstrated on CIFAR10 and ImageNet datasets.

Contribution

The paper proposes a novel partial channel sampling strategy with edge normalization to improve efficiency and stability in differentiable architecture search.

Findings

01

Achieves 2.57% error on CIFAR10 in 0.1 GPU-days.

02

Attains 24.2% top-1 error on ImageNet with 3.8 GPU-days.

03

Reduces memory and computational overhead compared to original DARTS.

Abstract

Differentiable architecture search (DARTS) provided a fast solution in finding effective network architectures, but suffered from large memory and computing overheads in jointly training a super-network and searching for an optimal architecture. In this paper, we present a novel approach, namely, Partially-Connected DARTS, by sampling a small part of super-network to reduce the redundancy in exploring the network space, thereby performing a more efficient search without comprising the performance. In particular, we perform operation search in a subset of channels while bypassing the held out part in a shortcut. This strategy may suffer from an undesired inconsistency on selecting the edges of super-net caused by sampling different channels. We alleviate it using edge normalization, which adds a new set of edge-level parameters to reduce uncertainty in search. Thanks to the reduced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Differentiable Architecture Search