Layer-compensated Pruning for Resource-constrained Convolutional Neural   Networks

Ting-Wu Chin; Cha Zhang; Diana Marculescu

arXiv:1810.00518·cs.CV·October 19, 2018·38 cites

Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks

Ting-Wu Chin, Cha Zhang, Diana Marculescu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a layer-compensated pruning method for resource-constrained CNNs that combines filter pruning decisions into a global ranking, leveraging meta-learning to improve efficiency and accuracy.

Contribution

It proposes a novel layer-compensated pruning algorithm that unifies pruning decisions and employs meta-learning for better solutions, reducing accuracy loss and meta-learning time.

Findings

01

Reduces accuracy gap from 0.9% to 0.7%

02

Achieves 8x faster meta-learning process

03

Effective on ResNet and MobileNetV2 across multiple datasets

Abstract

Resource-efficient convolution neural networks enable not only the intelligence on edge devices but also opportunities in system-level optimization such as scheduling. In this work, we aim to improve the performance of resource-constrained filter pruning by merging two sub-problems commonly considered, i.e., (i) how many filters to prune for each layer and (ii) which filters to prune given a per-layer pruning budget, into a global filter ranking problem. Our framework entails a novel algorithm, dubbed layer-compensated pruning, where meta-learning is involved to determine better solutions. We show empirically that the proposed algorithm is superior to prior art in both effectiveness and efficiency. Specifically, we reduce the accuracy gap between the pruned and original networks from 0.9% to 0.7% with 8x reduction in time needed for meta-learning, i.e., from 1 hour down to 7 minutes. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cmu-enyac/LeGR
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Image Enhancement Techniques

MethodsPruning · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Batch Normalization · Inverted Residual Block · Average Pooling · 1x1 Convolution · Tether Customer Service Number +1-833-534-1729 · Convolution