Data-Driven Sparse Structure Selection for Deep Neural Networks

Zehao Huang; Naiyan Wang

arXiv:1707.01213·cs.CV·September 6, 2018·32 cites

Data-Driven Sparse Structure Selection for Deep Neural Networks

Zehao Huang, Naiyan Wang

PDF

Open Access 2 Repos

TL;DR

This paper introduces an end-to-end framework for learning and pruning deep neural networks by applying sparsity regularization to scaling factors, enabling automatic structure selection and model compression.

Contribution

It proposes a novel parameter-based sparsity regularization method with a modified APG optimizer for efficient network pruning without extensive trials.

Findings

01

Effective pruning of CNNs with minimal fine-tuning

02

Automatic adaptive depth and width selection

03

Comparable or superior performance to existing methods

Abstract

Deep convolutional neural networks have liberated its extraordinary power on various tasks. However, it is still very challenging to deploy state-of-the-art models into real-world applications due to their high computational complexity. How can we design a compact and effective network without massive experiments and expert knowledge? In this paper, we propose a simple and effective framework to learn and prune deep models in an end-to-end manner. In our framework, a new type of parameter -- scaling factor is first introduced to scale the outputs of specific structures, such as neurons, groups or residual blocks. Then we add sparsity regularizations on these factors, and solve this optimization problem by a modified stochastic Accelerated Proximal Gradient (APG) method. By forcing some of the factors to zero, we can safely remove the corresponding structures, thus prune the unimportant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM