Compression of Deep Convolutional Neural Networks under Joint Sparsity   Constraints

Yoojin Choi; Mostafa El-Khamy; Jungwon Lee

arXiv:1805.08303·cs.CV·October 30, 2018·5 cites

Compression of Deep Convolutional Neural Networks under Joint Sparsity Constraints

Yoojin Choi, Mostafa El-Khamy, Jungwon Lee

PDF

Open Access

TL;DR

This paper introduces a universal CNN compression framework that produces a single sparse model adaptable to different hardware platforms and convolution methods, significantly reducing model size and computational cost.

Contribution

The proposed method creates a universal compressed CNN model with joint sparsity in spatial and Winograd domains, enabling deployment without re-training and further complexity reduction.

Findings

01

Achieved up to 47.7x compression on AlexNet

02

Reduced computational cost by up to 23.5x

03

Enabled universal deployment across platforms

Abstract

We consider the optimization of deep convolutional neural networks (CNNs) such that they provide good performance while having reduced complexity if deployed on either conventional systems utilizing spatial-domain convolution or lower complexity systems designed for Winograd convolution. Furthermore, we explore the universal quantization and compression of these networks. In particular, the proposed framework produces one compressed model whose convolutional filters can be made sparse either in the spatial domain or in the Winograd domain. Hence, one compressed model can be deployed universally on any platform, without need for re-training on the deployed platform, and the sparsity of its convolutional filters can be exploited for further complexity reduction in either domain. To get a better compression ratio, the sparse model is compressed in the spatial domain which has a less number…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Sparse and Compressive Sensing Techniques · Image and Signal Denoising Methods

Methods1x1 Convolution · Local Response Normalization · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax · How do I speak to a person at Expedia?-/+/ · Convolution