Compressing Deep Convolutional Networks using Vector Quantization

Yunchao Gong; Liu Liu; Ming Yang; Lubomir Bourdev

arXiv:1412.6115·cs.CV·December 22, 2014·1.0k cites

Compressing Deep Convolutional Networks using Vector Quantization

Yunchao Gong, Liu Liu, Ming Yang, Lubomir Bourdev

PDF

Open Access

TL;DR

This paper introduces a vector quantization approach to significantly compress deep CNN models, especially dense layers, enabling deployment on resource-limited devices with minimal accuracy loss.

Contribution

It demonstrates that vector quantization methods outperform matrix factorization for CNN compression, achieving 16-24x size reduction with only 1% accuracy loss.

Findings

01

Vector quantization outperforms matrix factorization in CNN compression.

02

Achieved 16-24x compression with 1% accuracy loss on ImageNet.

03

Simple k-means clustering effectively balances model size and accuracy.

Abstract

Deep convolutional neural networks (CNN) has become the most promising method for object recognition, repeatedly demonstrating record breaking results for image classification and object detection in recent years. However, a very deep CNN generally involves many layers with millions of parameters, making the storage of the network model to be extremely large. This prohibits the usage of deep CNNs on resource limited hardware, especially cell phones or other embedded devices. In this paper, we tackle this model storage issue by investigating information theoretical vector quantization methods for compressing the parameters of CNNs. In particular, we have found in terms of compressing the most storage demanding dense connected layers, vector quantization methods have a clear gain over existing matrix factorization methods. Simply applying k-means clustering to the weights or conducting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning

Methodsk-Means Clustering