Deep $k$-Means: Re-Training and Parameter Sharing with Harder Cluster   Assignments for Compressing Deep Convolutions

Junru Wu; Yue Wang; Zhenyu Wu; Zhangyang Wang; Ashok Veeraraghavan,; Yingyan Lin

arXiv:1806.09228·cs.LG·June 26, 2018·84 cites

Deep $k$-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions

Junru Wu, Yue Wang, Zhenyu Wu, Zhangyang Wang, Ashok Veeraraghavan,, Yingyan Lin

PDF

Open Access 1 Repo

TL;DR

This paper introduces Deep $k$-Means, a method that compresses CNNs by clustering weights with a spectral regularization to make hard assignments during re-training, reducing energy consumption without accuracy loss.

Contribution

It proposes a novel spectral $k$-means regularization for CNN weight clustering and introduces improved energy estimation metrics for hardware implementation.

Findings

01

Achieves high compression ratios with no accuracy loss.

02

Reduces energy consumption effectively on CNN hardware.

03

Demonstrates promising results across multiple CNN models.

Abstract

The current trend of pushing CNNs deeper with convolutions has created a pressing demand to achieve higher compression gains on CNNs where convolutions dominate the computation and parameter amount (e.g., GoogLeNet, ResNet and Wide ResNet). Further, the high energy consumption of convolutions limits its deployment on mobile devices. To this end, we proposed a simple yet effective scheme for compressing convolutions though applying k-means clustering on the weights, compression is achieved through weight-sharing, by only recording $K$ cluster centers and weight assignment indexes. We then introduced a novel spectrally relaxed $k$ -means regularization, which tends to make hard assignments of convolutional layer weights to $K$ learned cluster centers during re-training. We additionally propose an improved set of metrics to estimate energy consumption of CNN hardware implementations, whose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Sandbox3aster/Deep-K-Means
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques

MethodsAverage Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · Dropout · Dense Connections · Softmax · GoogLeNet · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution