Deep $k$-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions
Junru Wu, Yue Wang, Zhenyu Wu, Zhangyang Wang, Ashok Veeraraghavan,, Yingyan Lin

TL;DR
This paper introduces Deep $k$-Means, a method that compresses CNNs by clustering weights with a spectral regularization to make hard assignments during re-training, reducing energy consumption without accuracy loss.
Contribution
It proposes a novel spectral $k$-means regularization for CNN weight clustering and introduces improved energy estimation metrics for hardware implementation.
Findings
Achieves high compression ratios with no accuracy loss.
Reduces energy consumption effectively on CNN hardware.
Demonstrates promising results across multiple CNN models.
Abstract
The current trend of pushing CNNs deeper with convolutions has created a pressing demand to achieve higher compression gains on CNNs where convolutions dominate the computation and parameter amount (e.g., GoogLeNet, ResNet and Wide ResNet). Further, the high energy consumption of convolutions limits its deployment on mobile devices. To this end, we proposed a simple yet effective scheme for compressing convolutions though applying k-means clustering on the weights, compression is achieved through weight-sharing, by only recording cluster centers and weight assignment indexes. We then introduced a novel spectrally relaxed -means regularization, which tends to make hard assignments of convolutional layer weights to learned cluster centers during re-training. We additionally propose an improved set of metrics to estimate energy consumption of CNN hardware implementations, whose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques
MethodsAverage Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · Dropout · Dense Connections · Softmax · GoogLeNet · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution
