Compressing Convolutional Neural Networks

Wenlin Chen; James T. Wilson; Stephen Tyree; Kilian Q. Weinberger,; Yixin Chen

arXiv:1506.04449·cs.LG·June 16, 2015·32 cites

Compressing Convolutional Neural Networks

Wenlin Chen, James T. Wilson, Stephen Tyree, Kilian Q. Weinberger,, Yixin Chen

PDF

Open Access

TL;DR

This paper introduces FreshNets, a novel CNN compression method that leverages frequency domain analysis and hashing to significantly reduce model size while maintaining performance.

Contribution

The paper proposes a new architecture, FreshNets, that exploits frequency domain properties and hashing to compress CNNs more effectively than existing methods.

Findings

01

FreshNets achieve higher compression ratios with minimal accuracy loss.

02

Frequency-based hashing effectively captures redundancy in CNN weights.

03

Fewer parameters are allocated to high-frequency components, optimizing compression.

Abstract

Convolutional neural networks (CNN) are increasingly used in many areas of computer vision. They are particularly attractive because of their ability to "absorb" great quantities of labeled data through millions of parameters. However, as model sizes increase, so do the storage and memory requirements of the classifiers. We present a novel network architecture, Frequency-Sensitive Hashed Nets (FreshNets), which exploits inherent redundancy in both convolutional layers and fully-connected layers of a deep learning model, leading to dramatic savings in memory and storage consumption. Based on the key observation that the weights of learned convolutional filters are typically smooth and low-frequency, we first convert filter weights to the frequency domain with a discrete cosine transform (DCT) and use a low-cost hash function to randomly group frequency parameters into hash buckets. All…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications

MethodsDiscrete Cosine Transform