TL;DR
This paper introduces an Ising energy model-based pruning method for CNNs that effectively reduces over 50% of parameters with minimal accuracy loss, improving network efficiency.
Contribution
It presents a novel Ising energy model framework for pruning CNNs, focusing on reducing redundancy and inactive units, which is a new approach in neural network pruning.
Findings
Achieves over 50% parameter pruning rate.
Maintains less than 10% Top-1 accuracy drop.
Maintains less than 5% Top-5 accuracy drop.
Abstract
Pruning is one of the major methods to compress deep neural networks. In this paper, we propose an Ising energy model within an optimization framework for pruning convolutional kernels and hidden units. This model is designed to reduce redundancy between weight kernels and detect inactive kernels/hidden units. Our experiments using ResNets, AlexNet, and SqueezeNet on CIFAR-10 and CIFAR-100 datasets show that the proposed method on average can achieve a pruning rate of more than of the trainable parameters with approximately and drop of Top-1 and Top-5 classification accuracy, respectively.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Xavier Initialization · Max Pooling · Softmax · Dropout · Residual Connection · Fire Module · Global Average Pooling
