2PFPCE: Two-Phase Filter Pruning Based on Conditional Entropy
Chuhan Min, Aosen Wang, Yiran Chen, Wenyao Xu, Xin Chen

TL;DR
This paper introduces 2PFPCE, a two-phase filter pruning method based on conditional entropy, to significantly reduce CNN computational costs with minimal accuracy loss, suitable for resource-limited applications.
Contribution
The paper proposes a novel two-phase filter pruning framework using conditional entropy and a maximum-entropy filter freeze technique, improving compression and inference speed of CNNs.
Findings
Achieves 10x FLOPs reduction on VGG-16.
Reduces inference time by 46%.
Maintains 98% of original accuracy.
Abstract
Deep Convolutional Neural Networks~(CNNs) offer remarkable performance of classifications and regressions in many high-dimensional problems and have been widely utilized in real-word cognitive applications. However, high computational cost of CNNs greatly hinder their deployment in resource-constrained applications, real-time systems and edge computing platforms. To overcome this challenge, we propose a novel filter-pruning framework, two-phase filter pruning based on conditional entropy, namely \textit{2PFPCE}, to compress the CNN models and reduce the inference time with marginal performance degradation. In our proposed method, we formulate filter pruning process as an optimization problem and propose a novel filter selection criteria measured by conditional entropy. Based on the assumption that the representation of neurons shall be evenly distributed, we also develop a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
MethodsPruning
