Learning Instance-wise Sparsity for Accelerating Deep Models
Chuanjian Liu, Yunhe Wang, Kai Han, Chunjing Xu, Chang Xu

TL;DR
This paper introduces an instance-wise feature pruning method that accelerates deep neural networks by selectively eliminating informative features at intermediate layers, based on feature decay regularization and layer variability analysis.
Contribution
It proposes a novel data-aware feature pruning approach that improves inference speed without sacrificing accuracy by identifying and removing less informative features per instance.
Findings
Significant acceleration in inference time on benchmark datasets.
Maintains high accuracy despite feature pruning.
Effective layer selection based on coefficient of variation.
Abstract
Exploring deep convolutional neural networks of high efficiency and low memory usage is very essential for a wide variety of machine learning tasks. Most of existing approaches used to accelerate deep models by manipulating parameters or filters without data, e.g., pruning and decomposition. In contrast, we study this problem from a different perspective by respecting the difference between data. An instance-wise feature pruning is developed by identifying informative features for different instances. Specifically, by investigating a feature decay regularization, we expect intermediate feature maps of each instance in deep neural networks to be sparse while preserving the overall network performance. During online inference, subtle features of input images extracted by intermediate layers of a well-trained neural network can be eliminated to accelerate the subsequent calculations. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Image and Signal Denoising Methods · Advanced Image Processing Techniques
MethodsPruning
