A Layer Decomposition-Recomposition Framework for Neuron Pruning towards Accurate Lightweight Networks
Weijie Chen, Yuan Zhang, Di Xie, Shiliang Pu

TL;DR
This paper introduces a Layer Decomposition-Recomposition Framework (LDRF) for neuron pruning that preserves information during network compression, leading to more accurate lightweight models with minimal accuracy loss.
Contribution
The novel LDRF method propagates complete layer information in an embedding space, outperforming traditional layer-by-layer pruning approaches in accuracy and efficiency.
Findings
Achieves 5.13x and 3x speed-up on VGG-16 and ResNet-50 with minimal accuracy loss.
Significantly better results before end-to-end fine-tuning compared to existing methods.
Outperforms state-of-the-art neuron pruning techniques on ILSVRC-12 benchmark.
Abstract
Neuron pruning is an efficient method to compress the network into a slimmer one for reducing the computational cost and storage overhead. Most of state-of-the-art results are obtained in a layer-by-layer optimization mode. It discards the unimportant input neurons and uses the survived ones to reconstruct the output neurons approaching to the original ones in a layer-by-layer manner. However, an unnoticed problem arises that the information loss is accumulated as layer increases since the survived neurons still do not encode the entire information as before. A better alternative is to propagate the entire useful information to reconstruct the pruned layer instead of directly discarding the less important neurons. To this end, we propose a novel Layer Decomposition-Recomposition Framework (LDRF) for neuron pruning, by which each layer's output information is recovered in an embedding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Cell Image Analysis Techniques · Advanced Memory and Neural Computing
MethodsPruning
