TL;DR
Net-Trim is a fast, convex optimization-based method for pruning trained neural networks, which maintains internal responses and offers theoretical guarantees on sample complexity and sparsity, enabling efficient network compression.
Contribution
Introduces Net-Trim, a convex post-processing technique for neural network pruning with theoretical analysis and an ADMM-based implementation for efficiency.
Findings
Net-Trim effectively prunes networks while preserving responses.
The method requires O(s log N/s) samples for s-sparse layer reconstruction.
The approach is scalable and theoretically grounded, similar to sparse regression techniques.
Abstract
We develop a fast, tractable technique called Net-Trim for simplifying a trained neural network. The method is a convex post-processing module, which prunes (sparsifies) a trained network layer by layer, while preserving the internal responses. We present a comprehensive analysis of Net-Trim from both the algorithmic and sample complexity standpoints, centered on a fast, scalable convex optimization program. Our analysis includes consistency results between the initial and retrained models before and after Net-Trim application and guarantees on the number of training samples needed to discover a network that can be expressed using a certain number of nonzero terms. Specifically, if there is a set of weights that uses at most terms that can re-create the layer outputs from the layer inputs, we can find these weights from samples, where is the input size.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
