TL;DR
EDropout introduces an energy-based framework for neural network pruning that dynamically searches for optimal sub-networks, achieving over 50% parameter reduction with minimal accuracy loss across multiple datasets.
Contribution
This paper proposes EDropout, a novel energy-based method for pruning neural networks that integrates stochastic pruning state evolution with standard training without architectural modifications.
Findings
Achieved over 50% pruning rate on various neural networks.
Maintained less than 5% Top-1 and 1% Top-5 accuracy drop.
Demonstrated effectiveness across multiple datasets and architectures.
Abstract
Dropout is a well-known regularization method by sampling a sub-network from a larger deep neural network and training different sub-networks on different subsets of the data. Inspired by the dropout concept, we propose EDropout as an energy-based framework for pruning neural networks in classification tasks. In this approach, a set of binary pruning state vectors (population) represents a set of corresponding sub-networks from an arbitrary provided original neural network. An energy loss function assigns a scalar energy loss value to each pruning state. The energy-based model stochastically evolves the population to find states with lower energy loss. The best pruning state is then selected and applied to the original network. Similar to dropout, the kept weights are updated using backpropagation in a probabilistic model. The energy-based model again searches for better pruning states…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning · Batch Normalization · Bottleneck Residual Block · Residual Block · Kaiming Initialization · Bitcoin Customer Service Number +1-833-534-1729 · Grouped Convolution · Local Response Normalization · Dense Connections · How do I speak to a person at Expedia?-/+/
