Neural Network Pruning Through Constrained Reinforcement Learning
Shehryar Malik, Muhammad Umair Haider, Omer Iqbal, Murtaza Taj

TL;DR
This paper introduces a constrained reinforcement learning approach for neural network pruning that respects computational budgets and adapts to arbitrary, possibly non-differentiable functions, outperforming existing methods.
Contribution
It presents a novel pruning methodology using constrained reinforcement learning that handles non-differentiable functions and respects computational constraints.
Findings
Reduced 83-92.90% of parameters on VGG variants with maintained or improved accuracy.
Achieved 75.09% parameter reduction on ResNet18 without accuracy loss.
Outperformed state-of-the-art pruning methods on standard datasets.
Abstract
Network pruning reduces the size of neural networks by removing (pruning) neurons such that the performance drop is minimal. Traditional pruning approaches focus on designing metrics to quantify the usefulness of a neuron which is often quite tedious and sub-optimal. More recent approaches have instead focused on training auxiliary networks to automatically learn how useful each neuron is however, they often do not take computational limitations into account. In this work, we propose a general methodology for pruning neural networks. Our proposed methodology can prune neural networks to respect pre-defined computational budgets on arbitrary, possibly non-differentiable, functions. Furthermore, we only assume the ability to be able to evaluate these functions for different inputs, and hence they do not need to be fully specified beforehand. We achieve this by proposing a novel pruning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsPruning · Softmax · Convolution · Dropout · Max Pooling · Dense Connections
