Exchangeability in Neural Network and its Application to Dynamic Pruning
Pu (Luke) Yi, Tianlang Chen, Yifan Yang, Sara Achour

TL;DR
This paper introduces ExPrune, a dynamic pruning method based on exchangeability theory that reduces neural network FLOPs with minimal accuracy loss across various models and can complement static pruning.
Contribution
ExPrune is a novel, theory-grounded dynamic pruning technique that operates without altering model architecture or training, enabling efficient partial computation on a per-input basis.
Findings
Achieves 10.98-17.33% FLOPs reduction with negligible accuracy drop.
Achieves 21.61-27.16% FLOPs reduction with at most 1% accuracy drop.
Provides additional FLOPs reduction on already pruned models.
Abstract
Modern neural networks (NN) contain an ever-growing number of parameters, substantially increasing the memory and computational cost of inference. Researchers have explored various ways to reduce the inference cost of NNs by reducing the model size before deployment and dynamically pruning the inference computation at runtime. In this work, we present ExPrune, a general, dynamic pruning optimization that enables multi-granularity partial computation on a per-input basis. ExPrune requires no change to the model architecture or the training algorithm. ExPrune is based on our theoretical results that the relationship between certain model parameters and intermediate values can be described by a statistical property called exchangeability. By identifying exchangeable parameters and values in the model, we are able to first partially evaluate the network, analyze the statistics of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Pruning
