Model compression as constrained optimization, with application to neural nets. Part V: combining compressions
Miguel \'A. Carreira-Perpi\~n\'an, Yerlan Idelbayev

TL;DR
This paper introduces a constrained optimization framework for combining different neural network compression techniques, demonstrating that hybrid approaches can outperform individual methods depending on the model architecture.
Contribution
It formulates a novel optimization approach for combining compression methods and provides algorithms to learn optimal combinations, showing improved neural network compression results.
Findings
Combining compression methods yields better models than single techniques.
Different neural network architectures benefit from different compression combinations.
Hybrid compression can achieve significant size reduction without accuracy loss.
Abstract
Model compression is generally performed by using quantization, low-rank approximation or pruning, for which various algorithms have been researched in recent years. One fundamental question is: what types of compression work better for a given model? Or even better: can we improve by combining compressions in a suitable way? We formulate this generally as a problem of optimizing the loss but where the weights are constrained to equal an additive combination of separately compressed parts; and we give an algorithm to learn the corresponding parts' parameters. Experimentally with deep neural nets, we observe that 1) we can find significantly better models in the error-compression space, indicating that different compression types have complementary benefits, and 2) the best type of combination depends exquisitely on the type of neural net. For example, we can compress ResNets and AlexNet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Algorithms · Medical Image Segmentation Techniques
MethodsMax Pooling · Convolution · Dense Connections · Softmax · Dropout
