Weight Fixing Networks
Christopher Subia-Waud, Srinandan Dasmahapatra

TL;DR
This paper introduces Weight Fixing Networks (WFN), a novel method for lossless neural network compression that significantly reduces unique weights and entropy while maintaining performance.
Contribution
The paper proposes WFN, a new approach to achieve lossless compression of neural networks by minimizing unique weights and entropy, outperforming state-of-the-art quantisation methods.
Findings
56x fewer unique weights on Imagenet
1.9x lower weight-space entropy than SOTA
Lossless compression with maintained performance
Abstract
Modern iterations of deep learning models contain millions (billions) of unique parameters, each represented by a b-bit number. Popular attempts at compressing neural networks (such as pruning and quantisation) have shown that many of the parameters are superfluous, which we can remove (pruning) or express with less than b-bits (quantisation) without hindering performance. Here we look to go much further in minimising the information content of networks. Rather than a channel or layer-wise encoding, we look to lossless whole-network quantisation to minimise the entropy and number of unique parameters in a network. We propose a new method, which we call Weight Fixing Networks (WFN) that we design to realise four model outcome objectives: i) very few unique weights, ii) low-entropy weight encodings, iii) unique weight values which are amenable to energy-saving versions of hardware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Brain Tumor Detection and Classification
MethodsPruning
