Weight Compander: A Simple Weight Reparameterization for Regularization
Rinor Cakaj, Jens Mehnert, Bin Yang

TL;DR
This paper introduces weight compander, a simple reparameterization technique for weights in neural networks that enhances regularization by controlling weight magnitudes, leading to better generalization and feature extraction.
Contribution
The paper proposes a novel weight reparameterization method called weight compander, which improves regularization by implicitly restricting weight magnitudes and can be combined with other techniques.
Findings
Improves neural network generalization performance.
Encourages more feature extraction by increasing weight redundancy.
Compatible with existing regularization methods.
Abstract
Regularization is a set of techniques that are used to improve the generalization ability of deep neural networks. In this paper, we introduce weight compander (WC), a novel effective method to improve generalization by reparameterizing each weight in deep neural networks using a nonlinear function. It is a general, intuitive, cheap and easy to implement method, which can be combined with various other regularization techniques. Large weights in deep neural networks are a sign of a more complex network that is overfitted to the training data. Moreover, regularized networks tend to have a greater range of weights around zero with fewer weights centered at zero. We introduce a weight reparameterization function which is applied to each weight and implicitly reduces overfitting by restricting the magnitude of the weights while forcing them away from zero at the same time. This leads to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and Data Classification
MethodsALIGN
