TL;DR
This paper introduces class-wise regularizers for deep network representations that improve classification performance by explicitly shaping features per class, demonstrating significant gains on benchmark datasets.
Contribution
The paper proposes two novel class-wise regularizers, cw-CR and cw-VR, which explicitly utilize class information to enhance feature independence and compactness, outperforming traditional regularizers.
Findings
Significant performance improvements over baseline and L1/L2 regularization on 21 out of 22 tasks.
cw-VR achieved the best results on 13 tasks, including ResNet-32/110.
Regularizers are simple, efficient, and effectively shape representations as visualized.
Abstract
Statistical characteristics of deep network representations, such as sparsity and correlation, are known to be relevant to the performance and interpretability of deep learning. When a statistical characteristic is desired, often an adequate regularizer can be designed and applied during the training phase. Typically, such a regularizer aims to manipulate a statistical characteristic over all classes together. For classification tasks, however, it might be advantageous to enforce the desired characteristic per class such that different classes can be better distinguished. Motivated by the idea, we design two class-wise regularizers that explicitly utilize class information: class-wise Covariance Regularizer (cw-CR) and class-wise Variance Regularizer (cw-VR). cw-CR targets to reduce the covariance of representations calculated from the same class samples for encouraging feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
