Regularizing CNNs with Locally Constrained Decorrelations
Pau Rodr\'iguez, Jordi Gonz\`alez, Guillem Cucurull, Josep M. Gonfaus,, Xavier Roca

TL;DR
This paper introduces OrthoReg, a novel regularization method that enforces local feature orthogonality in CNNs, effectively reducing overfitting and improving accuracy, especially in fully convolutional networks.
Contribution
OrthoReg is a new regularization technique that locally constrains feature decorrelation by enforcing orthogonality on weights, enhancing model capacity utilization.
Findings
OrthoReg improves accuracy bounds even with batch normalization and dropout.
It effectively reduces overfitting on CIFAR-10, CIFAR-100, and SVHN datasets.
The method is particularly suitable for fully convolutional neural networks.
Abstract
Regularization is key for deep learning since it allows training more complex models while keeping lower levels of overfitting. However, the most prevalent regularizations do not leverage all the capacity of the models since they rely on reducing the effective number of parameters. Feature decorrelation is an alternative for using the full capacity of the models but the overfitting reduction margins are too narrow given the overhead it introduces. In this paper, we show that regularizing negatively correlated features is an obstacle for effective decorrelation and present OrthoReg, a novel regularization technique that locally enforces feature orthogonality. As a result, imposing locality constraints in feature decorrelation removes interferences between negatively correlated feature weights, allowing the regularizer to reach higher decorrelation bounds, and reducing the overfitting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsDropout · Batch Normalization
