Regularizing CNNs with Locally Constrained Decorrelations

Pau Rodr\'iguez; Jordi Gonz\`alez; Guillem Cucurull; Josep M. Gonfaus,; Xavier Roca

arXiv:1611.01967·cs.LG·March 16, 2017·84 cites

Regularizing CNNs with Locally Constrained Decorrelations

Pau Rodr\'iguez, Jordi Gonz\`alez, Guillem Cucurull, Josep M. Gonfaus,, Xavier Roca

PDF

Open Access 1 Repo

TL;DR

This paper introduces OrthoReg, a novel regularization method that enforces local feature orthogonality in CNNs, effectively reducing overfitting and improving accuracy, especially in fully convolutional networks.

Contribution

OrthoReg is a new regularization technique that locally constrains feature decorrelation by enforcing orthogonality on weights, enhancing model capacity utilization.

Findings

01

OrthoReg improves accuracy bounds even with batch normalization and dropout.

02

It effectively reduces overfitting on CIFAR-10, CIFAR-100, and SVHN datasets.

03

The method is particularly suitable for fully convolutional neural networks.

Abstract

Regularization is key for deep learning since it allows training more complex models while keeping lower levels of overfitting. However, the most prevalent regularizations do not leverage all the capacity of the models since they rely on reducing the effective number of parameters. Feature decorrelation is an alternative for using the full capacity of the models but the overfitting reduction margins are too narrow given the overhead it introduces. In this paper, we show that regularizing negatively correlated features is an obstacle for effective decorrelation and present OrthoReg, a novel regularization technique that locally enforces feature orthogonality. As a result, imposing locality constraints in feature decorrelation removes interferences between negatively correlated feature weights, allowing the regularizer to reach higher decorrelation bounds, and reducing the overfitting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

prlz77/orthoreg
torch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsDropout · Batch Normalization