Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality

Yue Song; Nicu Sebe; Wei Wang

arXiv:2207.02119·cs.CV·July 6, 2022·1 cites

Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality

Yue Song, Nicu Sebe, Wei Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces methods to improve the covariance conditioning of the SVD meta-layer in neural networks by enforcing orthogonality, leading to better training stability and generalization, validated through experiments on visual recognition tasks.

Contribution

The paper proposes the Nearest Orthogonal Gradient and Optimal Learning Rate methods to enhance covariance conditioning without harming performance, advancing orthogonal treatments in neural network layers.

Findings

01

Improved covariance conditioning with orthogonality techniques.

02

Enhanced generalization in visual recognition tasks.

03

Orthogonal weight combinations further boost performance.

Abstract

Inserting an SVD meta-layer into neural networks is prone to make the covariance ill-conditioned, which could harm the model in the training stability and generalization abilities. In this paper, we systematically study how to improve the covariance conditioning by enforcing orthogonality to the Pre-SVD layer. Existing orthogonal treatments on the weights are first investigated. However, these techniques can improve the conditioning but would hurt the performance. To avoid such a side effect, we propose the Nearest Orthogonal Gradient (NOG) and Optimal Learning Rate (OLR). The effectiveness of our methods is validated in two applications: decorrelated Batch Normalization (BN) and Global Covariance Pooling (GCP). Extensive experiments on visual recognition demonstrate that our methods can simultaneously improve the covariance conditioning and generalization. Moreover, the combinations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kingjamessong/orthoimprovecond
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Brain Tumor Detection and Classification

MethodsBatch Normalization · ZCA Whitening · Decorrelated Batch Normalization