Orthogonal SVD Covariance Conditioning and Latent Disentanglement
Yue Song, Nicu Sebe, Wei Wang

TL;DR
This paper introduces methods to improve covariance conditioning in neural networks by enforcing orthogonality, enhancing training stability, generalization, and latent disentanglement across various applications.
Contribution
It proposes the Nearest Orthogonal Gradient and Optimal Learning Rate techniques to improve covariance conditioning without harming performance.
Findings
Enhanced covariance conditioning and generalization in visual recognition tasks.
Improved latent disentanglement in generative models.
Orthogonal treatments combined with weight orthogonality boost performance.
Abstract
Inserting an SVD meta-layer into neural networks is prone to make the covariance ill-conditioned, which could harm the model in the training stability and generalization abilities. In this paper, we systematically study how to improve the covariance conditioning by enforcing orthogonality to the Pre-SVD layer. Existing orthogonal treatments on the weights are first investigated. However, these techniques can improve the conditioning but would hurt the performance. To avoid such a side effect, we propose the Nearest Orthogonal Gradient (NOG) and Optimal Learning Rate (OLR). The effectiveness of our methods is validated in two applications: decorrelated Batch Normalization (BN) and Global Covariance Pooling (GCP). Extensive experiments on visual recognition demonstrate that our methods can simultaneously improve covariance conditioning and generalization. The combinations with orthogonal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsZCA Whitening · Decorrelated Batch Normalization · Batch Normalization
