Riemannian batch normalization for SPD neural networks

Daniel Brooks; Olivier Schwander; Frederic Barbaresco; Jean-Yves; Schneider; Matthieu Cord

arXiv:1909.02414·cs.LG·September 13, 2019·56 cites

Riemannian batch normalization for SPD neural networks

Daniel Brooks, Olivier Schwander, Frederic Barbaresco, Jean-Yves, Schneider, Matthieu Cord

PDF

Open Access

TL;DR

This paper introduces a Riemannian batch normalization technique for SPD neural networks, leveraging geometric operations on the manifold to improve classification performance and robustness across diverse data types.

Contribution

It proposes a novel Riemannian batch normalization layer and a manifold-constrained gradient descent algorithm for SPD matrices, enhancing deep learning on structured data.

Findings

01

Improved classification accuracy across multiple datasets

02

Enhanced robustness to limited data scenarios

03

Effective integration of Riemannian geometry in deep learning

Abstract

Covariance matrices have attracted attention for machine learning applications due to their capacity to capture interesting structure in the data. The main challenge is that one needs to take into account the particular geometry of the Riemannian manifold of symmetric positive definite (SPD) matrices they belong to. In the context of deep networks, several architectures for these matrices have recently been proposed. In our article, we introduce a Riemannian batch normalization (batchnorm) algorithm, which generalizes the one used in Euclidean nets. This novel layer makes use of geometric operations on the manifold, notably the Riemannian barycenter, parallel transport and non-linear structured matrix transformations. We derive a new manifold-constrained gradient descent algorithm working in the space of SPD matrices, allowing to learn the batchnorm layer. We validate our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsBatch Normalization