RMLR: Extending Multinomial Logistic Regression into General Geometries
Ziheng Chen, Yue Song, Rui Wang, Xiaojun Wu, Nicu Sebe

TL;DR
This paper introduces RMLR, a flexible framework for multinomial logistic regression on general Riemannian geometries, broadening applicability beyond specific geometric constraints.
Contribution
It develops a general RMLR framework applicable to various geometries, including SPD manifolds and rotation groups, with multiple metric-based models.
Findings
Effective classification on SPD manifolds with five new MLR families.
Successful application to rotation matrices using Lie MLR.
Validated framework's broad applicability through extensive experiments.
Abstract
Riemannian neural networks, which extend deep learning techniques to Riemannian spaces, have gained significant attention in machine learning. To better classify the manifold-valued features, researchers have started extending Euclidean multinomial logistic regression (MLR) into Riemannian manifolds. However, existing approaches suffer from limited applicability due to their strong reliance on specific geometric properties. This paper proposes a framework for designing Riemannian MLR over general geometries, referred to as RMLR. Our framework only requires minimal geometric properties, thus exhibiting broad applicability and enabling its use with a wide range of geometries. Specifically, we showcase our framework on the Symmetric Positive Definite (SPD) manifold and special orthogonal group, i.e., the set of rotation matrices. On the SPD manifold, we develop five families of SPD MLRs…
Peer Reviews
Decision·NeurIPS 2024 poster
1. The proposed RMLR framework (Thm. 3.3) can be easily implemented in different geometries. For a specific geometry, one only needs to put the involved operators into Eq. 11. 2. The proposed RMLR not only generalizes the Euclidean MLR, but also incorporates several previous MLRs, such as gyro SPD MLR, gyro SPSD MLR, and flat SPD MLR (Tab. 1). Besides, it further can deal with the geometry which is non-flat or agnostic to gyro structure. 3. A complete study of 5 families of deformed SPD metri
1. More details on the optimization for learning the parameters of the MLR should be presented. 1. For the SPD manifold, there are at most three hyperparameters: $\theta, \alpha, \beta$. Although these indicate the generality of the proposed framework, how to select the parameter should also be discussed from a practical view.
Code & Models
Videos
Taxonomy
TopicsFace and Expression Recognition · Neural Networks and Applications · Data Mining Algorithms and Applications
MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training · Logistic Regression
