An Infinitesimal Probabilistic Model for Principal Component Analysis of Manifold Valued Data
Stefan Sommer

TL;DR
This paper introduces a probabilistic, intrinsic approach to PCA on manifolds using stochastic development, allowing analysis of nonlinear manifold data without linearization, and incorporates curvature effects.
Contribution
It generalizes PCA to manifold data through a probabilistic model based on stochastic processes and connections, avoiding linearization and enabling global principal component transport.
Findings
Curvature affects the integrability of principal subspaces.
Stochastic flows offer an alternative to explicit subspace construction.
The model enables inference and prediction on embedded surfaces.
Abstract
We provide a probabilistic and infinitesimal view of how the principal component analysis procedure (PCA) can be generalized to analysis of nonlinear manifold valued data. Starting with the probabilistic PCA interpretation of the Euclidean PCA procedure, we show how PCA can be generalized to manifolds in an intrinsic way that does not resort to linearization of the data space. The underlying probability model is constructed by mapping a Euclidean stochastic process to the manifold using stochastic development of Euclidean semimartingales. The construction uses a connection and bundles of covariant tensors to allow global transport of principal eigenvectors, and the model is thereby an example of how principal fiber bundles can be used to handle the lack of global coordinate system and orientations that characterizes manifold valued statistics. We show how curvature implies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPrincipal Components Analysis
