Geodesic Sinkhorn for Fast and Accurate Optimal Transport on Manifolds
Guillaume Huguet, Alexander Tong, Mar\'ia Ramos Zapatero, Christopher, J. Tape, Guy Wolf, Smita Krishnaswamy

TL;DR
This paper introduces Geodesic Sinkhorn, a fast and accurate optimal transport method on manifolds that leverages heat kernel diffusion, reducing computational complexity from quadratic to near-linear, and is applied to high-dimensional biological data analysis.
Contribution
The paper presents Geodesic Sinkhorn, a novel optimal transport algorithm on manifolds using heat kernel approximation, significantly improving computational efficiency over existing methods.
Findings
Achieves $O(n \, \log n)$ computation complexity.
Effectively computes barycenters of high-dimensional single-cell data.
Identifies optimal transport paths related to treatment effects.
Abstract
Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require computations. In addition, Sinkhorn-based methods commonly use an Euclidean ground distance between datapoints. However, with the prevalence of manifold structured scientific data, it is often desirable to consider geodesic ground distance. Here, we tackle both issues by proposing Geodesic Sinkhorn -- based on diffusing a heat kernel on a manifold graph. Notably, Geodesic Sinkhorn requires only computation, as we approximate the heat kernel with Chebyshev polynomials based on the sparse graph Laplacian. We apply our method to the computation of barycenters of several distributions of high dimensional single cell data from patient samples undergoing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Point processes and geometric inequalities · Markov Chains and Monte Carlo Methods
