Linearized Optimal Transport for Analysis of High-Dimensional Point-Cloud and Single-Cell Data
Tianxiang Wang, Yingtong Ke, Dhananjay Bhaskar, Smita Krishnaswamy, Alexander Cloninger

TL;DR
This paper introduces a linear optimal transport framework to embed high-dimensional single-cell data into a fixed-dimensional space, enabling interpretable classification, comparison, and synthetic data generation while preserving biological distributional structure.
Contribution
It adapts the Linear Optimal Transport method for high-dimensional biological point clouds, providing a linear, interpretable, and generative embedding that bridges predictive accuracy and biological insight.
Findings
Accurate classification of COVID-19 patient states using LOT embeddings.
Synthetic data generation for patient-derived organoids.
LOT barycenters represent combined cellular profiles for drug testing.
Abstract
Single-cell technologies generate high-dimensional point clouds of cells, enabling detailed characterization of complex patient states and treatment responses. Yet each patient is represented by an irregular point cloud rather than a simple vector, making it difficult to directly quantify and compare biological differences between individuals. Nonlinear methods such as kernels and neural networks achieve predictive accuracy but act as black boxes, offering little biological interpretability. To address these limitations, we adapt the Linear Optimal Transport (LOT) framework to this setting, embedding irregular point clouds into a fixed-dimensional Euclidean space while preserving distributional structure. This embedding provides a principled linear representation that preserves optimal transport geometry while enabling downstream analysis. It also forms a registration between any two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
