Fast distance computation of multivariate distributions via nonparanormal transport
Edward Shao, Junyoung Park, Naresh Punjabi, Hui Jiang, and Irina Gaynanova

TL;DR
This paper introduces the Nonparanormal Transport (NPT) metric, a fast and scalable method for computing distances between multivariate distributions, enabling efficient analysis in high-dimensional distributional data.
Contribution
The paper proposes the NPT metric, a closed-form, computationally efficient alternative to Wasserstein distance for multivariate distributions, suitable for large-scale data analysis.
Findings
NPT is at least 1000 times faster than existing Wasserstein variants.
NPT closely approximates Wasserstein distance in simulations.
Application to sleep study data demonstrates NPT's practical utility.
Abstract
With the increasing availability of data objects in the form of probability distributions, there is a growing need for statistical methods tailored to distributional data. Distance measures, especially the pairwise distance matrix between data objects, provide the foundation for a wide range of modern data analysis methods, such as clustering, multidimensional scaling, and distance-based regression, among others. The Wasserstein distance is commonly used with distributional data due to its compelling optimal transport property. However, while the Wasserstein distance can be efficiently computed for univariate distributions, its application to multivariate distributions is limited due to high computational costs. To address these scalability issues, we introduce the Nonparanormal Transport (NPT) metric, a closed-form distance based on the flexible nonparanormal distribution family for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsObstructive Sleep Apnea Research · Random Matrices and Applications · Advanced Neuroimaging Techniques and Applications
