Projected Statistical Methods for Distributional Data on the Real Line with the Wasserstein Metric
Matteo Pegoraro, Mario Beraha

TL;DR
This paper introduces a new projected statistical framework for analyzing distributional data on the real line using the Wasserstein metric, focusing on PCA and regression with improved computational efficiency.
Contribution
It develops a novel projected approach leveraging the Wasserstein space's structure, enabling fast empirical methods and extending PCA and regression techniques for distributional data.
Findings
Projected PCA achieves similar accuracy with less computation.
Projected regression demonstrates high flexibility under misspecification.
Theoretical properties, including asymptotic consistency, are established.
Abstract
We present a novel class of projected methods, to perform statistical analysis on a data set of probability distributions on the real line, with the 2-Wasserstein metric. We focus in particular on Principal Component Analysis (PCA) and regression. To define these models, we exploit a representation of the Wasserstein space closely related to its weak Riemannian structure, by mapping the data to a suitable linear space and using a metric projection operator to constrain the results in the Wasserstein space. By carefully choosing the tangent point, we are able to derive fast empirical methods, exploiting a constrained B-spline approximation. As a byproduct of our approach, we are also able to derive faster routines for previous work on PCA for distributions. By means of simulation studies, we compare our approaches to previously proposed methods, showing that our projected PCA has similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Probabilistic and Robust Engineering Design
MethodsPrincipal Components Analysis
