Efficient Distribution Learning with Error Bounds in Wasserstein Distance
Eduardo Figueiredo, Steven Adams, Luca Laurenti

TL;DR
This paper introduces a new algorithmic framework for approximating unknown probability distributions using Wasserstein distance, providing non-asymptotic error bounds and outperforming existing methods on benchmarks.
Contribution
The paper presents a novel approach combining optimal transport, nonlinear optimization, and clustering to efficiently learn distributions with guaranteed error bounds in Wasserstein distance.
Findings
Outperforms state-of-the-art methods on benchmark datasets
Returns smaller support approximations with tighter error bounds
Provides a tractable optimization-based method for distribution learning
Abstract
The Wasserstein distance has emerged as a key metric to quantify distances between probability distributions, with applications in various fields, including machine learning, control theory, decision theory, and biological systems. Consequently, learning an unknown distribution with non-asymptotic and easy-to-compute error bounds in Wasserstein distance has become a fundamental problem in many fields. In this paper, we devise a novel algorithmic and theoretical framework to approximate an unknown probability distribution from a finite set of samples by an approximate discrete distribution while bounding the Wasserstein distance between and . Our framework leverages optimal transport, nonlinear optimization, and concentration inequalities. In particular, we show that, even if is unknown, the Wasserstein…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Geometric Analysis and Curvature Flows
