Stochastic Optimization for Large-scale Optimal Transport
Genevay Aude (MOKAPLAN, CEREMADE), Marco Cuturi, Gabriel Peyr\'e, (MOKAPLAN, CEREMADE), Francis Bach (SIERRA, LIENS)

TL;DR
This paper introduces stochastic optimization algorithms for large-scale optimal transport problems, enabling efficient computation on high-dimensional distributions without discretization, and demonstrating superior performance over existing methods.
Contribution
The paper presents novel stochastic algorithms for optimal transport that handle arbitrary distributions and outperform current state-of-the-art solvers in various settings.
Findings
Incremental stochastic schemes outperform Sinkhorn's algorithm in discrete comparisons.
A semi-discrete stochastic gradient method improves over discretization approaches.
A new RKHS-based stochastic gradient method uniquely solves continuous density comparisons.
Abstract
Optimal transport (OT) defines a powerful framework to compare probability distributions in a geometrically faithful way. However, the practical impact of OT is still limited because of its computational burden. We propose a new class of stochastic optimization algorithms to cope with large-scale problems routinely encountered in machine learning applications. These methods are able to manipulate arbitrary distributions (either discrete or continuous) by simply requiring to be able to draw samples from them, which is the typical setup in high-dimensional learning problems. This alleviates the need to discretize these densities, while giving access to provably convergent methods that output the correct distance without discretization error. These algorithms rely on two main ideas: (a) the dual OT problem can be re-cast as the maximization of an expectation ; (b) entropic regularization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Stochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference
