Learning Probability Measures with respect to Optimal Transport Metrics

Guillermo D. Canas; Lorenzo Rosasco

arXiv:1209.1077·cs.LG·September 6, 2012·30 cites

Learning Probability Measures with respect to Optimal Transport Metrics

Guillermo D. Canas, Lorenzo Rosasco

PDF

Open Access

TL;DR

This paper investigates estimating probability measures on manifolds using optimal transport metrics, linking it to quantization and learning theory, and providing new bounds for k-means performance and convergence rates.

Contribution

It establishes a novel connection between optimal transport, quantization, and learning theory, deriving new probabilistic bounds and convergence rates for measure estimation.

Findings

01

New probabilistic bounds for k-means in measure learning

02

Lower bounds on convergence rates of empirical measures

03

Bounds applicable to a wide class of measures

Abstract

We study the problem of estimating, in the sense of optimal transport metrics, a measure which is assumed supported on a manifold embedded in a Hilbert space. By establishing a precise connection between optimal transport metrics, optimal quantization, and learning theory, we derive new probabilistic bounds for the performance of a classic algorithm in unsupervised learning (k-means), when used to produce a probability measure derived from the data. In the course of the analysis, we arrive at new lower bounds, as well as probabilistic upper bounds on the convergence rate of the empirical law of large numbers, which, unlike existing bounds, are applicable to a wide class of measures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Markov Chains and Monte Carlo Methods · Bayesian Modeling and Causal Inference