Sample Complexity of Sinkhorn divergences

Aude Genevay; L\'enaic Chizat; Francis Bach; Marco Cuturi; Gabriel; Peyr\'e

arXiv:1810.02733·math.ST·October 16, 2019·AISTATS·142 cites

Sample Complexity of Sinkhorn divergences

Aude Genevay, L\'enaic Chizat, Francis Bach, Marco Cuturi, Gabriel, Peyr\'e

PDF

Open Access

TL;DR

This paper investigates the sample complexity of Sinkhorn divergences, a regularized optimal transport measure, providing bounds on approximation error, optimizer boundedness, and the first sample complexity rate, bridging OT and MMD.

Contribution

It derives a new sample complexity bound for Sinkhorn divergences, connecting OT and MMD, and analyzes their approximation error and optimizer properties.

Findings

01

Sample complexity of SDs scales as 1/√n, similar to MMD.

02

Bound on approximation error of SDs relative to OT depending on regularization.

03

Optimizers of regularized OT are bounded in a Sobolev (RKHS) ball, independent of measures.

Abstract

Optimal transport (OT) and maximum mean discrepancies (MMD) are now routinely used in machine learning to compare probability measures. We focus in this paper on \emph{Sinkhorn divergences} (SDs), a regularized variant of OT distances which can interpolate, depending on the regularization strength $ε$ , between OT ( $ε = 0$ ) and MMD ( $ε = \infty$ ). Although the tradeoff induced by that regularization is now well understood computationally (OT, SDs and MMD require respectively $O (n^{3} lo g n)$ , $O (n^{2})$ and $n^{2}$ operations given a sample size $n$ ), much less is known in terms of their \emph{sample complexity}, namely the gap between these quantities, when evaluated using finite samples \emph{vs.} their respective densities. Indeed, while the sample complexity of OT and MMD stand at two extremes, $1/ n^{1/ d}$ for OT in dimension $d$ and $1/ n$ for MMD, that for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProbabilistic and Robust Engineering Design · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms