Sample Complexity of Sinkhorn divergences
Aude Genevay, L\'enaic Chizat, Francis Bach, Marco Cuturi, Gabriel, Peyr\'e

TL;DR
This paper investigates the sample complexity of Sinkhorn divergences, a regularized optimal transport measure, providing bounds on approximation error, optimizer boundedness, and the first sample complexity rate, bridging OT and MMD.
Contribution
It derives a new sample complexity bound for Sinkhorn divergences, connecting OT and MMD, and analyzes their approximation error and optimizer properties.
Findings
Sample complexity of SDs scales as 1/√n, similar to MMD.
Bound on approximation error of SDs relative to OT depending on regularization.
Optimizers of regularized OT are bounded in a Sobolev (RKHS) ball, independent of measures.
Abstract
Optimal transport (OT) and maximum mean discrepancies (MMD) are now routinely used in machine learning to compare probability measures. We focus in this paper on \emph{Sinkhorn divergences} (SDs), a regularized variant of OT distances which can interpolate, depending on the regularization strength , between OT () and MMD (). Although the tradeoff induced by that regularization is now well understood computationally (OT, SDs and MMD require respectively , and operations given a sample size ), much less is known in terms of their \emph{sample complexity}, namely the gap between these quantities, when evaluated using finite samples \emph{vs.} their respective densities. Indeed, while the sample complexity of OT and MMD stand at two extremes, for OT in dimension and for MMD, that for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms
