On integral probability metrics, \phi-divergences and binary classification
Bharath K. Sriperumbudur, Kenji Fukumizu, Arthur Gretton, Bernhard, Sch\"olkopf, Gert R. G. Lanckriet

TL;DR
This paper explores integral probability metrics (IPMs), their relation to phi-divergences, and introduces their practical estimation and interpretation via binary classification, enhancing their applicability in statistical analysis.
Contribution
The work establishes the conditions under which IPMs and phi-divergences intersect, develops consistent empirical estimators for IPMs, and links IPMs to binary classification risk, broadening their practical use.
Findings
Total variation is the only non-trivial phi-divergence that is also an IPM.
Empirical estimators for IPMs are consistent with favorable convergence rates.
IPMs can be interpreted through binary classification, relating distribution distances to classifier risk.
Abstract
A class of distance measures on probabilities -- the integral probability metrics (IPMs) -- is addressed: these include the Wasserstein distance, Dudley metric, and Maximum Mean Discrepancy. IPMs have thus far mostly been used in more abstract settings, for instance as theoretical tools in mass transportation problems, and in metrizing the weak topology on the set of all Borel probability measures defined on a metric space. Practical applications of IPMs are less common, with some exceptions in the kernel machines literature. The present work contributes a number of novel properties of IPMs, which should contribute to making IPMs more widely used in practice, for instance in areas where -divergences are currently popular. First, to understand the relation between IPMs and -divergences, the necessary and sufficient conditions under which these classes intersect are derived:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Advanced Statistical Methods and Models · Fuzzy Systems and Optimization
