Hilbert space embeddings and metrics on probability measures
Bharath K. Sriperumbudur, Arthur Gretton, Kenji Fukumizu, Bernhard, Sch\"olkopf, Gert R. G. Lanckriet

TL;DR
This paper explores the properties of Hilbert space embeddings of probability measures, focusing on conditions for these embeddings to form a true metric and their relation to other probability metrics.
Contribution
It provides clear, practical conditions for when the kernel-induced distance is a true metric and analyzes the topology induced by this metric on probability measures.
Findings
Bounded continuous strictly positive definite kernels are characteristic.
Translation-invariant kernels are characteristic if their Fourier transform's support is entire.
The metric $oldsymbol{oldsymbol{ extgamma_k}}$ can metrize the weak topology under certain conditions.
Abstract
A Hilbert space embedding for probability measures has recently been proposed, with applications including dimensionality reduction, homogeneity testing, and independence testing. This embedding represents any probability measure as a mean element in a reproducing kernel Hilbert space (RKHS). A pseudometric on the space of probability measures can be defined as the distance between distribution embeddings: we denote this as , indexed by the kernel function that defines the inner product in the RKHS. We present three theoretical properties of . First, we consider the question of determining the conditions on the kernel for which is a metric: such are denoted {\em characteristic kernels}. Unlike pseudometrics, a metric is zero only when two distributions coincide, thus ensuring the RKHS embedding maps all distributions uniquely (i.e., the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Advanced Neuroimaging Techniques and Applications · Bayesian Methods and Mixture Models
