Kernel Embeddings and the Separation of Measure Phenomenon
Leonardo V. Santoro, Kartik G. Waghmare, Victor M. Panaretos

TL;DR
The paper proves that kernel covariance embeddings enable perfect separation of different continuous probability distributions, simplifying the distinction between measures in high-dimensional spaces.
Contribution
It establishes that testing measure equality is equivalent to testing Gaussian singularity in RKHS, revealing a fundamental separation phenomenon.
Findings
Kernel embeddings achieve perfect measure separation.
Singular Gaussian measures are supported on separate affine subspaces.
Small distribution perturbations are magnified in Gaussian embeddings.
Abstract
We prove that kernel covariance embeddings lead to information-theoretically perfect separation of distinct continuous probability distributions. In statistical terms, we establish that testing for the \emph{equality} of two non-atomic (Borel) probability measures on a locally compact uncountable Polish space is \emph{equivalent} to testing for the \emph{singularity} between two centered Gaussian measures on a reproducing kernel Hilbert space. The corresponding Gaussians are defined via the notion of kernel covariance embedding of a probability measure, and the Hilbert space is that generated by the embedding kernel. Distinguishing singular Gaussians is structurally simpler from an information-theoretic perspective than non-parametric two-sample testing, particularly in complex or high-dimensional domains. This is because singular Gaussians are supported on essentially separate and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
