Comparing distributions: $\ell_1$ geometry improves kernel two-sample testing
M. Scetbon, G. Varoquaux

TL;DR
This paper introduces an $ ext{L}^p$-based approach, especially $ ext{L}^1$, for kernel two-sample testing that improves detection power and computational efficiency by leveraging the geometry of distribution representations.
Contribution
It demonstrates that $ ext{L}^p$ distances with $p eq 2$ provide well-behaved metrics for distribution comparison and introduces a scalable $ ext{L}^1$-based test with finite-dimensional approximation.
Findings
$ ext{L}^1$ geometry improves detection power for analytic kernels.
Proposed method is faster and more powerful than state-of-the-art quadratic-time tests.
Experiments show better power/time tradeoff and sometimes superior detection than existing methods.
Abstract
Are two sets of observations drawn from the same distribution? This problem is a two-sample test. Kernel methods lead to many appealing properties. Indeed state-of-the-art approaches use the distance between kernel-based distribution representatives to derive their test statistics. Here, we show that distances (with ) between these distribution representatives give metrics on the space of distributions that are well-behaved to detect differences between distributions as they metrize the weak convergence. Moreover, for analytic kernels, we show that the geometry gives improved testing power for scalable computational procedures. Specifically, we derive a finite dimensional approximation of the metric given as the norm of a vector which captures differences of expectations of analytic functions evaluated at spatial locations or frequencies (i.e,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Statistical Methods and Inference · Machine Learning and Data Classification
