Kernel Selection is Model Selection: A Unified Complexity-Penalized Approach for MMD Two-Sample Tests

Yijin Ni; Xiaoming Huo

arXiv:2605.06883·stat.ML·May 11, 2026

Kernel Selection is Model Selection: A Unified Complexity-Penalized Approach for MMD Two-Sample Tests

Yijin Ni, Xiaoming Huo

PDF

TL;DR

This paper introduces CP-MMD, a new complexity-penalized criterion for kernel selection in MMD two-sample tests, enabling continuous, grid-free optimization that maximizes test power while maintaining Type-I error control.

Contribution

It formulates kernel selection as a model selection problem and develops CP-MMD, which accounts for optimization complexity and supports continuous kernel parameter search.

Findings

01

CP-MMD achieves higher or comparable test power to state-of-the-art methods.

02

It enables grid-free kernel selection over continuous parameter spaces.

03

CP-MMD maintains unconditional Type-I error control.

Abstract

The Maximum Mean Discrepancy (MMD) is a cornerstone statistic for nonparametric two-sample testing, but its test power is dictated entirely by the chosen kernel. Because any fixed kernel inherently fails to distinguish certain distributions, the kernel must be dynamically optimized. However, data-driven optimization violates the foundational i.i.d. assumption, forcing a strict trade-off in existing frameworks. Ratio criteria ignore this dependence, inducing overfitting and variance collapse on rich kernel classes. Conversely, aggregation methods bypass the dependence using finite grids, but this strategy cannot scale to continuous search spaces like deep kernels. To break this dichotomy, we establish data-driven kernel selection as a model selection problem. We propose Complexity-Penalized MMD (CP-MMD), a criterion derived by applying the two-sample uniform concentration inequality of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.