On the Decreasing Power of Kernel and Distance based Nonparametric Hypothesis Tests in High Dimensions
Sashank J. Reddi, Aaditya Ramdas, Barnab\'as P\'oczos, Aarti Singh and, Larry Wasserman

TL;DR
This paper investigates the limitations of kernel and distance-based nonparametric hypothesis tests in high-dimensional settings, revealing that their power diminishes polynomially with increasing dimension against fair alternatives.
Contribution
It distinguishes between estimation and testing hardness, introduces a notion of fair alternatives, and demonstrates the polynomial decay of test power in high dimensions.
Findings
Test power drops polynomially with dimension
Kernel bandwidth selection impacts test performance
High-dimensional testing is fundamentally harder
Abstract
This paper is about two related decision theoretic problems, nonparametric two-sample testing and independence testing. There is a belief that two recently proposed solutions, based on kernels and distances between pairs of points, behave well in high-dimensional settings. We identify different sources of misconception that give rise to the above belief. Specifically, we differentiate the hardness of estimation of test statistics from the hardness of testing whether these statistics are zero or not, and explicitly discuss a notion of "fair" alternative hypotheses for these problems as dimension increases. We then demonstrate that the power of these tests actually drops polynomially with increasing dimension against fair alternatives. We end with some theoretical insights and shed light on the \textit{median heuristic} for kernel bandwidth selection. Our work advances the current…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
