Compress Then Test: Powerful Kernel Testing in Near-linear Time
Carles Domingo-Enrich, Raaz Dwivedi, Lester Mackey

TL;DR
This paper introduces Compress Then Test (CTT), a kernel testing framework that achieves near-linear time complexity by sample compression, maintaining high power and providing significant speed-ups over existing methods.
Contribution
The paper presents a novel sample compression-based kernel testing method, CTT, that retains optimal detection power while drastically reducing computational complexity.
Findings
CTT achieves 20-200x speed-ups over existing methods.
CTT maintains the same detection boundary as quadratic-time tests.
The framework is effective on real and simulated data.
Abstract
Kernel two-sample testing provides a powerful framework for distinguishing any pair of distributions based on sample points. However, existing kernel tests either run in time or sacrifice undue power to improve runtime. To address these shortcomings, we introduce Compress Then Test (CTT), a new framework for high-powered kernel testing based on sample compression. CTT cheaply approximates an expensive test by compressing each point sample into a small but provably high-fidelity coreset. For standard kernels and subexponential distributions, CTT inherits the statistical behavior of a quadratic-time test -- recovering the same optimal detection boundary -- while running in near-linear time. We couple these advances with cheaper permutation testing, justified by new power analyses; improved time-vs.-quality guarantees for low-rank approximation; and a fast aggregation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification
MethodsTest
