
TL;DR
Kernel thinning is a novel distribution compression method that significantly reduces sample size while maintaining integration accuracy, outperforming i.i.d. sampling and standard thinning, with broad applicability to various kernels and distributions.
Contribution
We introduce kernel thinning, a new procedure that compresses distribution approximations efficiently, providing theoretical guarantees and practical benefits over existing sampling methods.
Findings
Achieves $O(n^{-1/2})$ integration error for compactly supported distributions.
Outperforms i.i.d. sampling with $ ext{Omega}(n^{-1/4})$ error.
Provides near-optimal $L^ ext{infinity}$ coresets in quadratic time.
Abstract
We introduce kernel thinning, a new procedure for compressing a distribution more effectively than i.i.d. sampling or standard thinning. Given a suitable reproducing kernel and time, kernel thinning compresses an -point approximation to into a -point approximation with comparable worst-case integration error across the associated reproducing kernel Hilbert space. The maximum discrepancy in integration error is in probability for compactly supported and for sub-exponential on . In contrast, an equal-sized i.i.d. sample from suffers integration error. Our sub-exponential guarantees resemble the classical quasi-Monte Carlo error rates for uniform on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Approximation and Integration · Statistical Methods and Inference
