Efficient Numerical Integration in Reproducing Kernel Hilbert Spaces via Leverage Scores Sampling
Antoine Chatalic, Nicolas Schreuder, Ernesto De Vito, Lorenzo Rosasco

TL;DR
This paper introduces an efficient sampling-based method for numerical integration in reproducing kernel Hilbert spaces, reducing computational cost while maintaining optimal convergence rates, applicable to kernel mean embeddings and distribution testing.
Contribution
It proposes a novel subsampling procedure using leverage scores for RKHS integration, achieving optimal rates with fewer function evaluations and broad applicability.
Findings
Achieves standard $n^{-1/2}$ convergence rate with fewer evaluations.
Rates adapt to the integrand's smoothness, matching optimal rates.
Numerical experiments demonstrate improved efficiency-accuracy tradeoff.
Abstract
In this work we consider the problem of numerical integration, i.e., approximating integrals with respect to a target probability measure using only pointwise evaluations of the integrand. We focus on the setting in which the target distribution is only accessible through a set of i.i.d. observations, and the integrand belongs to a reproducing kernel Hilbert space. We propose an efficient procedure which exploits a small i.i.d. random subset of samples drawn either uniformly or using approximate leverage scores from the initial observations. Our main result is an upper bound on the approximation error of this procedure for both sampling strategies. It yields sufficient conditions on the subsample size to recover the standard (optimal) rate while reducing drastically the number of functions evaluations, and thus the overall computational cost. Moreover, we obtain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Approximation and Integration · Statistical Methods and Inference
MethodsSparse Evolutionary Training · Focus
