Monte Carlo wavelets: a randomized approach to frame discretization
Zeljko Kereta, Stefano Vigogna, Valeriya Naumova, Lorenzo Rosasco,, Ernesto De Vito

TL;DR
This paper introduces Monte Carlo wavelets, a stochastic discretization method for continuous wavelets on general domains, leveraging spectral calculus and concentration of measure to ensure convergence and establish rates.
Contribution
It presents a novel randomized approach to discretize continuous wavelets called Monte Carlo wavelets, with theoretical guarantees of convergence.
Findings
Convergence of Monte Carlo wavelet discretization established
Derived convergence rates under regularity assumptions
Applicable to general domains using reproducing kernel Hilbert spaces
Abstract
In this paper we propose and study a family of continuous wavelets on general domains, and a corresponding stochastic discretization that we call Monte Carlo wavelets. First, using tools from the theory of reproducing kernel Hilbert spaces and associated integral operators, we define a family of continuous wavelets by spectral calculus. Then, we propose a stochastic discretization based on Monte Carlo estimates of integral operators. Using concentration of measure results, we establish the convergence of such a discretization and derive convergence rates under natural regularity assumptions.
| symbol | definition |
|---|---|
| inner product in a Hilbert space | |
| norm of a Hilbert space, or the operator norm | |
| Hilbert-Schmidt norm | |
| linear span of the set | |
| topological closure of the set | |
| orthogonal projection onto the closed subspace | |
| support of the measure | |
| Dirac measure at | |
| for some constant | |
| -th component of the vector | |
| -th component of the matrix |
| method | qualification | |
|---|---|---|
| Tikhonov regularization | ||
| Iterated Tikhonov | ||
| Landweber iteration | ||
| Asymptotic regularization |
| method | rate |
|---|---|
| Tikhonov regularization | |
| Iterated Tikhonov | |
| Landweber iteration | |
| Asymptotic regularization |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Monte Carlo wavelets: a randomized approach to frame discretization
Zeljko Kereta
Stefano Vigogna
Valeriya Naumova
Lorenzo Rosasco
Ernesto De Vito
Abstract
In this paper we propose and study a family of continuous wavelets on general domains, and a corresponding stochastic discretization that we call Monte Carlo wavelets. First, using tools from the theory of reproducing kernel Hilbert spaces and associated integral operators, we define a family of continuous wavelets by spectral calculus. Then, we propose a stochastic discretization based on Monte Carlo estimates of integral operators. Using concentration of measure results, we establish the convergence of such a discretization and derive convergence rates under natural regularity assumptions.
1 Introduction
To construct a discrete frame it is common practice to first construct a frame on a continuous parameter (measure) space, where the mathematical properties are nice and the structure is rich, and then obtain a discrete frame by carefully selecting a discrete subset of parameters. The discretization problem has been studied extensively and in several settings [5, 2, 7, 6]. Although theoretically relevant in establishing density theorems and complete characterizations, some general discretization procedures require assumptions which might be difficult to confirm, and they are often not constructive. On the other hand, discrete parameter selection in constructive discretization methods is usually sensitive, and it does not generalize trivially from the Euclidean space to more general geometries [1]. In the following, we propose a different approach based on random sampling.
First, we consider a general domain with an associated positive definite kernel, and exploit the theory of reproducing kernel Hilbert spaces (RKHS) and corresponding integral operators to define a family of continuous wavelets. Here, we borrow ideas from [8, 9, 2], and develop them using the properties of an RKHS. Then, we propose a stochastic discretization approach which replaces constructive discrete parameter selection with random sampling. We note that the corresponding discrete frame is actually only finite-dimensional. In particular, it does not solve the discretization problem in the classical sense. Indeed, each discrete frame is a frame only up to a certain sampling resolution. The frame dimensions increase with the number of samples, and our main technical contribution is proving convergence to the continuous frame as the number of samples goes to infinity. Further, we derive convergence rates under natural regularity assumptions [4]. Our analysis combines classical tools from approximation theory and spectral calculus with results of concentration of measure to deal with random sampling.
The paper is organized as follows. In Section 2 we introduce the general framework in which we will conduct our analysis. This will allow us to state the continuous and discrete settings within a unified formalism. The key ingredient is a reproducing kernel and the associated integral operator. Section 3 contains the general frame construction based on spectral filters and eigenfunctions of the integral operator. In Section 4 we consider a specific class of filters, stemming from the theory of inverse problems, for which we will derive explicit approximation rates. In Section 5 we specialize the construction of Section 3 in a continuous and a discrete frame, regarding the latter as a Monte Carlo estimate of the former. Our main result, Theorem 4, establishes quantitative consistency of the Monte Carlo estimate on a class of Sobolev-regular signals. In Section 6 we provide an implementable formula to compute our Monte Carlo wavelets from the eigendecomposition of a sample kernel matrix.
2 Setting
Let be a locally compact, second countable topological space endowed with a Borel probability measure . We fix a continuous positive semi-definite kernel
[TABLE]
and let be the corresponding reproducing kernel Hilbert space
[TABLE]
with and inner product given by . Assuming , we define the integral operator
[TABLE]
Under these assumptions, is a positive self-adjoint trace class operator with spectrum . Let
[TABLE]
be the spectral decomposition of over the orthonormal basis . The subspace
[TABLE]
splits as .
As examples of this setting, we can think of as , or non-Euclidean domains such as a compact connected Riemannian manifold or a weighted graph. In all these cases, we can take as the heat kernel associated with the proper notion of Laplacian, be it the Laplace-Beltrami operator or the graph Laplacian.
3 Frame construction
To construct our frame, we take inspiration from [8] and references therein. We start by defining filters on the spectrum of . Let be a family of bounded measurable functions satisfying
[TABLE]
is then a positive, self-adjoint operator on with , for every . We can thus define
[TABLE]
Note that, if , then since . Moreover, using self-adjointness of , the reproducing property (2), and expression (4), we can rewrite as
[TABLE]
This reveals (6) as a generalization of classical wavelets, where ’s correspond to eigenfunctions of the Laplacian. We may therefore interpret and as location and scale parameters, respectively.
The following proposition shows that the family defined through (6) is a Parseval frame on .
Proposition 1**.**
For every ,
[TABLE]
Proof.
Let . Since is self-adjoint, we have . Hence, integrating over we get
[TABLE]
Summation over along with (5) thus gives
[TABLE]
The assertion follows since is dense in . ∎
4 Examples of spectral filters
Let be a family of functions such that
[TABLE]
We can define filters obeying (5) by means of (8), as
[TABLE]
The qualification of a spectral function is a constant such that, for all ,
[TABLE]
As instances of , we can take spectral functions in Table 1 (see also [3]).
These functions play a fundamental role in ill-posed inverse problems regularization, where the error of a given regularizer decays with a rate governed by its qualification.
5 Consistency
From now on, assume , whence by (4). Therefore, (6) is a frame on the whole space , thanks to Proposition 1. Suppose we are given independent and identically distributed samples
[TABLE]
Then we can consider the empirical distribution
[TABLE]
define the empirical integral operator
[TABLE]
and repeat the construction of Section 3, to get
[TABLE]
In view of Proposition 1, (11) defines a discrete Parseval frame on
[TABLE]
Since (10) is a Monte Carlo estimate of (3), we call (11) a family of Monte Carlo wavelets. Note that, as we take more and more samples, we obtain a sequence of discrete Parseval frames on a sequence of nested subspaces of increasing dimension:
[TABLE]
Our goal is to establish consistency of the Monte Carlo estimate (11), that is, to see whether, and in what sense, (11) tends to (6) as .
Now, let
[TABLE]
be the frame operator at scale and its empirical counterpart. Proposition 1 gives a resolution of the identity, which can be approximated by a truncated empirical version:
[TABLE]
We split the resolution error of (13) on a signal into the approximation and the sampling error,
[TABLE]
and derive quantitative bounds for each term (Propositions 2 and 3). Tuning the resolution in terms of the sample size will then yield our consistency result in Theorem 4.
For what concerns the approximation error, note that Proposition 1 already implies
[TABLE]
by dominated convergence. However, in order to obtain a convergence rate, we need to assume some regularity of the signal . Specifically, we will assume the following Sobolev smoothness condition [4]:
[TABLE]
Proposition 2** (Approximation error).**
Let as in (8). Suppose has qualification . Then, for every ,
[TABLE]
where .
Proof.
By Lemma 5 we have , hence
[TABLE]
Thus,
[TABLE]
Bound on the sampling error follows by concentration of the empirical integral operator on the continuous operator .
Proposition 3** (Sampling error).**
Let be as in (8). Suppose is Lipschitz continuous on with Lipschitz constant bounded by . Then, for every , with probability higher than ,
[TABLE]
Proof.
Using Lemma 6 we have
[TABLE]
Thanks to the reproducing property (2), we can write
[TABLE]
This allows for the concentration in Lemma 7, which yields
[TABLE]
with probability higher than . ∎
We are finally ready to state the consistency of our Monte Carlo discretization.
Theorem 4**.**
Let as in (8). Suppose has qualification , and is Lipschitz continuous on with Lipschitz constant bounded by . Then, for every , with probability higher than ,
[TABLE]
where .
Proof.
We split in (14), and combine the bounds in Propositions 2 and 3. In order to balance the two terms, we choose , which concludes the proof. ∎
In particular, in view of Lemma 8, we can use the spectral filters defined in Table 1, for which we obtain the following rates:
6 Numerical implementation
To implement the system of Monte Carlo wavelets in (11) we exploit equation (7), that is,
[TABLE]
where is the spectral decomposition of . In particular, we need to compute eigenvalues and eigenvectors of , translating the calculations from to and back [10]. To this end, we introduce the restriction map
[TABLE]
whose adjoint defines the out-of-sample extension
[TABLE]
Let
[TABLE]
be the (normalized) sample kernel matrix. Now and , hence and share the same eigenvalues. Moreover, the eigenvectors of are related to the ones of by the singular value decomposition of :
[TABLE]
Thus, we have
[TABLE]
which evaluated at gives
[TABLE]
We therefore obtain
[TABLE]
In summary, given a kernel (1), filters and samples (9), a numerical implementation of the Monte Carlo wavelets (11) can be performed from the sample kernel matrix (16) alone following the steps below:
compute eigenvalues and eigenvectors of ; 2. 2.
apply equation (17).
Lemmata
Lemma 5**.**
Let , and be defined as in (3), (5) and (12). Then .
Proof.
For every we have
[TABLE]
Thus,
[TABLE]
Since , the claim follows by polarization. ∎
Lemma 6**.**
Let be self-adjoint operators on a Hilbert space , and let be a Lipschitz continuous function with Lipschitz constant . Then
[TABLE]
Proof.
Let and be orthonormal bases of such that and . Then
[TABLE]
Lemma 7** ([10, Theorem 7]).**
The operators and defined in (3) and (10) are Hilbert-Schmidt and
[TABLE]
with probability higher than .
Lemma 8**.**
Let be as in Table 1. The function is Lipschitz continuous on , with Lipschitz constant bounded by .
Proof.
The claim follows by a direct computation of the derivative of . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Christ. A T ( b ) 𝑇 𝑏 T(b) theorem with remarks on analytic capacity and the Cauchy integral. Colloq. Math. , 60/61(2):601–628, 1990.
- 2[2] T. Coulhon, G. Kerkyacharian, and P. P. Petrushev. Heat kernel generated frames in the setting of Dirichlet spaces. J Fourier Anal Appl , 18(5):995–1066, 2012.
- 3[3] H. W. Engl, M. Hanke, and A. Neubauer. Regularization of inverse problems , volume 375 of Mathematics and its Applications . Kluwer Academic Publishers Group, Dordrecht, 1996.
- 4[4] H. G. Feichtinger, H. Führ, and I. Z. Pesenson. Geometric space–frequency analysis on manifolds. J Fourier Anal Appl , 22(6):1294–1355, 2016.
- 5[5] M. Fornasier and H. Rauhut. Continuous frames, function spaces, and the discretization problem. J Fourier Anal Appl , 11(3):245–287, 2005.
- 6[6] D. Freeman and D. Speegle. The discretization problem for continuous frames. Advances in Mathematics , 345:784 – 813, 2019.
- 7[7] H. Führ, K. Gröchenig, A. Haimi, A. Klotz, and J. L. Romero. Density of sampling and interpolation in reproducing kernel Hilbert spaces. J. Lond. Math. Soc. (2) , 96(3):663–686, 2017.
- 8[8] F. Göbel, G. Blanchard, and U. von Luxburg. Construction of tight frames on graphs and application to denoising. In W. K. Härdle and X. Shen H.-S. Lu, editors, Handbook of Big Data Analytics , chapter 20, pages 503–522. Springer, 2018.
