Approximation by translates of a single function of functions in space induced by the convolution with a given function
Dinh D\~ung, Charles A. Micchelli, Vu Nhat Huy

TL;DR
This paper investigates how well functions can be approximated using linear combinations of translates of a single function, providing bounds on approximation rates in various $L_p$ spaces for periodic functions induced by convolution.
Contribution
It introduces new methods for approximation and establishes upper and lower bounds for the approximation rates in $L_p$ spaces, advancing understanding of convolution-induced function classes.
Findings
Established upper bounds for $L_p$ approximation convergence rates.
Derived lower bounds for the best approximation in the case $p=2$.
Proposed methods for approximation of convolution-induced function classes.
Abstract
We study approximation by arbitrary linear combinations of translates of a single function of periodic functions. We construct some methods of this approximation for functions in a class induced by the convolution with a given function, and prove upper bounds of -the approximation convergence rate by these methods, when , for , and lower bounds of the quantity of best approximation of this class by arbitrary linear combinations of translates of arbitrary function, for the particular case .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Approximation and Integration · Approximation Theory and Sequence Spaces · Advanced Harmonic Analysis Research
Approximation by translates of a single function
of functions in space induced by the convolution with a given function
Dinh Dũnga111Corresponding author. Email: [email protected]., Charles A. Micchellib and Vu Nhat Huyc
a Vietnam National University, Information Technology Institute
144 Xuan Thuy, Hanoi, Vietnam
bDepartment of Mathematics and Statistics, SUNY Albany
Albany, 12222, USA
c College of Science, Vietnam National University
334 Nguyen Trai, Thanh Xuan, Ha Noi
(September 26, 2016 -- Version 2.0)
Abstract
We study approximation by arbitrary linear combinations of translates of a single function of periodic functions. We construct some methods of this approximation for functions in a class induced by the convolution with a given function, and prove upper bounds of -the approximation convergence rate by these methods, when , for , and lower bounds of the quantity of best approximation of this class by arbitrary linear combinations of translates of arbitrary function, for the particular case .
Keywords: Function spaces induced by the convolution with a given function; Reproducing kernel Hilbert space; Approximation by arbitrary linear combinations of translates of a single function.
Mathematics Subject Classifications: (2010) 41A46; 41A63; 42A99.
1 Introduction
The purpose of this paper is to improve and extend the ideas in the recent papers [2, 3] on approximation by translates of the multivariate Korobov function. The motivation for the results given in [2, 3], and those presented here come from Machine Learning, since certain cases of our results here relate to approximation of a function by sections of a reproducing kernel corresponding to specific Hilbert space of functions. This relationship to Machine Learning is described in the papers [2, 5] and is not reviewed in detail here. Nonetheless, in this regard, we recall that the observation presented in [5] provide necessary and sufficient conditions for sections of a reproducing kernel to be dense in continuous functions in the corresponding Hilbert space. This result begs the question of the convergence rate of approximation by sections of a reproducing kernel. We refer the reader to [2, 3] for detailed survey and bibliography on the problems considered in the present paper. Here, in this paper, we introduce a weighted Hilbert space of multivariate periodic functions and provide insights into this question. The results presented here also extend other norms on multivariate periodic functions and these results are presented separately in this paper.
We shall begin the study of this problem with a description of the notation used throughout the paper. In this regard, we merely follow closely the presentation in [2, 3]. The -dimensional torus denoted by is the cross product of copies of the interval with the identification of the end points. When , we simply denote the -torus by . Functions on are identified with functions on which are periodic in each variable. We shall denote by , the space of integrable functions on equipped with the norm
[TABLE]
and we shall only consider only real valued functions on . However, all the results in this paper are true in the complex setting. Also, we will use the Fourier series of a real valued function in complex form.
For vectors and in we use for the inner product of with . Given any integrable function on and any lattice vector , we let denote the -th Fourier coefficient of defined by the equation
[TABLE]
Frequently, we use the superscript notation to denote the cross product of copies of a given set in .
Let and be defined be the equation . Assume that belongs to and can be represented as the Fourier series
[TABLE]
in distributional sense for some sequence with nonzero components. Notice that if is absolutely summable, the function is continuous on . In the case that we merely write for the univariate function . The special case when , and is given by the equation
[TABLE]
In this case, the function corresponds to the Korobov function which was the focus of study in [2]. In general, we introduce a subspace of defined as
[TABLE]
with norm
[TABLE]
where we denote the convolution of any two functions and on , as , and as usual, define it at by equation
[TABLE]
whenever the integrand is in .
The space is particularly interesting as it has an interpretation in Machine Learning which is described in detail in the papers [2, 5]. As in the paper [2] we are concerned with the following concept. Let be a prescribed subset of and be a given function on . Set . We are interested in the approximation in -norm of all functions by arbitrary linear combinations of translates of the function , that is, by the functions in the set and measure the error in terms of the quantity
[TABLE]
The aim of the present paper is to investigate the convergence rate, when , of , where is the unit ball in . We shall also obtain a lower bound for the convergence rate as of the quantity
[TABLE]
which gives information about the best choice of .
This paper is organized in the following manner. In Section two we introduce the method of approximation used throughout the paper and provide error estimates for both the univariate and multivariate cases. In Section three we apply these results to the problem described earlier, in particular, of approximating periodic functions by sections of reproducing kernels. We continue this line of investigation in Section four by relying upon observations of V. Maiorov [4] as a means to establish lower bounds of approximation.
2 A linear method of univariate approximation
In this section, we introduce a method of approximation induced by translates of the function defined in equation (1.1) in the univariate case. We do this in some greater generality. To the end, we start with the functions of the form given in equation (1.1). We introduce a trigonometric polynomial defined at as
[TABLE]
that is, for . For a function represented as , , we define the operator
[TABLE]
where and Our goal is to obtain an estimate for the error of approximating a function by a linear combination of translates of the function . For the moment, we assume that and put
[TABLE]
2.1 Error estimates for functions in the space
Theorem 2.1
We put
[TABLE]
where , the unique integer in such that the number is an integer, and . Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
Proof. We define the kernel for as
[TABLE]
and easily obtain from our definition (2.3) the equation
[TABLE]
We now use equation (1.1), the definition of the trigonometric polynomial given in equation (2.2) and the easily verified fact, for , that
[TABLE]
to conclude that
[TABLE]
We again use the formula for the function given in equation (1.1) to get that
[TABLE]
For we have that and the above expression becomes
[TABLE]
From this equation and the definition of we deduce the formula
[TABLE]
By the triangle inequality we have
[TABLE]
Parseval’s identity gives us the equation
[TABLE]
Hence, by our assumption on and the inequality
[TABLE]
we obtain that
[TABLE]
Next, by using Parseval’s identity again, we have that
[TABLE]
This, together with (2.4) and (2.5), proves the theorem.
Definition 2.2
The sequence will be called a nondecreasing-type sequence if there exists a positive constant such that for all satisfying the inequality .
Theorem 2.3
Let \Big{\{}\frac{|\beta_{k}|}{|\alpha_{k}|}:k\in{\mathbb{Z}}\Big{\}} and be nondecreasing-type sequences. Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
Proof. From our hypothesis we have that ,
[TABLE]
and for all with some positive constants , and . From these inequalities and Theorem 2.1, the proof of the result is complete.
From this theorem we have the following result.
Corollary 2.4
Let for all , and \Big{\{}\frac{|\lambda_{k}|}{k^{r}}:k\in{\mathbb{Z}}\Big{\}} be a nondecreasing-type sequence for some . Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
Proof. We see from the hypothesis that
[TABLE]
for some positive constant and then from which it follows that
[TABLE]
for all . Hence, we conclude that
[TABLE]
Note that, since we have and then by applying Theorem 2.3 we complete the proof.
Corollary 2.5
Let for all , and be a nondecreasing-type sequence. Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
Proof. The inclusions and the equations for all yield that for all with a positive constants . Hence, by applying Theorem 2.3 we prove the corollary.
Similarly to the proof of Corollary 2.4 we can prove the following fact.
Corollary 2.6
Let for all and \Big{\{}\frac{|\lambda_{k}|}{k^{r}}:k\in{\mathbb{Z}}\Big{\}} be a nondecreasing-type sequence . Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
Corollary 2.7
Let and for all , and . Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
Remark 2.8
Note that under the assumptions of Corollary 2.5, is the reproducing kernel for the Hilbert space . This means, for every function and , we have that
[TABLE]
where denotes the inner product on the Hilbert space . It is known that the linear span of the set of functions is dense in the Hilbert space . Under a certain restriction on the sequence , Corollaries 2.5, 2.6 and 2.7 give an explicit rate of the error of the linear approximation of by the function belonging . For a definitive treatment of reproducing kernels, see, for example, [1]. Corollary 2.7 has been proven as Theorem 2.10 in [2] where is the Korobov space .
2.2 Error estimates for functions in the space
For this purpose, we define, for , the quantity
[TABLE]
where is defined as in Theorem 2.1, and for the sequence .
Now, we are ready to state the the following result.
Theorem 2.9
If then there exists a positive constant such that for all and , we have that
[TABLE]
and consequently,
[TABLE]
Proof. In order to prove (2.7) we need an auxiliary result which is a direct corollary of the well-known Marcinkiewicz multiplier theorem, see, e.g., [2, Lemma 2.7]. For and an integrable function on we introduce the trigonometric polynomial
[TABLE]
Then there exists an absolute positive constant such that for all and we have that
[TABLE]
In a completely similar way as in the proof of Theorem 2.1, we can establish the formula
[TABLE]
for a function represented as , .
The next step is to decompose each sum above into two parts. Specifically, we write the first sum above as
[TABLE]
We call the the first sum in equation (2.10) and the other . Now, we readily rewrite in the form
[TABLE]
We shall express the right hand side of equation (2.11) in an alternate form by using summation by parts. For this purpose, we introduce the modified difference operator defined on vectors as
[TABLE]
With this notation in hand and the fact that, for we have that , we conclude that
[TABLE]
Consequently, according to (2.8) and the Hölder inequality, there exists a positive constant such that
[TABLE]
From this inequality and the definition of , given in equation (2.6), we conclude that
[TABLE]
A bound on follows by a similar argument and yields the inequality
[TABLE]
There still remains the task of bounding the second sum in equation (2.9). As before, we split it into two parts
[TABLE]
and call the first sum above and the second sum . As before, summation by parts yields the alternate form
[TABLE]
from which we deduce that
[TABLE]
Therefore, by (2.6) we obtain that
[TABLE]
and, in a similar way, we prove that
[TABLE]
Combining our remarks above proves the result.
Now, we are ready to state the the following result.
Theorem 2.10
Let and , . Assume that and are nondecreasing, positive sequences. Then there exists a positive constant such that for all and , we have that
[TABLE]
and consequently,
[TABLE]
Proof. Since be a nondecreasing sequence, we have that for all and then it follows that
[TABLE]
From the inequalities we have that
[TABLE]
Hence, we obtain that
[TABLE]
and also that
[TABLE]
Then, it follows from the hypothesis that and are both nondecreasing, positive sequences, from which we deduce for all that
[TABLE]
Consequently, we conclude that
[TABLE]
From inequalities (2.13), (2.14) and Theorem 2.9 we confirm (2.12) which completes the proof the theorem.
Remark 2.11
Note that for the sequence defined as
[TABLE]
* becomes the Korobov space and from Theorem 2.10 we derive the following estimate*
[TABLE]
which have been proven is in [2].
From Theorem 2.10 we immediately derive the following corollary.
Corollary 2.12
Let , for all where . Then there exists a positive constant such that for all and , we have that
[TABLE]
and consequently,
[TABLE]
Definition 2.13
Let . A function will be called a mask of type if is an even function, twice continuously differentiable such that for , for some function , where for some constant for all and . A sequence will be called a sequence mask of type if there exists a mask of type such that for all .
In the next two theorems and their proofs we use the abbreviated notation: and .
Theorem 2.14
Let and the sequence be a sequence mask of type . Then there exists a positive constant such that for all and ,
[TABLE]
and consequently,
[TABLE]
Proof. According the hypothesis we have, by Theorem 2.10, that
[TABLE]
where
[TABLE]
Note that for
[TABLE]
where . Therefore, we have for , that
[TABLE]
Consequently, we conclude that
[TABLE]
We also have that
[TABLE]
where We complete the proof by using equations (2.15) and (2.16).
Definition 2.15
A function will be called a function of exponent-type if is two times continuously differentiable and there exists a positive constant such that for some decreasing function The sequence will be called a sequence mask of exponent-type if there exists a function of exponent-type such that for all .
Theorem 2.16
Let and the sequence be a sequence mask of type , the sequence be sequence mask of exponent-type. Then there exists a positive constant such that for all and ,
[TABLE]
and consequently,
[TABLE]
Proof. We have proven in Theorem 2.14 that
[TABLE]
Also, we have that
[TABLE]
and so we obtain that
[TABLE]
The proof is complete.
3 Multivariate Approximation
3.1 Error estimates for functions in the space
Definition 3.1
For we define
[TABLE]
We introduce the trigonometric polynomial defined at as
[TABLE]
where we set for .
For a function represented as , , we define the operator
[TABLE]
where and We put
[TABLE]
where and is the unique vector in such that and for all , and .
Definition 3.2
The sequence will be called a non decreasing-type sequence if for all satisfying the inequalities , .
In a similar way to the proof of Theorems 2.9 and 2.10 we can prove the following two theorems.
Theorem 3.3
There exists a positive constant such that for all and , we have that
[TABLE]
and consequently,
[TABLE]
Theorem 3.4
Let for all with a positive constants and be a non decreasing-type sequence. Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
From the above corollary we have the following result.
Corollary 3.5
Let \Big{\{}\frac{|\beta_{\bf k}|}{|{\bf k}|^{r}|\lambda_{\bf k}|}:{\bf k}\in{\mathbb{Z}}^{d}\Big{\}} be a non decreasing-type sequence for some . Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
Proof. We see from the hypothesis that for all with , we have that
[TABLE]
and then it follows that
[TABLE]
Hence, we conclude that
[TABLE]
and so
[TABLE]
Note that for the series is convergent which completes the proof of the corollary.
Corollary 3.6
Let for all , and the sequence \Big{\{}\frac{|\lambda_{\bf k}|}{|{\bf k}|^{r}}:{\bf k}\in{\mathbb{Z}}^{d}\Big{\}} be non decreasing-type for some . Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
Corollary 3.7
Let for all ; the sequence \Big{\{}\frac{|\lambda_{\bf k}|}{|{\bf k}|^{r}}:{\bf k}\in{\mathbb{Z}}^{d}\Big{\}} is nondecreasing-type for some . Then there exists a positive constant such that for all and we have that
[TABLE]
and consequently,
[TABLE]
3.2 Convergence rate
Theorem 3.8
Let be a nondecreasing function such that for all . If for all then there exist positive constants and such that for all ,
[TABLE]
Proof. Let be any natural number and set
[TABLE]
where denoted by the number of elements of a finite set . Then, for some positive constants and , we have that
[TABLE]
Let be any natural number satisfying . We define the positive integers
[TABLE]
and as the largest natural number satisfying the inequality
[TABLE]
where . Then it follows from (3.1) that there exists a positive number independent of such that
[TABLE]
We consider the set of trigonometric polynomials
[TABLE]
where
[TABLE]
Let
[TABLE]
be any polynomial from . Since
[TABLE]
belongs to and consequently, . Take an arbitrary function from . Then it follows from a result in [4] that there exists a function and a positive number such that for any linear combination of translates of
[TABLE]
we have that
[TABLE]
Therefore, by using (3.2) and for all , there exists a positive constant independent of such that
[TABLE]
which proves the lower bound of the theorem.
By using Corollary 3.5 for and , there exists a positive constant independent of such that
[TABLE]
where and That gives us the inequality
[TABLE]
and then we get that
[TABLE]
Hence, by using for all , we obtain the remaining desired result
[TABLE]
The proof is complete.
Acknowledgments Dinh Dũng’s research is funded by Vietnam National Foundation for Science and Technology Development (NAFOSTED) under Grant No. 102.01-2017.05. Dinh Dũng and Vu Nhat Huy thank Vietnam Institute for Advanced Study in Mathematics (VIASM) for providing a fruitful research environment and working condition. In addition, Charles Micchelli wishes to acknowledge partial support from NSF under Grant DMS 1522339.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Aronszajn, Theory of reproducing kernels, Trans. Amer. Math. Soc. 68 (1950), 337-404.
- 2[2] Dinh Dũng and Charles A. Micchelli, Multivariate approximation by translates of the Korobov function on Smolyak grids, Journal of Complexity, 29 (2013), 424-437.
- 3[3] Dinh Dũng and Charles A. Micchelli, Corrigendum to ”Multivariate approximation by translates of the Korobov function on Smolyak grids” [J. Complexity 29 (2013) 424-437], Journal of Complexity 35 (2016), 124-125.
- 4[4] V. Maiorov, Almost optimal estimates for best approximation by translates on a torus, Constructive Approx. 21 (2005), 1–20.
- 5[5] C. A. Micchelli, Y. Xu, H. Zhang, Universal kernels, Journal of Machine Learning Research 7 (2006), 2651-2667.
