Polynomial Approximation of Anisotropic Analytic Functions of Several Variables
Andrea Bonito, Ronald DeVore, Diane Guignard, Peter Jantsch, Guergana, Petrova

TL;DR
This paper develops methods for approximating multivariate analytic functions, especially in high or infinite dimensions, using algebraic polynomials with optimal lower set structures, relevant for solving parametric PDEs.
Contribution
It introduces a framework for polynomial approximation of anisotropic functions in high dimensions, identifying optimal lower sets for certifiable error bounds, applicable even for small polynomial dimensions.
Findings
Optimal lower sets provide near-best approximation errors.
Results hold uniformly for all polynomial dimensions n ≥ 1.
Approximations are effective in high or infinite variable settings.
Abstract
Motivated by numerical methods for solving parametric partial differential equations, this paper studies the approximation of multivariate analytic functions by algebraic polynomials. We introduce various anisotropic model classes based on Taylor expansions, and study their approximation by finite dimensional polynomial spaces described by lower sets . Given a budget for the dimension of , we prove that certain lower sets , with cardinality , provide a certifiable approximation error that is in a certain sense optimal, and that these lower sets have a simple definition in terms of simplices. Our main goal is to obtain approximation results when the number of variables is large and even infinite, and so we concentrate almost exclusively on the case . We also emphasize obtaining results which hold for the full…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Polynomial Approximation of Anisotropic Analytic Functions of Several Variables
Andrea Bonito, Ronald DeVore, Diane Guignard, Peter Jantsch, and Guergana Petrova
This research was supported by the NSF grants DMS-1817691 (AB), DMS 15-21067 (RD-GP), DMS 18-17603 (RD-GP), ONR grants N00014-17-1-2908 (RD), N00014-16-1-2706 (RD); DG was supported by the Swiss National Science Foundation grant P2ELP2-175056 and IAMCS at TAMU, and PJ was supported by an NSF Fellowship DMS-1704121. A portion of this research was completed while RD (Simon Fellow), DG, and PJ were supported as visitors of the Isaac Newton Institute at Cambridge University.
Abstract
Motivated by numerical methods for solving parametric partial differential equations, this paper studies the approximation of multivariate analytic functions by algebraic polynomials. We introduce various anisotropic model classes based on Taylor expansions, and study their approximation by finite dimensional polynomial spaces described by lower sets . Given a budget for the dimension of , we prove that certain lower sets , with cardinality , provide a certifiable approximation error that is in a certain sense optimal, and that these lower sets have a simple definition in terms of simplices. Our main goal is to obtain approximation results when the number of variables is large and even infinite, and so we concentrate almost exclusively on the case . We also emphasize obtaining results which hold for the full range , rather than asymptotic results that only hold for sufficiently large. In applications, one typically wants small to comply with computational budgets.
**Mathematics Subject Classification ** 41A10, 41A58, 41A63, 65N15
1 Introduction
Polynomial and piecewise polynomial approximation are a staple in numerical analysis. For example, approximation by piecewise polynomials on simplicial partitions is the underpinning of Finite Element Methods. In that setting, one approximates the solution to a partial differential equation (PDE) on a domain , where is typically small (). The solution to the PDE typically has limited regularity, and the rate of approximation is of order , where is the number of degrees of freedom in the approximation and is small. This type of approximation is well-understood by means of theorems which relate the approximation order to the smoothness order of in certain Sobolev and Besov spaces (see [6, 7, 8, 9]). The approximation rate takes the form and therefore deteriorates as increases. This is commonly referred to as the curse of dimensionality.
The present paper is interested in a different setting that arises in other application areas, in particular when using numerical methods for solving stochastic or parametric PDEs. In that setting, one wishes to approximate the solution to the parametric PDE which depends on input parameters and takes values in a Banach space . The parameters come from a set where is large or even infinite. Hence, it is often crucial to perform a model reduction (dimension reduction) for the solution map of the parametric PDE. One possibility to obtain such dimension reduction is to approximate by Banach space valued polynomials in . The main property of that makes such an approximation possible is that under standard assumptions on the parametrized coefficients of the PDE, it is known that admits an analytic extension onto certain complex polydiscs that contain (see [13]). In other words, has a certain anisotropic analyticity. This motivates the study of approximation of anisotropic analytic functions by polynomials, which is the subject of the present paper. Although we are motivated by parametric PDE applications, we formulate and study this subject as purely a problem in multivariate approximation. In this way, we hope to draw the attention of the approximation community to this area of research.
For the most part, we are interested in the case of an infinite number of parameters, i.e., . This allows us to prove results which are immune to the dimension and is a common setting in parametric PDEs. Specifically, we take parameters in the set , where is the set of natural numbers. Sometimes we remark on the case with finite, in particular, when we wish to compare our results with other results in the literature established only for finite .
Let denote the set of all infinite sequences with entries , where only a finite number of the entries in are allowed to be nonzero. If is a finite subset of , we denote by the space of -valued polynomials spanned by the monomials , where the come from the set . Thus, any element of has the form
[TABLE]
where the coefficients come from . Here and throughout the paper, we use standard multivariate notation. In particular, . Since has only a finite number of nonzero entries, any such product is finite.
For any , and any finite set , we define the error of approximation of by polynomials in to be
[TABLE]
where consists of all functions on that are bounded mappings into .
If no conditions are imposed, then the potential sets may be quite complex and beyond the scope of numerical methods. For this reason, one usually imposes additional structure on these sets such as fixed total degree or fixed coordinate degree in the case is finite. We are especially interested in the case where the sets are lower sets, that is, sets with the property
[TABLE]
We consider the collection of lower sets with cardinality ,
[TABLE]
and for given compact class of functions in , and any finite set , we define
[TABLE]
and
[TABLE]
Notice that in this definition the set is allowed to depend on , but cannot change for the various . So, as formulated, this is a problem of finding the best linear space to use when approximating . Hence, the optimal performance satisfies
[TABLE]
where denotes the Kolmogorov -width of in . The sets are commonly called model classes. The case where the error of approximation is measured in , , is also interesting but not studied here.
Given the model class , we are interested in several fundamental issues:
- •
First, what can we say about the rate of decay of as increases? By now, there are several results in the literature that give upper bounds on for certain anisotropic analytic classes of the type analyzed in this paper. Most often these bounds have been developed in the setting where is a solution to a parametric PDE. So part of our effort is to separate out which of these results are simply a result of the analyticity of and do not use any additional properties of the PDE solution.
- •
A second important issue is the optimality of the known bounds for . Indeed, the typical results only give upper bounds for and sometimes only for sufficiently large.
- •
A third significant problem in this area of research is to give a recipe for finding good lower sets such that performs at or near . This can be a nontrivial issue in numerical applications since, given a budget , searching over all lower sets in to find a suitable is prohibitive.
As already noted, we are interested in model classes described by some form of anisotropic analyticity. We focus on valued functions that are analytic on a polydisc consisting of all complex sequences , with , . Here, is always a nondecreasing sequence of positive numbers with . The functions in have more smoothness in the variable as increases. In turn, the influence of this variable on the value of at a point in is weaker. Any function in has Taylor coefficients that are elements of . In §2, we introduce a variety of spaces , which differ in the assumptions imposed on the Taylor coefficients. These spaces are motivated by recent work (see [1, 15, 13]) in parametric PDEs.
The remainder of the paper concentrates on understanding the rate of decay of for these model classes and understanding how to choose lower sets of cardinality which attain . It turns out that the norm is difficult to work with and so we replace it by a certain surrogate majorant. In §3, we show that estimating the error of approximating in the surrogate norm and finding optimal lower sets in this norm has a simple solution. Namely, given any , the smallest lower set for which approximates to accuracy is given by the set of lattice points in a certain simplex determined by and . Therefore, understanding the rate of decay of is equivalent to counting the number of lattice points in these simplices.
Of course, counting lattice points in simplices is a well studied problem in number theory where several deep results are known. General results typically only hold for large when the number of lattice points can be estimated through the volume of the simplex. In numerical applications, the pre-asymptotic region is the most important since it corresponds to the only which can be implemented in computation. Therefore, we focus on counting lattice points when is small. For results of this type, one needs to have more specific information on the simplices, and therefore on the sequence . This leads us to consider specific anisotropic classes that arise in applications. These correspond to sequences which grow polynomially.
For , we define the sequence with , . The problem of counting lattice points in the simplex associated to this sequence is directly related to counting the number of multiplicative partitions of integers. One can therefore use the results in [10] to do exact and asymptotic analysis for the number of such lattice points. Exact counts on the number of multiplicative partitions of an integer are known for certain values of . Making such counts becomes numerically more intensive as increases.
It turns out that this situation can be alleviated some by slightly modifying the sequence . We modify this sequence to obtain a related sequence with the same asymptotic decay as . The advantage of this modification is that the number of lattice points of the modified sequence is related to the number of additive partitions of an integer rather than the multiplicative partitions. Finding additive partitions is somewhat easier numerically. We show how one can do an exact count of lattice points for in §5. In §5.2, we give an asymptotic analysis for this count by using known asymptotic bounds for the number of additive partitions of integers. In §6, we give some simple recipes for how to find the optimal for various sequences . Finally, in §7, we make some final remarks and compare out results with those in [20].
Let us close this introduction by mentioning that the results in this paper have a large intersection with several earlier papers. As we have already noted, our motivation for the introduction of the spaces stems from several works on parametric PDEs, see the survey [13] and the references therein. Let us also mention [5, 16] which study approximation of anisotropic analytic functions in a quite general tensor product framework. There are several papers, most notably [5, 20, 22], that realize that one method to construct approximations for solutions of parametric PDEs is related to counting lattice points. In [20], this count is done for certain non-simplicial sets as well. We touch more on some of these works later in the paper once our results are formulated.
2 Anisotropic analyticity and motivation
In this section, we introduce a variety of model classes based on some form of anisotropic analyticity. We recall that throughout this paper denotes a non-decreasing sequence of positive real numbers with , and . We call any sequence with these properties admissible. We recall the Banach spaces of all bounded complex valued sequences , with its usual norm . We let denote the unit ball of in this section.
Perhaps the most natural class of anisotropic analytic functions is the following. We start with the complex (open) polydisc , which consists of all , , for which , and define as the set of all for which , , . We then define
[TABLE]
as the set of all functions which are bounded on , continuous on , and holomorphic in each variable , , on . We can equip this space with the norm
[TABLE]
Because the sequence is non-decreasing, we see that functions in have more smoothness in the variable as increases. These spaces are analogous to Hardy spaces.
If , then the support of is finite and any has uniquely defined Taylor coefficients
[TABLE]
where again we are using standard multivariate notation. Note here that the definition of requires only the function , for a suitable finite value of . Since this is an analytic function of a finite number of variables, these coefficients are well defined from the usual theory of functions of a finite number of variables.
In what follows, we are interested in representing in a Taylor series expansion
[TABLE]
An important issue is the sense in which the above Taylor series converges. For this, we follow Section 3.1 of [13]. It is shown in that paper that any rearrangement of this series converges uniformly on whenever is in . This guarantees that there is a function defined on such that any rearrangement of the terms in the series in (2.1) converges in uniformly to . We call this type of convergence uniform unconditional.
Convergence of the Taylor series associated to does not guarantee that its limit is equal to . For this one requires additional structure. A sufficient condition is that has the following property:
Truncation Property: For all , we have
[TABLE]
This property is known to hold for the solutions to parametric PDEs.
Our first observation is that whenever is in , then has a bound on its Taylor coefficients.
Lemma 2.1**.**
If , then
- (i)
the Taylor coefficients of satisfy the bounds
[TABLE] 2. (ii)
if in addition, has the Truncation Property and is in , we have
[TABLE]
with uniform unconditional convergence of the series.
Proof: We use a slight modification of the proof of Lemma 3.14 in [13] accounting for the fact that the assumptions of the lemma do not guarantee that is holomorphic on an open set containing as is required in that lemma of [13]. If we fix and any sequence , we claim that
[TABLE]
Given , let contain the support of . We consider the function , where , and is zero otherwise. Then is the corresponding Taylor coefficient of and the bound (2.4) is derived from Cauchy’s formula as in [13]. Since this bound holds for any and the support of is finite, we obtain (2.3) by letting , for each in the support of . The uniqueness of again follows from the fact that has only a finite number of nonzero coordinates and is determined by restricting to the finite number of coordinates corresponding to where is nonzero. This proves (i).
For the proof of (ii), the assumption guarantees the convergence of the Taylor series and then the fact that its sum is follows easily (see Proposition 2.1.5 in [22]).
This lemma motivates the definition of the following class of functions.
**Definition of : ** We say that a function defined on and taking values in , is in the space if admits a representation
[TABLE]
with the convergence of the series uniform unconditional on , and where the are unique and satisfy
[TABLE]
Another type of restriction on functions , derived in the context of parametric PDEs (see [1]), is that
[TABLE]
This motivates the general definition of the following model classes.
Definition of : For any , we define the space , as the set of all which admit a representation
[TABLE]
with the convergence of the series uniform unconditional on , and where the are unique and satisfy
[TABLE]
Notice that these classes get smaller as decreases: when . We study the approximation of the model classes in this paper.
We could similarly define anisotropic spaces using other sequence norms in place of norms, for example, Lorentz space norms. However, we will not explore this in the present paper.
Remark 2.2**.**
We have introduced spaces of anisotropic analytic functions by imposing conditions on Taylor coefficients. One could replace the Taylor basis , , by other polynomial bases and define corresponding spaces of analytic functions. A particularly interesting case is when the polynomial basis consists of Legendre polynomials, since such expansions occur naturally in parametric PDEs (see [12]).
3 The approximation of functions in
In this section, we give first estimates for the error in approximating functions in by polynomials in , with a lower set. We follow the ideas in [15] which treats the case . Recall that in this paper we limit our discussion to the approximation of in the norm. This norm is not easy to access especially when is a general Banach space. However, if has a Taylor expansion , , then it has a simple majorant given by
[TABLE]
The surrogate norm is defined and finite only if has a Taylor expansion valid on and is in . We assume that this is the case in going further in this section. As we shall see below, this assumption is easy to verify when under suitable assumptions on . This leads us to consider the surrogate error
[TABLE]
and similarly
[TABLE]
for the surrogate performance on a compact set . Given any set , the polynomial
[TABLE]
provides an approximation to which satisfies
[TABLE]
We now describe a simple way to find a lower set from which gives the smallest surrogate error for the unit ball of , , among all lower sets from . Given the sequence and given any , we define
[TABLE]
Notice that has the following properties:
- •
whenever , since is non-decreasing, with and ;
- •
is a lower set, since ;
- •
whenever .
We define the sequence to be a decreasing rearrangement of the sequence . Then, . We further define
[TABLE]
as any lower set contained in with cardinality and which has the property that it contains all for which . Such a lower set can be obtained from by successively removing extreme points and thereby retaining the lower set property. Note that is not unique because of possible ties in the value of , .
For any admissible and any , we define
[TABLE]
While need not be unique, we always have a unique value for for all choices of .
Theorem 3.1**.**
For , we have:
- (i)
the set , defined in (3.5), minimizes over all lower sets , and
[TABLE] 2. (ii)
if and is the conjugate index to , i.e., , then
[TABLE]
Proof: Let us first make some remarks about the structure of that hold for any and any admissible . Given any , we know that and the Taylor coefficients satisfy
[TABLE]
where is the unit ball of the space . Conversely, let be a non-negative sequence and let with . If we define , , then the function will be in provided that is summable.
We first prove (ii) for a fixed and . We only discuss the case . The case is proved in a similar way. The two inequalities in (ii) are obvious from the definitions (1.4), (3.2), and (3.3), and so we only need to show that . Let with . It follows from (3.1) with and Hölder’s inequality that
[TABLE]
To prove that , we construct a function for which . First assume that is finite, so that there is a nonnegative sequence in the unit ball of for which . Then, as in our lead remarks, we let with and define . Then, we have
[TABLE]
is in . Note here we use the fact that is in . Since , we have finished the proof of (ii) in the case that is finite. If , the same argument as above shows that there is a for which is as large as we wish. Therefore (ii) holds in this case as well.
Now, consider the proof of (i). The inequalities stated in (i) are all obvious and so we need only show that minimizes over all lower set . To prove this, we first consider the case . If , then by our lead remarks
[TABLE]
Here, we use the fact that is summable because is in . The minimum of (3.9) over all is achieved by taking .
Finally, we have to prove (i) in the case . Let us first recall that there is an enumeration , , of all of the , such that , , and such that . We suppose is any lower set with . From the definition of , we have
[TABLE]
Using the same construction as in the proof of (ii), we can find with
[TABLE]
which thereby proves (i).
Corollary 3.2**.**
For , let be the conjugate index to . Then whenever is finite for some , we have
[TABLE]
In particular, we have
[TABLE]
Proof: The case where (resp. ) follows from (ii) of Theorem 3.1, since . So we can assume and . Since the sequence is non-increasing, we have
[TABLE]
Because of (ii) in Theorem 3.1, taking a -th root proves (3.11). To show (3.12), we use the fact that , along with the standard estimate
[TABLE]
Inserting these into (3.13), we obtain
[TABLE]
and the proof is complete.
Remark 3.3**.**
We can define the above space also in the case with finite. The results of this section hold equally well in this case.
4 The sequence
First, let us observe that in order for the from (3.6) to be finite, and therefore Theorem 3.1 to be meaningful, we need that the sequence , which is the same as asking that . The following lemma shows that this is the case if and only if .
Lemma 4.1**.**
Let . Then the sequence if and only if the sequence . Moreover, the two norms are related in the following way:
- (i)
when , we have 2. (ii)
when , we have
[TABLE]
Proof: The case is trivial. When , we have
[TABLE]
Taking logarithms, we have from the mean value theorem that
[TABLE]
where the , . Since , it follows that
[TABLE]
This proves item (ii) in the lemma, and likewise shows that the product in (4.1) converges if and only if .
Remark 4.2**.**
The upper bound established in the above lemma can be found in [15].
The error estimates derived in §3 for approximation by polynomials on lower sets depend crucially on the sequence , and are achieved by choosing the lower set . This leads to two central issues:
- (i)
establishing sharp a priori estimates for given the sequence ; 2. (ii)
efficient algorithms for generating the sets .
We discuss item (ii) in §6, and here we discuss first item (i). We begin this section with methods for bounding which hold for any admissible sequence .
Remark 4.3**.**
In order to compute or its asymptotic decay as , we study , . This function of takes integer values and increases as goes to zero. Hence, it is a piecewise constant function and is the decreasing sequence of the breakpoints of , where each value is repeated times and with .
Since , there is a such that , . It follows that any has support in . Moreover, if we write , then taking logarithms we see that if and only if satisfies
[TABLE]
Hence, if and only if is supported on , and is a lattice point in the simplex
[TABLE]
where
[TABLE]
Estimating the number of lattice points in such a simplex is a classical problem in number theory and combinatorics. Let us first note that the volume (measure) of is
[TABLE]
We recall the following general upper bound (see [4, 21]) for the number of such that :
[TABLE]
Note that the right side of (4.4) is inflated by a factor of ( when compared with the volume of . We use this result to prove the following lemma.
Lemma 4.4**.**
Let be any admissible sequence. Given , where , let be the last integer for which . Then, for the set of all such that , we have
[TABLE]
Proof: From (4.4) with , , we have
[TABLE]
which is equivalent to (4.5).
Let us make some remarks that will clarify when the bound in the lemma is effective and when it is deficient. First of all, if is finite and the sequence is fixed, then the set , , is the set of lattice points in the fixed simplex . If we let tend to infinity (which corresponds to ), we see that provided is large enough, and behaves like times the measures of . This is in agreement with the bound (4.5) because the inflation factor tends to one as . So this bound is good for finite , provided the error we seek is small. However, there is a transition before this asymptotic kicks in where the upper bound provided by the lemma is not effective.
To see this, we consider one example which is central to this paper. We consider the sequence , with finite. We take as our target error , i.e. . Then and the upper bound for provided by Lemma 4.4 is
[TABLE]
where we used the fact that
[TABLE]
Since is a concave function, we have
[TABLE]
Therefore, we have
[TABLE]
where we used Stirling’s formula. Thus, if we want an error in this particular example, the best bound that Lemma 4.4 can provide for the size of is exponential in . In contrast, in Lemma 5.3 from the following section, we give a much more favorable bound.
5 Analysis of when has polynomial growth
As we have just observed, the bounds of the previous section for are generally far from sharp. We can establish sharper bounds, and even compute exactly, if we have more information on the sequence . In this section, we give such an analysis when the sequence has polynomial growth.
Recall that for , we defined the sequence . In some parts of our analysis, it is useful to slightly modify this sequence. Accordingly, we introduce the following modified sequence , , defined as follows. If and , , then
[TABLE]
Note that the sequence increases like . Moreover, and for .
Given any , we want to determine the cardinality of the set or its counterpart , i.e., how many satisfy the inequality . According to Remark 4.3, the decay rate of can then be derived from this knowledge. Let us note that for these two sequences, we have
[TABLE]
and so it is enough to analyze the case . We therefore take in the estimates on cardinality that follow.
As decreases, the cardinality of increases. While it is interesting to understand how this cardinality grows asymptotically when tends to zero, in numerical scenarios it is important to keep this cardinality small.
5.1 Exact formulas for
Exact formulas for the cardinality of can be given in terms of the multiplicative partitions of natural numbers (see [10] and Remark 3.18 in [13]). In theory, these formulas allow the precise computation of provided that this cardinality is not too large. However, this computation is very intense and in fact, to our knowledge, has not been done. It turns out that these computations are simpler if one uses the sequence instead of . This stems from the fact that is always an integer power of two. For this reason, we focus on this sequence for the remainder of this section. We begin by showing how one can do an exact count of the multiindices in the simplex associated to .
For any , we define
[TABLE]
The set contains only the zero sequence and hence . We want to determine the cardinality of the sets , . This is the same as finding how many satisfy (3.4), since if we denote by
[TABLE]
we have that
[TABLE]
Let us first note that if has a nonzero component for some , then and so is not in . Hence, any is supported on . We decompose the set , and given any , we define
[TABLE]
which we think of as the energy of on . Therefore, for any , we have
[TABLE]
Note that there are only certain sequences which satisfy (5.5). We denote the collection of all such sequences by ,
[TABLE]
The sequences in are related to the additive partitions of , which are decompositions of into , where the and where the order of the appearance of an does not matter.
There is a one to one correspondence between the elements in and additive partitions of . Indeed, any additive partition of corresponds to a sequence
, where is the number of appearances of in . Conversely, any for which corresponds to the unique additive partition of , where appears times, appears times and so on. Thus is the additive partition number of .
The following theorem gives an exact count for the cardinality of , and hence the cardinality of the set .
Theorem 5.1**.**
For , the cardinality of is given by
[TABLE]
Moreover, for every ,
[TABLE]
where
Proof: For any fixed , we define
[TABLE]
Now, for each , we count all possible satisfying
[TABLE]
Since , the latter cardinality can be viewed as the number of ways one can place indistinguishable balls into distinguishable boxes so that some boxes can remain empty. The answer to this combinatorial problem is known to be (see [19]). Therefore, the cardinality of is the product of these binomial coefficients:
[TABLE]
Equation (5.6) now follows from the definitions of and and (5.8). The last statement in the theorem follows from (5.4) and (5.6).
Theorem 5.1 gives an exact formula for for any , since
[TABLE]
Note that the sequence is then given by and, for ,
[TABLE]
Moreover, since , , we similarly derive that
[TABLE]
and is then given by and, for ,
[TABLE]
In Table 1, we present the computed cardinality for values of in the range and .
Remark 5.2**.**
If we combine this theorem with Theorem 3.1 and (5.11), we determine the optimal error and best lower set for approximating any of the spaces , provided the error is measured in the surrogate norm rather than the true norm. Of course, it gives an upper bound on the performance in the norm, that is for we have
[TABLE]
and
[TABLE]
where the sequence is given by (5.11). The efficiency of the algorithm is determined by the cardinality of , given in Table 1. In particular, let us suppose the user desires to approximate a function in with accuracy . Because for according to (5.11), when , we need and thus a set of cardinality 4101 achieves this accuracy. Similarly, a sufficient cardinality for is 50 when ; 20 for ; 8 for .
In view of Remark 5.2, the behavior of the sequence dictates the error of approximation for . The values of are provided in Figure 1 for for the cases and .
5.2 The asymptotic behavior of
Theorem 5.1 gives an exact expression for which then can be used to determine for any and . We can also use this theorem to give bounds on the asymptotic decay of . We begin with a lemma.
Lemma 5.3**.**
For , , when , , and for every , we have the following two estimates:
- (i)
, 2. (ii)
, where and .
If we superimpose these inequalities we obtain
[TABLE]
Proof: Note that for the sequence given by (5.1), we have
[TABLE]
Therefore does not depend on , and in what follows we may take .
For the particular cases we readily check that
[TABLE]
To show (i) and (ii), we first prove
[TABLE]
For , we note that the binomial coefficient from (5.8) can be estimated
[TABLE]
since
[TABLE]
Therefore, for any sequence in , we have
[TABLE]
yielding the estimate
[TABLE]
As noted before, is the same as the number of additive partitions of the integer . The number has been exactly computed for small values of and there are bounds for for any . The following upper bound for can be found in [18]:
[TABLE]
Hence,
[TABLE]
and using Theorem 5.1, we obtain (5.13).
We can now use (5.13) to prove each of the inequalities (i) and (ii). To prove (ii), it is enough to show that
[TABLE]
The above relation is valid for and we now proceed by induction assuming that it has been proven for and verify the case . Using the induction hypothesis, we have
[TABLE]
where to derive the last inequality we used , , and the specific value of . This completes the proof of (ii).
We prove estimate (i) for in a similar way (the case clearly holds) showing by induction that
[TABLE]
The details are omitted.
To prove the superimposed estimate we note that
[TABLE]
On the interval , the function on the left is increasing and the function on the right is decreasing since , and the range of for which the inequality holds is . The proof is completed.
In Figure 2, we present the graphs of the exactly computed values of compared to the estimate from Lemma 5.3.
5.2.1 Bounds for the error .
In this section, we use Lemma 5.3 to give bounds on the decay of and . We start with the case .
Corollary 5.4**.**
If , then we have the following bounds
[TABLE]
Proof: We first consider the case when , . Let be the largest non-negative natural number satisfying
[TABLE]
It follows from Lemma 5.3 that . Relation (5.11) and the monotonicity of the sequence give which, according to Remark 5.2, leads to .
Let us define by the equation and give an upper bound for . Since the integer does not satisfy (5.18), we have
[TABLE]
and so
[TABLE]
Rearranging terms, we have
[TABLE]
Noticing that the left-hand side vanishes for
[TABLE]
we obtain the upper bound from which we get
[TABLE]
Therefore, we have the estimate
[TABLE]
which leads to
[TABLE]
Now, given any , we choose the largest such that . This implies that and , and so we derive
[TABLE]
as desired.
The next corollary treats the case of general .
Corollary 5.5**.**
Let and let be given by . For any , we have
[TABLE]
where is a constant depending only on and .
Proof: The first inequality is (ii) of Theorem 3.1. Next, let us denote by
[TABLE]
and observe that since is an increasing function of , we have
[TABLE]
Note that to complete the proof we need only show (5.19) in the case because then for ,
[TABLE]
where we have used the fact that the sequence is decreasing. Thus, we concentrate on the case and define
[TABLE]
Similarly to the function , we have that
[TABLE]
It follows from Corollary 5.4 that , , and using the above estimate we have
[TABLE]
Here, in the last inequality we have used the bound
[TABLE]
valid for every , which follows from the fact that
[TABLE]
The bound (5.21) gives
[TABLE]
which is (5.19) for , and therefore completes the proof of the Corollary.
According to (5.12), we can improve estimate (5.17) when is large. For this, we state the following two corollaries whose proofs will be given in the appendix.
Corollary 5.6**.**
Let be the largest natural number such that
[TABLE]
where is the constant of Lemma 5.3. Then
[TABLE]
Note that the dependence of as a function of in the above corollary is implicit. One may want to get an explicit version of that statement which is the next corollary.
Corollary 5.7**.**
If ,
[TABLE]
and therefore
[TABLE]
where with as in Lemma 5.3.
6 Finding the set
In this section, we describe a possible strategy to build the set for any given sequence and a given target accuracy . A second procedure (not given here) can then be used to find when we prescribe the cardinality of the set rather than the accuracy. Before we begin describing our algorithm, let us note that other procedures have been given for constructing (see e.g. [5, 22]).
As above, we consider to be a non-decreasing sequence such that and . Let us denote by the support of a multiindex , that is
[TABLE]
Recalling the definition of given in (3.4), we first notice that:
- •
whenever ;
- •
for every fixed , there is an index such that if , then ;
- •
if , then .
The lower set can be built using the iterative strategy described in the following Algorithm.
When implementing this algorithm in practice, we form a tree where each has possible children to be checked for admissibility. When a constructed is found to be inadmissible, then it is not included in . This stops the search down the entire subtree rooted at . If is found to be admissible then it is added to . In this way, each forms a level in the tree rooted with the zero sequence. When all elements of are exhausted, then the computation moves to processing elements in . If the current set being processed is empty, then the procedure is ended and . Finally, we mention that the set corresponds to the so-called reduced margin (see e.g. [11]) of the set .
Remark 6.1**.**
One can deduce that the number of computations needed to construct the set is of order , where , provided one imposes additional growth conditions on the sequence (for an analysis for another sorting algorithm see [5]). This would cover the sequences and for example.
7 Concluding Remarks
In this work, we discussed the approximation of Banach space valued functions with an infinite number of variables by polynomials on lower sets. We defined a family of model classes based on anisotropic analyticity, and derived bounds for the decay rate for the approximation of these model classes using multivariate polynomials. We considered only the case when the approximation error is measured in the norm, though it would be interesting to develop corresponding results when measuring the approximation error in norms. Already, several results in the case have been given in [15].
Another setting that arises in parametric PDEs is analytic functions which have Legendre expansions (instead of Taylor expansions) with bounds on the size of the Legendre coefficients (see [12]). It would be interesting to formally introduce and study the spaces (analogous to the ) associated to such expansions. The functions in these spaces would now be analytic on polyellipses.
Our main vehicle for deriving error estimates for these classes was to use a surrogate norm in place of the norm. We showed in Theorem 3.1 that for this surrogate norm, our estimates are optimal. It would be very interesting to understand what optimal results would look like in the original norm, i.e., to prove lower bounds for the approximation rate in the norm rather than the surrogate norm.
We concentrated on the sequences and , , since they comply with typical assumptions in applied settings. It is possible to extend these results to more general sequences which eventually behave asymptotically like or . However, the behavior of the sequence in the preasymptotic regime strongly effects the final decay rate bounds for . For instance, the value , representing the smoothness of in the direction , might remain close to for arbitrarily many before eventually growing to . It would be interesting to give bounds for other sequences with polynomial or even exponential growth.
Our formulation of the model classes and our approximation results have been strongly influenced by the works [1, 15, 20, 22]. The paper [16] has a significant intersection with our paper where results analogous to Corollary 5.5 in the case are proven.
We next ellaborate on the distinctions between our paper and the results given in [20]. In [20], the authors derive bounds for the approximation of parametric PDEs using Taylor and Legendre series. They work under the assumption that , and use analyticity of the parameter-to-PDE-solution map to derive certain upper bounds on the norms of the coefficients in the Legendre and Taylor series expansions of the solution . In the case of Taylor series, their analysis includes the case when , which corresponds to our model classes . We restrict our further comments to this case. Although their results are only stated for solutions to parametric PDEs, their proofs give the following estimates for functions in .
Theorem 7.1**.**
Let be a nondecreasing sequence with . Then for any , there exists an such that for all ,
[TABLE]
holds with .
If we specialize to the sequence , , then their result takes the form
[TABLE]
where has an absolute bound and actually grows with and . Note that the bound is subexponential in , and hence is better than the algebraic rate given in our estimates. The reason for this is the assumption that is finite. However, we must emphasize that the number grows exponentially in , and so this result can only be applied when is very large. We have concentrated on obtaining results that hold for all and all with no dependence on .
The reason for this restriction on in [20] is that their proof of this theorem utilizes bounds on the number of lattice points in the simplex . Their bound requires that this number behaves like . As discussed in the remarks following the proof of Lemma 4.4, this asymptotic count on the lattice points is effective only for small and in turn prohibitively large.
By contrast, our results given above apply for and any . When is finite we can always extend the sequence to an infinite sequence in an arbitrary way. In this way our results apply without any restrictions on the size of relative to .
8 Appendix: Proofs of Corollaries 5.6 and 5.7
Proof of Corollary 5.6: Let be the largest natural number satisfying (5.23). One can check that for , we have , and thus it follows from (ii) or Lemma 5.3 that
[TABLE]
which gives , and thus .
Proof of Corollary 5.7: To show (5.25), we proceed as follows. We consider first the case , . Let be the largest non-negative natural number satisfying
[TABLE]
and let be defined by the equation . Since , the largest that satisfies the above estimate is greater or equal to . Moreover, we can easily show that . Therefore, we use the fact that and that for , which gives
[TABLE]
Thus if , we have
[TABLE]
It follows (since ) that
[TABLE]
Therefore, (5.11) and the monotonicity of the sequence give
[TABLE]
which, according to Remark 5.2 leads to
[TABLE]
Now, if is such that , it follows that
[TABLE]
which is (5.25).
Acknowledgements The authors would like to acknowledge and thank Matthew Hielsberg for the help in carrying the numerical experiments.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Bachmayr, A. Cohen and G. Migliorati. Sparse polynomial approximation of parametric elliptic PD Es. Part I: affine coefficients. ESAIM: Mathematical Modelling and Numerical Analysis, 51(1):321–339, 2017.
- 2[2] J. Beck, F. Nobile, L. Tamellini and R. Tempone. On the optimal polynomial approximation of stochastic PD Es by Galerkin and collocation methods. Mathematical Models and Methods in Applied Sciences, 22(9):1250023, 2012.
- 3[3] J. Beck, F. Nobile, L. Tamellini and R. Tempone. Convergence of quasi-optimal Stochastic Galerkin methods for a class of PDES with random coefficients. Computers and Mathematics with Applications, 67(4):732–751, 2014.
- 4[4] A. Beged-Dov. Lower and upper bounds for the number of lattice points in a simplex . SIAM Journal on Applied Mathematics, 22(1):106–108, 1972.
- 5[5] M. Bieri, R. Andreev, and C. Schwab. Sparse tensor discretizations of elliptic SPD Es . SIAM Journal of Scientific Computation 31(6), 4281–4304, 2009.
- 6[6] P. Binev, W. Dahmen and R. De Vore. Adaptive finite element methods with convergence rates. Numerische Mathematik, 97(2):219–268, 2004.
- 7[7] P. Binev, W. Dahmen, R. De Vore and P. Petrushev. Approximation classes for adaptive methods. Serdica Mathematical Journal, 28(4):391–416, 2002.
- 8[8] A. Bonito, R. De Vore and R. Nochetto. Adaptive finite element methods for elliptic problems with discontinuous coefficients. SIAM Journal on Numerical Analysis, 51(6):3106–3134, 2013.
