Polynomial Approximation of Anisotropic Analytic Functions of Several   Variables

Andrea Bonito; Ronald DeVore; Diane Guignard; Peter Jantsch; Guergana; Petrova

arXiv:1904.12105·math.NA·January 17, 2020

Polynomial Approximation of Anisotropic Analytic Functions of Several Variables

Andrea Bonito, Ronald DeVore, Diane Guignard, Peter Jantsch, Guergana, Petrova

PDF

TL;DR

This paper develops methods for approximating multivariate analytic functions, especially in high or infinite dimensions, using algebraic polynomials with optimal lower set structures, relevant for solving parametric PDEs.

Contribution

It introduces a framework for polynomial approximation of anisotropic functions in high dimensions, identifying optimal lower sets for certifiable error bounds, applicable even for small polynomial dimensions.

Findings

01

Optimal lower sets provide near-best approximation errors.

02

Results hold uniformly for all polynomial dimensions n ≥ 1.

03

Approximations are effective in high or infinite variable settings.

Abstract

Motivated by numerical methods for solving parametric partial differential equations, this paper studies the approximation of multivariate analytic functions by algebraic polynomials. We introduce various anisotropic model classes based on Taylor expansions, and study their approximation by finite dimensional polynomial spaces $P_{Λ}$ described by lower sets $Λ$ . Given a budget $n$ for the dimension of $P_{Λ}$ , we prove that certain lower sets $Λ_{n}$ , with cardinality $n$ , provide a certifiable approximation error that is in a certain sense optimal, and that these lower sets have a simple definition in terms of simplices. Our main goal is to obtain approximation results when the number of variables $d$ is large and even infinite, and so we concentrate almost exclusively on the case $d = \infty$ . We also emphasize obtaining results which hold for the full…

Tables1

Table 1. Table 1: Computed cardinality of Λ ( 2 − m s , ρ ∗ ( s ) ) Λ superscript 2 𝑚 𝑠 superscript 𝜌 𝑠 \Lambda(2^{-ms},\rho^{*}(s)) using Theorem 5.1 and the relation ( 5.10 ). When ρ = ρ ∗ ( s ) 𝜌 superscript 𝜌 𝑠 \rho=\rho^{*}(s) , the table gives the cardinality of the lower set needed to achieve accuracy 2 − m s superscript 2 𝑚 𝑠 2^{-ms} . Refer to Remark 5.2 to deduce estimates on the errors E n ( 𝒰 ρ ∗ ( s ) , 1 ) subscript 𝐸 𝑛 subscript 𝒰 superscript 𝜌 𝑠 1 E_{n}({\cal U}_{\rho^{*}(s),1}) .

$m$	$# Λ (2^{- m s}, ρ^{*} (s))$	$2^{- m s}$
$m$	$# Λ (2^{- m s}, ρ^{*} (s))$	$s = 1$	$s = 2$	$s = 3$	$s = 4$
$0$	$1$	$1$	$1$	$1$	$1$
$1$	$3$	$5.0000 \times 10^{- 1}$	$2.5000 \times 10^{- 1}$	$1.2500 \times 10^{- 1}$	$6.2500 \times 10^{- 2}$
$2$	$8$	$2.5000 \times 10^{- 1}$	$6.2500 \times 10^{- 2}$	$1.5625 \times 10^{- 2}$	$3.9062 \times 10^{- 3}$
$3$	$20$	$1.2500 \times 10^{- 1}$	$1.5625 \times 10^{- 2}$	$1.9531 \times 10^{- 3}$	$2.4414 \times 10^{- 4}$
$4$	$50$	$6.2500 \times 10^{- 2}$	$3.9062 \times 10^{- 3}$	$2.4414 \times 10^{- 4}$	$1.5259 \times 10^{- 5}$
$5$	$122$	$3.1250 \times 10^{- 2}$	$9.7656 \times 10^{- 4}$	$3.0518 \times 10^{- 5}$	$9.5367 \times 10^{- 7}$
$6$	$298$	$1.5625 \times 10^{- 2}$	$2.4414 \times 10^{- 4}$	$3.8147 \times 10^{- 6}$	$5.9605 \times 10^{- 8}$
$7$	$718$	$7.8125 \times 10^{- 3}$	$6.1035 \times 10^{- 5}$	$4.7684 \times 10^{- 7}$	$3.7253 \times 10^{- 9}$
$8$	$1723$	$3.9062 \times 10^{- 3}$	$1.5259 \times 10^{- 5}$	$5.9605 \times 10^{- 8}$	$2.3283 \times 10^{- 10}$
$9$	$4101$	$1.9531 \times 10^{- 3}$	$3.8147 \times 10^{- 6}$	$7.4506 \times 10^{- 9}$	$1.4552 \times 10^{- 11}$
$10$	$9712$	$9.7656 \times 10^{- 4}$	$9.5367 \times 10^{- 7}$	$9.3132 \times 10^{- 10}$	$9.0949 \times 10^{- 13}$

Equations250

P (y) = ν \in Λ \sum c_{ν} y^{ν},

P (y) = ν \in Λ \sum c_{ν} y^{ν},

E_{Λ} (u) := P \in P_{Λ} in f ∥ u - P ∥_{L_{\infty} (Y, X)}, and ∥ v ∥_{L_{\infty} (Y, X)} := y \in Y sup ∥ v (y) ∥_{X},

E_{Λ} (u) := P \in P_{Λ} in f ∥ u - P ∥_{L_{\infty} (Y, X)}, and ∥ v ∥_{L_{\infty} (Y, X)} := y \in Y sup ∥ v (y) ∥_{X},

if ν \in Λ, then μ \in Λ whenever μ_{j} \leq ν_{j}, j = 1, 2, \dots .

if ν \in Λ, then μ \in Λ whenever μ_{j} \leq ν_{j}, j = 1, 2, \dots .

L_{0} := \emptyset, L_{n} := {Λ \subset F : #Λ \leq n, Λ is a lower set}, n = 1, 2, \dots .

L_{0} := \emptyset, L_{n} := {Λ \subset F : #Λ \leq n, Λ is a lower set}, n = 1, 2, \dots .

E_{Λ} (K) := u \in K sup E_{Λ} (u),

E_{Λ} (K) := u \in K sup E_{Λ} (u),

E_{0} (K) := u \in K sup ∥ u ∥_{L_{\infty} (Y, X)}, E_{n} (K) := Λ \in L_{n} in f E_{Λ} (K), n = 1, 2, \dots .

E_{0} (K) := u \in K sup ∥ u ∥_{L_{\infty} (Y, X)}, E_{n} (K) := Λ \in L_{n} in f E_{Λ} (K), n = 1, 2, \dots .

d_{n} (K)_{L_{\infty} (Y, X)} \leq E_{n} (K),

d_{n} (K)_{L_{\infty} (Y, X)} \leq E_{n} (K),

H_{ρ} := H_{ρ} (X)

H_{ρ} := H_{ρ} (X)

∥ u ∥_{H_{ρ}} := z \in \overline{D}_{ρ} sup ∥ u (z) ∥_{X} .

∥ u ∥_{H_{ρ}} := z \in \overline{D}_{ρ} sup ∥ u (z) ∥_{X} .

t_{ν} := \frac{\partial ^{ν} u ( 0 )}{ν !}, ν \in F,

t_{ν} := \frac{\partial ^{ν} u ( 0 )}{ν !}, ν \in F,

u (z) = ν \in F \sum t_{ν} z^{ν} .

u (z) = ν \in F \sum t_{ν} z^{ν} .

u (z) = N \to \infty lim u (z_{1}, \dots, z_{N}, 0, \dots) .

u (z) = N \to \infty lim u (z_{1}, \dots, z_{N}, 0, \dots) .

∥ t_{ν} ∥_{X} \leq ∥ u ∥_{H_{ρ}} ρ^{- ν}, ν \in F .

∥ t_{ν} ∥_{X} \leq ∥ u ∥_{H_{ρ}} ρ^{- ν}, ν \in F .

u (z) = ν \in F \sum t_{ν} z^{ν}, ∥ z ∥_{ℓ_{\infty} (N)} \leq 1,

u (z) = ν \in F \sum t_{ν} z^{ν}, ∥ z ∥_{ℓ_{\infty} (N)} \leq 1,

∥ t_{ν} ∥_{X} \leq ∥ u ∥_{H_{ρ}} δ^{- ν} .

∥ t_{ν} ∥_{X} \leq ∥ u ∥_{H_{ρ}} δ^{- ν} .

u (y) = ν \in F \sum t_{ν} y^{ν}, y \in Y,

u (y) = ν \in F \sum t_{ν} y^{ν}, y \in Y,

∥ u ∥_{B_{ρ, \infty}} := ν \in F sup ρ^{ν} ∥ t_{ν} ∥_{X} < \infty.

∥ u ∥_{B_{ρ, \infty}} := ν \in F sup ρ^{ν} ∥ t_{ν} ∥_{X} < \infty.

ν \in F \sum [ρ^{ν} ∥ t_{ν} ∥_{X}]^{2} < \infty.

ν \in F \sum [ρ^{ν} ∥ t_{ν} ∥_{X}]^{2} < \infty.

u (y) = ν \in F \sum t_{ν} y^{ν}, y \in Y,

u (y) = ν \in F \sum t_{ν} y^{ν}, y \in Y,

∥ u ∥_{B_{ρ, p}} := (ν \in F \sum [ρ^{ν} ∥ t_{ν} ∥_{X}]^{p})^{1/ p} = ∥ (ρ^{ν} ∥ t_{ν} ∥_{X})_{ν \in F} ∥_{ℓ_{p} (F)} < \infty.

∥ u ∥_{B_{ρ, p}} := (ν \in F \sum [ρ^{ν} ∥ t_{ν} ∥_{X}]^{p})^{1/ p} = ∥ (ρ^{ν} ∥ t_{ν} ∥_{X})_{ν \in F} ∥_{ℓ_{p} (F)} < \infty.

∥ u ∥_{L_{\infty} (Y, X)} \leq ν \in F \sum ∥ t_{ν} ∥_{X} = ν \in F \sum ∥ t_{ν} (u) ∥_{X} =: ∥ u ∥^{*} .

∥ u ∥_{L_{\infty} (Y, X)} \leq ν \in F \sum ∥ t_{ν} ∥_{X} = ν \in F \sum ∥ t_{ν} (u) ∥_{X} =: ∥ u ∥^{*} .

E_{Λ}^{*} (u) := P \in P_{Λ} in f ∥ u - P ∥^{*} = ν \in / Λ \sum ∥ t_{ν} ∥_{X},

E_{Λ}^{*} (u) := P \in P_{Λ} in f ∥ u - P ∥^{*} = ν \in / Λ \sum ∥ t_{ν} ∥_{X},

E_{Λ}^{*} (K) := u \in K sup E_{Λ}^{*} (u),

E_{Λ}^{*} (K) := u \in K sup E_{Λ}^{*} (u),

T_{Λ} (y) := ν \in Λ \sum t_{ν} y^{ν}

T_{Λ} (y) := ν \in Λ \sum t_{ν} y^{ν}

E_{Λ} (u) \leq ∥ u - T_{Λ} ∥_{L_{\infty} (Y, X)} \leq ∥ u - T_{Λ} ∥^{*} = E_{Λ}^{*} (u) .

E_{Λ} (u) \leq ∥ u - T_{Λ} ∥_{L_{\infty} (Y, X)} \leq ∥ u - T_{Λ} ∥^{*} = E_{Λ}^{*} (u) .

Λ (ε, ρ) := {ν \in F : ρ^{- ν} \geq ε} = {ν \in F : ρ^{ν} \leq ε^{- 1}} .

Λ (ε, ρ) := {ν \in F : ρ^{- ν} \geq ε} = {ν \in F : ρ^{ν} \leq ε^{- 1}} .

Λ_{n} := Λ_{n, ρ}

Λ_{n} := Λ_{n, ρ}

δ_{n, q} := δ_{n, q} (ρ) := {(\sum_{ν \in / Λ_{n, ρ}} ρ^{- ν q})^{1/ q} = (\sum_{j > n} δ_{j}^{q})^{1/ q}, δ_{n + 1}, if 0 < q < \infty, q = \infty.

δ_{n, q} := δ_{n, q} (ρ) := {(\sum_{ν \in / Λ_{n, ρ}} ρ^{- ν q})^{1/ q} = (\sum_{j > n} δ_{j}^{q})^{1/ q}, δ_{n + 1}, if 0 < q < \infty, q = \infty.

E_{n} (U_{ρ, p}) \leq E_{Λ_{n, ρ}} (U_{ρ, p}) \leq E_{Λ_{n, ρ}}^{*} (U_{ρ, p});

E_{n} (U_{ρ, p}) \leq E_{Λ_{n, ρ}} (U_{ρ, p}) \leq E_{Λ_{n, ρ}}^{*} (U_{ρ, p});

E_{n} (U_{ρ, p}) \leq E_{Λ_{n, ρ}} (U_{ρ, p}) \leq E_{Λ_{n, ρ}}^{*} (U_{ρ, p}) = δ_{n, q} .

E_{n} (U_{ρ, p}) \leq E_{Λ_{n, ρ}} (U_{ρ, p}) \leq E_{Λ_{n, ρ}}^{*} (U_{ρ, p}) = δ_{n, q} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Polynomial Approximation of Anisotropic Analytic Functions of Several Variables

Andrea Bonito, Ronald DeVore, Diane Guignard, Peter Jantsch, and Guergana Petrova

This research was supported by the NSF grants DMS-1817691 (AB), DMS 15-21067 (RD-GP), DMS 18-17603 (RD-GP), ONR grants N00014-17-1-2908 (RD), N00014-16-1-2706 (RD); DG was supported by the Swiss National Science Foundation grant P2ELP2-175056 and IAMCS at TAMU, and PJ was supported by an NSF Fellowship DMS-1704121. A portion of this research was completed while RD (Simon Fellow), DG, and PJ were supported as visitors of the Isaac Newton Institute at Cambridge University.

Abstract

Motivated by numerical methods for solving parametric partial differential equations, this paper studies the approximation of multivariate analytic functions by algebraic polynomials. We introduce various anisotropic model classes based on Taylor expansions, and study their approximation by finite dimensional polynomial spaces ${\cal P}_{\Lambda}$ described by lower sets $\Lambda$ . Given a budget $n$ for the dimension of ${\cal P}_{\Lambda}$ , we prove that certain lower sets $\Lambda_{n}$ , with cardinality $n$ , provide a certifiable approximation error that is in a certain sense optimal, and that these lower sets have a simple definition in terms of simplices. Our main goal is to obtain approximation results when the number of variables $d$ is large and even infinite, and so we concentrate almost exclusively on the case $d=\infty$ . We also emphasize obtaining results which hold for the full range $n\geq 1$ , rather than asymptotic results that only hold for $n$ sufficiently large. In applications, one typically wants $n$ small to comply with computational budgets.

**Mathematics Subject Classification ** 41A10, 41A58, 41A63, 65N15

1 Introduction

Polynomial and piecewise polynomial approximation are a staple in numerical analysis. For example, approximation by piecewise polynomials on simplicial partitions is the underpinning of Finite Element Methods. In that setting, one approximates the solution $u$ to a partial differential equation (PDE) on a domain $D\subset\mathbb{R}^{d}$ , where $d$ is typically small ( $d=1,2,3$ ). The solution $u$ to the PDE typically has limited regularity, and the rate of approximation is of order $O(n^{-r})$ , where $n$ is the number of degrees of freedom in the approximation and $r$ is small. This type of approximation is well-understood by means of theorems which relate the approximation order $r$ to the smoothness order $s$ of $u$ in certain Sobolev and Besov spaces (see [6, 7, 8, 9]). The approximation rate takes the form $r=s/d$ and therefore deteriorates as $d$ increases. This is commonly referred to as the curse of dimensionality.

The present paper is interested in a different setting that arises in other application areas, in particular when using numerical methods for solving stochastic or parametric PDEs. In that setting, one wishes to approximate the solution $u$ to the parametric PDE which depends on input parameters $y$ and takes values in a Banach space $X$ . The parameters $y$ come from a set $Y\subset\mathbb{R}^{d}$ where $d$ is large or even infinite. Hence, it is often crucial to perform a model reduction (dimension reduction) for the solution map $y\rightarrow u(y)\in X$ of the parametric PDE. One possibility to obtain such dimension reduction is to approximate $u$ by Banach space valued polynomials in $y$ . The main property of $u$ that makes such an approximation possible is that under standard assumptions on the parametrized coefficients of the PDE, it is known that $u$ admits an analytic extension onto certain complex polydiscs that contain $Y$ (see [13]). In other words, $u$ has a certain anisotropic analyticity. This motivates the study of approximation of anisotropic analytic functions by polynomials, which is the subject of the present paper. Although we are motivated by parametric PDE applications, we formulate and study this subject as purely a problem in multivariate approximation. In this way, we hope to draw the attention of the approximation community to this area of research.

For the most part, we are interested in the case of an infinite number of parameters, i.e., $d=\infty$ . This allows us to prove results which are immune to the dimension $d$ and is a common setting in parametric PDEs. Specifically, we take parameters in the set $Y:=[-1,1]^{\mathbb{N}}$ , where $\mathbb{N}$ is the set of natural numbers. Sometimes we remark on the case $Y_{d}:=[-1,1]^{d}$ with $d$ finite, in particular, when we wish to compare our results with other results in the literature established only for finite $d$ .

Let ${\cal F}$ denote the set of all infinite sequences $\nu=(\nu_{1},\nu_{2},\dots)$ with entries $\nu_{j}\in\mathbb{N}_{0}:=\mathbb{N}\cup\{0\}$ , where only a finite number of the entries in $\nu$ are allowed to be nonzero. If $\Lambda\subset{\cal F}$ is a finite subset of ${\cal F}$ , we denote by ${\cal P}_{\Lambda}$ the space of $X$ -valued polynomials spanned by the monomials $y^{\nu}$ , where the $\nu$ come from the set $\Lambda$ . Thus, any element of ${\cal P}_{\Lambda}$ has the form

[TABLE]

where the coefficients $c_{\nu}$ come from $X$ . Here and throughout the paper, we use standard multivariate notation. In particular, $y^{\nu}:=y_{1}^{\nu_{1}}y_{2}^{\nu_{2}}\cdots$ . Since $\nu$ has only a finite number of nonzero entries, any such product is finite.

For any $u\in L_{\infty}(Y,X)$ , and any finite set $\Lambda\subset{\cal F}$ , we define the error of approximation of $u$ by polynomials in ${\cal P}_{\Lambda}$ to be

[TABLE]

where $L_{\infty}(Y,X)$ consists of all functions $v$ on $Y$ that are bounded mappings into $X$ .

If no conditions are imposed, then the potential sets $\Lambda$ may be quite complex and beyond the scope of numerical methods. For this reason, one usually imposes additional structure on these sets such as fixed total degree or fixed coordinate degree in the case $d$ is finite. We are especially interested in the case where the sets $\Lambda$ are lower sets, that is, sets $\Lambda$ with the property

[TABLE]

We consider the collection ${\cal L}_{n}$ of lower sets with cardinality $\leq n$ ,

[TABLE]

and for given compact class $K$ of functions in $L_{\infty}(Y,X)$ , and any finite set $\Lambda\subset{\cal F}$ , we define

[TABLE]

and

[TABLE]

Notice that in this definition the set $\Lambda$ is allowed to depend on $K$ , but cannot change for the various $u\in K$ . So, as formulated, this is a problem of finding the best linear space ${\cal P}_{\Lambda}$ to use when approximating $K$ . Hence, the optimal performance $E_{n}(K)$ satisfies

[TABLE]

where $d_{n}$ denotes the Kolmogorov $n$ -width of $K$ in $L_{\infty}(Y,X)$ . The sets $K$ are commonly called model classes. The case where the error of approximation is measured in $L_{q}(Y,X)$ , $q<\infty$ , is also interesting but not studied here.

Given the model class $K$ , we are interested in several fundamental issues:

•

First, what can we say about the rate of decay of $E_{n}(K)$ as $n$ increases? By now, there are several results in the literature that give upper bounds on $E_{n}(K)$ for certain anisotropic analytic classes $K$ of the type analyzed in this paper. Most often these bounds have been developed in the setting where $u$ is a solution to a parametric PDE. So part of our effort is to separate out which of these results are simply a result of the analyticity of $u$ and do not use any additional properties of the PDE solution.

•

A second important issue is the optimality of the known bounds for $E_{n}(K)$ . Indeed, the typical results only give upper bounds for $E_{n}(K)$ and sometimes only for $n$ sufficiently large.

•

A third significant problem in this area of research is to give a recipe for finding good lower sets $\Lambda_{n}\in{\cal L}_{n}$ such that $E_{\Lambda_{n}}(K)$ performs at or near $E_{n}(K)$ . This can be a nontrivial issue in numerical applications since, given a budget $n$ , searching over all lower sets in ${\cal L}_{n}$ to find a suitable $\Lambda_{n}$ is prohibitive.

As already noted, we are interested in model classes $K$ described by some form of anisotropic analyticity. We focus on $X$ valued functions that are analytic on a polydisc $D_{\rho}$ consisting of all complex sequences $z=(z_{1},z_{2},\dots)$ , with $|z_{j}|<\rho_{j}$ , $j=1,2,\dots$ . Here, $\rho=(\rho_{1},\rho_{2},\dots)$ is always a nondecreasing sequence of positive numbers with $\rho_{1}>1$ . The functions in $D_{\rho}$ have more smoothness in the variable $z_{j}$ as $j$ increases. In turn, the influence of this variable on the value of $u$ at a point in $Y$ is weaker. Any function in $D_{\rho}$ has Taylor coefficients that are elements of $X$ . In §2, we introduce a variety of spaces ${\cal B}_{\rho,p}$ , which differ in the assumptions imposed on the Taylor coefficients. These spaces are motivated by recent work (see [1, 15, 13]) in parametric PDEs.

The remainder of the paper concentrates on understanding the rate of decay of $E_{n}(K)$ for these model classes and understanding how to choose lower sets $\Lambda_{n}\subset{\cal F}$ of cardinality $n$ which attain $E_{n}(K)$ . It turns out that the $L_{\infty}(Y,X)$ norm is difficult to work with and so we replace it by a certain surrogate majorant. In §3, we show that estimating the error of approximating $K$ in the surrogate norm and finding optimal lower sets $\Lambda_{n}$ in this norm has a simple solution. Namely, given any $\varepsilon>0$ , the smallest lower set $\Lambda$ for which ${\cal P}_{\Lambda}$ approximates $K$ to accuracy $\varepsilon$ is given by the set of lattice points in a certain simplex $S=S(\varepsilon,\rho)$ determined by $\rho$ and $\varepsilon$ . Therefore, understanding the rate of decay of $E_{n}(K)$ is equivalent to counting the number of lattice points in these simplices.

Of course, counting lattice points in simplices is a well studied problem in number theory where several deep results are known. General results typically only hold for $n$ large when the number of lattice points can be estimated through the volume of the simplex. In numerical applications, the pre-asymptotic region is the most important since it corresponds to the only $n$ which can be implemented in computation. Therefore, we focus on counting lattice points when $n$ is small. For results of this type, one needs to have more specific information on the simplices, and therefore on the sequence $\rho$ . This leads us to consider specific anisotropic classes that arise in applications. These correspond to sequences $\rho$ which grow polynomially.

For $s>0$ , we define the sequence $\rho(s):=(\rho_{j}(s))_{j\geq 1}$ with $\rho_{j}(s):=(j+1)^{s}$ , $j\geq 1$ . The problem of counting lattice points in the simplex associated to this sequence is directly related to counting the number of multiplicative partitions of integers. One can therefore use the results in [10] to do exact and asymptotic analysis for the number of such lattice points. Exact counts on the number of multiplicative partitions of an integer $n$ are known for certain values of $n$ . Making such counts becomes numerically more intensive as $n$ increases.

It turns out that this situation can be alleviated some by slightly modifying the sequence $\rho(s)$ . We modify this sequence to obtain a related sequence $\rho^{*}(s)$ with the same asymptotic decay as $\rho(s)$ . The advantage of this modification is that the number of lattice points of the modified sequence is related to the number of additive partitions of an integer $n$ rather than the multiplicative partitions. Finding additive partitions is somewhat easier numerically. We show how one can do an exact count of lattice points for $\rho^{*}(s)$ in §5. In §5.2, we give an asymptotic analysis for this count by using known asymptotic bounds for the number of additive partitions of integers. In §6, we give some simple recipes for how to find the optimal $\Lambda_{n}$ for various sequences $\rho$ . Finally, in §7, we make some final remarks and compare out results with those in [20].

Let us close this introduction by mentioning that the results in this paper have a large intersection with several earlier papers. As we have already noted, our motivation for the introduction of the spaces ${\cal B}_{\rho,p}$ stems from several works on parametric PDEs, see the survey [13] and the references therein. Let us also mention [5, 16] which study approximation of anisotropic analytic functions in a quite general tensor product framework. There are several papers, most notably [5, 20, 22], that realize that one method to construct approximations for solutions of parametric PDEs is related to counting lattice points. In [20], this count is done for certain non-simplicial sets as well. We touch more on some of these works later in the paper once our results are formulated.

2 Anisotropic analyticity and motivation

In this section, we introduce a variety of model classes based on some form of anisotropic analyticity. We recall that throughout this paper $\rho=(\rho_{1},\rho_{2},\dots)$ denotes a non-decreasing sequence of positive real numbers with $\rho_{1}>1$ , and $\lim_{j\to\infty}\rho_{j}=\infty$ . We call any sequence with these properties admissible. We recall the Banach spaces $\ell_{\infty}(\mathbb{N})$ of all bounded complex valued sequences $(z_{j})_{j\geq 1}$ , with its usual norm $\|z\|_{\ell_{\infty}{(\mathbb{N})}}:=\sup_{j\geq 1}|z_{j}|$ . We let ${\cal U}$ denote the unit ball of $\ell_{\infty}(\mathbb{N})$ in this section.

Perhaps the most natural class of anisotropic analytic functions is the following. We start with the complex (open) polydisc $D_{\rho}$ , which consists of all $z=(z_{1},z_{2},\dots)$ , $z_{j}\in\mathbb{C}$ , for which $|z_{j}|<\rho_{j}$ , $j=1,2,\dots,$ and define $\overline{D}_{\rho}$ as the set of all $z=(z_{1},z_{2},\dots)$ for which $|z_{j}|\leq\rho_{j}$ , $z_{j}\in\mathbb{C}$ , $j=1,2,\dots$ . We then define

[TABLE]

as the set of all functions $u:\ell_{\infty}(\mathbb{N})\rightarrow X$ which are bounded on $\overline{D}_{\rho}$ , continuous on $D_{\rho}$ , and holomorphic in each variable $z_{j}$ , $j=1,2,\dots$ , on $D_{\rho}$ . We can equip this space with the norm

[TABLE]

Because the sequence $\rho$ is non-decreasing, we see that functions in ${\cal H}_{\rho}$ have more smoothness in the variable $z_{j}$ as $j$ increases. These spaces are analogous to Hardy spaces.

If $\nu\in{\cal F}$ , then the support of $\nu$ is finite and any $u\in{\cal H}_{\rho}$ has uniquely defined Taylor coefficients

[TABLE]

where again we are using standard multivariate notation. Note here that the definition of $t_{\nu}$ requires only the function $u(z_{1},\dots,z_{N},0,\dots)$ , for a suitable finite value of $N$ . Since this is an analytic function of a finite number of variables, these coefficients are well defined from the usual theory of functions of a finite number of variables.

In what follows, we are interested in representing $u$ in a Taylor series expansion

[TABLE]

An important issue is the sense in which the above Taylor series converges. For this, we follow Section 3.1 of [13]. It is shown in that paper that any rearrangement of this series converges uniformly on ${\cal U}$ whenever $(\|t_{\nu}\|_{X})_{\nu\in{\cal F}}$ is in $\ell_{1}({\cal F})$ . This guarantees that there is a function $v$ defined on ${\cal U}$ such that any rearrangement of the terms in the series in (2.1) converges in $X$ uniformly to $v$ . We call this type of convergence uniform unconditional.

Convergence of the Taylor series associated to $u$ does not guarantee that its limit is equal to $u$ . For this one requires additional structure. A sufficient condition is that $u$ has the following property:

Truncation Property: For all $z\in{\cal U}$ , we have

[TABLE]

This property is known to hold for the solutions to parametric PDEs.

Our first observation is that whenever $u$ is in ${\cal H}_{\rho}$ , then $u$ has a bound on its Taylor coefficients.

Lemma 2.1.

If $u\in{\cal H}_{\rho}$ , then

(i)

the Taylor coefficients $t_{\nu}\in X$ of $u$ satisfy the bounds

[TABLE] 2. (ii)

if in addition, $u$ has the Truncation Property and $(\|t_{\nu}\|_{X})_{\nu\in{\cal F}}$ is in $\ell_{1}({\cal F})$ , we have

[TABLE]

with uniform unconditional convergence of the series.

Proof: We use a slight modification of the proof of Lemma 3.14 in [13] accounting for the fact that the assumptions of the lemma do not guarantee that $u$ is holomorphic on an open set containing $\overline{D}_{\rho}$ as is required in that lemma of [13]. If we fix $\nu\in{\cal F}$ and any sequence $\delta<\rho$ , we claim that

[TABLE]

Given $\nu$ , let $\{1,\dots,J\}$ contain the support of $\nu$ . We consider the function $F(z_{1},\dots,z_{J}):=u(\hat{z})$ , where $\hat{z}_{j}:=z_{j}$ $j=1,\dots,J$ , and $\hat{z}_{j}$ is zero otherwise. Then $t_{\nu}$ is the corresponding Taylor coefficient of $F$ and the bound (2.4) is derived from Cauchy’s formula as in [13]. Since this bound holds for any $\delta<\rho$ and the support of $\nu$ is finite, we obtain (2.3) by letting $\delta_{j}\to\rho_{j}$ , for each $j$ in the support of $\nu$ . The uniqueness of $t_{\nu}$ again follows from the fact that $\nu$ has only a finite number of nonzero coordinates and $t_{\nu}$ is determined by restricting $u$ to the finite number of coordinates corresponding to where $\nu$ is nonzero. This proves (i).

For the proof of (ii), the assumption $(\|t_{\nu}\|_{X})_{\nu\in{\cal F}}\in\ell_{1}({\cal F})$ guarantees the convergence of the Taylor series and then the fact that its sum is $u$ follows easily (see Proposition 2.1.5 in [22]). $\Box$

This lemma motivates the definition of the following class of functions.

**Definition of ${\cal B}_{\rho,\infty}$ : ** We say that a function $u$ defined on $Y$ and taking values in $X$ , is in the space ${\cal B}_{\rho,\infty}:={\cal B}_{\rho,\infty}(Y)$ if $u$ admits a representation

[TABLE]

with the convergence of the series uniform unconditional on $Y$ , and where the $t_{\nu}=t_{\nu}(u)\in X$ are unique and satisfy

[TABLE]

Another type of restriction on functions $u$ , derived in the context of parametric PDEs (see [1]), is that

[TABLE]

This motivates the general definition of the following model classes.

Definition of ${\cal B}_{\rho,p}$ : For any $0<p\leq\infty$ , we define the space ${\cal B}_{\rho,p}$ , as the set of all $u\in L_{\infty}(Y,X)$ which admit a representation

[TABLE]

with the convergence of the series uniform unconditional on $Y$ , and where the $t_{\nu}=t_{\nu}(u)\in X$ are unique and satisfy

[TABLE]

Notice that these classes get smaller as $p$ decreases: ${\cal B}_{\rho,p}\subset{\cal B}_{\rho,q}$ when $p\leq q$ . We study the approximation of the model classes ${\cal B}_{\rho,p}$ in this paper.

We could similarly define anisotropic spaces using other sequence norms in place of $\ell_{p}$ norms, for example, Lorentz space norms. However, we will not explore this in the present paper.

Remark 2.2.

We have introduced spaces of anisotropic analytic functions by imposing conditions on Taylor coefficients. One could replace the Taylor basis $y^{\nu}$ , $\nu\in{\cal F}$ , by other polynomial bases and define corresponding spaces of analytic functions. A particularly interesting case is when the polynomial basis consists of Legendre polynomials, since such expansions occur naturally in parametric PDEs (see [12]).

3 The approximation of functions in ${\cal B}_{\rho,p}$

In this section, we give first estimates for the error in approximating functions in ${\cal B}_{\rho,p}$ by polynomials in ${\cal P}_{\Lambda}$ , with $\Lambda$ a lower set. We follow the ideas in [15] which treats the case $p=2$ . Recall that in this paper we limit our discussion to the approximation of $u$ in the $L_{\infty}(Y,X)$ norm. This norm is not easy to access especially when $X$ is a general Banach space. However, if $u$ has a Taylor expansion $u(y)=\sum_{\nu\in{\cal F}}t_{\nu}y^{\nu}$ , $y\in Y$ , then it has a simple majorant given by

[TABLE]

The surrogate norm $\|u\|^{*}$ is defined and finite only if $u$ has a Taylor expansion valid on $Y$ and $(\|t_{\nu}\|_{X})_{\nu\in{\cal F}}$ is in $\ell_{1}({\cal F})$ . We assume that this is the case in going further in this section. As we shall see below, this assumption is easy to verify when $u\in{\cal B}_{\rho,p}$ under suitable assumptions on $\rho$ . This leads us to consider the surrogate error

[TABLE]

and similarly

[TABLE]

for the surrogate performance on a compact set $K\subset L_{\infty}(Y,X)$ . Given any set $\Lambda$ , the polynomial

[TABLE]

provides an approximation to $u$ which satisfies

[TABLE]

We now describe a simple way to find a lower set from ${\cal L}_{n}$ which gives the smallest surrogate error for the unit ball ${\cal U}_{\rho,p}$ of ${\cal B}_{\rho,p}$ , $0<p\leq\infty$ , among all lower sets from ${\cal L}_{n}$ . Given the sequence $\rho$ and given any $\varepsilon>0$ , we define

[TABLE]

Notice that $\Lambda(\varepsilon,\rho)$ has the following properties:

•

$\#\Lambda(\varepsilon,\rho)<\infty$ whenever $\varepsilon>0$ , since $\rho$ is non-decreasing, with $\rho_{1}>1$ and $\lim_{j\to\infty}\rho_{j}=\infty$ ;

•

$\Lambda(\varepsilon,\rho)$ is a lower set, since $\mu\leq\nu\Rightarrow\rho^{-\nu}\leq\rho^{-\mu}$ ;

•

$\Lambda(\varepsilon,\rho)\subset\Lambda(\varepsilon^{\prime},\rho)$ whenever $\varepsilon^{\prime}\leq\varepsilon$ .

We define the sequence $(\delta_{n})_{n\geq 1}=(\delta_{n}(\rho))_{n\geq 1}$ to be a decreasing rearrangement of the sequence $(\rho^{-\nu})_{\nu\in{\cal F}}$ . Then, $\#\Lambda(\delta_{n},\rho)\geq n$ . We further define

[TABLE]

as any lower set contained in $\Lambda(\delta_{n},\rho)$ with cardinality $n$ and which has the property that it contains all $\nu$ for which $\rho^{-\nu}>\delta_{n}(\rho)$ . Such a lower set can be obtained from $\Lambda(\delta_{n},\rho)$ by successively removing extreme points and thereby retaining the lower set property. Note that $\Lambda_{n}$ is not unique because of possible ties in the value of $\rho^{-\nu}$ , $\nu\in{\cal F}$ .

For any admissible $\rho$ and any $n\geq 1$ , we define

[TABLE]

While $\Lambda_{n}$ need not be unique, we always have a unique value for $\delta_{n,q}(\rho)$ for all choices of $n,q,\rho$ .

Theorem 3.1.

For $0<p\leq\infty$ , we have:

(i)

the set $\Lambda_{n,\rho}$ , defined in (3.5), minimizes $E^{*}_{\Lambda}({{\cal U}_{\rho,p}})$ over all lower sets $\Lambda\in{\cal L}_{n}$ , and

[TABLE] 2. (ii)

if $p\geq 1$ and $q$ is the conjugate index to $p$ , i.e., $1/p+1/q=1$ , then

[TABLE]

Proof: Let us first make some remarks about the structure of ${\cal U}_{\rho,p}$ that hold for any $0<p\leq\infty$ and any admissible $\rho$ . Given any $u\in{\cal U}_{\rho,p}$ , we know that $u(y)=\sum_{\nu\in{\cal F}}t_{\nu}y^{\nu}$ and the Taylor coefficients $t_{\nu}$ satisfy

[TABLE]

where $U(\ell_{p}({\cal F}))$ is the unit ball of the space $\ell_{p}({\cal F})$ . Conversely, let $(\alpha_{\nu})_{\nu\in{\cal F}}\in U(\ell_{p}({\cal F}))$ be a non-negative sequence and let $g\in X$ with $\|g\|_{X}=1$ . If we define $t_{\nu}:=g\rho^{-\nu}\alpha_{\nu}$ , $\nu\in{\cal F}$ , then the function $u(y):=\sum_{\nu\in{\cal F}}t_{\nu}y^{\nu}$ will be in ${\cal U}_{\rho,p}$ provided that $(\rho^{-\nu}\alpha_{\nu})_{\nu\in{\cal F}}$ is summable.

We first prove (ii) for a fixed $n$ and $p$ . We only discuss the case $p>1$ . The case $p=1$ is proved in a similar way. The two inequalities in (ii) are obvious from the definitions (1.4), (3.2), and (3.3), and so we only need to show that $E^{*}_{\Lambda_{n,\rho}}({{\cal U}_{\rho,p}})=\delta_{n,q}$ . Let $u\in{\cal U}_{\rho,p}$ with $u(y)=\sum_{\nu\in{\cal F}}t_{\nu}y^{\nu}$ . It follows from (3.1) with $T_{\Lambda_{n,\rho}}(y):=\sum_{\nu\in\Lambda_{n,\rho}}t_{\nu}y^{\nu}$ and Hölder’s inequality that

[TABLE]

To prove that $E_{\Lambda_{n,\rho}}^{*}({\cal U}_{\rho,p})\geq\delta_{n,q}$ , we construct a function $\tilde{u}\in{\cal U}_{\rho,p}$ for which $E_{\Lambda_{n,\rho}}^{*}(\tilde{u})=\delta_{n,q}$ . First assume that $\delta_{n,q}$ is finite, so that there is a nonnegative sequence $(c_{\nu})_{\nu\in{\cal F}}$ in the unit ball of $\ell_{p}({\cal F})$ for which $\sum_{\nu\notin\Lambda_{n,\rho}}c_{\nu}\rho^{-\nu}=\delta_{n,q}$ . Then, as in our lead remarks, we let $g\in X$ with $\|g\|_{X}=1$ and define $t_{\nu}:=c_{\nu}\rho^{-\nu}g$ . Then, we have

[TABLE]

is in ${\cal U}_{\rho,p}$ . Note here we use the fact that $(\|t_{\nu}\|_{X})_{\nu\in{\cal F}}$ is in $\ell_{1}({\cal F})$ . Since $E_{\Lambda_{n,\rho}}^{*}(\tilde{u})=\delta_{n,q}$ , we have finished the proof of (ii) in the case that $\delta_{n,q}$ is finite. If $\delta_{n,q}=\infty$ , the same argument as above shows that there is a $\tilde{u}$ for which $E_{\Lambda_{n,\rho}}^{*}(\tilde{u})$ is as large as we wish. Therefore (ii) holds in this case as well.

Now, consider the proof of (i). The inequalities stated in (i) are all obvious and so we need only show that $\Lambda_{n,\rho}$ minimizes $E_{\Lambda}^{*}({\cal U}_{\rho,p})$ over all lower set $\Lambda\in{\cal L}_{n}$ . To prove this, we first consider the case $p\leq 1$ . If $\Lambda\in{\cal L}_{n}$ , then by our lead remarks

[TABLE]

Here, we use the fact that $(\rho^{-\nu}\alpha_{\nu})_{\nu\in{\cal F}}$ is summable because $(\alpha_{\nu})_{\nu\in{\cal F}}$ is in $\ell_{1}({\cal F})$ . The minimum of (3.9) over all $\Lambda$ is achieved by taking $\Lambda=\Lambda_{n,\rho}$ .

Finally, we have to prove (i) in the case $1<p\leq\infty$ . Let us first recall that there is an enumeration $\nu(n)$ , $n\geq 1$ , of all of the $\nu\in{\cal F}$ , such that $\delta_{n}=\rho^{-\nu(n)}$ , $n\geq 1$ , and such that $\Lambda_{n,\rho}=\{\nu(1),\dots,\nu(n)\}$ . We suppose $\Lambda$ is any lower set with $\#\Lambda=n$ . From the definition of $\Lambda_{n,\rho}$ , we have

[TABLE]

Using the same construction as in the proof of (ii), we can find $\tilde{u}\in{\cal U}_{\rho,p}$ with

[TABLE]

which thereby proves (i). $\Box$

Corollary 3.2.

For $1\leq p\leq\infty$ , let $q$ be the conjugate index to $p$ . Then whenever $\delta_{n,r}$ is finite for some $0<r<q$ , we have

[TABLE]

In particular, we have

[TABLE]

Proof: The case where $p=1$ (resp. $q=\infty$ ) follows from (ii) of Theorem 3.1, since $\delta_{n,\infty}=\delta_{n+1}$ . So we can assume $p>1$ and $q<\infty$ . Since the sequence $(\delta_{n})_{n\geq 1}$ is non-increasing, we have

[TABLE]

Because of (ii) in Theorem 3.1, taking a $q$ -th root proves (3.11). To show (3.12), we use the fact that $\delta_{n,r}\leq\|(\rho^{-\nu})_{\nu\in{\cal F}}\|_{\ell_{r}({\cal F})}$ , along with the standard estimate

[TABLE]

Inserting these into (3.13), we obtain

[TABLE]

and the proof is complete. $\Box$

Remark 3.3.

We can define the above space ${\cal B}_{\rho,p}$ also in the case $\rho=(\rho_{1},\dots,\rho_{d})$ with $d$ finite. The results of this section hold equally well in this case.

4 The sequence $\delta_{n}(\rho)$

First, let us observe that in order for the $\delta_{n,q}$ from (3.6) to be finite, and therefore Theorem 3.1 to be meaningful, we need that the sequence $(\rho^{-\nu})_{\nu\in{\cal F}}\in\ell_{q}({\cal F})$ , which is the same as asking that $(\delta_{n}(\rho))_{n\geq 1}\in\ell_{q}(\mathbb{N})$ . The following lemma shows that this is the case if and only if $(\rho_{j}^{-1})_{j\geq 1}\in\ell_{q}(\mathbb{N})$ .

Lemma 4.1.

Let $0<q\leq\infty$ . Then the sequence $(\rho^{-\nu})_{\nu\in{\cal F}}\in\ell_{q}({\cal F})$ if and only if the sequence $(\rho_{j}^{-1})_{j\geq 1}{\in}\ell_{q}(\mathbb{N})$ . Moreover, the two norms are related in the following way:

(i)

when $q=\infty$ , we have ${\|(\rho^{-\nu})_{\nu\in{\cal F}}\|}_{\ell_{\infty}{({\cal F})}}=1,$ 2. (ii)

when $0<q<\infty$ , we have

[TABLE]

Proof: The case $q=\infty$ is trivial. When $q<\infty$ , we have

[TABLE]

Taking logarithms, we have from the mean value theorem that

[TABLE]

where the $\xi_{j}\in(0,\rho_{j}^{-q})\subset(0,\rho_{1}^{-q})$ , $j=1,2,\dots$ . Since $1<(1-\xi_{j})^{-1}<(1-\rho_{1}^{-q})^{-1}$ , it follows that

[TABLE]

This proves item (ii) in the lemma, and likewise shows that the product in (4.1) converges if and only if $(\rho_{j}^{-1})_{j\geq 1}{\in}\ell_{q}{(\mathbb{N})}$ . $\Box$

Remark 4.2.

The upper bound established in the above lemma can be found in [15].

The error estimates derived in §3 for approximation by polynomials on lower sets depend crucially on the sequence $(\delta_{n}(\rho))_{n\geq 1}$ , and are achieved by choosing the lower set $\Lambda_{n}=\Lambda_{n,\rho}$ . This leads to two central issues:

(i)

establishing sharp a priori estimates for $\delta_{n}(\rho)$ given the sequence $\rho$ ; 2. (ii)

efficient algorithms for generating the sets $\Lambda_{n}$ .

We discuss item (ii) in §6, and here we discuss first item (i). We begin this section with methods for bounding $(\delta_{n}(\rho))_{n\geq 1}$ which hold for any admissible sequence $\rho$ .

Remark 4.3.

In order to compute $\delta_{n}(\rho)$ or its asymptotic decay as $n\to\infty$ , we study $\#\Lambda(\varepsilon,\rho)$ , $0<\varepsilon\leq 1$ . This function of $\varepsilon$ takes integer values and increases as $\varepsilon$ goes to zero. Hence, it is a piecewise constant function and $(\delta_{n}(\rho))_{n\geq 1}$ is the decreasing sequence of the breakpoints $\varepsilon_{1},\varepsilon_{2},\ldots,$ of $\#\Lambda(\varepsilon,\rho)$ , where each value $\varepsilon_{i}$ is repeated $\#\Lambda(\varepsilon_{i+1},\rho)-\#\Lambda(\varepsilon_{i},\rho)$ times and $\delta_{1}(\rho)=\ldots=\delta_{k_{1}}(\rho)=1$ with $k_{1}=\#\Lambda(1,\rho)$ .

Since $\displaystyle{\lim_{j\to\infty}\rho_{j}=+\infty}$ , there is a $D=D(\varepsilon)$ such that $\rho_{j}^{-1}<\varepsilon$ , $j>D$ . It follows that any $\nu\in\Lambda(\varepsilon,\rho)$ has support in $\{1,2,\dots,D\}$ . Moreover, if we write $\varepsilon=e^{-M}$ , then taking logarithms we see that $\nu\in\Lambda(\varepsilon,\rho)$ if and only if $\nu$ satisfies

[TABLE]

Hence, $\nu\in\Lambda(\varepsilon,\rho)$ if and only if $\nu$ is supported on $\{1,2,\dots,D\}$ , and $(\nu_{1},\dots,\nu_{D})$ is a lattice point in the simplex

[TABLE]

where

[TABLE]

Estimating the number of lattice points in such a simplex is a classical problem in number theory and combinatorics. Let us first note that the volume (measure) of $S$ is

[TABLE]

We recall the following general upper bound (see [4, 21]) for the number $\#\Lambda(S)$ of $\nu\in\mathbb{N}_{0}^{D}$ such that $\nu\in S$ :

[TABLE]

Note that the right side of (4.4) is inflated by a factor of ( $1+a)^{D}$ when compared with the volume of $S$ . We use this result to prove the following lemma.

Lemma 4.4.

Let $\rho$ be any admissible sequence. Given $\varepsilon=e^{-M}$ , where $M>0$ , let $D$ be the last integer $j$ for which $\rho_{j}\leq e^{M}$ . Then, for the set $\Lambda(\varepsilon,\rho)$ of all $\nu\in{\cal F}$ such that $\rho^{-\nu}\geq\varepsilon$ , we have

[TABLE]

Proof: From (4.4) with $a_{j}=\frac{M}{\ln\rho_{j}}$ , $j=1,\dots,D$ , we have

[TABLE]

which is equivalent to (4.5). $\Box$

Let us make some remarks that will clarify when the bound in the lemma is effective and when it is deficient. First of all, if $d$ is finite and the sequence $(\rho_{j})_{j=1}^{d}$ is fixed, then the set $\Lambda(\varepsilon,\rho)$ , $\varepsilon=e^{-M}$ , is the set of lattice points $\mathbb{N}^{d}_{0}/M$ in the fixed simplex $S^{*}:=S(1/\ln\rho_{1},\dots,1/\ln\rho_{d})$ . If we let $M$ tend to infinity (which corresponds to $\varepsilon\to 0$ ), we see that $D=d$ provided $M$ is large enough, and $\#\Lambda(e^{-M},\rho)$ behaves like $M^{d}$ times the measures of $S^{*}$ . This is in agreement with the bound (4.5) because the inflation factor $(1+L/M)^{D}=(1+L/M)^{d}$ tends to one as $M\to\infty$ . So this bound is good for finite $d$ , provided the error we seek is small. However, there is a transition before this asymptotic kicks in where the upper bound provided by the lemma is not effective.

To see this, we consider one example which is central to this paper. We consider the sequence $\rho:=(j+1)_{j=1}^{d}$ , with $d$ finite. We take as our target error $\varepsilon:=1/(d+1)$ , i.e. $M=\ln(d+1)$ . Then $D=d$ and the upper bound for $\#\Lambda(\varepsilon,\rho)$ provided by Lemma 4.4 is

[TABLE]

where we used the fact that

[TABLE]

Since $\ln(x)$ is a concave function, we have

[TABLE]

Therefore, we have

[TABLE]

where we used Stirling’s formula. Thus, if we want an error $\varepsilon=1/(d+1)$ in this particular example, the best bound that Lemma 4.4 can provide for the size of $\Lambda(\varepsilon,\rho)$ is exponential in $d$ . In contrast, in Lemma 5.3 from the following section, we give a much more favorable bound.

5 Analysis of $\delta_{n}(\rho)$ when $\rho$ has polynomial growth

As we have just observed, the bounds of the previous section for $\delta_{n}(\rho)$ are generally far from sharp. We can establish sharper bounds, and even compute $\delta_{n}(\rho)$ exactly, if we have more information on the sequence $\rho$ . In this section, we give such an analysis when the sequence $\rho$ has polynomial growth.

Recall that for $s>0$ , we defined the sequence $\rho(s):=((j+1)^{s})_{j\geq 1}$ . In some parts of our analysis, it is useful to slightly modify this sequence. Accordingly, we introduce the following modified sequence $\rho^{*}(s)$ , $s>0$ , defined as follows. If $I_{1}:=\{1,2\}$ and $I_{k}:=\{j:\ 2^{k-1}<j\leq 2^{k}\}$ , $k\geq 2$ , then

[TABLE]

Note that the sequence $\rho_{j}^{*}(s)$ increases like $j^{s}$ . Moreover, $\#I_{1}=2$ and $\#I_{k}=2^{k-1}$ for $k\geq 2$ .

Given any $\varepsilon$ , we want to determine the cardinality of the set $\Lambda(\varepsilon,\rho(s))$ or its counterpart $\Lambda(\varepsilon,\rho^{*}(s))$ , i.e., how many $\nu$ satisfy the inequality $[\rho^{*}(s)]^{-\nu}\geq\varepsilon$ . According to Remark 4.3, the decay rate of $\delta_{n}(\rho^{*}(s))$ can then be derived from this knowledge. Let us note that for these two sequences, we have

[TABLE]

and so it is enough to analyze the case $s=1$ . We therefore take $s=1$ in the estimates on cardinality that follow.

As $\varepsilon$ decreases, the cardinality of $\Lambda(\varepsilon,\rho(1))$ increases. While it is interesting to understand how this cardinality grows asymptotically when $\varepsilon$ tends to zero, in numerical scenarios it is important to keep this cardinality small.

5.1 Exact formulas for $\#\Lambda(\varepsilon,\rho^{*}(1))$

Exact formulas for the cardinality of $\Lambda(\varepsilon,\rho(1))$ can be given in terms of the multiplicative partitions of natural numbers (see [10] and Remark 3.18 in [13]). In theory, these formulas allow the precise computation of $\#\Lambda(\varepsilon,\rho(1))$ provided that this cardinality is not too large. However, this computation is very intense and in fact, to our knowledge, has not been done. It turns out that these computations are simpler if one uses the sequence $\rho^{*}(1)$ instead of $\rho(1)$ . This stems from the fact that $\rho^{*}(1)^{\nu}$ is always an integer power of two. For this reason, we focus on this sequence for the remainder of this section. We begin by showing how one can do an exact count of the multiindices in the simplex associated to $\rho^{*}(1)$ .

For any $m\in\mathbb{N}_{0}$ , we define

[TABLE]

The set $S_{0}$ contains only the zero sequence and hence $\#S_{0}=1$ . We want to determine the cardinality of the sets $S_{m}$ , $m\geq 1$ . This is the same as finding how many $\nu\in{\cal F}$ satisfy (3.4), since if we denote by

[TABLE]

we have that

[TABLE]

Let us first note that if $\nu$ has a nonzero component $\nu_{j}>0$ for some $j>2^{m}$ , then $\rho^{*}(1)^{\nu}>2^{m}$ and so $\nu$ is not in $S_{m}$ . Hence, any $\nu\in S_{m}$ is supported on $\{1,\dots,2^{m}\}$ . We decompose the set $\{1,\dots,2^{m}\}=\bigcup_{k=1}^{m}I_{k}$ , and given any $\nu$ , we define

[TABLE]

which we think of as the energy of $\nu$ on $I_{k}$ . Therefore, for any $\nu\in\ S_{m}$ , we have

[TABLE]

Note that there are only certain sequences $(N_{1},\dots,N_{m})$ which satisfy (5.5). We denote the collection of all such sequences by ${\cal Q}_{m}$ ,

[TABLE]

The sequences in ${\cal Q}_{m}$ are related to the additive partitions of $m$ , which are decompositions of $m\in\mathbb{N}$ into $m=m_{1}+\dots+m_{j}$ , where the $m_{j}\in\mathbb{N}$ and where the order of the appearance of an $m_{j}$ does not matter.

There is a one to one correspondence between the elements in ${\cal Q}_{m}$ and additive partitions of $m$ . Indeed, any additive partition $(m_{1},m_{2},\dots,m_{j})$ of $m$ corresponds to a sequence

$(N_{1},\dots,N_{k},\dots,N_{m})\in{\cal Q}_{m}$ , where $N_{k}$ is the number of appearances of $k$ in $(m_{1},\dots,m_{j})$ . Conversely, any $(N_{1},\dots,N_{m})$ for which $\sum_{k=1}^{m}kN_{k}=m$ corresponds to the unique additive partition of $m$ , where $1$ appears $N_{1}$ times, $2$ appears $N_{2}$ times and so on. Thus $q(m):=\#{\cal Q}_{m}$ is the additive partition number of $m$ .

The following theorem gives an exact count for the cardinality of $S_{m}$ , and hence the cardinality of the set $\Lambda(\varepsilon,\rho^{*}(1))$ .

Theorem 5.1.

For $m\geq 1$ , the cardinality of $S_{m}$ is given by

[TABLE]

Moreover, for every $\varepsilon>0$ ,

[TABLE]

where $m(\varepsilon):=\lfloor\log_{2}\left(\frac{1}{\varepsilon}\right)\rfloor.$

Proof: For any fixed $(N_{1},\dots,N_{m})\in{\cal Q}_{m}$ , we define

[TABLE]

Now, for each $k=1,\ldots,m$ , we count all possible $\nu$ satisfying

[TABLE]

Since $\nu_{j}\in\mathbb{N}_{0}$ , the latter cardinality can be viewed as the number of ways one can place $N_{k}$ indistinguishable balls into $\#I_{k}$ distinguishable boxes so that some boxes can remain empty. The answer to this combinatorial problem is known to be $\binom{N_{k}-1+\#I_{k}}{N_{k}}$ (see [19]). Therefore, the cardinality of $\Gamma(N_{1},\dots,N_{m})$ is the product of these binomial coefficients:

[TABLE]

Equation (5.6) now follows from the definitions of $S_{m}$ and ${\cal Q}_{m}$ and (5.8). The last statement in the theorem follows from (5.4) and (5.6). $\Box$

Theorem 5.1 gives an exact formula for $\#\Lambda(\varepsilon,\rho^{*}(1))$ for any $\varepsilon$ , since

[TABLE]

Note that the sequence $(\delta_{n}(\rho^{*}(1)))_{n\geq 1}$ is then given by $\delta_{1}(\rho^{*}(1))=1$ and, for $m=1,2,\ldots$ ,

[TABLE]

Moreover, since $\Lambda(\varepsilon^{s},\rho^{*}(s))=\Lambda(\varepsilon,\rho^{*}(1))$ , $s>0$ , we similarly derive that

[TABLE]

and $(\delta_{n}(\rho^{*}(s)))_{n\geq 1}$ is then given by $\delta_{1}(\rho^{*}(s))=1$ and, for $m=1,2,\ldots$ ,

[TABLE]

In Table 1, we present the computed cardinality $\#\Lambda(2^{-ms},\rho^{*}(s))$ for values of $m$ in the range $0\leq m\leq 10$ and $s=1,2,3,4$ .

Remark 5.2.

If we combine this theorem with Theorem 3.1 and (5.11), we determine the optimal error and best lower set for approximating any of the spaces ${\cal B}_{\rho^{*}(s),p}$ , provided the error is measured in the surrogate norm rather than the true $L_{\infty}(Y,X)$ norm. Of course, it gives an upper bound on the performance in the $L_{\infty}(Y,X)$ norm, that is for $n\geq 1$ we have

[TABLE]

and

[TABLE]

where the sequence $(\delta_{n})_{n\geq 1}=(\delta_{n}(\rho^{*}(s)))_{n\geq 1}$ is given by (5.11). The efficiency of the algorithm is determined by the cardinality of $\Lambda(2^{-ms},\rho^{*}(s))=\Lambda(2^{-m},\rho^{*}(1))$ , given in Table 1. In particular, let us suppose the user desires to approximate a function in ${\cal U}_{\rho^{*}(s),1}$ with accuracy $10^{-3}$ . Because $\delta_{n+1}=2^{-ms}$ for $n=\#\Lambda(2^{-m+1},\rho^{*}(1))$ according to (5.11), when $s=1$ , we need $m=10$ and thus a set $\Lambda$ of cardinality 4101 achieves this accuracy. Similarly, a sufficient cardinality for $\Lambda$ is 50 when $s=2$ ; 20 for $s=3$ ; 8 for $s=4$ .

In view of Remark 5.2, the behavior of the sequence $(\delta_{n}(\rho))_{n\geq 1}$ dictates the error of approximation for ${\cal B}_{\rho,p}$ . The values of $\delta_{n}(\rho(s))$ are provided in Figure 1 for $s=1,2,3,4$ for the cases $\rho(s)=\rho^{*}(s)$ and $\rho(s)=((j+1)^{s})_{j\geq 1}$ .

5.2 The asymptotic behavior of $\delta_{n}(\rho^{*}(s))$

Theorem 5.1 gives an exact expression for $\#\Lambda(\varepsilon,\rho^{*}(s))$ which then can be used to determine $\delta_{n}(\rho^{*}(s))$ for any $s$ and $n$ . We can also use this theorem to give bounds on the asymptotic decay of $\delta_{n}(\rho^{*}(s))$ . We begin with a lemma.

Lemma 5.3.

For $m=0$ , $\#\Lambda(2^{-ms},\rho^{*}(s))=1$ , when $m=1$ , $\#\Lambda(2^{-ms},\rho^{*}(s))=3$ , and for every $m\geq 1$ , we have the following two estimates:

(i)

$\#\Lambda(2^{-ms},\rho^{*}(s))\leq 2^{m+4\sqrt{m}}$ , 2. (ii)

$\#\Lambda(2^{-ms},\rho^{*}(s))\leq Cm^{-3/4}2^{m+c\sqrt{m}}$ , where $C:=(1-2^{-1/4})^{-1}$ and $c:=\pi\sqrt{\frac{2}{3}}(\ln 2)^{-1}<4$ .

If we superimpose these inequalities we obtain

[TABLE]

Proof: Note that for the sequence $\rho^{*}(s)$ given by (5.1), we have

[TABLE]

Therefore $\Lambda(2^{-ms},\rho^{*}(s))$ does not depend on $s$ , and in what follows we may take $s=1$ .

For the particular cases $m=0,1$ we readily check that

[TABLE]

To show (i) and (ii), we first prove

[TABLE]

For $k\geq 1$ , we note that the binomial coefficient from (5.8) can be estimated

[TABLE]

since

[TABLE]

Therefore, for any sequence $(N_{1},\dots,N_{m})$ in ${\cal Q}_{m}$ , we have

[TABLE]

yielding the estimate

[TABLE]

As noted before, $q(m)$ is the same as the number of additive partitions of the integer $m$ . The number $q(m)$ has been exactly computed for small values of $m$ and there are bounds for $q(m)$ for any $m$ . The following upper bound for $q(m)$ can be found in [18]:

[TABLE]

Hence,

[TABLE]

and using Theorem 5.1, we obtain (5.13).

We can now use (5.13) to prove each of the inequalities (i) and (ii). To prove (ii), it is enough to show that

[TABLE]

The above relation is valid for $m=1$ and we now proceed by induction assuming that it has been proven for $m$ and verify the case $m+1$ . Using the induction hypothesis, we have

[TABLE]

where to derive the last inequality we used $\sqrt{m}<\sqrt{m+1}$ , $1/m\leq 1$ , and the specific value of $C$ . This completes the proof of (ii).

We prove estimate (i) for $m\geq 2$ in a similar way (the case $m=1$ clearly holds) showing by induction that

[TABLE]

The details are omitted.

To prove the superimposed estimate we note that

[TABLE]

On the interval $[2,\infty)$ , the function on the left is increasing and the function on the right is decreasing since $c<4$ , and the range of $m$ for which the inequality holds is $2\leq m\leq 5$ . The proof is completed. $\Box$

In Figure 2, we present the graphs of the exactly computed values of $\#\Lambda(2^{-m},\rho^{*}(1))$ compared to the estimate from Lemma 5.3.

5.2.1 Bounds for the error $E_{n}({\cal U}_{\rho^{*}(s),p})$ .

In this section, we use Lemma 5.3 to give bounds on the decay of $\delta_{n}(\rho^{*}(s))$ and $E_{n}({\cal U}_{\rho^{*}(s),p})$ . We start with the case $p=1$ .

Corollary 5.4.

If $s>0$ , then we have the following bounds

[TABLE]

Proof: We first consider the case when $n=2^{k}$ , $k\geq 1$ . Let $m$ be the largest non-negative natural number satisfying

[TABLE]

It follows from Lemma 5.3 that $\#\Lambda(2^{-m},\rho^{*}(1))\leq 2^{m+4\sqrt{m}}\leq 2^{k}=n$ . Relation (5.11) and the monotonicity of the sequence $(\delta_{n}(\rho^{*}(s)))_{n\geq 1}$ give $\delta_{n}(\rho^{*}(s))\leq 2^{-ms}$ which, according to Remark 5.2, leads to $E_{n}({\cal U}_{\rho^{*}(s),1})\leq 2^{-ms}$ .

Let us define $\alpha$ by the equation $m=k-\alpha\sqrt{k}$ and give an upper bound for $\alpha$ . Since the integer $m+1=k+1-\alpha\sqrt{k}$ does not satisfy (5.18), we have

[TABLE]

and so

[TABLE]

Rearranging terms, we have

[TABLE]

Noticing that the left-hand side vanishes for

[TABLE]

we obtain the upper bound $\alpha<\alpha_{+}$ from which we get

[TABLE]

Therefore, we have the estimate

[TABLE]

which leads to

[TABLE]

Now, given any $n\geq 2$ , we choose the largest $k$ such that $2^{k}\leq n<2^{k+1}$ . This implies that $2^{-k}<2n^{-1}$ and $\sqrt{4+k}\leq\sqrt{4+\log_{2}n}$ , and so we derive

[TABLE]

as desired. $\Box$

The next corollary treats the case of general $p$ .

Corollary 5.5.

Let $1<p\leq\infty$ and let $q$ be given by $1/p+1/q=1$ . For any $s>1/q$ , we have

[TABLE]

where $C(q,s)$ is a constant depending only on $s$ and $q$ .

Proof: The first inequality is (ii) of Theorem 3.1. Next, let us denote by

[TABLE]

and observe that since $\varphi(x):=x^{\frac{4s\sqrt{4+\log_{2}x}}{\log_{2}x}}$ is an increasing function of $x>0$ , we have

[TABLE]

Note that to complete the proof we need only show (5.19) in the case $n=2^{N}$ because then for $2^{N}<k\leq 2^{N+1}$ ,

[TABLE]

where we have used the fact that the sequence $(\delta_{n,q}(\rho^{*}(s)))_{n\geq 1}$ is decreasing. Thus, we concentrate on the case $n=2^{N}$ and define

[TABLE]

Similarly to the function $\phi$ , we have that

[TABLE]

It follows from Corollary 5.4 that $\delta_{n}\leq 2^{-6s}\psi(n)$ , $n\geq 2$ , and using the above estimate we have

[TABLE]

Here, in the last inequality we have used the bound

[TABLE]

valid for every $m\geq 0$ , which follows from the fact that

[TABLE]

The bound (5.21) gives

[TABLE]

which is (5.19) for $n=2^{N}$ , and therefore completes the proof of the Corollary. $\Box$

According to (5.12), we can improve estimate (5.17) when $n$ is large. For this, we state the following two corollaries whose proofs will be given in the appendix.

Corollary 5.6.

Let $m=m(n)$ be the largest natural number such that

[TABLE]

where $C$ is the constant of Lemma 5.3. Then

[TABLE]

Note that the dependence of $m$ as a function of $n$ in the above corollary is implicit. One may want to get an explicit version of that statement which is the next corollary.

Corollary 5.7.

If $s>0$ ,

[TABLE]

and therefore

[TABLE]

where $\tilde{C}:=C(1-c/4)^{-3/4}$ with $C,c$ as in Lemma 5.3.

6 Finding the set $\Lambda(\varepsilon,\rho)$

In this section, we describe a possible strategy to build the set $\Lambda(\varepsilon,\rho)$ for any given sequence $\rho$ and a given target accuracy $\varepsilon$ . A second procedure (not given here) can then be used to find $\Lambda_{n,\rho}$ when we prescribe the cardinality $n$ of the set rather than the accuracy. Before we begin describing our algorithm, let us note that other procedures have been given for constructing $\Lambda(\varepsilon,\rho)$ (see e.g. [5, 22]).

As above, we consider $\rho=(\rho_{j})_{j\geq 1}$ to be a non-decreasing sequence such that $\rho_{1}>1$ and $\lim_{j\rightarrow\infty}\rho_{j}=\infty$ . Let us denote by ${\rm supp}(\nu)$ the support of a multiindex $\nu=(\nu_{1},\nu_{2},\ldots)$ , that is

[TABLE]

Recalling the definition of $\Lambda(\varepsilon,\rho)$ given in (3.4), we first notice that:

•

$\nu=0\in\Lambda(\varepsilon,\rho)$ whenever $\varepsilon\leq 1$ ;

•

for every fixed $\varepsilon$ , there is an index $D(\varepsilon)$ such that if $\nu\in\Lambda(\varepsilon,\rho)$ , then ${\rm supp}(\nu)\subset\{1,2,\ldots,D(\varepsilon)\}$ ;

•

if $\varepsilon_{1}\leq\varepsilon_{2}$ , then $D(\varepsilon_{2})\leq D(\varepsilon_{1})$ .

The lower set $\Lambda(\varepsilon,\rho)$ can be built using the iterative strategy described in the following Algorithm.

When implementing this algorithm in practice, we form a tree where each $\nu\in T_{i}$ has $D(\varepsilon)$ possible children $\mu$ to be checked for admissibility. When a constructed $\mu$ is found to be inadmissible, then it is not included in $T_{i+1}$ . This stops the search down the entire subtree rooted at $\mu$ . If $\mu$ is found to be admissible then it is added to $T_{i+1}$ . In this way, each $T_{i}$ forms a level in the tree rooted with the zero sequence. When all elements of $T_{i}$ are exhausted, then the computation moves to processing elements in $T_{i+1}$ . If the current set being processed is empty, then the procedure is ended and $\Lambda(\varepsilon,\rho)=\bigcup_{k=0}^{i}T_{k}$ . Finally, we mention that the set $T_{i+1}$ corresponds to the so-called reduced margin (see e.g. [11]) of the set $\bigcup_{k=0}^{i}T_{k}$ .

Remark 6.1.

One can deduce that the number of computations needed to construct the set $\Lambda(\varepsilon,\rho)$ is of order ${\cal O}(m\log m)$ , where $m=\#\Lambda(\varepsilon,\rho)$ , provided one imposes additional growth conditions on the sequence $(\rho_{j})_{j\geq 1}$ (for an analysis for another sorting algorithm see [5]). This would cover the sequences $\rho(s)$ and $\rho^{*}(s)$ for example.

7 Concluding Remarks

In this work, we discussed the approximation of Banach space valued functions with an infinite number of variables by polynomials on lower sets. We defined a family of model classes ${\cal B}_{\rho,p}$ based on anisotropic analyticity, and derived bounds for the decay rate for the approximation of these model classes using multivariate polynomials. We considered only the case when the approximation error is measured in the $L_{\infty}(Y,X)$ norm, though it would be interesting to develop corresponding results when measuring the approximation error in $L_{q}(Y,X)$ norms. Already, several results in the case $q=2$ have been given in [15].

Another setting that arises in parametric PDEs is analytic functions which have Legendre expansions (instead of Taylor expansions) with bounds on the size of the Legendre coefficients (see [12]). It would be interesting to formally introduce and study the spaces (analogous to the ${\cal B}_{\rho,p}$ ) associated to such expansions. The functions in these spaces would now be analytic on polyellipses.

Our main vehicle for deriving error estimates for these classes was to use a surrogate norm in place of the $L_{\infty}(Y,X)$ norm. We showed in Theorem 3.1 that for this surrogate norm, our estimates are optimal. It would be very interesting to understand what optimal results would look like in the original $L_{\infty}(Y,X)$ norm, i.e., to prove lower bounds for the approximation rate in the $L_{\infty}(Y,X)$ norm rather than the surrogate norm.

We concentrated on the sequences $\rho(s)$ and $\rho^{*}(s)$ , $s>0$ , since they comply with typical assumptions in applied settings. It is possible to extend these results to more general sequences $\rho$ which eventually behave asymptotically like $\rho(s)$ or $\rho^{*}(s)$ . However, the behavior of the sequence in the preasymptotic regime strongly effects the final decay rate bounds for $\delta_{n}(\rho)$ . For instance, the value $\rho_{j}$ , representing the smoothness of $u$ in the direction $j$ , might remain close to $1$ for arbitrarily many $j$ before eventually growing to $\infty$ . It would be interesting to give bounds for other sequences $\rho$ with polynomial or even exponential growth.

Our formulation of the model classes and our approximation results have been strongly influenced by the works [1, 15, 20, 22]. The paper [16] has a significant intersection with our paper where results analogous to Corollary 5.5 in the case $p=\infty$ are proven.

We next ellaborate on the distinctions between our paper and the results given in [20]. In [20], the authors derive bounds for the approximation of parametric PDEs using Taylor and Legendre series. They work under the assumption that $d<\infty$ , and use analyticity of the parameter-to-PDE-solution map to derive certain upper bounds on the norms of the coefficients in the Legendre and Taylor series expansions of the solution $u$ . In the case of Taylor series, their analysis includes the case when $\|t_{\nu}\|_{X}\leq M\rho^{-\nu}$ , which corresponds to our model classes ${\cal B}_{\rho,\infty}$ . We restrict our further comments to this case. Although their results are only stated for solutions to parametric PDEs, their proofs give the following estimates for functions in ${\cal B}_{\rho,\infty}$ .

Theorem 7.1.

Let $\rho=(\rho_{1},\dots,\rho_{d})$ be a nondecreasing sequence with $\rho_{1}>1$ . Then for any $\sigma>0$ , there exists an $n(d,\sigma)$ such that for all $n\geq n(d,\sigma)$ ,

[TABLE]

holds with $C_{\sigma}:=(4e+4\sigma e-2)\frac{e}{e-1}$ .

If we specialize to the sequence $\rho^{*}(s)$ , $s>0$ , then their result takes the form

[TABLE]

where $C$ has an absolute bound and $c(d,s)$ actually grows with $d$ and $s$ . Note that the bound is subexponential in $n$ , and hence is better than the algebraic rate given in our estimates. The reason for this is the assumption that $d$ is finite. However, we must emphasize that the number $n(d,\sigma)$ grows exponentially in $d$ , and so this result can only be applied when $n$ is very large. We have concentrated on obtaining results that hold for all $n$ and all $d$ with no dependence on $d$ .

The reason for this restriction on $n$ in [20] is that their proof of this theorem utilizes bounds on the number of lattice points $t\mathbb{N}^{d}$ in the simplex $S=S(1/\ln\rho_{1},\dots,1/\ln\rho_{d})$ . Their bound requires that this number behaves like $t^{-d}{\rm meas}(S)$ . As discussed in the remarks following the proof of Lemma 4.4, this asymptotic count on the lattice points is effective only for $t$ small and in turn $n$ prohibitively large.

By contrast, our results given above apply for $d=\infty$ and any $n$ . When $d$ is finite we can always extend the sequence to an infinite sequence in an arbitrary way. In this way our results apply without any restrictions on the size of $n$ relative to $d$ .

8 Appendix: Proofs of Corollaries 5.6 and 5.7

Proof of Corollary 5.6: Let $m=m(n)$ be the largest natural number satisfying (5.23). One can check that for $n\geq 2^{16}$ , we have $m(n)\geq 6$ , and thus it follows from (ii) or Lemma 5.3 that

[TABLE]

which gives $\delta_{n}(\rho^{*}(s))\leq 2^{-m(n)s}$ , and thus $E_{n}({\cal U}_{\rho^{*}(s),1})\leq 2^{-m(n)s}$ . $\Box$

Proof of Corollary 5.7: To show (5.25), we proceed as follows. We consider first the case $n=2^{k}$ , $k\geq 16$ . Let $m$ be the largest non-negative natural number satisfying

[TABLE]

and let $\beta$ be defined by the equation $m=k-\beta\sqrt{k}$ . Since $k\geq 16$ , the largest $m$ that satisfies the above estimate is greater or equal to $6$ . Moreover, we can easily show that $\beta\leq c$ . Therefore, we use the fact that $m=k-\beta\sqrt{k}\geq k-c\sqrt{k}$ and that $k-c\sqrt{k}\geq(1-c/4)k$ for $k\geq 16$ , which gives

[TABLE]

Thus if $C_{1}:=\log_{2}C$ , we have

[TABLE]

It follows (since $m\geq 6$ ) that

[TABLE]

Therefore, (5.11) and the monotonicity of the sequence $(\delta_{n}(\rho^{*}(s)))_{n\geq 1}$ give

[TABLE]

which, according to Remark 5.2 leads to

[TABLE]

Now, if $k\geq 16$ is such that $2^{k}\leq n<2^{k+1}$ , it follows that

[TABLE]

which is (5.25). $\Box$

Acknowledgements The authors would like to acknowledge and thank Matthew Hielsberg for the help in carrying the numerical experiments.

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Bachmayr, A. Cohen and G. Migliorati. Sparse polynomial approximation of parametric elliptic PD Es. Part I: affine coefficients. ESAIM: Mathematical Modelling and Numerical Analysis, 51(1):321–339, 2017.
2[2] J. Beck, F. Nobile, L. Tamellini and R. Tempone. On the optimal polynomial approximation of stochastic PD Es by Galerkin and collocation methods. Mathematical Models and Methods in Applied Sciences, 22(9):1250023, 2012.
3[3] J. Beck, F. Nobile, L. Tamellini and R. Tempone. Convergence of quasi-optimal Stochastic Galerkin methods for a class of PDES with random coefficients. Computers and Mathematics with Applications, 67(4):732–751, 2014.
4[4] A. Beged-Dov. Lower and upper bounds for the number of lattice points in a simplex . SIAM Journal on Applied Mathematics, 22(1):106–108, 1972.
5[5] M. Bieri, R. Andreev, and C. Schwab. Sparse tensor discretizations of elliptic SPD Es . SIAM Journal of Scientific Computation 31(6), 4281–4304, 2009.
6[6] P. Binev, W. Dahmen and R. De Vore. Adaptive finite element methods with convergence rates. Numerische Mathematik, 97(2):219–268, 2004.
7[7] P. Binev, W. Dahmen, R. De Vore and P. Petrushev. Approximation classes for adaptive methods. Serdica Mathematical Journal, 28(4):391–416, 2002.
8[8] A. Bonito, R. De Vore and R. Nochetto. Adaptive finite element methods for elliptic problems with discontinuous coefficients. SIAM Journal on Numerical Analysis, 51(6):3106–3134, 2013.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Polynomial Approximation of Anisotropic Analytic Functions of Several Variables

Abstract

1 Introduction

2 Anisotropic analyticity and motivation

Lemma 2.1**.**

Remark 2.2**.**

3 The approximation of functions in Bρ,p{\cal B}_{\rho,p}Bρ,p​

Theorem 3.1**.**

Corollary 3.2**.**

Remark 3.3**.**

4 The sequence δn(ρ)\delta_{n}(\rho)δn​(ρ)

Lemma 4.1**.**

Remark 4.2**.**

Remark 4.3**.**

Lemma 4.4**.**

5 Analysis of δn(ρ)\delta_{n}(\rho)δn​(ρ) when ρ\rhoρ has polynomial growth

5.1 Exact formulas for #Λ(ε,ρ∗(1))\#\Lambda(\varepsilon,\rho^{*}(1))#Λ(ε,ρ∗(1))

Theorem 5.1**.**

Remark 5.2**.**

5.2 The asymptotic behavior of δn(ρ∗(s))\delta_{n}(\rho^{*}(s))δn​(ρ∗(s))

Lemma 5.3**.**

5.2.1 Bounds for the error En(Uρ∗(s),p)E_{n}({\cal U}_{\rho^{*}(s),p})En​(Uρ∗(s),p​).

Corollary 5.4**.**

Corollary 5.5**.**

Corollary 5.6**.**

Corollary 5.7**.**

6 Finding the set Λ(ε,ρ)\Lambda(\varepsilon,\rho)Λ(ε,ρ)

Remark 6.1**.**

7 Concluding Remarks

Theorem 7.1**.**

8 Appendix: Proofs of Corollaries 5.6 and 5.7

Lemma 2.1.

Remark 2.2.

3 The approximation of functions in ${\cal B}_{\rho,p}$

Theorem 3.1.

Corollary 3.2.

Remark 3.3.

4 The sequence $\delta_{n}(\rho)$

Lemma 4.1.

Remark 4.2.

Remark 4.3.

Lemma 4.4.

5 Analysis of $\delta_{n}(\rho)$ when $\rho$ has polynomial growth

5.1 Exact formulas for $\#\Lambda(\varepsilon,\rho^{*}(1))$

Theorem 5.1.

Remark 5.2.

5.2 The asymptotic behavior of $\delta_{n}(\rho^{*}(s))$

Lemma 5.3.

5.2.1 Bounds for the error $E_{n}({\cal U}_{\rho^{*}(s),p})$ .

Corollary 5.4.

Corollary 5.5.

Corollary 5.6.

Corollary 5.7.

6 Finding the set $\Lambda(\varepsilon,\rho)$

Remark 6.1.

Theorem 7.1.