Bernstein-Walsh theory associated to convex bodies and applications to multivariate approximation theory
Len Bos, Norm Levenberg

TL;DR
This paper extends Bernstein-Walsh theorem to multivariate polynomial approximation on convex bodies, providing new insights and validating previous observations in multivariate approximation theory.
Contribution
It introduces a version of Bernstein-Walsh theorem for convex bodies in several complex variables, enhancing understanding of multivariate polynomial approximation.
Findings
Validated and clarified Trefethen's observations in multivariate approximation
Extended Bernstein-Walsh theorem to subclasses of polynomial spaces associated with convex bodies
Provided theoretical foundations for multivariate uniform polynomial approximation
Abstract
We prove a version of the Bernstein-Walsh theorem on uniform polynomial approximation of holomorphic functions on compact sets in several complex variables. Here we consider subclasses of the full polynomial space associated to a convex body P. As a consequence, we validate and clarify some observations of Trefethen in multivariate approximation theory.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometry and complex manifolds · Geometric Analysis and Curvature Flows · Holomorphic and Operator Theory
Bernstein-Walsh theory associated to convex bodies and applications to multivariate approximation theory
L. Bos and N. Levenberg
Abstract.
We prove a version of the Bernstein-Walsh theorem on uniform polynomial approximation of holomorphic functions on compact sets in several complex variables. Here we consider subclasses of the full polynomial space associated to a convex body . As a consequence, we validate and clarify some observations of Trefethen in multivariate approximation theory.
Key words and phrases:
convex body, Bernstein-Walsh, multivariate approximation
1991 Mathematics Subject Classification:
32U15, 32U20, 41A10
*Supported by Simons Foundation grant No. 354549
1. Introduction.
A standard theorem in several complex variables, quantifying the classical Oka-Weil theorem on polynomial approximation – which itself is the multivariate version of the classical Runge theorem for polynomial approximation in the complex plane – is the Bernstein-Walsh theorem:
Theorem 1.1**.**
Let be compact, nonpluripolar and polynomially convex with continuous. Let , and let . Let be continuous on . Then
[TABLE]
if and only if is the restriction to of a function holomorphic in .
Here for compact,
[TABLE]
where is a nonconstant holomorphic polynomial; and for a continuous complex-valued function on ,
[TABLE]
where is the space of holomorphic polynomials of degree at most . See [1] for a survey and history of Theorem 1.1 in both one and several complex variables.
In Trefethen [9], the author gives some evidence for why one might consider non-traditional notions of “degree” of a polynomial in the setting of multivariate approximation theory. More precisely, for certain functions on he compares the approximation numbers where “degree” has three possible meanings: total degree, Euclidean degree, or maximum degree. We clarify this distinction in a more general setting by describing generalizations of the extremal functions associated to subclasses of the full polynomial space . Given a convex body , following Bayraktar [2], we define a extremal function associated to . In Section 2 we list and prove some basic properties of these functions. We state and prove a generalization of Theorem 1.1 in this setting in Section 3. Section 4 recovers the Trefethen cases for by taking appropriate and provides explicit examples of functions comparing rates of approximation.
2. Background: extremal functions.
In what follows, we fix a convex body ; i.e., a compact, convex set in with non-empty interior . Standard examples include the case where
- (1)
is a non-degenerate convex polytope, i.e., the convex hull of a finite subset of in with ; 2. (2)
is the (nonnegative) portion of an ball in , .
As a particular case of (2), with we have where
[TABLE]
We will consider convex bodies with the property that
[TABLE]
Associated with , following [2], we consider the finite-dimensional polynomial spaces
[TABLE]
for . Here . In the case we have , the usual space of holomorphic polynomials of degree at most in . Clearly there exists a minimal positive integer such that . Thus
[TABLE]
We let dim. From (2.2),
[TABLE]
Note it follows from convexity of that
[TABLE]
It suffices to verify this for monomials . Then so and . Thus
[TABLE]
Recall the indicator function of a convex body is
[TABLE]
For the we consider, on with . Define the logarithmic indicator function
[TABLE]
Here for (the components need not be integers). From (2.1), we have
[TABLE]
We will use to define generalizations of the Lelong classes , the set of all plurisubharmonic (psh) functions on with the property that , and
[TABLE]
where is a constant depending on . We remark that, a priori, for a set , one defines the global extremal function
[TABLE]
It is a theorem, due to Siciak and to Zaharjuta (cf., Theorem 5.1.7 in [5]), that for compact, coincides with the function in (1.1). Moreover,
[TABLE]
precisely when is nonpluripolar; i.e., for such that plurisubharmonic on a neighborhood of with on implies .
Define
[TABLE]
and
[TABLE]
Then and . Given , the extremal function of is given by where
[TABLE]
For , we recover . We will restrict to the case where is compact. In this case, Bayraktar [2] proved a Siciak-Zaharjuta type theorem showing that can be obtained using polynomials. Note that for .
Proposition 2.1**.**
Let be compact and nonpluripolar. Then
[TABLE]
pointwise on where
[TABLE]
If is continuous, the convergence is locally uniform on .
It follows that where
[TABLE]
is the polynomial hull of . Also, either , which occurs if and only if is pluripolar, or .
Example 2.2**.**
Let , the unit torus in . Then
[TABLE]
This is Example 2.3 in [2]. There it is stated only for a convex polytope. We give an alternate proof in Proposition 2.4.
From Proposition 2.1 we have a Bernstein-Walsh inequality.
Proposition 2.3**.**
Let be nonpluripolar. Then for ,
[TABLE]
In particular, if is continuous, for
[TABLE]
is an open neighborhood of and for ,
[TABLE]
Many of the results in Chapter 5 of [5] remain valid for extremal functions. From the definition of and Example 2.2, we obtain:
- (1)
If are compact sets with and , then ; 2. (2)
for compact, is lower semicontinuous; 3. (3)
for compact, if then is continuous (on ); 4. (4)
for compact, where .
For , we say a compact set is regular if is continuous on ; i.e., if (equivalently, on ). Given a convex body , we call a compact set regular if is continuous on ; i.e., if . Since , for each we have
[TABLE]
[TABLE]
Hence . It follows that if is regular, then is regular for any convex body .
The proofs of Propositions 5.3.12, 5.3.14 and Corollary 5.3.13 in [5] carry over in this setting to show that is regular for a bounded open set with boundary. Hence for any compact set , one can find a decreasing sequence of bounded open sets with smooth boundaries (thus is regular) such that .
We end this section with a result on extremal functions for product sets. This will be useful in Section 4. Before proceeding we require a definition: we call a convex body a lower set if for each , whenever we have for all .
Proposition 2.4**.**
Let be a lower set and let be compact and nonpolar. Then
[TABLE]
Proof.
For simplicity, we do the case . Thus let be compact sets and let be the support function of . From properties (1) and (4) of extremal functions, we can assume that are regular in with and that is regular in .
To see that
[TABLE]
since , it suffices to show that . From the definition of ,
[TABLE]
which is an upper envelope of locally bounded above plurisubharmonic functions. As is convex and are continuous, is continuous. Finally, since as and as , it follows that .
To prove the reverse inequality, and hence (2.5), we modify the proof of Theorem 5.1.8 in [5]. Let be probability measures on such that and are Bernstein-Markov pairs: thus, given , there exists a positive constant such that
[TABLE]
for all . Any compact set admits a measure so that is a Bernstein-Markov pair; cf., [4]. Let with . Given an orthonormal basis in and in for the univariate polynomials, where deg and deg, we can write
[TABLE]
where
[TABLE]
for all . The lower set property of implies that for . Using (2.6),
[TABLE]
so that, using the univariate Bernstein-Walsh estimates, i.e., (2.4) for (),
[TABLE]
for all . Thus, using (2.2),
[TABLE]
Now
[TABLE]
so that, since
[TABLE]
we have
[TABLE]
[TABLE]
The result follows, using (2.3), upon letting .
∎
In particular, this gives (another) proof of Example 2.2.
Remark 2.5**.**
For , let . Let
[TABLE]
be the polar of and let be the dual norm defined by ; i.e.,
[TABLE]
Thus is the unit ball in this norm. Then we can write (2.5) as
[TABLE]
3. Bernstein-Walsh theorem.
As in the previous section, we fix a convex body . We prove a Bernstein-Walsh theorem in this setting. Given a compact set , for a continuous complex-valued function on we define
[TABLE]
Theorem 3.1**.**
Let be compact and regular. Let , and let . Let be continuous on .
- (1)
If is a lower set and is the restriction to of a function holomorphic in , then
[TABLE] 2. (2)
If , implies is the restriction to of a function holomorphic in .
Proof.
(2). Suppose
[TABLE]
for some . We show that if satisfies , then the series converges uniformly on compact subsets of to a holomorphic function which agrees with on . To this end, choose with ; by hypothesis the polynomials satisfy
[TABLE]
for some . Now let , and apply (2.4) to the polynomial to obtain
[TABLE]
[TABLE]
Since and were arbitrary numbers satisfying , we conclude that converges locally uniformly on to a holomorphic function . From (3.1), on . ∎
To verify (1) we will follow Bloom’s reasoning in [3]. Note that is a finite-dimensional complex vector space; we call its dimension (see Remark 2.3). We begin with the key lemma. Fix where is as in (2.1) and let be a basis for . For define
[TABLE]
Lemma 3.2**.**
Let be a lower set and let be holomorphic in a neighborhood of . Then for each positive integer , there exists such that for all ,
[TABLE]
where is a constant independent of .
Proof.
We show for , an integer multiple of , that there exists such that for all ,
[TABLE]
where is independent of . To this end, let via
[TABLE]
Then is a subvariety of and is a subvariety of the polydisk
[TABLE]
Choose so that is holomorphic on a neighborhood of . Let be holomorphic in a neighborhood of such that on . That such an exists follows from Theorem 8.2 in [6]; see Remark 3.3 below. Define
[TABLE]
Thus . Let
[TABLE]
be the Taylor series of about . By the Cauchy estimates on , for each multiindex we have
[TABLE]
Given a positive integer , we let
[TABLE]
be the Taylor polynomial of degree at most of at . Then for , let
[TABLE]
where . It follows that because and is a lower set. Since , we have
[TABLE]
To obtain the desired estimate, note first that
[TABLE]
Choose with and so that
[TABLE]
Then
[TABLE]
[TABLE]
where .
∎
Remark 3.3**.**
A version of this lemma was proved in [8] using the Oka extension theorem: instead of the mapping via , one considers via ; the Oka result provides the existence of holomorphic on a neighborhood of with on a neighborhood of (cf., section 3.3 of [6]). We need to avoid using this Oka map as not all powers of the coordinates of may be included in our spaces. Here, to apply the result in [6], we need to be one-to-one on . This follows since implies so that the coordinate functions belong to and form a basis for .
We want to construct polynomials so that for large the sets approximate the sublevel sets of . To this end, recall for compact, we defined
[TABLE]
From Proposition 2.1, we have for compact and nonpluripolar,
[TABLE]
pointwise on ; and if is continuous, the convergence is locally uniform on . We assume continuity of in Theorem 3.1. We will use Fekete points and Lagrange interpolating polynomials to prove our results. To this end, let be basis monomials for where dim and let be Fekete points of order for ; i.e.,
[TABLE]
is maximal among all tuples of points in . Then the fundamental Lagrange interpolating polynomials
[TABLE]
( in the th slot) form a basis for with the additional properties that
- (1)
; hence 2. (2)
for ; while 3. (3)
for .
This final property follows from the Lagrange interpolation formula: for any defined on , the Lagrange interpolating polynomial for with nodes is
[TABLE]
In particular, for ,
[TABLE]
Thus if, in addition, ,
[TABLE]
For we have
[TABLE]
Defining
[TABLE]
we have since . Note depends on while does not. From (3), we get a reverse-type inclusion for large:
Lemma 3.4**.**
Given , there exists such that for all ,
[TABLE]
Proof.
We have
[TABLE]
Thus if ,
[TABLE]
for where depends on . Since we assume is continuous, we can choose independent of for in a compact set; e.g., for . Now take .
∎
The following result proves the “if” direction of Theorem 3.1.
Proposition 3.5**.**
Let be a lower set and let be compact and regular. Let , and let be holomorphic on . Then for any the Lagrange interpolating polynomials for associated with a Fekete array for satisfy
[TABLE]
where is a constant independent of .
Proof.
Choose with . By Lemma 3.4 for all sufficiently large we have
[TABLE]
Here is defined in (3.2). Fix such an . By Lemma 3.2 there exists with
[TABLE]
for all . Let . Since ,
[TABLE]
But
[TABLE]
Recall that for all and . Thus, since for all , from (3.3),
[TABLE]
Thus for all sufficiently large,
[TABLE]
Now since (for all ), again by (3.3) we have
[TABLE]
Thus for large we obtain
[TABLE]
so that, indeed,
[TABLE]
∎
4. Applications.
In this section we make the connection between Theorem 3.1 and Trefethen’s work [9], where he introduces a new notion of degree for a polynomial , the Euclidean degree, which we may write as
[TABLE]
For a convex body, we may define an associated “norm” for via the Minkowski functional
[TABLE]
We remark that this defines a true norm on all of if is the positive “octant” of a centrally symmetric convex body i.e., . We may thus define a general degree associated to the convex body as
[TABLE]
Then
[TABLE]
For , if we let
[TABLE]
be the portion of an ball then we have, in the notation of [9],
[TABLE]
Further, if we let , then if , from (2.7),
[TABLE]
For the particular product set, where for that Trefethen considers, and hence we have
[TABLE]
Further, for continuous and complex-valued on we define (as before) the approximation numbers,
[TABLE]
Essentially, [9] compares approximation numbers for and in different dimensions and notes the different rates of decay for holomorphic functions. Our Theorem 3.1 explains this behavior, precisely and in greater generality.
Example 4.1**.**
Consider the multivariate Runge-type function
[TABLE]
where and (cf. (2.1) of [9]). This function is holomorphic except on its singular set
[TABLE]
an algebraic variety having no real points.
By Theorem 3.1, the approximation numbers decay like iff is holomorphic in the set
[TABLE]
In other words, decays like where
[TABLE]
It is easy to see that
[TABLE]
We note at this point the following elementary fact.
Lemma 4.2**.**
Suppose that Then
[TABLE]
We now compute the values of for Specifically
Lemma 4.3**.**
For (corresponding to the total degree case)
[TABLE]
Proof.
In this case
[TABLE]
Now, as is well known, the level sets of the univariate extremal function are confocal ellipses. Specifically, for
[TABLE]
is the ellipse with and The degenerate case with corresponds to
The interior and exterior of the ellipse are given by the sublevel and suplevel sets
[TABLE]
For convenience, set so that the singular set
[TABLE]
For the particular case of it is easy to check that and Hence if we have and iff It follows that for we have
[TABLE]
and iff Consequently, for a point on the singular set i.e., with (and hence ), implies that for some Thus the minimum of is as claimed, and is attained for ∎
Lemma 4.4**.**
(Characterization of Lagrange Critical Points) Suppose that (i.e., ). Then, in the minimization problem
[TABLE]
the critical points are characterized by the condition
[TABLE]
for every pair
Proof.
We consider the objective function
[TABLE]
where and separate the constraint into its real and imaginary parts as
[TABLE]
where we have written
Then we may calculate
[TABLE]
and
[TABLE]
If we write the Lagrange multiplier conditions for a critical point as
[TABLE]
then critical points are characterized by
[TABLE]
Treating the gradients as column vectors this latter condition may be expressed in matrix form as
[TABLE]
iff
[TABLE]
iff
[TABLE]
Consequently,
[TABLE]
for
We now proceed to calculate . To this end, write
[TABLE]
with Note that
[TABLE]
Then
[TABLE]
and, similarly,
[TABLE]
Hence,
[TABLE]
Therefore
[TABLE]
Substituting this into the critical point condition (4.4) and taking conjugates gives the result. ∎
Lemma 4.5**.**
For (corresponding to the tensor-product (max) degree case)
[TABLE]
Proof.
By Lemma 4.4 for critical points outside are characterized by such that and
[TABLE]
Squaring, we see that it is necessary that
[TABLE]
However,
[TABLE]
and so either or else
To complete the proof that the minimum of on the singular set is indeed as claimed, we proceed by induction on the dimension For dimension there is nothing to do. Hence, suppose that the result holds for any and dimension strictly less that We must show that it also holds in dimension First note that for and and hence To show the reverse inequality there are three possibilities to consider:
- (1)
is a critical point for which 2. (2)
is a critical point for which there is a pair such that 3. (3)
is a boundary point, i.e, for some
In case (1) we have i.e.,
[TABLE]
The value of in this case is
[TABLE]
But
[TABLE]
as the first term is the solution of the ODE and the second of the ODE with higher growth factor: Thus such critical points are not candidates for the minimum.
In case (2) we may suppose that we have Then for
[TABLE]
and so by the -dimensional case and the fact that
[TABLE]
Hence neither are such critical points candidates for the minimum.
Finally, for case (3), a boundary point has at least one of its coordinates in Without loss of generality we may assume that Then and iff iff with In other words is on the -dimensional singular set Hence, by the -dimensional case the minimum of (with ) is Clearly, this is minimized for in which case and we are done. ∎
Lemma 4.6**.**
Suppose that and let Then
[TABLE]
Proof.
We first prove the case. By Lemma 4.4 the critical points are characterized by the condition
[TABLE]
We may assume that
[TABLE]
so that
[TABLE]
But on the singular set so we have
[TABLE]
Setting we have
[TABLE]
which, upon dividing, becomes
[TABLE]
i.e., after dividing by
[TABLE]
Then, cross-multiplying, we have
[TABLE]
Since the discriminant of this quadratic is it follows that and hence as well.
We now analyze the various possibilities for the minimum. Consider first a boundary point where one of the coordinates, say is in In that case and
[TABLE]
But
[TABLE]
so that and
[TABLE]
which is clearly minimized when in which case
[TABLE]
Thus is the minimum value of over boundary points.
Consider now the critical points. As reported above, in this case we must have If say then just as in the boundary case
[TABLE]
and
[TABLE]
and so this case is not a candidate for the minimum. Hence we assume that both and Since the univariate extremal function is invariant under we may write with The critical point condition (4.6) then reduces to
[TABLE]
But the function
[TABLE]
is the product of two positive strictly decreasing functions; the function in parentheses being the average over the interval of a strictly decreasing function. Hence is also strictly decreasing and the critical point condition therefore requires that As indeed
At this critical point
[TABLE]
We claim that this latter quantity is greater than Indeed, taking logarithms, we claim that
[TABLE]
or, equivalently, that
[TABLE]
Consider
[TABLE]
for Then
[TABLE]
as the integrand is decreasing. Hence is increasing and, in particular, and the Lemma is proved for the case.
For (), the monotonicity of norms implies that
[TABLE]
and so
[TABLE]
From Lemma 4.5 and the case,
[TABLE]
and the result follows. ∎
As
[TABLE]
the approximation order of the Euclidean degree is considerably higher than for the total degree, while the use of tensor-product degree provides no additional advantage, as reported in [9].
It is also interesting to note that decreases to as the dimension increases to while for , is independent of the dimension indicating that the rate of polynomial approximation using the total degree degenerates for higher dimensions while for the Euclidean and tensor-product degree it does not.
However this is not a completely fair comparison. The dimension of the spaces are proportional (asymptotically) to the volume indeed,
[TABLE]
To equalize their dimensions we may scale by
[TABLE]
For example, for in the Euclidean case we have
[TABLE]
By Lemma 4.2 we then compare
[TABLE]
We note that for “small” () and so the Euclidean degree, even in the dimension normalized case, has a better approximation order than the total degree case, albeit with a lesser advantage. For example, for
[TABLE]
Further, for “large” (), so that then the total degree provides a better order of approximation.
Example 4.7**.**
Consider now the bivariate function
[TABLE]
for and This has a single real pole at and complex singular set
[TABLE]
Lemma 4.8**.**
We have
[TABLE]
Proof.
Consider first the case where
[TABLE]
We are claiming that
[TABLE]
As discussed for Example 1, the level set \bigl{|}z_{j}+\sqrt{z_{j}^{2}-1}\bigr{|}=\alpha is the ellipse with
[TABLE]
Now if then
[TABLE]
implies that and consequently that But then is such that
[TABLE]
i.e.,
Similarly, one may show that if then Consequently the minimum is for for example
[TABLE]
We next consider the case where and
[TABLE]
By calculations entirely analogous to those of the first example, the Lagrange multiplier critical points for are characterized by the condition that
[TABLE]
from which it follows upon squaring that Applying the constraint results in specific values for these critical points and their corresponding function values can be shown by elementary (but lengthy!) calculations to not be candidates for the minimum.
There remains the case of a boundary point, when of one of If then \log\bigl{|}z_{1}+\sqrt{z_{1}^{2}-1}\bigr{|}=0 and then for \log\bigl{|}\alpha-z_{1}+\sqrt{(\alpha-z_{1})^{2}+1}\bigr{|} is minimized by for which
[TABLE]
On the other hand, if then \log\bigl{|}z_{2}+\sqrt{z_{2}^{2}-1}\bigr{|}=0 and It is easy to see that then \log\bigl{|}z_{1}+\sqrt{z_{1}^{2}-1}\bigr{|} is minimized for i.e., But the ellipse with has semi-major axis for Hence and, in this case,
[TABLE]
and this is not a candidate for the minimum.
∎
Again we have (note that, by the monotonicity of norms, we have and so, at best, ). However the gain in approximation order is much less. Indeed, if we write then
[TABLE]
Further, if we normalize the area of to make the dimesnions of the spaces comparable, we obtain
[TABLE]
even for small In other words, the total degree is then, in this sense, the better option.
Example 4.9**.**
In Example 4.1, and from the numerical evidence, also in Example 4.7, it is the case that
[TABLE]
indicating that there is no advantage in using the tensor-product degree over the Euclidean degree. This is not always the case. Indeed, consider the function
[TABLE]
We report the numerical result that for we obtain
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T. Bagby and N. Levenberg, Bernstein theorems, New Zealand J. Math. , 22 (1993), no. 1, 1-20.
- 2[2] T. Bayraktar, Zero distribution of random sparse polynomials, ar Xiv:1503.00630 v 4, to appear in Mich. Math. J.
- 3[3] T. Bloom, On the convergence of multivariable Lagrange interpolants, Constr. Approx. , 5 (1989), 415-435.
- 4[4] T. Bloom and N. Levenberg, Pluripotential energy and large deviation, Indiana Univ. Math. J. , 62 (2013), no. 2, 523-550.
- 5[5] M. Klimek, Pluripotential Theory , Oxford Univ. Press, 1991.
- 6[6] T. Nishino, Function theory in several complex variables , AMS, Providence, RI, 1996.
- 7[7] E. Saff and V. Totik, Logarithmic potentials with external fields , Springer-Verlag, Berlin, 1997.
- 8[8] J. Siciak, Extremal plurisubharmonic functions in ℂ N superscript ℂ 𝑁 {\mathbb{C}}^{N} , A nn. Polon. Math. 3 9 (1981), 175-211.
