Improved convergence analysis of Lasserre's measure-based upper bounds for polynomial minimization on compact sets
Lucas Slot, Monique Laurent

TL;DR
This paper extends the convergence rate analysis of Lasserre's polynomial hierarchy for minimizing polynomials over compact sets, achieving faster error bounds for broader classes of sets and measures.
Contribution
It provides new convergence rate bounds for Lasserre's hierarchy on convex bodies and with various reference measures, improving previous estimates.
Findings
Convergence rate of O(1/r^2) for hypercube with Chebyshev measure.
Error estimate of O(log r / r) for convex bodies under certain conditions.
Improved error bounds of O(log^2 r / r^2) for convex bodies with Lebesgue measure.
Abstract
We consider the problem of computing the minimum value of a polynomial over a compact set , which can be reformulated as finding a probability measure on minimizing . Lasserre showed that it suffices to consider such measures of the form , where is a sum-of-squares polynomial and is a given Borel measure supported on . By bounding the degree of by one gets a converging hierarchy of upper bounds for . When is the hypercube , equipped with the Chebyshev measure, the parameters are known to converge to at a rate in . We extend this error estimate to a wider class of convex bodies, while also allowing for a broader class of reference measures, including the Lebesgue measure. Our analysis applies to simplices, balls andā¦
| compact | reference | ||
| General | Borel | [21] | |
| Assumption 1 | Lebesgue | [12] | |
| Convex body | Lebesgue | [8] | |
| Hypersphere | Haar | [15] | |
| Hypersphere | Haar | [11] | |
| Hypercube | Chebyshev | [10] | |
| Hypercube | Thm. 3 | ||
| Ball | Thm. 4 | ||
| Simplex | Lebesgue | Thm. 9 | |
| Ball-like convex body | Lebesgue | Thm. 6 | |
| Global minimizer in the interior | Lebesgue | Thm. 5 | |
| Assumption 1 | Lebesgue | Thm. 10 | |
| Convex body | Thm. 11 |
| Name | Formula | |
|---|---|---|
| Linear | ||
| Quadratic | ||
| Booth | ||
| Matyas | ||
| Camel | ||
| Motzkin |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Improved convergence analysis of Lasserreās measureābased upper bounds for polynomial minimization on compact sets
Lucas Slot &Monique Laurent Centrum Wiskunde & Informatica (CWI), Amsterdam, [email protected]Centrum Wiskunde & Informatica (CWI), Amsterdam and Tilburg University, [email protected]
Abstract
We consider the problem of computing the minimum value of a polynomial over a compact set , which can be reformulated as finding a probability measure on minimizing . Lasserre showed that it suffices to consider such measures of the form , where is a sum-of-squares polynomial and is a given Borel measure supported on . By bounding the degree of by one gets a converging hierarchy of upper bounds ā for . When is the hypercube , equipped with the Chebyshev measure, the parameters are known to converge to at a rate in . We extend this error estimate to a wider class of convex bodies, while also allowing for a broader class of reference measures, including the Lebesgue measure. Our analysis applies to simplices, balls and convex bodies that locally look like a ball. In addition, we show an error estimate in when satisfies a minor geometrical condition, and in when is a convex body, equipped with the Lebesgue measure. This improves upon the currently best known error estimates in and for these two respective cases.
K****eywordsāpolynomial optimization Ā sum-of-squares polynomial Ā Lasserre hierarchy Ā semidefinite programming Ā needle polynomial
AMS subject classification 90C22; 90C26; 90C30
1 Introduction
1.1 Lasserreās measure-based hierarchy
Let be a compact set and let be a polynomial. We consider the minimization problem
[TABLE]
Computing is a hard problem in general, and some well-known problems from combinatorial optimization are among its special cases. For example, it is shown in [24, 13] that the stability number of a graph is given by
[TABLE]
where we take to be the standard simplex in .
Problem (1) may be reformulated as the problem of finding a probability measure on for which the integral is minimized. Indeed, for any such we have . On the other hand, if is a global minimizer of in , then we have , where is the Dirac measure centered at .
Lasserre [21] showed that it suffices to consider measures of the form , where is a sum-of-squares polynomial and is a (fixed) reference Borel measure supported by . That is, we may reformulate (1) as
[TABLE]
For each we may then obtain an upper bound for by limiting our choice of in (3) to polynomials of degree at most :
[TABLE]
Here, denotes the set of all sum-of-squares polynomials of degree at most . We shall also write for simplicity. As detecting sum-of-squares polynomials is possible using semidefinite programming, the program (4) can be modeled as an SDP [21]. Moreover, the special structure of this SDP allows a reformulation to an eigenvalue minimization problem [21], as will be briefly described below.
By definition, we have for all and
[TABLE]
In this paper we are interested in upper bounding the convergence rate of the sequence to in terms of . That is, we wish to find bounds in terms of for the parameter:
[TABLE]
often also denoted for simplicity when there is no ambiguity on .
1.2 Related work
Bounds on the parameter have been shown in the literature for several different sets of assumptions on and . Depending on these assumptions, two main strategies have been employed, which we now briefly discuss.
Algebraic analysis via an eigenvalue reformulation. The first strategy relies on a reformulation of the optimization problem (4) as an eigenvalue minimization problem (see [12, 21]). We describe it briefly, in the univariate case only, for simplicity and since this is the case we need. Let be the (unique) orthonormal basis of w.r.t. the inner product . For each , we then define the (generalized) truncated moment matrix of by setting
[TABLE]
It can be shown that , the smallest eigenvalue of the matrix . Any bounds on the eigenvalues of thus immediately translate to bounds on .
In [10], the authors determine the exact asymptotic behaviour of in the case that is a quadratic polynomial, and , known as the Chebyshev measure. Based on this, they show that and extend this result to arbitrary multivariate polynomials on the hypercube equipped with the product measure . In addition, they prove that for linear polynomials, which thus shows that in some sense quadratic convergence is the best we can hope for. (This latter result is shown in [10] for all measures with Jacobi weight on ).
The orthogonal polynomials corresponding to the measure on are the Chebyshev polynomials of the first kind, denoted by . They are well-studied objects (see, e.g., [25]). In particular, they satisfy the following three-term recurrence relation
[TABLE]
This imposes a large amount of structure on the matrix when is quadratic, which has been exploited in [10] to obtain information on its smallest eigenvalue.
The main disadvantage of the eigenvalue strategy is that it requires the moment matrix of to have a closed form expression which is sufficiently structured so as to allow for an analysis of its eigenvalues. Closed form expressions for the entries of the matrix are known only for special sets , such as the interval , the unit ball, the unit sphere, or the simplex, and only with respect to certain measures.
However, as we will see in this paper, the convergence analysis from [10] in for the interval equipped with the Chebyshev measure, can be transported to a large class of compact sets, such as the interval with more general measures, the ball, the simplex, and āball-likeā convex bodies.
Analysis via the construction of feasible solutions. A second strategy to bound the convergence rate of the parameters is to construct explicit sum-of-squares density functions for which the integral is close to . In contrast to the previous strategy, such constructions will only yield upper bounds on .
As noted earlier, the integral may be minimized by selecting the probability measure , the Dirac measure at a global minimizer of on . When the reference measure is the Lebesgue measure, it thus intuitively seems sensible to consider sum-of-squares densities that approximate the Dirac delta in some way.
This approach is followed in [12]. There, the authors consider truncated Taylor expansions of the Gaussian function , which they use to define the sum-of-squares polynomials
[TABLE]
Setting for carefully selected standard deviation , they show that when satisfies a minor geometrical assumption (AssumptionĀ 1 below), which holds, e.g., if is a convex body or if it is star-shaped with respect to a ball.
In subsequent work [8], the authors show that if is assumed to be a convex body, then a bound in may be obtained by setting . As explained in [8], the sum-of-squares density in this case can be seen as an approximation of the Boltzman density function for , which plays an important role in simulated annealing.
The advantage of this second strategy seems to be its applicability to a broad class of sets with respect to the natural Lebesgue measure. This generality, however, is so-far offset by significantly weaker bounds on . Another main contribution of this paper will be to show improved bounds on for this broad class of sets .
Analysis for the hypersphere. Tight results are known for polynomial minimization on the unit sphere , equipped with the Haar surface measure. Doherty and Wehner [15] have shown a convergence rate in , by using harmonic analysis on the sphere and connections to quantum information theory. In the very recent work [11], the authors show an improved convergence rate in , by using a reduction to the case of the interval and the above mentioned convergence rate in for this case. This reduction is based on replacing by an easy (linear) upper estimator. This idea was already exploited in [12, 8] (where a quadratic upper estimator was used) and we will also exploit it in this paper.
1.3 Our contribution
The contribution of this paper is showing improved bounds on the convergence rate of the parameters for a wide class of sets and measures . It is twofold.
Firstly, we extend the known bound from [10] in for the hypercube equipped with the Chebyshev measure, to a wider class of convex bodies. Our results hold for the ball , the simplex , and āball-likeā convex bodies (see Definition 3) equipped with the Lebesgue measure. For the ball and hypercube, they further hold for a wider class of measures; namely for the measures given by
[TABLE]
on the ball, and the measures
[TABLE]
on the hypercube. Note that for the hypercube, setting yields the Chebyshev measure, and that for both the ball and the hypercube, setting yields the Lebesgue measure. The rate also holds for any compact equipped with the Lebesgue measure under the assumption of existence of a global minimizer in the interior of . These results are presented in SectionĀ 3.
Secondly, we improve the known bounds in and for general compact sets (under AssumptionĀ 1) and convex bodies equipped with the Lebesgue measure, established in [12], [8], respectively. For general compact sets, we prove a bound in , and for convex bodies we show a bound in . These results are exposed in SectionĀ 4.
For our results in SectionĀ 3, we will use several tools that will enable us to reduce to the case of the interval equipped with the Chebyshev measure. These tools are presented in Sections 2 and 3. They include: (a) replacing by an affine linear image of it (SectionĀ 2.3); (b) replacing by an upper estimator (easier to analyze, obtained via Taylorās theorem) (SectionĀ 2.4); (c) transporting results between two comparable weight functions on and between two convex sets which look locally the same in the neighbourhood of a global minimizer (SectionsĀ 3.1, 3.2). In particular, the result of PropositionĀ 1 will play a key role in our treatment.
To establish our results in SectionĀ 4 we will follow the second strategy sketched above, namely we will define suitable sum-of-squares polynomials that approximate well the Dirac delta at a global minimizer. However, instead of using truncations of the Taylor expansion of the Gaussian function or of the Boltzman distribution as was done in [12], [8], we will now use the so-called needle polynomials from [19] (constructed from the Chebyshev polynomials, see SectionĀ 4.1). In TableĀ 1 we provide an overview of both known and new results.
Finally, we illustrate some of the results in Sections 3 and 4 with numerical examples in Section 5.
2 Preliminaries
In this section, we first introduce some notation that we will use throughout the rest of the paper and recall some basic terminology and results about convex bodies. We then show that the error is invariant under nonsingular affine transformations of . Finally, we introduce the notion of upper estimators for . Roughly speaking, this tool will allow us to replace in the analysis of by a simpler function (usually a quadratic, separable polynomial). We will make use of this extensively in both SectionĀ 3 and SectionĀ 4.
2.1 Notation
For , denotes the standard inner product and the corresponding norm. We write for the -dimensional ball of radius centered at . When and , we also write .
Throughout, is always a compact set with non-empty interior, and is an -variate polynomial. We let (resp., ) denote the gradient (resp., the Hessian) of at , and introduce the parameters
[TABLE]
Whenever we write an expression of the form
[TABLE]
we mean that there exists a constant such that for all , where depends only on , and the parameters . Some of our results are obtained by embedding into a larger set . If this is the case, then may depend on as well. If there is an additional dependence of on the global minimizer of on , we will make this explicit by using the notation .
2.2 Convex bodies
Let be a convex body, i.e., a compact, convex set with non-empty interior. We say is an (inward) normal of at if holds for all . We refer to the set of all normals of at as the normal cone, and write
[TABLE]
We will make use of the following basic result.
Lemma 1** (e.g., [2, Prop. 2.1.1]).**
Let be a convex body and let be a continuously differentiable function with local minimizer . Then .
Proof.
Suppose not. Then, by definition of , there exists an element such that . Expanding the definition of the gradient this means that
[TABLE]
which implies for all small enough. But by convexity, contradicting the fact that is a local minimizer of on . ā
The set is smooth if it has a unique unit normal at each boundary point . In this case, we denote by the (unique) hyperplane tangent to at , defined by the equation .
For , we say is of class if there exists a convex function such that and . If is of class for some , it is automatically smooth in the above sense.
We refer, e.g., to [1] for a general reference on convex bodies.
2.3 Linear transformations
Suppose that is a nonsingular affine transformation, given by . If is a sum-of-squares density function w.r.t. the Lebesgue measure on , then we have
[TABLE]
[TABLE]
As a result, the polynomial is a sum of squares density function w.r.t. the Lebesgue measure on . It has the same degree as , and it satisfies
[TABLE]
We have just shown the following.
Lemma 2**.**
Let be a non-singular affine transformation. Write . Then we have
[TABLE]
2.4 Upper estimators
Given a point and two functions , we write if and for all ; we then say that is an upper estimator for on , which is exact at . The next lemma, whose easy proof is omitted, will be very useful.
Lemma 3**.**
Let be an upper estimator for , exact at one of its global minimizers on . Then we have for all .
Remark 1**.**
We make the following observations for future reference.
LemmaĀ 3 tells us that we may always replace in our analysis by an upper estimator which is exact at one of its global minimizers. This is useful if we can find an upper estimator that is significantly simpler to analyse.
- 2.
We may always assume that , in which case for all and . Indeed, if we consider the function given by , then , and for every density function on , we have
[TABLE]
showing that for all .
In the remainder of this section, we derive some general upper estimators based on the following variant of Taylorās theorem for multivariate functions.
Theorem 1** (Taylorās theorem).**
For and we have
[TABLE]
where is the constant from (12).
Lemma 4**.**
Let be a global minimizer of on . Then has an upper estimator on which is exact at and satisfies the following properties:
- (i)
* is a quadratic, separable polynomial.* 2. (ii)
* for all .* 3. (iii)
If , then for all .
Proof.
Consider the function defined by
[TABLE]
which is an upper estimator of exact at by TheoremĀ 1. As we have , is indeed a quadratic, separable polynomial.
As is a global minimizer of on , we know by LemmaĀ 1 that . This means that for all , which proves the second property.
If , we must have , and the third property follows. ā
In the special case that is a ball and has a global minimizer on the boundary of , we have an upper estimator for , exact at , which is a linear polynomial.
Lemma 5**.**
Assume that for some . Then there exists a linear polynomial with on .
Proof.
Write and for simplicity. In view of LemmaĀ 4, we have for all , where is the quadratic polynomial from relation (21). Since is a global minimizer of on , we have by LemmaĀ 1, and thus for some . Therefore we have
[TABLE]
On the other hand, for any we have
[TABLE]
Combining these facts we get
[TABLE]
So is a linear upper estimator of with , as desired. ā
Remark 2**.**
As can be seen from the above proof, the assumption in LemmaĀ 5 that is a global minimizer of on may be replaced by the weaker assumption that .
Finally, we give a very simple upper estimator, which will be used in SectionĀ 4.
Lemma 6**.**
Recall the constant from (12). Let be a global minimizer of on . Then we have
[TABLE]
3 Special convex bodies
In this section we extend the bound from [10] on , when is equipped with the Chebyshev measure , to a broader class of convex bodies and reference measures .
First, we show that, for the hypercube , we still have for all and all measures of the form with . Previously this was only known to be the case when is a linear polynomial. Note that, for , we obtain the Lebesgue measure on . Next, we use this result to show that for all measures on the unit ball of the form with . We apply this result to also obtain when is the Lebesgue measure and is a āball-likeā convex body, meaning it has inscribed and circumscribed tangent balls at all boundary points (see DefinitionĀ 3 below). The primary new tool we use to obtain these results is PropositionĀ 1, which tells us that the behaviour of essentially only depends on the local behaviour of and in a neighbourhood of a global minimizer of on .
3.1 Measures and weight functions
A function is a weight function on if it is continuous and satisfies . A weight function gives rise to a measure on defined by . We note that if , and is a weight function on , it can naturally be interpreted as a weight function on as well, by simply restricting its domain (assuming ). In what follows we will implicitly make use of this fact.
Definition 1**.**
Given two weight functions on and a point , we say that on if there exist constants such that
[TABLE]
If the constant can be chosen uniformly, i.e., if there exists a constant such that
[TABLE]
then we say that on .
Remark 3**.**
We note the following facts for future reference:
- (i)
As weight functions are continuous on the interior of by definition, we always have if . 2. (ii)
If is bounded from below, and is bounded from above on , then we automatically have .
3.2 Local similarity
Assuming that the global minimizer of on is unique, sum-of-squares density functions for which the integral is small should in some sense approximate the Dirac delta function centered at . With this in mind, it seems reasonable to expect that the quality of the bound depends in essence only on the local properties of and around . We formalize this intuition here.
Definition 2**.**
Suppose . Given , we say that and are locally similar at , which we denote by , if there exists such that
[TABLE]
Clearly, for any point .
FigureĀ 1 depicts some examples of locally similar sets.
Proposition 1**.**
Let , let be a global minimizer of on and assume . Let be two weight functions on , respectively. Assume that for all , and that . Then there exists an upper estimator of on which is exact at and satisfies
[TABLE]
for all large enough. Here is the constant defined by (23).
Recall that if is an upper estimator for which is exact at one of its global minimizers, we then have by LemmaĀ 3. PropositionĀ 1 then allows us to bound in terms of . For its proof, we first need the following lemma.
Lemma 7**.**
Let , and assume that . Then any normal vector of at is also a normal vector of . That is,
Proof.
Let . Suppose for contradiction that . Then, by definition of the normal cone, there exists such that . As , there exists for which . Now choose small enough such that . Then, by convexity, we have . Now, we have But, as , this contradicts the assumption that . ā
of PropositionĀ 1.
For simplicity, we assume here , which is without loss of generality by RemarkĀ 1. Consider the quadratic polynomial from (21):
[TABLE]
where is defined in (12). By Taylorās theorem (TheoremĀ 1), we have that for all , and clearly . That is, is an upper estimator for on , exact at (cf. Lemma 4). We proceed to show that
[TABLE]
We start by selecting a degree sum-of-squares polynomial satisfying
[TABLE]
We may then rescale to obtain a density function on w.r.t. by setting
[TABLE]
By assumption, for all . Moreover, for all . This implies that
[TABLE]
and thus it suffices to show that . The key to proving this bound is the following lemma, which tells us that optimum sum-of-squares densities should assign rather high weight to the ball around .
Lemma 8**.**
Let . Then, for any , we have
[TABLE]
Proof.
By LemmaĀ 1, we have and so by LemmaĀ 7. As a result, we have for all (cf. Lemma 4). In particular, this implies that for all and so
[TABLE]
The statement now follows from reordering terms. ā
As , there exists such that . As , there exist , such that for . Set . Choose large enough such that for all , which is possible since tends to [math] as . Then, LemmaĀ 8 yields
[TABLE]
for all . Putting things together yields the desired lower bound:
[TABLE]
for all . ā
Corollary 1**.**
Let , let be a global minimizer of on , and assume that . Let be two weight functions on , respectively. Assume that for all and that . Then there exists an upper estimator of on , exact at , such that
[TABLE]
for all large enough. Here is the constant defined by (24).
3.3 The unit cube
Here we consider optimization over the hypercube and we restrict to reference measures on having a weight function of the form
[TABLE]
with . The following result is shown in [10] on the convergence rate of the bound when using the measure on
Theorem 2** ([10]).**
Let and consider the weight function from (37).
- (i)
If , then we have:
[TABLE] 2. (ii)
If and has a global minimizer on the boundary of , then (38) holds for all .
The key ingredients for claim (ii) above are: (a) when the global minimizer is a boundary point of then has a linear upper estimator (recall LemmaĀ 5), and (b) the convergence rate of (38) holds for any linear function and any (see [10]).
In this section we show TheoremĀ 3 below, which extends the above result to all weight functions with . Following the approach in [10], we proceed in two steps: first we reduce to the univariate case, and then we deal with the univariate case. Then the new situation to be dealt with is when and the minimizer lies in the interior of , which we can settle by getting back to the case through applying PropositionĀ 1, the ālocal similarityā tool, with .
Reduction to the univariate case. Let be a global minimizer of in . Following [10] (recall RemarkĀ 1 and LemmaĀ 4), we consider the upper estimator . This is separable, i.e., we can write , where each is quadratic univariate with as global minimizer over . Let be an optimum solution to the problem (4) corresponding to the minimization of over w.r.t. the weight function . If we set , then is a sum of squares with degree at most , such that . Hence we have
[TABLE]
As a consequence, we need only to consider the case of a quadratic univariate polynomial on . We distinguish two cases, depending whether the global minimizer lies on the boundary or in the interior of . The case when the global minimizer lies on the boundary of is settled by TheoremĀ 2(ii) above, so we next assume the global minimizer lies in the interior of .
Case of a global minimizer in the interior of . To deal with this case we make use of PropositionĀ 1 with , weight function on , and weight function on . We check that the conditions of the proposition are met. As , clearly we have . Further, for any , we have
[TABLE]
for all . As , we also have (see RemarkĀ 3(i)). Hence we may apply PropositionĀ 1 to find that there exists a polynomial upper estimator of on , exact at , and having
[TABLE]
for all large enough. Now, (the univariate case of) TheoremĀ 2(i) allows us to claim , so that we obtain:
[TABLE]
In summary, in view of the above, we have shown the following extension of TheoremĀ 2.
Theorem 3**.**
Let and . Let be a global minimizer of on . Then we have
[TABLE]
The constant involved in the proof of TheoremĀ 3 depends on the global minimizer of on . It is introduced by the application of PropositionĀ 1 to cover the case where lies in the interior of . When (i.e., when corresponds to the Lebesgue measure), one can replace by a uniform constant , as we now explain.
Consider , equipped with the scaled Chebyshev weight . Of course, TheoremĀ 2 applies to this choice of as well. Further, we still have for all . However, we now have a uniform upper bound for on , which means that on (see RemarkĀ 3(ii)). Indeed, we have
[TABLE]
We may thus apply CorollaryĀ 1 (instead of PropositionĀ 1) to obtain the following.
Corollary 2**.**
If is equipped with the Lebesgue measure then
[TABLE]
3.4 The unit ball
We now consider optimization over the unit ball (); we restrict to reference measures on with weight function of the form
[TABLE]
where . For further reference we recall (see e.g. [16, §6.3.2]) or [3, §11]) that
[TABLE]
For the case , we can analyse the bounds and show the following result.
Theorem 4**.**
Let be the unit ball. Let be a global minimizer of on . Consider the weight function from (45) on .
- (i)
If , we have
[TABLE] 2. (ii)
If , we have
[TABLE]
For the proof, we distinguish the two cases when lies in the interior of or on its boundary.
Case of a global minimizer in the interior of . Our strategy is to reduce this to the case of the hypercube with the help of PropositionĀ 1. Set . As , we have . Consider the weight function on , and on the hypercube . Since , we have for all . Furthermore, as , we also have . Hence we may apply PropositionĀ 1 to find a polynomial upper estimator of on , exact at , satisfying
[TABLE]
for all large enough. Here is the constant from (23). Now, TheoremĀ 3 allows us to claim . Hence we obtain:
[TABLE]
As in the previous section, it is possible to replace the constant by a uniform constant in the case that , i.e., in the case that we have the Lebesgue measure on . Indeed, in this case we have , and so in particular . We may thus invoke CorollaryĀ 1 (instead of PropositionĀ 1) to obtain
[TABLE]
and so
[TABLE]
Note that in this case, we do not actually make use of the fact that . Rather, we only need that lies in the interior of and that . As we may freely apply affine transformations to (by LemmaĀ 2), the latter is no true restriction. We have thus shown the following result.
Theorem 5**.**
Let be a compact set, with non-empty interior, equipped with the Lebesgue measure. Assume that has a global minimizer on with . Then we have
[TABLE]
Case of a global minimizer on the boundary of . Our strategy is now to reduce to the univariate case of the interval . For this, we use LemmaĀ 5, which claims that has a linear upper estimator on , exact at . Up to applying an orthogonal transformation (and scaling) we may assume that is of the form . It therefore suffices now to analyze the behaviour of the bounds for the function minimized on the ball . Note that when minimizing on or on the interval the minimum is attained at the boundary in both cases. The following technical lemma will be useful for reducing to the case of the interval .
Lemma 9**.**
Let be a univariate polynomial and let . Then we have
[TABLE]
where is given in (46).
Proof.
Change variables and set for . Then we have
[TABLE]
and Putting things together we obtain the desired result. ā
Let be an optimal sum-of-squares density with degree at most for the problem of minimizing over the interval , equipped with the weight function . Then, its scaling provides a feasible solution for the problem of minimizing over the ball . Indeed, using LemmaĀ 9, we have , and so
[TABLE]
The proof is now concluded by applying TheoremĀ 2(ii).
3.5 Ball-like convex bodies
Here we show a convergence rate of in for a special class of smooth convex bodies with respect to the Lebesgue measure. The basis for this result is a reduction to the case of the unit ball.
We say has an inscribed tangent ball (of radius ) at if there exists and a closed ball of radius such that and . Similarly, we say has a circumscribed tangent ball (of radius ) at if there exists and a closed ball of radius such that and .
Definition 3**.**
We say that a (smooth) convex body is ball-like if there exist (uniform) such that has inscribed and circumscribed tangent balls of radii , respectively, at all points .
Theorem 6**.**
Assume that is a (smooth) ball-like convex body, equipped with the Lebesgue measure. Then we have
[TABLE]
Proof.
Let be a global minimizer of on . We again distinguish two cases depending on whether lies in the interior of or on its boundary.
Case of a global minimizer in the interior of . This case is covered directly by TheoremĀ 5.
Case of a global minimizer on the boundary of . By applying a suitable affine transformation, we can arrange that the following holds: , , is an inward normal of at , and the radius of the circumscribed tangent ball at is equal to 1, i.e., . See FigureĀ 2 for an illustration. Now, as is a global minimizer of on , we have by LemmaĀ 1. But , and so . As noted in RemarkĀ 2, we may thus use LemmaĀ 5 to find that on for some constant . In light of RemarkĀ 1(i), and after scaling, it therefore suffices to analyse the function .
Again, we will use a reduction to the univariate case, now on the interval . For any , let be an optimum sum-of-squares density of degree for the minimization of on with respect to the weight function
[TABLE]
That is, satisfies
[TABLE]
where the first equality relies on TheoremĀ 2(ii). As is a sum-of-squares density on with respect to the Lebesgue measure, we have
[TABLE]
We will now show that, on the one hand, the numerator in (59) has an upper bound in and that, on the other hand, the denominator in (59) is lower bounded by an absolute constant that does not depend on . Putting these two bounds together then yields , as desired.
The upper bound. We make use of the fact that to compute:
[TABLE]
The lower bound. Here, we consider an inscribed tangent ball of at . Say for some . See again FigureĀ 2. We may then compute:
[TABLE]
where the last inequality follows using the fact that for . It remains to show that
[TABLE]
The argument is similar to the one used for the proof of LemmaĀ 8. By (58), there is a constant such that for all . So we have
[TABLE]
which implies for large enough.
This concludes the proof of TheoremĀ 6. ā
Classification of ball-like sets. With TheoremĀ 6 in mind, it is interesting to understand under which conditions a convex body is ball-like. Under the assumption that has a -boundary, the well-known Rolling Ball Theorem (cf., e.g., [17]) guarantees the existence of inscribed tangent balls.
Theorem 7** (Rolling Ball Theorem).**
Let be a convex body with - boundary. Then there exists such that has an inscribed tangent ball of radius for each .
Classifying the existence of circumscribed tangent balls is somewhat more involved. Certainly, we should assume that is strictly convex, which means that its boundary should not contain any line segments. This assumption, however, is not sufficient. Instead we need the following stronger notion of 2-strict convexity introduced in [4].
Definition 4**.**
Let be a convex body with -boundary and let such that and . Assume for all . The set is said to be -strictly convex if the following holds:
[TABLE]
In other words, the Hessian of at any boundary point should be positive definite, when restricted to the tangent space.
Example 1**.**
Consider the unit ball for the -norm:
[TABLE]
Then, is strictly convex, but not 2-strictly convex. Indeed, at any of the points and , the Hessian of is not positive definite on the tangent space. For instance, for , we have and , which vanishes at . In fact, one can verify that does not have a circumscribed tangent ball at any of the points , .
It is shown in [4] that the set of 2-strictly convex bodies lies dense in the set of all convex bodies. For with -boundary, it turns out that -strict convexity is equivalent to the existence of circumscribed tangent balls at all boundary points.
Theorem 8** ([5, Corollary 3.3]).**
Let be a convex body with -boundary. Then is -strictly convex if and only if there exists such that has a circumscribed tangent ball of radius at all boundary points .
Combining Theorems 7 and 8 then gives a full classification of the ball-like convex bodies with -boundary.
Corollary 3**.**
Let be a convex body with -boundary. Then is ball-like if and only if it is -strictly convex.
A convex body without inscribed tangent balls. We now give an example of a convex body which does not have inscribed tangent balls, going back to de Rham [14]. The idea is to construct a curve by starting with a polygon, and then successively ācutting cornersā. Let be the polygon in with vertices and , i.e., a square. For , we obtain by subdividing each edge of into three equal parts and taking the convex hull of the resulting subdivision points (see FigureĀ 3). We then let be the limiting curve obtained by letting tend to . Then, is a continuously differentiable, convex curve (see [6] for details). It is not, however, everywhere. We indicate below some point where no inscribed tangent ball exists for the convex body with boundary .
Consider the point , which is an element of for all . Fix . If we walk anti-clockwise along starting at , the first corner point encountered is , the slope of the edge starting at is and its end point is
[TABLE]
Now suppose that there exists an inscribed tangent ball at the point . Then, , and any point lies outside of the ball , so that
[TABLE]
As is contained in the polygonal region delimited by any , also and thus \big{(}{2k+1\over 3^{k}}\big{)}^{2}+\big{(}{2\over 3^{k}}\big{)}^{2}-{4\epsilon\over 3^{k}}\geq 0. Letting , we get , a contradiction.
3.6 The simplex
We now consider a full-dimensional simplex , equipped with the Lebesgue measure. We show the following.
Theorem 9**.**
Let be a simplex, equipped with the Lebesgue measure. Then
[TABLE]
Proof.
Let be a global minimizer of on . The idea is to apply an affine transformation to whose image is locally similar to at the global minimizer of , after which we may ātransportā the rate from the hypercube to the simplex.
Let be the facet of which does not contain . By reindexing, we may assume w.l.o.g. that . Consider the map determined by and for all , where is the -th standard basis vector of . See FigureĀ 4. Clearly, is nonsingular, and .
Lemma 10**.**
We have for all .
Proof.
By definition of , we have
[TABLE]
and so
[TABLE]
which is an open subset of . But this means that for each there exists such that
[TABLE]
which concludes the proof of the lemma. ā
The above lemma tells us in particular that . We now apply CorollaryĀ 1 with , and weight functions on , respectively. This yields a polynomial upper estimator of on having
[TABLE]
for large enough, using TheoremĀ 3 for the right most equality. It remains to apply LemmaĀ 2 to obtain:
[TABLE]
which concludes the proof of TheoremĀ 9. ā
4 General sets
In this section we analyze the error for a general compact set equipped with the Lebesgue measure. We will show the following two results: when satisfies a mild assumption (AssumptionĀ 1) we prove a convergence rate in (TheoremĀ 10), which improves on the previous rate in from [12], and when is a convex body we prove a convergence rate in (TheoremĀ 11), improving the previous rate in from [8]. As a byproduct of our analysis, we can show the stronger bound when all partial derivatives of of order at most vanish at a global minimizer (see TheoremĀ 14). We begin with introducing AssumptionĀ 1.
Assumption 1**.**
There exist constants such that
[TABLE]
In other words, AssumptionĀ 1 claims that contains a constant fraction of the full ball around for any radius small enough. This rather mild assumption is discussed in some detail in [12]. In particular, it is implied by the so-called interior cone condition used in approximation theory; it is satisfied by convex bodies and, more generally, by sets that are star-shaped with respect to a ball.
Theorem 10**.**
Let be a compact set satisfying AssumptionĀ 1. Then we have
[TABLE]
Theorem 11**.**
Let be a convex body. Then we have
[TABLE]
Outline of the proofs. First of all, if has a global minimizer which lies in the interior of , then we may apply TheoremĀ 5 to obtain a convergence rate in and so there is nothing to prove. Hence, in the rest of the section, we assume that has a global minimizer which lies on the boundary of .
The basic proof strategy for both theorems is to construct explicit sum-of-squares polynomials giving good feasible solutions to the program (4). The building blocks for these polynomials will be provided by the needle polynomials from [19]; these are degree univariate polynomials , parameterized by a constant , that approximate well the Dirac delta at [math] on and , respectively.
For TheoremĀ 10, we are able to use the needle polynomials directly after applying the transform and selecting the value carefully. We then make use of Lipschitz continuity of to bound the integral in the objective of (4).
For TheoremĀ 11, a more complicated analysis is needed. We then construct as a product of univariate well-selected needle polynomials, exploiting geometric properties of the boundary of in the neighbourhood of a global minimizer.
Simplifying assumptions. In order to simplify notation in the subsequent proofs we assume throughout this section that , and , so is a global minimizer of over . As is compact, and in light of LemmaĀ 2, this is without loss of generality.
We now introduce needle polynomials and their main properties in SectionĀ 4.1, and then give the proofs of Theorems 10 and 11 in Sections 4.2 and 4.3, respectively.
4.1 Needle polynomials
We begin by recalling some of the basic properties of the Chebyshev polynomials. The Chebyshev polynomials can be defined by the recurrence relation (8), and also by the following explicit expression:
[TABLE]
From this definition, it can be seen that on the interval , and that is nonnegative and monotone nondecreasing on . The Chebyshev polynomials form an orthogonal basis of with respect to the Chebyshev measure (with weight ) on and they are used extensively in approximation theory. For instance, they are the polynomials attaining equality in the Markov brotherās inequality on , recalled below.
Lemma 11** (Markov Brothersā Inequality; see, e.g., [27]).**
Let be a univariate polynomial of degree at most . Then, for any scalars , we have
[TABLE]
Kroó and Swetits [19] use the Chebyshev polynomials to construct the so-called (univariate) needle polynomials.
Definition 5**.**
For , we define the needle polynomial by
[TABLE]
Additionally, we define the -needle polynomial by
[TABLE]
By construction, the needle polynomials and are squares and have degree . They approximate well the Dirac delta function at [math] on and , respectively. In [26], a construction similar to the needles presented here is used to obtain the best polynomial approximation of the Dirac delta in terms of the Hausdorff distance.
The needle polynomials satisfy the following bounds (see FigureĀ 5 for an illustration).
Theorem 12** (cf. [19, 20, 18]).**
For any and , the following properties hold for the polynomials and :
[TABLE]
As this result plays a central role in our treatment we give a short proof, following the argument given in [22]. We need the following lemma.
Lemma 12**.**
For any , we have
Proof.
Using the explicit expression (76) for , we have
[TABLE]
By concavity of the logarithm, we have
[TABLE]
and so, using the above lower bound on , we obtain
[TABLE]
ā
of TheoremĀ 12.
Properties (80), (84) are clear. We first check (81)-(82). If then , giving by monotonicity of on . Assume now . Then as , and (again by monotonicity), which implies . In addition, since by LemmaĀ 12, we obtain .
We now check (85)-(86). If then follows by monotonicity of on Assume now . Then, and thus T_{2r}^{2}\big{(}{2+h-2t\over 2-h}\big{)}\leq 1. On the other hand, we have T_{2r}^{2}\big{(}{2+h\over 2-h}\big{)}\geq 1, which gives . In addition, as , using again monotonicity of and LemmaĀ 12, we get T_{2r}^{2}\big{(}{2+h\over 2-h}\big{)}\geq T_{2r}^{2}(1+h)\geq{1\over 4}e^{{1\over 2}r\sqrt{h}}, which implies (86). ā
We now give a simple lower estimator for a nonnegative polynomial with . This lower estimator will be useful later to lower bound the integral of the needle and -needle polynomials on small intervals and , respectively.
Lemma 13**.**
Let be a polynomial, which is nonnegative over and satisfies , for all . Let be defined by
[TABLE]
Then for all .
Proof.
Suppose not. Then there exists such that . As on , and for , we have . We find that Now, by the mean value theorem, there exists an element such that . But this is in contradiction with LemmaĀ 11, which implies that ā
Corollary 4**.**
Let , and let as above. Then and for all .
4.2 Compact sets satisfying Assumption 1
In this section we prove TheoremĀ 10. Recall we assume that satisfies AssumptionĀ 1 with constants and . We also assume that is a global minimizer of over , , and , so that . By LemmaĀ 6, we have on . Hence, in view of LemmaĀ 3, it suffices to find a polynomial for each such that and
[TABLE]
The idea is to set and then select carefully the constant . The main technical component of the proof is the following lemma, which bounds the normalized integral in terms of and . For TheoremĀ 10 we only need the case , but allowing permits to show a sharper convergence rate when the polynomial has special properties at the minimizer (see TheoremĀ 14).
Lemma 14**.**
Let and with . Let . Then
[TABLE]
where is a constant depending only on .
Proof.
Set , so that . We define the sets
[TABLE]
Note that by AssumptionĀ 1. For , we have the bounds (by (81), since as ) and . On the other hand, for , we have the bound , but now is exponentially small (by (82)). We exploit this for bounding the integral in (89):
[TABLE]
Combining with the following lower bound on the denominator:
[TABLE]
we get
[TABLE]
It remains to upper bound the last term in the above expression. By (82) we have for any and so
[TABLE]
Furthermore, by LemmaĀ 13, we have for all . Using AssumptionĀ 1 we obtain
[TABLE]
Putting things together yields
[TABLE]
This shows the lemma with the constant . ā
It remains to choose to obtain the polynomials . Our choice here is essentially the same as the one used in [18, 26]. With the next result (applied with ) the proof of TheoremĀ 10 is now complete.
Proposition 2**.**
For and , set and define the polynomial . Then is a sum-of-squares polynomial of degree with and
[TABLE]
Proof.
For sufficiently large, we have and and so we may use LemmaĀ 14 to obtain
[TABLE]
ā
4.3 Convex bodies
We now prove TheoremĀ 11. Here, is assumed to be a convex body, hence it still satisfies AssumptionĀ 1 for certain constants . As before we also assume that is a global minimizer of in , and .
If , then in view of Taylorās theorem (TheoremĀ 1) we know that on . Hence we may apply PropositionĀ 2 (with ) to this quadratic upper estimator of to obtain (recall LemmaĀ 3).
In the rest of this section, we will therefore assume that . In this case, we cannot get a better upper estimator than on , and so the choice of in PropositionĀ 2 is not sufficient. Instead we will need to make use of the sharper -needles . We will show how to do this in the univariate case first.
The univariate case. If is convex with [math] on its boundary, we may assume w.l.o.g. that for some (in which case we may choose ). By using the -needle instead of the regular needle , we immediately get the following analog of LemmaĀ 14.
Lemma 15**.**
Let and . Let and with . Then we have
[TABLE]
where is a universal constant.
Proof.
Same proof as for LemmaĀ 14, using now the fact that on and on from (85) and (86). ā
Since the exponent in (97) now contains the term āā instead of āā, we may square our previous choice of in PropositionĀ 2 to obtain the following result.
Proposition 3**.**
Assume . Set h(r)=\big{(}2{\log(r^{4})\over r}\big{)}^{2}=\big{(}8{\log r\over r}\big{)}^{2} and define the polynomial . Then is a sum-of-squares polynomial of degree satisfying and
[TABLE]
Proof.
For sufficiently large, we have and and so we may use LemmaĀ 15 to obtain
[TABLE]
ā
Since on we obtain , the desired result.
The multivariate case. Let and let be an orthonormal basis of . Then
[TABLE]
is an orthonormal basis, which we will use as basis of .
The basic idea of the proof is as follows. For any , if we minimize in the direction of then we minimize the univariate polynomial , which satisfies: . Hence, by Taylorās theorem, there is a quadratic upper estimator when minimizing in the direction , so that using a regular needle polynomial will suffice for the analysis. On the other hand, if we minimize in the direction , then , since and . As explained above this univariate minimization problem can be dealt with using -needle polynomials to get the desired convergence rate. This motivates defining the following sum-of-squares polynomials.
Definition 6**.**
For we define the polynomial by
[TABLE]
This construction is similar to the one used by Kroó in [18] to obtain sharp multivariate needle polynomials at boundary points of .
Proposition 4**.**
We have and
[TABLE]
Proof.
Note that for any we have and for . The required properties then follow immediately from those of the needle and -needle polynomials discussed in TheoremĀ 12. ā
It remains to formulate and prove an analog of LemmaĀ 14 for the polynomial . Before we are able to do so, we first need a few technical statements. For we define the polytope
[TABLE]
Note that for , the inequalities (103) and (104) can be summarized as
[TABLE]
which means is exponentially small for outside of . When instead , the following two lemmas show that the function value is small.
Lemma 16**.**
Let . Then for all .
Proof.
Let . By expressing in the orthonormal basis from (100), we obtain
[TABLE]
using the definition of for the second inequality. ā
Lemma 17**.**
*Let . Then f(x)\leq\big{(}\beta_{{K},{f}}+n\gamma_{{K},{f}}\big{)}h^{2} Ā for all *
Proof.
Using Taylorās TheoremĀ 1, LemmaĀ 16 and for , we obtain
[TABLE]
ā
We now give a lower bound on (compare to (92)). First we need the following bound on .
Lemma 18**.**
Let . If then we have: .
Proof.
Consider the halfspace . As , we have the inclusion . We show that , implying that . Let . By expressing in the orthonormal basis from (100), we get . Since and , we get and , thus showing . See FigureĀ 6 for an illustration. We may now apply AssumptionĀ 1 to find ā
Lemma 19**.**
Let . Assume that . Then
[TABLE]
Proof.
The integral is equal to
[TABLE]
ā
We are now able to prove an analog of LemmaĀ 14.
Lemma 20**.**
Let and . If then we have
[TABLE]
where is a constant depending only on .
Proof.
Set . By LemmaĀ 17, for all . Moreover, by PropositionĀ 4, we have for all . Hence,
[TABLE]
where . Combining with
[TABLE]
where we use LemmaĀ 19 for the last inequality, we obtain
[TABLE]
This shows the lemma, with the constant . ā
From the preceding lemma we get the following corollary, which immediately implies TheoremĀ 11.
Corollary 5**.**
For any , set and consider the polynomial . Then is a sum-of-squares polynomial of degree , which satisfies and
[TABLE]
Proof.
For sufficiently large, we have and so we may apply LemmaĀ 20, which implies directly
[TABLE]
ā
5 Numerical Experiments
In this section, we illustrate some of the results in this paper with numerical examples. We consider the test functions listed below in Table 2, the latter four of which are well-known in global optimization and also used for this purpose in [12].
We compare the behaviour of the error for these functions on different sets , namely the hypercube, the unit ball, and a regular octagon in . On the unit ball and the regular octagon, we consider the Lebesgue measure. On the hypercube, we consider both the Lebesgue measure and the Chebyshev measure. In each case, we compute the Lasserre bounds of order in the range , corresponding to sos-densities of degree up to .
Computing the bounds. As explained in Section 1, it is possible to compute the degree Lasserre bound by finding the smallest eigenvalue of the truncated moment matrix of , defined by
[TABLE]
assuming that one has an orthonormal basis of w.r.t. the inner product induced by the measure , i.e., such that .
More generally, if we use an arbitrary linear basis of then the bound is equal to the smallest generalized eigenvalue of the system:
[TABLE]
where is the matrix with entries . Note that if the are orthonormal, then is the identity matrix and one recovers the eigenvalue formulation of Section 1. For details, see, e.g., [21].
This formulation in terms of generalized eigenvalues allows us to work with the standard monomial basis of . To compute the entries of the matrices and , we therefore only require knowledge of the moments:
[TABLE]
For the hypercube, simplex and unit ball, closed form expressions for these moments are known (see, e.g., Table 1 in [9]). For the octagon, they can then be computed by triangulation. We solve the generalized eigenvalue problem (121) using the eig function of the SciPy software package.
The linear case. We consider first the linear case and equipped with the Lebesgue measure. Figure 7 shows the values of the parameters and . In accordance with Theorem 3 (and 2.(ii)), it appears indeed that , as suggested by the fact that the parameter approaches a constant value as grows.
The unit ball. Next, we consider the unit ball , again equipped with the Lebesgue measure. Figure 8 shows the values of the ratio
[TABLE]
for . In each case, the ratio (122) appears to tend to a constant value, suggesting that the error has similar asymptotic behaviour for and . This matches the result of Theorem 4 both in the case of a minimizer on the boundary () and in the case of a minimizer in the interior ().
The regular octagon. Consider now the regular octagon (with the Lebesgue measure)
[TABLE]
which is an example of a convex body that is not ball-like (see Definition 3). Note that as a result, the strongest theoretical guarantee we have shown for the convergence rate of the Lasserre bounds on is in (see Theorem 11). Figure 9 shows the values of the ratio
[TABLE]
for . As for the unit ball, the ratio (124) seemingly tends to a constant value for each of the test polynomials. This indicates a similar asymptotic behaviour of the error for and and suggests that the convergence rate guaranteed by Theorem 11 might not be tight in this instance.
The Chebyshev measure Finally, we consider the Chebyshev measure on , which we compare to the Lebesgue measure. Figure 10 shows the values of the fraction
[TABLE]
for . Again, we observe that the fraction (125) appears to tend to a constant value in each case, matching the result of Theorem 3.
6 Concluding remarks
Extension to non-polynomial functions. Throughout, we have assumed that the function is a polynomial. Strictly speaking, this assumption is not necessary to obtain our results. For the results in SectionĀ 3 and in TheoremĀ 11, it suffices that has an upper estimator, exact at one of its global minimizers on , and satisfying the properties given in LemmaĀ 4. In light of Taylorās Theorem, such an upper estimator exists for all . For TheoremĀ 10, it is even sufficient that satisfies for all , where is a constant. That is, it suffices that is Lipschitz continuous on . Finally, as shown in [11, Theorem 10], results on the convergence rate of the bounds for polynomials extend directly to the case of rational functions .
Accelerated convergence results. For the minimization of linear polynomials the convergence rate of the bounds is shown to be in the order for the hypercube [10] and the unit sphere [11]. Hence, for arbitrary polynomials, a quadratic rate is the best we can hope for. On the other hand, if we restrict to a class of functions with additional properties, then a better convergence rate can be shown. Indeed, a faster convergence rate can be achieved when the function has many vanishing derivatives at a global minimizer. We will make use of the following consequence of Taylorās theorem.
Theorem 13** (Taylorās theorem).**
Assume with . Then we have
[TABLE]
for some constant .
Theorem 14**.**
Let () and let be a global minimizer of on . Assume that all partial derivatives vanish for . Then, given any we have
[TABLE]
Proof.
This follows as a direct application of PropositionĀ 2. ā
This applies, e.g., for the univariate polynomial on the interval .
As an application we can answer in the negative a question posed in [10], where the authors asked about the existence of a āsaturation resultā for the convergence rate of the Lasserre upper bounds, namely whether
[TABLE]
Application to the generalized problem of moments (GPM) and cubature rules. As shown in [7] results on the convergence analysis of the bounds have direct implications for the following generalized moment problem (GMP):
[TABLE]
where and are given, and the variable is a Borel measure on . Bounds can be obtained by searching for measures of the form with a given Borel measure on and . Their quality can be analyzed via the parameter
[TABLE]
setting . It is shown in [7] (see also [9]) that, if for all polynomials , then . Hence, our results in this paper imply directly that for general convex bodies and for hypercubes, balls and simplices (recall TableĀ 1 for exact details). An important instance of (GMP) is finding cubature schemes for numerical integration on (see, e.g., [9] and references therein). If form a cubature scheme with positive weights that permits to integrate any polynomial of degree at most on w.r.t. measure , then, as shown in [23], we have
[TABLE]
Hence any upper bound on directly gives an upper bound on the parameter . Conversely, any lower bound on implies a lower bound on , which is the fact used in [10, 11] to show the lower bound for the hypercube and the sphere.
Finally, let us mention that the needle polynomials are used already in [18] to study cubature rules. There, the author considers degree cubature rules for which the sum is polynomially bounded in . For all , define the parameters
[TABLE]
which indicate how densely is distributed at or in , respectively. Kroó [18] shows that if is the set of nodes of a degree cubature rule on a convex body , we then have
[TABLE]
and that, if is a vertex of , we even have
[TABLE]
Although the asymptotic rates here are the same as the ones we find in TheoremĀ 10 and TheoremĀ 11, we are not aware of any direct link between the density of cubature points and the convergence rate of the Lasserre upper bounds.
Some open questions. There are several natural questions left open by this work. The first natural question is whether the convergence rate in can be proved for all convex bodies. So far we can only prove a rate in , but we suspect that the term is just a consequence of the analysis technique used here. The computational results for the octagon in Section 5 seem to support this. Another question is whether this also applies to general compact sets under AssumptionĀ 1, since we know of no example showing this is not possible.
In particular, it is interesting to determine the exact rate of convergence for polytopes. We could so far only deal with hypercubes and simplices. The main tool we used was the ālocal similarityā of the simplex with the hypercube. For a general polytope , if the minimum is attained at a point lying in the interior of or of one of its facets, then we can still apply the ālocal similarityā tool (and deduce the rate). However, at other points (like its vertices) is in general not locally similar to the hypercube, so another proof technique seems needed. A possible strategy could be splitting into simplices and using the known convergence rate for the simplex containing a global minimizer; however, a difficulty there is keeping track of the distribution of mass of an optimal sum-of-squares on the other simplices.
Acknowledgments
This work is supported by the Europeans Unionās EU Framework Programme for Research and Innovation Horizon 2020 under the Marie SkÅodowska-Curie Actions Grant Agreement No 764759 (MINOA).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bonnesen, T., Fenchel, W.: Theory of Convex Bodies. BCS Associates (1987)
- 2[2] Borwein, J., Lewis, A.: Convex Analysis and Nonlinear Optimization. Springer (2006)
- 3[3] Dai, F., Xu, Y.: Approximation Theory and Harmonic Analysis on Spheres and Balls. Springer (2013)
- 4[4] Dalla, L., Hatziafratis, T.: Strict convexity of sets in analytic terms. J. Aust. Math. Soc. 81 (1), 49ā61 (2006)
- 5[5] Dalla, L., Samiou, E.: Curvature and q-strict convexity. BeitrƤge zur Algebra und Geometrie 48 , 83ā93 (2007)
- 6[6] de Boor, C.: Cutting corners always works. Comput. Aided Geom. Design 4 (1-2), 125ā131 (1987)
- 7[7] de Klerk, E., Kuhn, D., Postek, K.: Distributionally robust optimization with polynomial densities: theory, models and algorithms. Mathematical Programming (2019). DOI 10.1007/s 10107-019-01429-5
- 8[8] de Klerk, E., Laurent, M.: Comparison of Lasserreās measure-based bounds for polynomial optimization to bounds obtained by simulated annealing. Mathematics of Operations Research 43 , 1317ā1325 (2017)
