On the exactness of Lasserre relaxations for compact convex basic closed semialgebraic sets
Markus Schweighofer, Tom-Lukas Kriel

TL;DR
This paper proves that for certain convex semialgebraic sets satisfying specific polynomial conditions, the Lasserre relaxation is exact, simplifying the process of obtaining semidefinite representations.
Contribution
The authors demonstrate that under natural convexity and second order quasiconcavity conditions, a single Lasserre relaxation suffices for exact representation, improving upon previous non-constructive methods.
Findings
Lasserre relaxation is exact for the specified convex sets.
The approach simplifies previous semidefinite representation methods.
Conditions include second order strict quasiconcavity and the Archimedean property.
Abstract
Consider a finite system of non-strict real polynomial inequalities and suppose its solution set is convex, has nonempty interior and is compact. Suppose that the system satisfies the Archimedean condition, which is slightly stronger than the compactness of . Suppose that each defining polynomial satisfies a second order strict quasiconcavity condition where it vanishes on (which is very natural because of the convexity of ) or its Hessian has a certain matrix sums of squares certificate for negative-semidefiniteness on (fulfilled trivially by linear polynomials). Then we show that the system possesses an exact Lasserre relaxation. In their seminal work of 2009, Helton and Nie showed under the same conditions that is the projection of a spectrahedron, i.e., it has a semidefinite representation. The semidefinite representation used by Helton…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Matrix Theory and Algorithms · Polynomial and algebraic computation
On the exactness of Lasserre relaxations for
compact convex basic closed semialgebraic sets
Tom-Lukas Kriel
Fachbereich Mathematik und Statistik, Universität Konstanz, 78457 Konstanz, Germany
and
Markus Schweighofer
Fachbereich Mathematik und Statistik, Universität Konstanz, 78457 Konstanz, Germany
(Date: January 29, 2018)
Abstract.
Consider a finite system of non-strict real polynomial inequalities and suppose its solution set is convex, has nonempty interior and is compact. Suppose that the system satisfies the Archimedean condition, which is slightly stronger than the compactness of . Suppose that each defining polynomial satisfies a second order strict quasiconcavity condition where it vanishes on (which is very natural because of the convexity of ) or its Hessian has a certain matrix sums of squares certificate for negative-semidefiniteness on (fulfilled trivially by linear polynomials). Then we show that the system possesses an exact Lasserre relaxation.
In their seminal work of 2009, Helton and Nie showed under the same conditions that is the projection of a spectrahedron, i.e., it has a semidefinite representation. The semidefinite representation used by Helton and Nie arises from glueing together Lasserre relaxations of many small pieces obtained in a non-constructive way. By refining and varying their approach, we show that we can simply take a Lasserre relaxation of the original system itself. Such a result was provided by Helton and Nie with much more machinery only under very technical conditions and after changing the description of .
Key words and phrases:
moment relaxation, Lasserre relaxation, basic closed semialgebraic set, sum of squares, polynomial optimization, semidefinite programming, linear matrix inequality, spectrahedron, semidefinitely representable set
2010 Mathematics Subject Classification:
Primary 14P10, 52A20; Secondary 13J30, 52A41, 90C22, 90C26
1. Introduction
Throughout the article, and denote the set of positive and nonnegative integers, respectively. We fix and denote by a tuple of variables. We denote by the polynomial ring in these variables over . For , we denote and . For with all , the degree of is defined as if and if . For each , we consider the real vector space
[TABLE]
of all polynomials of degree at most . We admit here real numbers for technical reasons but note that for all and for all . Occasionally, we will need the real polynomial ring in one variable as an auxiliary tool, and we will denote it by . We will denote the identity matrix by .
For a tuple of polynomials, the set
[TABLE]
is called a basic closed semialgebraic set [PD, Def. 2.1.1]. Boolean combinations of such sets are called semialgebraic sets [PD, Def. 2.1.4]. The finiteness theorem from real algebraic geometry says that every closed semialgebraic set is a finite union of basic closed ones [PD, Thm. 2.4.1]. In general, it is hard to answer questions about the geometry from its description . This is of course due to the nonlinear monomials with that might appear in . An extremely naive idea would be to replace each such nonlinear monomial in by a new variable . This would lead to a system of linear inequalities whose solution set is a (closed convex) polyhedron in a higher-dimensional space. The projection of this polyhedron to the -space contains but will very often just be the whole of and thus be of no help.
This idea becomes however less naive if we add a bunch of redundant inequalities before the linearization. For example, we could add certain inequalities of the form or with . If we choose finitely many such inequalities in a clever way and then linearize as above, we will get a polyhedron in a higher-dimensional space whose projection to -space might enclose more tightly. Unless happens to be a polyhedron, this projection can however still not equal since projections of polyhedra are again polyhedra (see [Scr, Subsection 12.2] for a textbook reference).
The idea of Lasserre was therefore to add the whole (infinite) family of all redundant inequalities of the form or with before the linearization [L1, L2]. To get something that is useful in practice (for example, one would like to avoid using infinitely many of the new variables ), he restricted the degree of the polynomials of the added redundant inequalities.
Therefore fix a degree bound and set . For each with , fix a (column) vector whose entries are the different monomials of degree at most
[TABLE]
and set . Note that in the case , is negative, and consequently and is the empty vector. This case is usually avoided in practice and in the literature by assuming large enough but we think it is more convenient to admit it. In the pathological case , we set , and let again be the empty vector. Then
[TABLE]
and
[TABLE]
The key observation is that instead of linearizing each with individually, we can just linearize the symmetric matrix polynomial . In this way, we get for each a linear symmetric matrix polynomial . Instead of an infinite family of linear inequalities, we thus get finitely many linear matrix inequalities [BEFB] (whose size depends on ) saying that
[TABLE]
where and “” means positive semidefiniteness. By defining with as the block diagonal matrix with blocks , we could even combine this into a single linear matrix inequality
[TABLE]
Its solution set is a spectrahedron [Vin] (in particular a semialgebraic closed convex subset of ) that projects down to the convex set
[TABLE]
The description of is called the degree Lasserre relaxation of (or of the system of polynomial inequalities given by ). By abuse of language, we call sometimes itself the degree Lasserre relaxation of . By construction, it is clear that each is convex and
[TABLE]
If happens to be convex, there is a certain hope that equals for all large enough. In this case, we say that (or the system of polynomial inequalities given by ) has an exact Lasserre relaxation.
In this article, we provide a new sufficient criterium for to have an exact Lasserre relaxation. To the best of our knowledge this is the strongest result currently available for convex .
If is not convex, one can still ask whether equals eventually the convex hull of . This seems to require very different techniques and will be studied in our forthcoming work [KS], see also Example 4.10 below.
Here we will also not address the important question asking from what on equals in case has an exact Lasserre relaxation. In principle, a corresponding complexity analysis of our proof would probably be possible but would, at least for general , be extremely tedious, and in the end yield a bound that is only of theoretical interest.
The Lasserre relaxation is a special case of the more general semidefinite representation of a subset
[TABLE]
where is a symmetric linear matrix polynomial for some . Sets having such a representation are called semidefinitely representable. Other commonly used terms are projections of spectrahedra, spectrahedral shadows, spectrahedrops, lifted LMI sets and SDP-representable sets. If the number of additional variables is not too large, one can optimize efficiently linear functions on such sets by the use of semidefinite programming, an important generalization of linear programming [NN]. Semidefinitely representable sets are obviously convex and they are semialgebraic by Tarski’s real quantifier elimination [PD, Thm. 2.1.6]. The class of semidefinitely representable sets is closed under many operations like for example taking the interior [Net]. It was asked by Nemirovski in his plenary address at the 2006 International Congress of Mathematicians in Madrid whether each convex semialgebraic set is semidefinitely representable [Nem, Subsection 4.3.1]. Helton and Nie conjectured the answer to be positive [HN2, Section 6]. In two seminal works, Scheiderer proved this conjecture for [S1, Theorem 6.8] and very recently disproved it for each [S2, Remark 4.21].
In [NPS, Theorem 3.5], it has been shown that cannot have an exact Lasserre relaxation if is convex, has nonempty interior and has at least one non-exposed face. Other obstructions to exactness have been given by Gouveia and Netzer [GN], see Theorem 4.9 below.
On the positive side, the breakthrough was the seminal work of Helton and Nie [HN2] from 2009 preceded by their earlier work [HN1], which curiously appeared later. We will the summarize the strategy behind their approach, which builds on ideas of Lasserre [L2], and indicate where this paper introduces advantageous modifications:
Let and suppose is convex and has nonempty interior. We will introduce in Definition 2.10 below the -truncated quadratic module associated to . It consist of the sums of polynomials with (or equivalently , see Equation (1) above). As explained above, these were the polynomials that we add before the linearization when we build the degree Lasserre relaxation. The following fact is good to know although we will need from it only the trivial “if” part in order to prove our Main Theorem 4.8: We have if and only if all (i.e., all linear polynomials) that are nonnegative on lie in , see Proposition 2.13 below.
Denoting by the quadratic module generated by introduced in Definition 2.10 below, one deduces from this (due to the compactness of ) a trivial necessary condition for having an exact Lasserre relaxation: For each , there is an such that . If satisfies this condition, one says that is Archimedean, see Proposition 2.7(d) below. This condition is unfortunately stronger than compactness of . In practice, this is however not too important, since a small change of the description of always makes Archimedean if is compact, see Remark 2.9 below.
Therefore suppose for the rest of the introduction that is Archimedean.
We saw that it suffices to look at those nonnegative on whose real zero set is a supporting hyperplane of the convex set . By Putinar’s Positivstellensatz from 1993 (see [Put, Lemma 4.1], [PD, Thm. 5.3.8], [Mar, Cor. 5.6.1], [Lau]), we know that each positive on lies in . However, this is not really what we need here. The advantage we have is that we need to consider only , i.e., only linear polynomials. The problem we have to fight is however that we have only nonnegative on and, most importantly, we need a uniform degree bound for which all such are in one and the same . Such degree bounds are known for polynomials positive on but depend on a measure of how close comes to have a zero on [NS’, Theorem 6].
Lasserre [L2] made a first key observation to deal with this problem: He considered without loss of generality only such nonnegative on that vanish in at least one point (and whose real zero set therefore defines a supporting hyperplane at the point of the convex set unless ). Under a very restrictive condition, namely that the Hessians of the defining polynomials have a certain matrix sums-of-squares (sos for short) representation (and in particular, are globally concave, which is still very restrictive), he showed that he can produce from this finitely many matrix sos representations by the use of Karush–Kuhn–Tucker (KKT) multipliers (the Lagrange multiplier technique for inequalities instead of equations [FH, Section 2.2]).
In the aforementioned articles [HN1, HN2], Helton and Nie pushed the idea of Lasserre much further and made it fruitful in many situations. There are several important ideas in their work. For those Hessians of the for which the matrix sos certificate that Lasserre assumed (and which is trivial for those that happen to be linear) does not exist, they show that in many situations, one can with a lot of new ideas still pursue the basic strategy of Lasserre. These ideas include:
- •
One might exchange in a very subtle way the at certain places by suitable having stronger concavity properties.
- •
Instead of looking for matrix sos representations of the Hessians themselves, they look for matrix representations of certain matrix polynomials arising from double integrals of the Hessians and depending on a parameter that runs over part of the boundary of . The matrix polynomial belonging to this parameter serves to produce the bounded degree polynomial sos certificates for those linear polynomials defining a supporting hyperplane containing the point .
- •
Instead of assuming the sos certificates as Lasserre did, Helton and Nie had the idea to prove the existence using a matrix version of Putinar’s Positivstellensatz that was already available [SH, Thm. 2]. Because of the dependence of the tangent point of the supporting hyperplane, they had to prove a version of Putinar’s theorem for matrix polynomials with degree bounds similar to the one existing already for polynomials that was mentioned above (see [HN1, Thm. 29] and Theorem 2.11 below).
We modify the approach of Helton and Nie at several places, but the most important change is a new analysis of the properties of the modified polynomials which are at the same time chosen slightly more carefully (see Lemma 4.5 below). This new analysis shows that the double integral mentioned above (actually already a related single integral) is negative definite even if the term under the integral is not negative semidefinite on the whole domain of integration, see Lemma 4.6 below. Helton and Nie seem to be compelled to work with negative semidefinite terms under the integral whereas the new method enables us to be more liberal about this issue.
In this way, we will be able to show our Main Theorem 4.8: If each satisfies a certain second order strict quasiconcavity condition (see Definition 3.1 below) where it vanishes on (which is very natural because of the convexity of , see Proposition 3.4(b) below) or its Hessian has a matrix sos certificate for negative-semidefiniteness on (see Definition 2.10 below), then has an exact Lasserre relaxation.
Helton and Nie showed under the same conditions only that is semidefinitely representable [HN2, Thm. 3.3]. They obtained the semidefinite representation by glueing together Lasserre relaxations of many small pieces obtained in a non-constructive way [HN2, Prop. 4.3] (see also [NS]). With a very tedious proof (using smoothening techniques similar to those from [Gho]) they show in addition under very technical assumptions not easy to state [HN2, Section 5] that there exists and such that and has an exact Lasserre relaxation [HN2, Theorem 5.1]. In his diploma thesis, Sinn thoroughly analyzed and improved this proof and showed under the same technical assumptions that one can take [Sin, Theorem 3.3.2].
2. Reminder on sums of squares
In this section, we collect all the tools from the interplay between positive polynomials and sums of squares that we need from the area of real algebraic geometry.
Definition 2.1**.**
We call a sums-of-squares (sos) polynomial if there exist and polynomials such that
[TABLE]
We say that a polynomial is nonnegative (or positive) on a set if (or ) for all . In this case, we write “ on ” (or “ on ”).
It is obvious that each sos polynomial is nonnegative on . In Lemma 4.5 below, we will need the well-known fact that each polynomial in one variable nonnegative on is sos.
Proposition 2.2**.**
Let with on . Then is sos.
Proof.
Using the fundamental theorem of algebra, one shows easily that there are such that where is the imaginary unit. ∎
A matrix is called positive semidefinite (psd) (or positive definite (pd)) if it is symmetric and (or ) for all . Equivalently, is symmetric and the eigenvalues of (which are all real) are all nonnegative (or positive). In this case, we write (or ). By , , etc., we mean , , and so on.
The appropriate generalization of Definition 2.1 to matrix polynomials is the following.
Definition 2.3**.**
We call a sums-of-squares (sos) matrix polynomial if there exist and such that
[TABLE]
The following is an easy exercise that is good to know when dealing with sos matrix polynomials.
Proposition 2.4**.**
For , the following are equivalent:
- (a)
is an sos matrix. 2. (b)
There is an and a matrix polynomial such that . 3. (c)
There are and such that .
We say that a matrix polynomial is psd (or pd) on a set if (or ) for all . In this case, we write “ on ” (or “ on ”).
Definition 2.5**.**
A subset of is called a quadratic module of if
- •
,
- •
for all and
- •
for all and .
For a tuple , the smallest quadratic module containing is obviously
[TABLE]
where we set . We call it the quadratic module generated by .
Definition 2.6**.**
A quadratic module of is called Archimedean if for all there is some such that .
The following is well-known (see for example [PD, Lemma 5.1.13] and [Mar, Cor. 5.2.4]) but for convenience of the reader we include a compact easy proof.
Proposition 2.7**.**
Let be a quadratic module of . Then the following are equivalent:
- (a)
is Archimedean. 2. (b)
There is some such that . 3. (c)
There are and such that the polyhedron is non-empty and compact. 4. (d)
For each , there is some such that .
Proof.
Consider the vector subspace
[TABLE]
of . If with , then we can choose such that and thus
[TABLE]
and thus . Conversely, if , then one can choose such that and thus
[TABLE]
showing that since anyway . Thus, we have
[TABLE]
for all . This implies that is a subring of . Indeed, for with we have
[TABLE]
This shows that , which is the equivalence (d)(a). Condition (b) is easily seen to be equivalent to , which in turn is by equivalent to . Again by using that is a subring of , this shows the equivalence (a)(b). It remains to show (c)(d). If (d) holds, then one trivially finds like in (c), e.g., with being a hypercube. Conversely, suppose that we have like in (c) and let . Then there is such that on the polytope . By the affine form of Farkas’ lemma [Scr, Cor. 7.1h, p. 93], we have that is a nonnegative linear combination of the and thus lies in . ∎
We mention the following important theorem although we will need it only for Example 4.10 below.
Theorem 2.8** (Schmüdgen).**
Let be a quadratic module of . The following are equivalent:
- (a)
There are and such that is compact and for all . 2. (b)
There is some with compact . 3. (c)
is Archimedean.
Proof.
(a)(c) is the deep part of Schmüdgen’s Positivstellensatz [Scm, Cor. 3], namely his characterization of Archimedean preorders (see [PD, Thm. 5.1.17] and [Mar, Thm. 6.1.1]). The implications (c)(b)(a) are trivial. ∎
Remark 2.9**.**
For , there are examples of with compact (even empty) such that is not Archimedean (see [Mar, Ex. 7.3.1] or [PD, Ex. 6.3.1]). However if is compact, then Proposition 2.7 and Theorem 2.8 provide several ways of changing the description of such that becomes Archimedean. For example, if one knows a big ball containing , it suffices to add its defining quadratic polynomial to by Proposition 2.7(b). That is why for many practical purposes, the Archimedean property of is not much stronger than the compactness of .
We use the symbols and to denote the gradient and the Hessian of a real-valued function of variables, respectively. For a polynomial , we understand its gradient as a column vector from , i.e., as a vector of polynomials. Similarly, its Hessian is a symmetric matrix polynomial of size , i.e., a symmetric matrix from .
Definition 2.10**.**
Let and set again . For , set if and if . Then we define the -truncated quadratic module associated to by
[TABLE]
More generally, we define the -truncated matricial quadratic module associated to by
[TABLE]
We say that is -sos-concave if
[TABLE]
If , this means that the negated Hessian of is an sos matrix polynomial and we say that is sos-concave.
Any is sos-concave since . The Hessian of a -sos-concave polynomial is negative semidefinite on .
The following is Putinar’s Positivstellensatz [Put, Lemma 4.1] for matrix polynomials with degree bounds. It has been first proven by Helton and Nie [HN1, Thm. 29] following the technical approach of Nie and the second author [NS’] for the case of polynomials. This technical approach yields explicit degree bounds. The first author found a short topological proof for the mere existence of such bounds [Kri, Thm. 3.2] that is based on knowing already the result without the degree bounds that stems from [SH, Thm. 2].
Theorem 2.11** (Helton and Nie).**
Fix and fix any norm on the vector space . Let such that is Archimedean. Then there exists such that every symmetric satisfying and on satisfies .
The following is a slight generalization of [HN1, Lemma 7] that will be needed in the proof of Theorem 4.7.
Lemma 2.12**.**
Let , and . If , then the matrix polynomial defined by
[TABLE]
for lies again in .
Proof.
The proof [HN1, Lemma 7] can be easily adapted. Another more conceptual proof is the following: is a convex cone in a finite-dimensional vector space. Then
[TABLE]
is an existing Bochner integral of a vector valued function with values in this convex cone and thus lies again in this convex cone [RW] (regardless of whether the cone is closed or not). ∎
The “if” direction of the following proposition is trivial since a closed convex set in a finite-dimensional vector space is the intersection over all half spaces containing it. We will use it to prove our Main Theorem 4.8. The “only if” direction will be needed only in Example 4.10 below.
Proposition 2.13** (Netzer, Plaumann and Schweighofer).**
Suppose , , is compact and convex and has nonempty interior. Then if and only if every with on lies in .
Proof.
This is a special case of [NPS, Proposition 3.1]. ∎
3. Reminder on strict quasiconcavity
We denote the real zero set of by
[TABLE]
We adopt the following notion from [HN1, p. 25], which is a local second order quasiconcavity condition.
Definition 3.1**.**
Let . We say that is strictly quasiconcave at if for all with , we have that . We say that is strictly quasiconcave on if is strictly quasiconcave at each point of .
Remark 3.2**.**
Let and such that .
- (a)
is strictly quasiconcave at if and only if . 2. (b)
If is strictly quasiconcave at and , then there is a neighborhood of such that .
If satisfies and , then is locally around a smooth hypersurface. Differential geometers will recognize that strict quasiconcavity of at then means that the second fundamental form of this hypersurface at is positive definite when one chooses the “outward normal” (pointing away from ). Thus this means that is locally convex in a strong second order sense. For a detailed discussion we refer to [HN1, HN2] and the references therein. As Helton and Nie in [HN1, Subsection 3.1], we want however to help those readers who are not familiar with the basics of differential geometry by discussing strict quasiconcavity in an elementary manner. The reason why we include this is that Helton and Nie presuppose already that the reader is familiar with the geometric notion of tangent hyperplanes and knows that the gradient is a normal vector for it [HN2, p. 786]. Conversely we fit this into their arguments, see Part (a) of the following lemma and Proposition 3.4(b) below.
Formally, we will use the following lemma and the next proposition only in Example 4.10 below and even there it can be avoided by some calculations. Some readers might therefore decide to skip them.
Lemma 3.3**.**
Let , and such that and . Suppose form a basis of , is an open neighborhood of [math] in , is smooth and satisfies as well as
[TABLE]
for all . Then the following hold:
- (a)
2. (b)
If and , then
[TABLE]
Proof.
Taking the derivative of with respect to , we get
[TABLE]
for all . Setting here to [math], we get
[TABLE]
for each . From this, (a) follows easily (for “” use that since is a basis). Taking the derivative of with respect to , we get
[TABLE]
for all . To prove (b), suppose now that and . Then the preceding equation implies
[TABLE]
Since now form a basis of the orthogonal complement of by (a), the matrix is negative definite if and only if is strictly quasiconcave at (see Definition 3.1). ∎
The following proposition is important for understanding the notion of quasiconcavity. It is trivial that quasiconcavity of a polynomial at depends only on the function where is an arbitrarily small neighborhood of . But if and , then it actually depends only on the function
[TABLE]
as the equivalence of Conditions (a) and (b) of the following proposition show.
Proposition 3.4**.**
Let , and such that
[TABLE]
Suppose that is a neighborhood of . Then the following are equivalent:
- (a)
is strictly quasiconcave at . 2. (b)
There is a basis of , an open neighborhood of [math] in and a smooth function such that , , ,
[TABLE]
for all and
[TABLE]
for all small enough . 3. (c)
Condition (b) holds with “basis” replaced by “orthogonal basis”.
For any basis of like in (b), one has
[TABLE]
Proof.
Using Lemma 3.3(a), it is easy to show that any like in (b) satisfy using that would contradict the hypothesis since is a basis. Now Part (b) of the same lemma shows that (b) implies (a). Since it is trivial that (c) implies (b), it only remains to show that (a) implies (c).
To this end, let (a) be satisfied. In order to show (c), choose an orthogonal basis of satisfying . The implicit function theorem yields an open neighborhood of the origin in such that for each there is a unique satisfying , in particular . Moreover, one can choose such that the resulting function is smooth. From , we get . From Part (a) of Lemma 3.3, we get . From Part (b) of the same lemma and from (a), we obtain . ∎
Another more algebraic way of understanding strict quasiconcavity is given by the following easy exercise [HN1, Lemma 11(a)].
Lemma 3.5**.**
Let be a compact set and consider a polynomial that is strictly quasiconcave on . Then one can find such that
[TABLE]
is positive definite on .
We will need the following lemma only in the case where is linear. In that case, one can use for its proof a slightly weaker version of the Karush-Kuhn-Tucker theorem [Pla, Theorem 5.1].
Lemma 3.6**.**
Suppose , is convex and has nonempty interior. Suppose and let . Suppose and is a neighborhood of such that is a minimizer of on and on for all . Then there exist a family of nonnegative Lagrange multipliers such that .
Proof.
By the Karush-Kuhn-Tucker theorem [FH, Theorem 2.2.5], it suffices to show that the () satisfy the Mangasarian-Fromowitz constraint qualification, i.e., there is some such that for all [FH, Chapter 2.2.5]. By discarding those that are the zero polynomial, we may assume for all . Since has nonempty interior, there is then some such that for all . Set and consider for fixed the function . We have and . Therefore there is such that . Because of for all , this implies as desired. ∎
4. The main result
In this section, we will prove our main result about the exactness of the Lasserre relaxation. The first step is to get an alternate description of the compact basic closed semialgebraic set with nonempty interior. Both descriptions, the original one and the alternate one will be used in the proof of Theorem 4.7. The new description will arise by replacing polynomials that are strictly quasiconcave on by polynomials of the form with a univariate polynomial such that on . It will be of outmost importance that which follows from the fact that and therefore is an sos-polynomial by Lemma 2.2 above. Roughly speaking, the basic idea is that will be, up to positive factor, approximately for a big constant when lies in or lies sufficiently close to . The effect of this is that will be a polynomial (unfortunately of large degree) that is very close to being a positive constant on the “safe part” of consisting of the points in that are in “safe distance” to the boundary of . On the “safe part” of one can hope (and it will turn out from our actual choice of ) that the Hessian of the does not vary too quickly. This will be crucial in the proof of Lemma 4.6 (the interval appearing there corresponds to this “safe part”).
In the proof of Lemma 4.5 below, the auxiliary polynomial will be chosen as for a big real constant and a large nonnegative even integer where is defined in Notation 4.1 below. In [HN1, Lemma 13], Helton and Nie use exactly the same polynomial except that they do not care about the parity of the degree . Lemma 4.4 below is an important observation that was probably not known to Helton and Nie. If Helton and Nie had exploited this, they could have sharpened some of their results in [HN1]. However, they would not have come close to our main result Theorem 4.8 which ultimately relies on our new refined and subtle analysis in the proofs of Lemma 4.6 and Theorem 4.7 that focuses on integrals of the Hessian of the instead of the Hessians themselves.
Notation 4.1**.**
For and , we denote by
[TABLE]
the -th Taylor polynomial of the function
[TABLE]
at the origin and we set
[TABLE]
For any , we denote by its (formal) derivative (with respect to ) and by its second derivative.
Proposition 4.2**.**
For , we have
[TABLE]
Proof.
Use the chain rule, the product rule and the quotient rule for derivation. ∎
The following lemma has been given an easy short proof by Speyer [Spe], which we reproduce here for convenience of the reader.
Lemma 4.3** (Speyer).**
For and , we have:
- (a)
If is even, then for all . 2. (b)
If is odd, then is strictly increasing on .
Proof.
We fix and proceed by induction on . The case is trivial since . Suppose the lemma is already proven for instead of where is fixed. First consider the case where is even. Then by induction hypothesis the odd degree polynomial must have exactly one real root . By Lemma 4.2(a) the even degree polynomial takes therefore its (unique) minimum in . To prove the statement, it suffices to observe that
[TABLE]
In the case where is odd, the statement follows immediately from the induction hypothesis and Lemma 4.2(a). ∎
Lemma 4.4**.**
Let and suppose is even. Then for all .
Proof.
The leading coefficient of is . Therefore it suffices to show that has no real roots. One easily checks that has no root at the origin. Assume we have a root different from the origin. Then . Observing that , it follows from Lemma 4.3(b) that , a contradiction. ∎
The following lemma is an improved version of [HN1, Lemma 13]. Most importantly, we manage to get that (defined in this lemma) is an sos polynomial (and in particular is positive on ) instead of just positivity of on the interval . This will come out of Lemmata 4.4 and 2.2 together with the approach we take in the proof that uses simply Taylor approximations of the exponential function instead of the nonconstructive approximation theory used in [HN1]. The second crucial improvement is the new property (c). A surprising improvement coming out of Lemma 4.3 is that we get in Condition (a) positivity on instead of just the positivity on that Helton and Nie get. At the moment however, we do not have any application for this. Finally, an insignificant improvement again not used by us is the validity of Condition (b) on the interval instead of the interval used by Helton and Nie.
Lemma 4.5**.**
Let such that and . Then there exists a univariate polynomial such that
[TABLE]
satisfying the following conditions:
[TABLE]
Proof.
By a scaling argument, we can relax the condition that is sos to the condition that is sos for some . By the Lemmata 4.4 and 2.2, it suffices to find and even such that (a)–(c) are satisfied for . Noting that
[TABLE]
by Proposition 4.2, this means that we are trying to find and even with
[TABLE]
Condition (a’) is always satisfied by Lemma 4.3(a) if is even. Since the functions induced by the polynomials on the interval converge uniformly to the function as tends to infinity, it suffices to find satisfying
[TABLE]
These conditions can be rewritten as
[TABLE]
Thus it suffices to choose and even and sufficiently large. ∎
The previous result is now used to prove the following key lemma. This key lemma is our “luxury version” of [HN1, Proposition 10] in the work of Helton and Nie. It will be used in this article only with (when is compact) but for potential future applications we formulate it in greater generality. It has several advantages over [HN1, Proposition 10]. The most important one is that we only require the to be strictly quasiconcave on a set that will be very slim in general whereas Helton and Nie assume them to be strictly quasiconcave on the whole of . Another important advantage is that the new polynomials lie in . The only price that we have to pay is that not the Hessian itself but only an integrated version of it satisfies the negative definiteness condition. This will however be enough for the proof of Theorem 4.7 and the Main Theorem 4.8.
Lemma 4.6**.**
Let and let be a compact subset of such that is strictly *quasi-*concave on for each . Then there exists a polynomial with an sos polynomial such that satisfies
[TABLE]
for all , and with .
Proof.
By Lemma 3.5 and the compactness of , we find such that
[TABLE]
satisfies
[TABLE]
for all and all . The polynomial will come out of Lemma 4.5 applied to certain values of , , and , which we will now adjust. First of all, we choose such that
[TABLE]
for all and . To get , we observe that the compact set is contained in the union of the chain consisting of the open sets
[TABLE]
and therefore is contained in those of these sets that belong to a sufficiently small , i.e., there is with such that
[TABLE]
By compactness, there exists such that
[TABLE]
We choose with arbitrary and such that
[TABLE]
for all . The compact subset of is contained in the union of the chain consisting of the open sets
[TABLE]
and therefore is contained in those of these sets that belong to a sufficiently small , i.e., there is with such that
[TABLE]
Because is compact, we can choose such that
[TABLE]
for all and . Finally, set
[TABLE]
Choose such that is an sos polynomial in according to Lemma 4.5 and the chosen values of , , and . Fix and set . Using the product and chain rule, we calculate
[TABLE]
and therefore
[TABLE]
Using
[TABLE]
it follows that
[TABLE]
One now recognizes that conditions (a) and (b) from Lemma 4.5 guarantee that
[TABLE]
for all since . Now let and with
[TABLE]
It suffices to show
[TABLE]
To this end, we split up the unit interval into three disjoint parts
[TABLE]
In particular, each is a union of intervals such that . We now analyze the integral in question on each of these parts separately: The integral over will contribute a guaranteed amount of positive definiteness, the integral over an unknown amount of positive semidefiniteness and the integral over will be very small in norm so that it cannot destroy the positive definiteness accumulated over . For further use, we set
[TABLE]
Analysis on . The subinterval of (note that ) is contained in since for and therefore
[TABLE]
for all by the choice of (see Property (3) above). By choice of , we have that
[TABLE]
for all (in fact also for ). By Parts (a) and (c) of Lemma 4.5, we have for all . Hence we get with Property (2) above that
[TABLE]
Analysis on . We have of course
[TABLE]
for all (in fact also for ) and, by Part (a) of Lemma 4.5,
[TABLE]
for all . Hence
[TABLE]
Analysis on . We have of course for all and therefore
[TABLE]
Total analysis. Finally, we get
[TABLE]
∎
Theorem 4.7**.**
Let such that is convex with nonempty interior and is Archimedean. Suppose that each is strictly quasiconcave on or -sos-concave. Then there is such that for all with on we have .
Proof.
Choose and such that , is strictly quasiconcave on for and is -sos-concave for . Applying Lemma 4.6 with instead of and the compact subset of , we get for each a polynomial
[TABLE]
satisfying , and
[TABLE]
for all and . Setting here , we obtain in particular
[TABLE]
for each . Set
[TABLE]
for all . Then
[TABLE]
Choose such that
[TABLE]
for all . Define for all and a symmetric matrix polynomial by
[TABLE]
for all . Applying compactness of , and the unit sphere in together with continuity, we find such that
[TABLE]
for all and . For each , we apply this to instead of to get
[TABLE]
for all and . Thus
[TABLE]
for all , and . Again using the compactness of and continuity, we find some such that
[TABLE]
for all and . Theorem 2.11 yields such that
[TABLE]
for all and . Lemma 2.12 yields such that
[TABLE]
for all and . For later use, set
[TABLE]
Now let with on . Since is nonempty and compact, we can define as the minimum of on . Exchanging by , we can suppose without loss of generality that . Then there is some with
[TABLE]
Consider
[TABLE]
Because of (see Property (4)) and continuity, we get a neighborhood of such that
[TABLE]
for all . Since each with is -sos-concave, we have on the other hand
[TABLE]
for all . Combining both, we have in particular that
[TABLE]
for all . Applying Lemma 3.6, we get a family of nonnegative Lagrange multipliers such that (recall that is linear) and thus
[TABLE]
Fix now . For the map
[TABLE]
we have , and
[TABLE]
for . Hence
[TABLE]
Since was arbitrary, we thus have
[TABLE]
and thus . ∎
Note that it is essential in the previous theorem to require to be linear. It is even not enough to require to be globally convex of small bounded degree [KL].
Main Theorem 4.8**.**
Let such that is convex with nonempty interior and is Archimedean. Suppose that each is strictly quasiconcave on or -sos-concave. Then has an exact Lasserre relaxation.
Proof.
Directly from 4.7 by the trivial direction of Proposition 2.13. ∎
In the situation of this theorem, now drop the convexity assumption and consequently ask whether the convex hull of (instead of itself) equals for large . Helton and Nie proved that in this situation the convex hull of is semidefinitely representable [HN2, Theorem 4.4]. The question arises if it even equals for large . This will be proven in our forthcoming paper [KS] if all are strictly quasiconcave on . However, Example 4.10 below shows that in this case, one cannot allow that some of the are linear (or even sos-concave) instead. To prove this, we need the following important criterion from [GN, Proposition 4.1].
Theorem 4.9** (Gouveia and Netzer).**
Suppose , is a straight line in , has nonempty interior in and is an element of the boundary of in . Suppose that for each with , is orthogonal to . Then strictly contains the closure of the convex hull of for all .
Example 4.10**.**
Let , write for and consider with
[TABLE]
We see that is the disjoint union of two closed disks of different radii. The affine half plane cuts out a piece from the bigger disk and its boundary line is tangent to the smaller disk. Since is compact, is Archimedean by Theorem 2.8(b). By Proposition 3.4(b), is strictly quasiconcave on . The line is tangent to the smaller disk in the point and passes through the interior of the larger disk. By the criterion 4.9 of Gouveia and Netzer applied with , strictly contains the convex hull of for all . By inspection of the proof of Gouveia and Netzer, we see more precisely that each contains a left neighbourhood of inside .
Acknowledgments
The authors would like to thank all three anonymous referees for their thorough reading that helped to improve the presentation of the material.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[BEFB] S. Boyd, L. El Ghaoui, E. Feron, V. Balakrishnan: Linear matrix inequalities in system and control theory, SIAM Studies in Applied Mathematics 15, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 1994
- 2[FH] W. Forst, D. Hoffmann: Optimization – theory and practice, Springer Undergraduate Texts in Mathematics and Technology, Springer, New York, 2010
- 3[Gho] M. Ghomi: Optimal smoothing for convex polytopes, Bull. London Math. Soc. 36 (2004), no. 4, 483–492
- 4[GN] J. Gouveia, T. Netzer: Positive polynomials and projections of spectrahedra, SIAM J. Optim. 21 (2011), no. 3, 960–976
- 5[HN 1] J.W. Helton, J. Nie: Semidefinite representation of convex sets, Math. Program. 122 (2010), no. 1, Ser. A, 21–64
- 6[HN 2] J.W. Helton, J. Nie: Sufficient and necessary conditions for semidefinite representability of convex hulls and sets, SIAM J. Optim. 20 (2009), no. 2, 759–791 [this article is a continuation of [ HN 1 ] although it appeared earlier]
- 7[KL] E. de Klerk, M. Laurent: On the Lasserre hierarchy of semidefinite programming relaxations of convex polynomial optimization problems, SIAM J. Optim. 21 (2011), no. 3, 824–832
- 8[Kri] T. Kriel: A new proof for the existence of degree bounds for Putinar’s Positivstellensatz, Ordered algebraic structures and related topics 203–209, Contemp. Math., 697, Amer. Math. Soc., Providence, RI, 2017
