On the exactness of Lasserre relaxations for compact convex basic closed   semialgebraic sets

Markus Schweighofer; Tom-Lukas Kriel

arXiv:1704.07231·math.AG·March 1, 2018·SIAM J. Optim.

On the exactness of Lasserre relaxations for compact convex basic closed semialgebraic sets

Markus Schweighofer, Tom-Lukas Kriel

PDF

Open Access

TL;DR

This paper proves that for certain convex semialgebraic sets satisfying specific polynomial conditions, the Lasserre relaxation is exact, simplifying the process of obtaining semidefinite representations.

Contribution

The authors demonstrate that under natural convexity and second order quasiconcavity conditions, a single Lasserre relaxation suffices for exact representation, improving upon previous non-constructive methods.

Findings

01

Lasserre relaxation is exact for the specified convex sets.

02

The approach simplifies previous semidefinite representation methods.

03

Conditions include second order strict quasiconcavity and the Archimedean property.

Abstract

Consider a finite system of non-strict real polynomial inequalities and suppose its solution set $S \subseteq R^{n}$ is convex, has nonempty interior and is compact. Suppose that the system satisfies the Archimedean condition, which is slightly stronger than the compactness of $S$ . Suppose that each defining polynomial satisfies a second order strict quasiconcavity condition where it vanishes on $S$ (which is very natural because of the convexity of $S$ ) or its Hessian has a certain matrix sums of squares certificate for negative-semidefiniteness on $S$ (fulfilled trivially by linear polynomials). Then we show that the system possesses an exact Lasserre relaxation. In their seminal work of 2009, Helton and Nie showed under the same conditions that $S$ is the projection of a spectrahedron, i.e., it has a semidefinite representation. The semidefinite representation used by Helton…

Equations213

R [\ushort X]_{d} := {p \in R [\ushort X] ∣ de g p \leq d}

R [\ushort X]_{d} := {p \in R [\ushort X] ∣ de g p \leq d}

S (\ushort g) := {x \in R^{n} ∣ g_{1} (x) \geq 0, \dots, g_{m} (x) \geq 0}

S (\ushort g) := {x \in R^{n} ∣ g_{1} (x) \geq 0, \dots, g_{m} (x) \geq 0}

r_{i} := \frac{d - de g g _{i}}{2}

r_{i} := \frac{d - de g g _{i}}{2}

R [\ushort X]_{r_{i}} = {a^{T} v_{i} ∣ a \in R^{ℓ_{i}}}

R [\ushort X]_{r_{i}} = {a^{T} v_{i} ∣ a \in R^{ℓ_{i}}}

{p^{2} g_{i} ∣ p \in R [\ushort X]_{r_{i}}} = {(a^{T} v_{i})^{2} g_{i} ∣ a \in R^{ℓ_{i}}} = {a^{T} (g_{i} v_{i} v_{i}^{T}) a ∣ a \in R^{ℓ_{i}}} .

{p^{2} g_{i} ∣ p \in R [\ushort X]_{r_{i}}} = {(a^{T} v_{i})^{2} g_{i} ∣ a \in R^{ℓ_{i}}} = {a^{T} (g_{i} v_{i} v_{i}^{T}) a ∣ a \in R^{ℓ_{i}}} .

M_{0} (x, y) ⪰ 0, \dots, M_{m} (x, y) ⪰ 0 (x \in R^{n}, y \in R^{I})

M_{0} (x, y) ⪰ 0, \dots, M_{m} (x, y) ⪰ 0 (x \in R^{n}, y \in R^{I})

M (x, y) ⪰ 0 (x \in R^{n}, y \in R^{I}) .

M (x, y) ⪰ 0 (x \in R^{n}, y \in R^{I}) .

(*) S_{d} (\ushort g) := {x \in R^{n} ∣ \exists y \in R^{I} : M (x, y) ⪰ 0} .

(*) S_{d} (\ushort g) := {x \in R^{n} ∣ \exists y \in R^{I} : M (x, y) ⪰ 0} .

S (\ushort g) \subseteq \dots \subseteq S_{d + 2} (\ushort g) \subseteq S_{d + 1} (\ushort g) \subseteq S_{d} (\ushort g) .

S (\ushort g) \subseteq \dots \subseteq S_{d + 2} (\ushort g) \subseteq S_{d + 1} (\ushort g) \subseteq S_{d} (\ushort g) .

(* *) S = {x \in R^{n} ∣ \exists y \in R^{h} : H (x, y) ⪰ 0}

(* *) S = {x \in R^{n} ∣ \exists y \in R^{h} : H (x, y) ⪰ 0}

p = p_{1}^{2} + \dots + p_{ℓ}^{2} .

p = p_{1}^{2} + \dots + p_{ℓ}^{2} .

P = P_{1}^{T} P_{1} + \dots + P_{ℓ}^{T} P_{ℓ} .

P = P_{1}^{T} P_{1} + \dots + P_{ℓ}^{T} P_{ℓ} .

M (\ushort g) := {i = 0 \sum m s_{i} g_{i} ∣ s_{0}, \dots, s_{m} \in R [\ushort X] are sos}

M (\ushort g) := {i = 0 \sum m s_{i} g_{i} ∣ s_{0}, \dots, s_{m} \in R [\ushort X] are sos}

B := {p \in R [\ushort X] ∣ \exists N \in N : N \pm p \in M} \supseteq R

B := {p \in R [\ushort X] ∣ \exists N \in N : N \pm p \in M} \supseteq R

N \pm p = (N - 1) - p^{2} + (\frac{1}{2} \pm p)^{2} + \frac{3}{4} \in M

N \pm p = (N - 1) - p^{2} + (\frac{1}{2} \pm p)^{2} + \frac{3}{4} \in M

N^{2} (2 N - 1) - p^{2} = \frac{1}{2} ((N - p)^{2} (2 N - 1 + p) + (N + p)^{2} (2 N - 1 - p)) \in M,

N^{2} (2 N - 1) - p^{2} = \frac{1}{2} ((N - p)^{2} (2 N - 1 + p) + (N + p)^{2} (2 N - 1 - p)) \in M,

(*) p^{2} \in B ⟺ p \in B

(*) p^{2} \in B ⟺ p \in B

pq = \frac{1}{2} ((\in B p + q)^{2} \in B - \in B p^{2} - \in B q^{2}) \in B .

pq = \frac{1}{2} ((\in B p + q)^{2} \in B - \in B p^{2} - \in B q^{2}) \in B .

M_{d} (\ushort g) := {i = 0 \sum m j \sum p_{ij}^{2} g_{i} ∣ p_{ij} \in R [\ushort X]_{r_{i}}} \subseteq M (\ushort g) \cap R [\ushort X]_{d} .

M_{d} (\ushort g) := {i = 0 \sum m j \sum p_{ij}^{2} g_{i} ∣ p_{ij} \in R [\ushort X]_{r_{i}}} \subseteq M (\ushort g) \cap R [\ushort X]_{d} .

M_{d}^{k \times k} (\ushort g) := {i = 0 \sum m j \sum P_{ij}^{T} P_{ij} g_{i} ∣ P_{ij} \in R [\ushort X]_{r_{i}}^{k \times k}} \subseteq R [\ushort X]_{d}^{k \times k} .

M_{d}^{k \times k} (\ushort g) := {i = 0 \sum m j \sum P_{ij}^{T} P_{ij} g_{i} ∣ P_{ij} \in R [\ushort X]_{r_{i}}^{k \times k}} \subseteq R [\ushort X]_{d}^{k \times k} .

- Hess f \in M^{n \times n} (\ushort g) := d \in N_{0} ⋃ M_{d}^{n \times n} (\ushort g) .

- Hess f \in M^{n \times n} (\ushort g) := d \in N_{0} ⋃ M_{d}^{n \times n} (\ushort g) .

H (x) = \int_{0}^{1} \int_{0}^{t} P (u + s (x - u)) d s d t

H (x) = \int_{0}^{1} \int_{0}^{t} P (u + s (x - u)) d s d t

H = \int_{0}^{1} \int_{0}^{t} P (u + s (\ushort X - u)) d s d t

H = \int_{0}^{1} \int_{0}^{t} P (u + s (\ushort X - u)) d s d t

Z (g) := {x \in R^{n} ∣ g (x) = 0} .

Z (g) := {x \in R^{n} ∣ g (x) = 0} .

(*) g (x + ξ_{1} v_{1} + \dots + ξ_{n - 1} v_{n - 1} + φ (ξ) v_{n}) = 0

(*) g (x + ξ_{1} v_{1} + \dots + ξ_{n - 1} v_{n - 1} + φ (ξ) v_{n}) = 0

g is strictly quasiconcave at x ⟺ Hess φ (0) ≻ 0.

g is strictly quasiconcave at x ⟺ Hess φ (0) ≻ 0.

(* *) (\nabla g (x + ξ_{1} v_{1} + \dots + ξ_{n - 1} v_{n - 1} + φ (ξ) v_{n}))^{T} (v_{i} + \frac{\partial φ ( ξ )}{\partial ξ _{i}} v_{n}) = 0

(* *) (\nabla g (x + ξ_{1} v_{1} + \dots + ξ_{n - 1} v_{n - 1} + φ (ξ) v_{n}))^{T} (v_{i} + \frac{\partial φ ( ξ )}{\partial ξ _{i}} v_{n}) = 0

(\nabla g (x))^{T} (v_{i} + \frac{\partial φ ( ξ )}{\partial ξ _{i}}_{ξ = 0} v_{n}) = 0

(\nabla g (x))^{T} (v_{i} + \frac{\partial φ ( ξ )}{\partial ξ _{i}}_{ξ = 0} v_{n}) = 0

(v_{j} + \frac{\partial φ ( ξ )}{\partial ξ _{j}} v_{n})^{T} (Hess g (x + ξ_{1} v_{1} + \dots + ξ_{n - 1} v_{n - 1} + φ (ξ) v_{n})) (v_{i} + \frac{\partial φ ( ξ )}{\partial ξ _{i}} v_{n}) + (\nabla g (x + ξ_{1} v_{1} + \dots + ξ_{n - 1} v_{n - 1} + φ (ξ) v_{n}))^{T} (\frac{\partial ^{2} φ ( ξ )}{\partial ξ _{i} \partial ξ _{j}} v_{n}) = 0

(v_{j} + \frac{\partial φ ( ξ )}{\partial ξ _{j}} v_{n})^{T} (Hess g (x + ξ_{1} v_{1} + \dots + ξ_{n - 1} v_{n - 1} + φ (ξ) v_{n})) (v_{i} + \frac{\partial φ ( ξ )}{\partial ξ _{i}} v_{n}) + (\nabla g (x + ξ_{1} v_{1} + \dots + ξ_{n - 1} v_{n - 1} + φ (ξ) v_{n}))^{T} (\frac{\partial ^{2} φ ( ξ )}{\partial ξ _{i} \partial ξ _{j}} v_{n}) = 0

Hess φ (0) = - \frac{1}{( \nabla g ( x ) ) ^{T} v _{n}} (v_{i}^{T} (Hess g (x)) v_{j})_{i, j \in {1, \dots, n - 1}} .

Hess φ (0) = - \frac{1}{( \nabla g ( x ) ) ^{T} v _{n}} (v_{i}^{T} (Hess g (x)) v_{j})_{i, j \in {1, \dots, n - 1}} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Matrix Theory and Algorithms · Polynomial and algebraic computation

Full text

On the exactness of Lasserre relaxations for

compact convex basic closed semialgebraic sets

Tom-Lukas Kriel

Fachbereich Mathematik und Statistik, Universität Konstanz, 78457 Konstanz, Germany

[email protected]

and

Markus Schweighofer

Fachbereich Mathematik und Statistik, Universität Konstanz, 78457 Konstanz, Germany

[email protected]

(Date: January 29, 2018)

Abstract.

Consider a finite system of non-strict real polynomial inequalities and suppose its solution set $S\subseteq\mathbb{R}^{n}$ is convex, has nonempty interior and is compact. Suppose that the system satisfies the Archimedean condition, which is slightly stronger than the compactness of $S$ . Suppose that each defining polynomial satisfies a second order strict quasiconcavity condition where it vanishes on $S$ (which is very natural because of the convexity of $S$ ) or its Hessian has a certain matrix sums of squares certificate for negative-semidefiniteness on $S$ (fulfilled trivially by linear polynomials). Then we show that the system possesses an exact Lasserre relaxation.

In their seminal work of 2009, Helton and Nie showed under the same conditions that $S$ is the projection of a spectrahedron, i.e., it has a semidefinite representation. The semidefinite representation used by Helton and Nie arises from glueing together Lasserre relaxations of many small pieces obtained in a non-constructive way. By refining and varying their approach, we show that we can simply take a Lasserre relaxation of the original system itself. Such a result was provided by Helton and Nie with much more machinery only under very technical conditions and after changing the description of $S$ .

Key words and phrases:

moment relaxation, Lasserre relaxation, basic closed semialgebraic set, sum of squares, polynomial optimization, semidefinite programming, linear matrix inequality, spectrahedron, semidefinitely representable set

2010 Mathematics Subject Classification:

Primary 14P10, 52A20; Secondary 13J30, 52A41, 90C22, 90C26

1. Introduction

Throughout the article, $\mathbb{N}$ and $\mathbb{N}_{0}$ denote the set of positive and nonnegative integers, respectively. We fix $n\in\mathbb{N}_{0}$ and denote by $\ushort X:=(X_{1},\dots,X_{n})$ a tuple of $n$ variables. We denote by $\mathbb{R}[\ushort X]:=\mathbb{R}[X_{1},\dots,X_{n}]$ the polynomial ring in these variables over $\mathbb{R}$ . For $\alpha\in\mathbb{N}_{0}^{n}$ , we denote $|\alpha|:=\alpha_{1}+\ldots+\alpha_{n}$ and $\ushort X^{\alpha}:=X_{1}^{\alpha_{1}}\dotsm X_{n}^{\alpha_{n}}$ . For $p=\sum_{\alpha}a_{\alpha}\ushort X^{\alpha}\in\mathbb{R}[\ushort X]$ with all $a_{\alpha}\in\mathbb{R}$ , the degree of $p$ is defined as $\deg p:=\max\{|\alpha|\mid a_{\alpha}\neq 0\}$ if $p\neq 0$ and $\deg p:=-\infty$ if $p=0$ . For each $d\in\mathbb{R}$ , we consider the real vector space

[TABLE]

of all polynomials of degree at most $d$ . We admit here real numbers $d$ for technical reasons but note that $\mathbb{R}[\ushort X]_{d}=\mathbb{R}[\ushort X]_{\lfloor d\rfloor}$ for all $d\in\mathbb{R}$ and $\mathbb{R}[\ushort X]_{d}=\{0\}$ for all $d<0$ . Occasionally, we will need the real polynomial ring in one variable as an auxiliary tool, and we will denote it by $\mathbb{R}[T]$ . We will denote the $n\times n$ identity matrix by $I_{n}$ .

For a tuple $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ of $m$ polynomials, the set

[TABLE]

is called a basic closed semialgebraic set [PD, Def. 2.1.1]. Boolean combinations of such sets are called semialgebraic sets [PD, Def. 2.1.4]. The finiteness theorem from real algebraic geometry says that every closed semialgebraic set is a finite union of basic closed ones [PD, Thm. 2.4.1]. In general, it is hard to answer questions about the geometry $S(\ushort g)$ from its description $\ushort g$ . This is of course due to the nonlinear monomials $\ushort X^{\alpha}$ with $|\alpha|\geq 2$ that might appear in $\ushort g$ . An extremely naive idea would be to replace each such nonlinear monomial $\ushort X^{\alpha}$ in $\ushort g$ by a new variable $Y_{\alpha}$ . This would lead to a system of $m$ linear inequalities whose solution set is a (closed convex) polyhedron in a higher-dimensional space. The projection of this polyhedron to the $\ushort X$ -space $\mathbb{R}^{n}$ contains $S(\ushort g)$ but will very often just be the whole of $\mathbb{R}^{n}$ and thus be of no help.

This idea becomes however less naive if we add a bunch of redundant inequalities before the linearization. For example, we could add certain inequalities of the form $p^{2}(x)\geq 0$ or $(p^{2}g_{i})(x)\geq 0$ with $p\in\mathbb{R}[\ushort X]$ . If we choose finitely many such inequalities in a clever way and then linearize as above, we will get a polyhedron in a higher-dimensional space whose projection to $\ushort X$ -space $\mathbb{R}^{n}$ might enclose $S(\ushort g)$ more tightly. Unless $S(\ushort g)$ happens to be a polyhedron, this projection can however still not equal $S(\ushort g)$ since projections of polyhedra are again polyhedra (see [Scr, Subsection 12.2] for a textbook reference).

The idea of Lasserre was therefore to add the whole (infinite) family of all redundant inequalities of the form $p^{2}(x)\geq 0$ or $(p^{2}g_{i})(x)\geq 0$ with $p\in\mathbb{R}[\ushort X]$ before the linearization [L1, L2]. To get something that is useful in practice (for example, one would like to avoid using infinitely many of the new variables $Y_{\alpha}$ ), he restricted the degree of the polynomials of the added redundant inequalities.

Therefore fix a degree bound $d\in\mathbb{N}_{0}$ and set $g_{0}:=1\in\mathbb{R}\subseteq\mathbb{R}[\ushort X]$ . For each $i\in\{0,\dots,m\}$ with $g_{i}\neq 0$ , fix a (column) vector $v_{i}$ whose entries are the different monomials of degree at most

[TABLE]

and set $\ell_{i}:=\dim\mathbb{R}[\ushort X]_{r_{i}}$ . Note that in the case $g_{i}\notin\mathbb{R}[\ushort X]_{d}$ , $r_{i}$ is negative, and consequently $\ell_{i}=0$ and $v_{i}=()\in\mathbb{R}[\ushort X]^{0}=\{0\}$ is the empty vector. This case is usually avoided in practice and in the literature by assuming $d$ large enough but we think it is more convenient to admit it. In the pathological case $g_{i}=0$ , we set $r_{i}:=-\infty$ , $\ell_{i}:=0$ and let $v_{i}$ again be the empty vector. Then

[TABLE]

and

[TABLE]

The key observation is that instead of linearizing each $p^{2}g_{i}$ with $p\in\mathbb{R}[\ushort X]_{r_{i}}$ individually, we can just linearize the symmetric matrix polynomial $g_{i}v_{i}v_{i}^{T}\in\mathbb{R}[\ushort X]^{\ell_{i}\times\ell_{i}}$ . In this way, we get for each $i\in\{0,\dots,m\}$ a linear symmetric matrix polynomial $M_{i}\in\mathbb{R}[\ushort X,(Y_{\alpha})_{2\leq|\alpha|\leq d}]_{1}^{\ell_{i}\times\ell_{i}}$ . Instead of an infinite family of linear inequalities, we thus get finitely many linear matrix inequalities [BEFB] (whose size depends on $d$ ) saying that

[TABLE]

where $I:=\{\alpha\in\mathbb{N}_{0}^{n}\mid 2\leq|\alpha|\leq d\}$ and “ $\succeq 0$ ” means positive semidefiniteness. By defining $M\in\mathbb{R}[\ushort X,(Y_{\alpha})_{2\leq|\alpha|\leq d}]_{1}^{\ell\times\ell}$ with $\ell:=\ell_{0}+\dots+\ell_{m}$ as the block diagonal matrix with blocks $M_{0},\dots,M_{n}$ , we could even combine this into a single linear matrix inequality

[TABLE]

Its solution set is a spectrahedron [Vin] (in particular a semialgebraic closed convex subset of $\mathbb{R}^{n}$ ) that projects down to the convex set

[TABLE]

The description $(*)$ of $S_{d}(\ushort g)$ is called the degree $d$ Lasserre relaxation of $\ushort g$ (or of the system of polynomial inequalities given by $\ushort g$ ). By abuse of language, we call sometimes $S_{d}(\ushort g)$ itself the degree $d$ Lasserre relaxation of $\ushort g$ . By construction, it is clear that each $S_{d}(\ushort g)$ is convex and

[TABLE]

If $S(\ushort g)$ happens to be convex, there is a certain hope that $S_{k}(\ushort g)$ equals $S(\ushort g)$ for all $k$ large enough. In this case, we say that $\ushort g$ (or the system of polynomial inequalities given by $\ushort g$ ) has an exact Lasserre relaxation.

In this article, we provide a new sufficient criterium for $\ushort g$ to have an exact Lasserre relaxation. To the best of our knowledge this is the strongest result currently available for convex $S(\ushort g)$ .

If $S(\ushort g)$ is not convex, one can still ask whether $S_{k}(\ushort g)$ equals eventually the convex hull of $S(\ushort g)$ . This seems to require very different techniques and will be studied in our forthcoming work [KS], see also Example 4.10 below.

Here we will also not address the important question asking from what $k$ on $S(\ushort g)$ equals $S_{k}(\ushort g)$ in case $\ushort g$ has an exact Lasserre relaxation. In principle, a corresponding complexity analysis of our proof would probably be possible but would, at least for general $\ushort g$ , be extremely tedious, and in the end yield a bound that is only of theoretical interest.

The Lasserre relaxation $(*)$ is a special case of the more general semidefinite representation of a subset $S\subseteq\mathbb{R}^{n}$

[TABLE]

where $H\in\mathbb{R}[\ushort X,Y_{1},\dots,Y_{h}]_{1}^{\ell\times\ell}$ is a symmetric linear matrix polynomial for some $h,\ell\in\mathbb{N}_{0}$ . Sets $S$ having such a representation $(**)$ are called semidefinitely representable. Other commonly used terms are projections of spectrahedra, spectrahedral shadows, spectrahedrops, lifted LMI sets and SDP-representable sets. If the number $h$ of additional variables is not too large, one can optimize efficiently linear functions on such sets by the use of semidefinite programming, an important generalization of linear programming [NN]. Semidefinitely representable sets are obviously convex and they are semialgebraic by Tarski’s real quantifier elimination [PD, Thm. 2.1.6]. The class of semidefinitely representable sets is closed under many operations like for example taking the interior [Net]. It was asked by Nemirovski in his plenary address at the 2006 International Congress of Mathematicians in Madrid whether each convex semialgebraic set is semidefinitely representable [Nem, Subsection 4.3.1]. Helton and Nie conjectured the answer to be positive [HN2, Section 6]. In two seminal works, Scheiderer proved this conjecture for $n=2$ [S1, Theorem 6.8] and very recently disproved it for each $n\geq 14$ [S2, Remark 4.21].

In [NPS, Theorem 3.5], it has been shown that $\ushort g$ cannot have an exact Lasserre relaxation if $S(\ushort g)\subseteq\mathbb{R}^{n}$ is convex, has nonempty interior and has at least one non-exposed face. Other obstructions to exactness have been given by Gouveia and Netzer [GN], see Theorem 4.9 below.

On the positive side, the breakthrough was the seminal work of Helton and Nie [HN2] from 2009 preceded by their earlier work [HN1], which curiously appeared later. We will the summarize the strategy behind their approach, which builds on ideas of Lasserre [L2], and indicate where this paper introduces advantageous modifications:

Let $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ and suppose $S(\ushort g)$ is convex and has nonempty interior. We will introduce in Definition 2.10 below the $d$ -truncated quadratic module $M_{d}(\ushort g)$ associated to $\ushort g$ . It consist of the sums of polynomials $p^{2}g_{i}$ with $\deg(p^{2}g_{i})\leq d$ (or equivalently $\deg(p)\leq r_{i}$ , see Equation (1) above). As explained above, these were the polynomials that we add before the linearization when we build the degree $d$ Lasserre relaxation. The following fact is good to know although we will need from it only the trivial “if” part in order to prove our Main Theorem 4.8: We have $S(\ushort g)=S_{d}(\ushort g)$ if and only if all $f\in\mathbb{R}[\ushort X]_{1}$ (i.e., all linear polynomials) that are nonnegative on $S(\ushort g)$ lie in $M_{d}(\ushort g)$ , see Proposition 2.13 below.

Denoting by $M(\ushort g)=\bigcup_{d\in\mathbb{N}}M_{d}(\ushort g)$ the quadratic module generated by $\ushort g$ introduced in Definition 2.10 below, one deduces from this (due to the compactness of $S$ ) a trivial necessary condition for $\ushort g$ having an exact Lasserre relaxation: For each $f\in\mathbb{R}[\ushort X]_{1}$ , there is an $N\in\mathbb{N}$ such that $f+N\in M(\ushort g)$ . If $\ushort g$ satisfies this condition, one says that $M(\ushort g)$ is Archimedean, see Proposition 2.7(d) below. This condition is unfortunately stronger than compactness of $S(\ushort g)$ . In practice, this is however not too important, since a small change of the description $\ushort g$ of $S(\ushort g)$ always makes $M(\ushort g)$ Archimedean if $S(\ushort g)$ is compact, see Remark 2.9 below.

Therefore suppose for the rest of the introduction that $M(\ushort g)$ is Archimedean.

We saw that it suffices to look at those $f\in\mathbb{R}[\ushort X]_{1}$ nonnegative on $S(\ushort g)$ whose real zero set is a supporting hyperplane of the convex set $S(\ushort g)$ . By Putinar’s Positivstellensatz from 1993 (see [Put, Lemma 4.1], [PD, Thm. 5.3.8], [Mar, Cor. 5.6.1], [Lau]), we know that each $f\in\mathbb{R}[\ushort X]$ positive on $S(\ushort g)$ lies in $M(\ushort g)$ . However, this is not really what we need here. The advantage we have is that we need to consider only $f\in\mathbb{R}[\ushort X]_{1}$ , i.e., only linear polynomials. The problem we have to fight is however that we have $f$ only nonnegative on $S(\ushort g)$ and, most importantly, we need a uniform degree bound $d$ for which all such $f$ are in one and the same $M_{d}(\ushort g)$ . Such degree bounds are known for polynomials positive on $S(\ushort g)$ but depend on a measure of how close $f$ comes to have a zero on $S(\ushort g)$ [NS’, Theorem 6].

Lasserre [L2] made a first key observation to deal with this problem: He considered without loss of generality only such $f\in\mathbb{R}[\ushort X]_{1}$ nonnegative on $S(\ushort g)$ that vanish in at least one point $u\in S(\ushort g)$ (and whose real zero set therefore defines a supporting hyperplane at the point $u$ of the convex set $S(\ushort g)$ unless $f=0$ ). Under a very restrictive condition, namely that the Hessians of the defining polynomials $g_{i}$ have a certain matrix sums-of-squares (sos for short) representation (and in particular, are globally concave, which is still very restrictive), he showed that he can produce from this finitely many matrix sos representations by the use of Karush–Kuhn–Tucker (KKT) multipliers (the Lagrange multiplier technique for inequalities instead of equations [FH, Section 2.2]).

In the aforementioned articles [HN1, HN2], Helton and Nie pushed the idea of Lasserre much further and made it fruitful in many situations. There are several important ideas in their work. For those Hessians of the $g_{i}$ for which the matrix sos certificate that Lasserre assumed (and which is trivial for those $g_{i}$ that happen to be linear) does not exist, they show that in many situations, one can with a lot of new ideas still pursue the basic strategy of Lasserre. These ideas include:

•

One might exchange in a very subtle way the $g_{i}$ at certain places by suitable $h_{i}$ having stronger concavity properties.

•

Instead of looking for matrix sos representations of the Hessians themselves, they look for matrix representations of certain matrix polynomials arising from double integrals of the Hessians and depending on a parameter $u$ that runs over part of the boundary of $S(\ushort g)$ . The matrix polynomial belonging to this parameter $u$ serves to produce the bounded degree polynomial sos certificates for those linear polynomials $f$ defining a supporting hyperplane containing the point $u$ .

•

Instead of assuming the sos certificates as Lasserre did, Helton and Nie had the idea to prove the existence using a matrix version of Putinar’s Positivstellensatz that was already available [SH, Thm. 2]. Because of the dependence of the tangent point $u$ of the supporting hyperplane, they had to prove a version of Putinar’s theorem for matrix polynomials with degree bounds similar to the one existing already for polynomials that was mentioned above (see [HN1, Thm. 29] and Theorem 2.11 below).

We modify the approach of Helton and Nie at several places, but the most important change is a new analysis of the properties of the modified polynomials $h_{i}$ which are at the same time chosen slightly more carefully (see Lemma 4.5 below). This new analysis shows that the double integral mentioned above (actually already a related single integral) is negative definite even if the term under the integral is not negative semidefinite on the whole domain of integration, see Lemma 4.6 below. Helton and Nie seem to be compelled to work with negative semidefinite terms under the integral whereas the new method enables us to be more liberal about this issue.

In this way, we will be able to show our Main Theorem 4.8: If each $g_{i}$ satisfies a certain second order strict quasiconcavity condition (see Definition 3.1 below) where it vanishes on $S(\ushort g)$ (which is very natural because of the convexity of $S$ , see Proposition 3.4(b) below) or its Hessian has a matrix sos certificate for negative-semidefiniteness on $S$ (see Definition 2.10 below), then $\ushort g$ has an exact Lasserre relaxation.

Helton and Nie showed under the same conditions only that $S(\ushort g)$ is semidefinitely representable [HN2, Thm. 3.3]. They obtained the semidefinite representation by glueing together Lasserre relaxations of many small pieces obtained in a non-constructive way [HN2, Prop. 4.3] (see also [NS]). With a very tedious proof (using smoothening techniques similar to those from [Gho]) they show in addition under very technical assumptions not easy to state [HN2, Section 5] that there exists $s\in\mathbb{N}_{0}$ and $\ushort h\in\mathbb{R}[\ushort X]^{s}$ such that $S(\ushort g)=S(\ushort h)$ and $\ushort h$ has an exact Lasserre relaxation [HN2, Theorem 5.1]. In his diploma thesis, Sinn thoroughly analyzed and improved this proof and showed under the same technical assumptions that one can take $\ushort h:=(g_{1},\dots,g_{m},g_{1}g_{2},g_{1}g_{3},\dots,g_{m-1}g_{m})$ [Sin, Theorem 3.3.2].

2. Reminder on sums of squares

In this section, we collect all the tools from the interplay between positive polynomials and sums of squares that we need from the area of real algebraic geometry.

Definition 2.1.

We call $p\in\mathbb{R}[\ushort X]$ a sums-of-squares (sos) polynomial if there exist $\ell\in\mathbb{N}_{0}$ and polynomials $p_{1},\dots,p_{\ell}\in\mathbb{R}[\ushort X]$ such that

[TABLE]

We say that a polynomial $p\in\mathbb{R}[\ushort X]$ is nonnegative (or positive) on a set $S\subseteq\mathbb{R}^{n}$ if $p(x)\geq 0$ (or $p(x)>0$ ) for all $x\in S$ . In this case, we write “ $p\geq 0$ on $S$ ” (or “ $p>0$ on $S$ ”).

It is obvious that each sos polynomial is nonnegative on $\mathbb{R}$ . In Lemma 4.5 below, we will need the well-known fact that each polynomial in one variable nonnegative on $\mathbb{R}$ is sos.

Proposition 2.2.

Let $f\in\mathbb{R}[T]$ with $f\geq 0$ on $\mathbb{R}$ . Then $f$ is sos.

Proof.

Using the fundamental theorem of algebra, one shows easily that there are $p,q\in\mathbb{R}[T]$ such that $f=(p-\mathbbm{i}q)(p+\mathbbm{i}q)=p^{2}+q^{2}$ where $\mathbbm{i}:=\sqrt{-1}\in\mathbb{C}$ is the imaginary unit. ∎

A matrix $A\in\mathbb{R}^{k\times k}$ is called positive semidefinite (psd) (or positive definite (pd)) if it is symmetric and $x^{T}Ax\geq 0$ (or $x^{T}Ax>0$ ) for all $x\in\mathbb{R}^{k}\setminus\{0\}$ . Equivalently, $A$ is symmetric and the eigenvalues of $A$ (which are all real) are all nonnegative (or positive). In this case, we write $A\succeq 0$ (or $A\succ 0$ ). By $A\succeq B$ , $A\succ B$ , $A\preceq 0$ etc., we mean $A-B\succeq 0$ , $A-B\succ 0$ , $-A\succeq 0$ and so on.

The appropriate generalization of Definition 2.1 to matrix polynomials is the following.

Definition 2.3.

We call $P\in\mathbb{R}[\ushort X]^{k\times k}$ a sums-of-squares (sos) matrix polynomial if there exist $\ell\in\mathbb{N}_{0}$ and $P_{1},\dots,P_{m}\in\mathbb{R}[\ushort X]^{k\times k}$ such that

[TABLE]

The following is an easy exercise that is good to know when dealing with sos matrix polynomials.

Proposition 2.4.

For $P\in\mathbb{R}[\ushort X]^{k\times k}$ , the following are equivalent:

(a)

$P$ is an sos matrix. 2. (b)

There is an $\ell\in\mathbb{N}_{0}$ and a matrix polynomial $Q\in\mathbb{R}[\ushort X]^{\ell\times k}$ such that $P=Q^{T}Q$ . 3. (c)

There are $\ell\in\mathbb{N}_{0}$ and $v_{1},\dots,v_{\ell}\in\mathbb{R}[\ushort X]^{k}$ such that $P=v_{1}v_{1}^{T}+\ldots+v_{\ell}v_{\ell}^{T}$ .

We say that a matrix polynomial $P\in\mathbb{R}[\ushort X]^{k\times k}$ is psd (or pd) on a set $S\subseteq\mathbb{R}^{n}$ if $P(x)\succeq 0$ (or $P(x)\succ 0$ ) for all $x\in S$ . In this case, we write “ $P\succeq 0$ on $S$ ” (or “ $P\succ 0$ on $S$ ”).

Definition 2.5.

A subset $M$ of $\mathbb{R}[\ushort X]$ is called a quadratic module of $\mathbb{R}[\ushort X]$ if

•

$1\in M$ ,

•

$p+q\in M$ for all $p,q\in M$ and

•

$p^{2}q\in M$ for all $p\in\mathbb{R}[\ushort X]$ and $q\in M$ .

For a tuple $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ , the smallest quadratic module containing $g_{1},\dots,g_{m}$ is obviously

[TABLE]

where we set $g_{0}:=1$ . We call it the quadratic module generated by $\ushort g$ .

Definition 2.6.

A quadratic module $M$ of $\mathbb{R}[\ushort X]$ is called Archimedean if for all $p\in M$ there is some $N\in\mathbb{N}$ such that $N+p\in M$ .

The following is well-known (see for example [PD, Lemma 5.1.13] and [Mar, Cor. 5.2.4]) but for convenience of the reader we include a compact easy proof.

Proposition 2.7.

Let $M$ be a quadratic module of $\mathbb{R}[\ushort X]$ . Then the following are equivalent:

(a)

$M$ is Archimedean. 2. (b)

There is some $N\in\mathbb{N}$ such that $N-(X_{1}^{2}+\ldots+X_{n}^{2})\in M$ . 3. (c)

There are $m\in\mathbb{N}$ and $\ushort g\in(\mathbb{R}[\ushort X]_{1}\cap M)^{m}$ such that the polyhedron $S(\ushort g)$ is non-empty and compact. 4. (d)

For each $f\in\mathbb{R}[\ushort X]_{1}$ , there is some $N\in\mathbb{N}$ such that $N+f\in M$ .

Proof.

Consider the vector subspace

[TABLE]

of $\mathbb{R}[\ushort X]$ . If $p\in\mathbb{R}[\ushort X]$ with $p^{2}\in B$ , then we can choose $N\in\mathbb{N}$ such that $(N-1)-p^{2}\in M$ and thus

[TABLE]

and thus $p\in B$ . Conversely, if $p\in B$ , then one can choose $N\in\mathbb{N}$ such that $2N-1\pm p\in M$ and thus

[TABLE]

showing that $p^{2}\in B$ since anyway $N^{2}(2N-1)+p^{2}\in M$ . Thus, we have

[TABLE]

for all $p\in\mathbb{R}[\ushort X]$ . This implies that $B$ is a subring of $\mathbb{R}[\ushort X]$ . Indeed, for $p,q\in\mathbb{R}[\ushort X]$ with $p,q\in B$ we have

[TABLE]

This shows that $\mathbb{R}[\ushort X]_{1}\subseteq B\iff\mathbb{R}[\ushort X]=B$ , which is the equivalence (d) $\iff$ (a). Condition (b) is easily seen to be equivalent to $X_{1}^{2},\dots,X_{n}^{2}\in B$ , which in turn is by $(*)$ equivalent to $X_{1},\dots,X_{n}\in B$ . Again by using that $B$ is a subring of $\mathbb{R}[\ushort X]$ , this shows the equivalence (a) $\iff$ (b). It remains to show (c) $\iff$ (d). If (d) holds, then one trivially finds $\ushort g$ like in (c), e.g., with $S(\ushort g)$ being a hypercube. Conversely, suppose that we have $\ushort g$ like in (c) and let $f\in\mathbb{R}[\ushort X]_{1}$ . Then there is $N\in\mathbb{N}$ such that $N+f\geq 0$ on the polytope $S(\ushort g)$ . By the affine form of Farkas’ lemma [Scr, Cor. 7.1h, p. 93], we have that $N+f$ is a nonnegative linear combination of the $1,g_{1},\dots,g_{m}$ and thus lies in $M$ . ∎

We mention the following important theorem although we will need it only for Example 4.10 below.

Theorem 2.8 (Schmüdgen).

Let $M$ be a quadratic module of $\mathbb{R}[\ushort X]$ . The following are equivalent:

(a)

There are $m\in\mathbb{N}$ and $\ushort g=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ such that $S(\ushort g)$ is compact and $\prod_{i\in I}g_{i}\in M$ for all $I\subseteq\{1,\dots,m\}$ . 2. (b)

There is some $g\in M$ with compact $S(g)$ . 3. (c)

$M$ is Archimedean.

Proof.

(a) $\implies$ (c) is the deep part of Schmüdgen’s Positivstellensatz [Scm, Cor. 3], namely his characterization of Archimedean preorders (see [PD, Thm. 5.1.17] and [Mar, Thm. 6.1.1]). The implications (c) $\implies$ (b) $\implies$ (a) are trivial. ∎

Remark 2.9.

For $n\geq 2$ , there are examples of $\ushort g=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ with compact (even empty) $S(\ushort g)$ such that $M(\ushort g)$ is not Archimedean (see [Mar, Ex. 7.3.1] or [PD, Ex. 6.3.1]). However if $S(\ushort g)$ is compact, then Proposition 2.7 and Theorem 2.8 provide several ways of changing the description $\ushort g$ of $S(\ushort g)$ such that $M(\ushort g)$ becomes Archimedean. For example, if one knows a big ball containing $S(\ushort g)$ , it suffices to add its defining quadratic polynomial to $\ushort g$ by Proposition 2.7(b). That is why for many practical purposes, the Archimedean property of $M(\ushort g)$ is not much stronger than the compactness of $S(\ushort g)$ .

We use the symbols $\nabla$ and $\operatorname{Hess}$ to denote the gradient and the Hessian of a real-valued function of $n$ variables, respectively. For a polynomial $g\in\mathbb{R}[\ushort X]$ , we understand its gradient $\nabla g$ as a column vector from $\mathbb{R}[\ushort X]^{n}$ , i.e., as a vector of polynomials. Similarly, its Hessian $\operatorname{Hess}g$ is a symmetric matrix polynomial of size $n$ , i.e., a symmetric matrix from $\mathbb{R}[\ushort X]^{n\times n}$ .

Definition 2.10.

Let $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ and set again $g_{0}:=1$ . For $i\in\{0,\dots,m\}$ , set $r_{i}:=\frac{d-\deg g_{i}}{2}$ if $g_{i}\neq 0$ and $r_{i}:=-\infty$ if $g_{i}=0$ . Then we define the $d$ -truncated quadratic module $M_{d}(\ushort g)$ associated to $\ushort g$ by

[TABLE]

More generally, we define the $d$ -truncated $k\times k$ matricial quadratic module associated to $\ushort g$ by

[TABLE]

We say that $f\in\mathbb{R}[\ushort X]$ is $\ushort g$ -sos-concave if

[TABLE]

If $m=0$ , this means that the negated Hessian of $f$ is an sos matrix polynomial and we say that $f$ is sos-concave.

Any $f\in\mathbb{R}[\ushort X]_{1}$ is sos-concave since $\operatorname{Hess}f=0$ . The Hessian of a $\ushort g$ -sos-concave polynomial is negative semidefinite on $S(\ushort g)$ .

The following is Putinar’s Positivstellensatz [Put, Lemma 4.1] for matrix polynomials with degree bounds. It has been first proven by Helton and Nie [HN1, Thm. 29] following the technical approach of Nie and the second author [NS’] for the case of polynomials. This technical approach yields explicit degree bounds. The first author found a short topological proof for the mere existence of such bounds [Kri, Thm. 3.2] that is based on knowing already the result without the degree bounds that stems from [SH, Thm. 2].

Theorem 2.11 (Helton and Nie).

Fix $C,d,k,m,n\in\mathbb{N}$ and fix any norm on the vector space $\mathbb{R}[\ushort X]_{d}^{k\times k}$ . Let $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ such that $M(\ushort g)$ is Archimedean. Then there exists $d\in\mathbb{N}_{0}$ such that every symmetric $H\in\mathbb{R}[\ushort X]_{d}^{k\times k}$ satisfying $\|H\|\leq C$ and $H\succeq\frac{1}{C}$ on $S(\ushort g)$ satisfies $H\in M_{d}^{k\times k}(\ushort g)$ .

The following is a slight generalization of [HN1, Lemma 7] that will be needed in the proof of Theorem 4.7.

Lemma 2.12.

Let $d\in\mathbb{N}_{0}$ , $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ and $u\in\mathbb{R}^{n}$ . If $P\in M_{d}^{k\times k}(\ushort g)$ , then the matrix polynomial $H\in\mathbb{R}[\ushort X]^{k\times k}$ defined by

[TABLE]

for $x\in\mathbb{R}^{n}$ lies again in $M_{d}^{k\times k}(\ushort g)$ .

Proof.

The proof [HN1, Lemma 7] can be easily adapted. Another more conceptual proof is the following: $M_{d}^{k\times k}(\ushort g)$ is a convex cone in a finite-dimensional vector space. Then

[TABLE]

is an existing Bochner integral of a vector valued function with values in this convex cone and thus lies again in this convex cone [RW] (regardless of whether the cone is closed or not). ∎

The “if” direction of the following proposition is trivial since a closed convex set in a finite-dimensional vector space is the intersection over all half spaces containing it. We will use it to prove our Main Theorem 4.8. The “only if” direction will be needed only in Example 4.10 below.

Proposition 2.13 (Netzer, Plaumann and Schweighofer).

Suppose $d\in\mathbb{N}_{0}$ , $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]_{d}^{m}$ , $S(\ushort g)$ is compact and convex and has nonempty interior. Then $S_{d}(\ushort g)=S(\ushort g)$ if and only if every $f\in\mathbb{R}[\ushort X]_{1}$ with $f\geq 0$ on $S(\ushort g)$ lies in $M_{d}(\ushort g)$ .

Proof.

This is a special case of [NPS, Proposition 3.1]. ∎

3. Reminder on strict quasiconcavity

We denote the real zero set of $g$ by

[TABLE]

We adopt the following notion from [HN1, p. 25], which is a local second order quasiconcavity condition.

Definition 3.1.

Let $g\in\mathbb{R}[\ushort X]$ . We say that $g$ is strictly quasiconcave at $x\in\mathbb{R}^{n}$ if for all $v\in\mathbb{R}^{n}\setminus\{0\}$ with $(\nabla g(x))^{T}v=0$ , we have that $v^{T}(\operatorname{Hess}g(x))v<0$ . We say that $g$ is strictly quasiconcave on $A\subseteq\mathbb{R}^{n}$ if $g$ is strictly quasiconcave at each point of $A$ .

Remark 3.2.

Let $g\in\mathbb{R}[\ushort X]$ and $x\in\mathbb{R}^{n}$ such that $\nabla g(x)=0$ .

(a)

$g$ is strictly quasiconcave at $x$ if and only if $\operatorname{Hess}g(x)\prec 0$ . 2. (b)

If $g$ is strictly quasiconcave at $x$ and $g(x)=0$ , then there is a neighborhood $U$ of $x$ such that $U\cap S(g)=\{x\}$ .

If $g\in\mathbb{R}[\ushort X]$ satisfies $g(x)=0$ and $\nabla g(x)\neq 0$ , then $Z(g)$ is locally around $x$ a smooth hypersurface. Differential geometers will recognize that strict quasiconcavity of $g$ at $x$ then means that the second fundamental form of this hypersurface at $x$ is positive definite when one chooses the “outward normal” (pointing away from $S(g)$ ). Thus this means that $S(g)$ is locally convex in a strong second order sense. For a detailed discussion we refer to [HN1, HN2] and the references therein. As Helton and Nie in [HN1, Subsection 3.1], we want however to help those readers who are not familiar with the basics of differential geometry by discussing strict quasiconcavity in an elementary manner. The reason why we include this is that Helton and Nie presuppose already that the reader is familiar with the geometric notion of tangent hyperplanes and knows that the gradient is a normal vector for it [HN2, p. 786]. Conversely we fit this into their arguments, see Part (a) of the following lemma and Proposition 3.4(b) below.

Formally, we will use the following lemma and the next proposition only in Example 4.10 below and even there it can be avoided by some calculations. Some readers might therefore decide to skip them.

Lemma 3.3.

Let $n\in\mathbb{N}$ , $g\in\mathbb{R}[\ushort X]$ and $x\in\mathbb{R}^{n}$ such that $g(x)=0$ and $\nabla g(x)\neq 0$ . Suppose $v_{1},\dots,v_{n}$ form a basis of $\mathbb{R}^{n}$ , $U$ is an open neighborhood of [math] in $\mathbb{R}^{n-1}$ , $\varphi\colon U\to\mathbb{R}$ is smooth and satisfies $\varphi(0)=0$ as well as

[TABLE]

for all $\xi=(\xi_{1},\dots,\xi_{n-1})\in U$ . Then the following hold:

(a)

$(\nabla g(x))^{T}v_{1}=\ldots=(\nabla g(x))^{T}v_{n-1}=0\iff\nabla\varphi(0)=0$ 2. (b)

If $\nabla\varphi(0)=0$ and $(\nabla g(x))^{T}v_{n}>0$ , then

[TABLE]

Proof.

Taking the derivative of $(*)$ with respect to $\xi_{i}$ , we get

[TABLE]

for all $i\in\{1,\dots,n-1\}$ . Setting here $\xi$ to [math], we get

[TABLE]

for each $i\in\{1,\dots,n-1\}$ . From this, (a) follows easily (for “ ${\implies}$ ” use that $(\nabla g(x))^{T}v_{n}\neq 0$ since $v_{1},\dots,v_{n}$ is a basis). Taking the derivative of $(**)$ with respect to $\xi_{j}$ , we get

[TABLE]

for all $i,j\in\{1,\dots,n-1\}$ . To prove (b), suppose now that $\nabla\varphi(0)=0$ and $(\nabla g(x))^{T}v_{n}>0$ . Then the preceding equation implies

[TABLE]

Since $v_{1},\ldots,v_{n-1}$ now form a basis of the orthogonal complement of $\nabla g(x)$ by (a), the matrix $(v_{i}^{T}(\operatorname{Hess}g(x))v_{j})_{i,j\in\{1,\dots,n-1\}}$ is negative definite if and only if $g$ is strictly quasiconcave at $x$ (see Definition 3.1). ∎

The following proposition is important for understanding the notion of quasiconcavity. It is trivial that quasiconcavity of a polynomial $g$ at $x$ depends only on the function $V\to\mathbb{R},\ x\mapsto g(x)$ where $V$ is an arbitrarily small neighborhood of $x$ . But if $g(x)=0$ and $\nabla g(x)\neq 0$ , then it actually depends only on the function

[TABLE]

as the equivalence of Conditions (a) and (b) of the following proposition show.

Proposition 3.4.

Let $n\in\mathbb{N}$ , $g\in\mathbb{R}[\ushort X]$ and $x\in\mathbb{R}^{n}$ such that

[TABLE]

Suppose that $V$ is a neighborhood of $x$ . Then the following are equivalent:

(a)

$g$ is strictly quasiconcave at $x$ . 2. (b)

There is a basis $v_{1},\dots,v_{n}$ of $\mathbb{R}^{n}$ , an open neighborhood $U$ of [math] in $\mathbb{R}^{n-1}$ and a smooth function $\varphi\colon U\to\mathbb{R}$ such that $\varphi(0)=0$ , $\nabla\varphi(0)=0$ , $\operatorname{Hess}\varphi(0)\succ 0$ ,

[TABLE]

for all $\xi\in U$ and

[TABLE]

for all small enough $\lambda\in\mathbb{R}_{>0}$ . 3. (c)

Condition (b) holds with “basis” replaced by “orthogonal basis”.

For any basis $v_{1},\dots,v_{n}$ of $\mathbb{R}^{n}$ like in (b), one has

[TABLE]

Proof.

Using Lemma 3.3(a), it is easy to show that any $v_{1},\dots,v_{n}$ like in (b) satisfy $(***)$ using that $(\nabla g(x))^{T}v_{n}=0$ would contradict the hypothesis $\nabla g(x)\neq 0$ since $v_{1},\dots,v_{n}$ is a basis. Now Part (b) of the same lemma shows that (b) implies (a). Since it is trivial that (c) implies (b), it only remains to show that (a) implies (c).

To this end, let (a) be satisfied. In order to show (c), choose an orthogonal basis $v_{1},\dots,v_{n}$ of $\mathbb{R}^{n}$ satisfying $(***)$ . The implicit function theorem yields an open neighborhood $U$ of the origin in $\mathbb{R}^{n-1}$ such that for each $\xi=(\xi_{1},\dots,\xi_{n-1})\in U$ there is a unique $\varphi(\xi)\in\mathbb{R}$ satisfying $(*)$ , in particular $\varphi(0)=0$ . Moreover, one can choose $U$ such that the resulting function $\varphi\colon U\to\mathbb{R}$ is smooth. From $(\nabla g(x))^{T}v_{n}>0$ , we get $(**)$ . From Part (a) of Lemma 3.3, we get $\nabla\varphi(0)=0$ . From Part (b) of the same lemma and from (a), we obtain $\operatorname{Hess}\varphi(0)\succ 0$ . ∎

Another more algebraic way of understanding strict quasiconcavity is given by the following easy exercise [HN1, Lemma 11(a)].

Lemma 3.5.

Let $S\subseteq\mathbb{R}^{n}$ be a compact set and consider a polynomial $g\in\mathbb{R}[\ushort X]$ that is strictly quasiconcave on $S$ . Then one can find $\lambda>0$ such that

[TABLE]

is positive definite on $S$ .

We will need the following lemma only in the case where $f$ is linear. In that case, one can use for its proof a slightly weaker version of the Karush-Kuhn-Tucker theorem [Pla, Theorem 5.1].

Lemma 3.6.

Suppose $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ , $S(\ushort g)$ is convex and has nonempty interior. Suppose $u\in S(\ushort g)$ and let $I:=\{i\in\{1,\dots,m\}\mid g_{i}(u)=0\}$ . Suppose $f\in\mathbb{R}[\ushort X]$ and $U$ is a neighborhood of $u$ such that $u$ is a minimizer of $f$ on $S(\ushort g)\cap U$ and $\operatorname{Hess}g_{i}\preceq 0$ on $S(\ushort g)\cap U$ for all $i\in I$ . Then there exist a family $(\lambda_{i})_{i\in I}$ of nonnegative Lagrange multipliers $\lambda_{i}\in\mathbb{R}_{\geq 0}$ such that $\nabla f(u)=\sum_{i\in I}\lambda_{i}\nabla g_{i}(u)$ .

Proof.

By the Karush-Kuhn-Tucker theorem [FH, Theorem 2.2.5], it suffices to show that the $g_{i}$ ( $i\in I$ ) satisfy the Mangasarian-Fromowitz constraint qualification, i.e., there is some $v\in\mathbb{R}^{n}$ such that $(\nabla g_{i}(u))^{T}v>0$ for all $i\in I$ [FH, Chapter 2.2.5]. By discarding those $g_{i}$ that are the zero polynomial, we may assume $g_{i}\neq 0$ for all $i\in I$ . Since $S(\ushort g)$ has nonempty interior, there is then some $x\in S(\ushort g)$ such that $g_{i}(x)>0$ for all $i\in I$ . Set $v:=x-u$ and consider for fixed $i\in I$ the function $h\colon\mathbb{R}\to\mathbb{R},\ t\mapsto g_{i}(u+tv)$ . We have $0=h(0)$ and $h(1)=g_{i}(x)>0$ . Therefore there is $t\in[0,1]$ such that $h^{\prime}(t)>0$ . Because of $h^{\prime\prime}(t)=v^{T}(\operatorname{Hess}g_{i}(u+tv))v\leq 0$ for all $t\in[0,1]$ , this implies $(\nabla g_{i}(u))^{T}v=h^{\prime}(0)>0$ as desired. ∎

4. The main result

In this section, we will prove our main result about the exactness of the Lasserre relaxation. The first step is to get an alternate description of the compact basic closed semialgebraic set $S(\ushort g)$ with nonempty interior. Both descriptions, the original one $\ushort g$ and the alternate one will be used in the proof of Theorem 4.7. The new description will arise by replacing polynomials $g_{i}$ that are strictly quasiconcave on $S(\ushort g)\cap Z(g_{i})$ by polynomials of the form $h_{i}:=g_{i}h(g_{i})$ with a univariate polynomial $h\in\mathbb{R}[T]$ such that $h\geq 1$ on $\mathbb{R}$ . It will be of outmost importance that $h_{i}\in M(\ushort g)$ which follows from the fact that $h-1$ and therefore $h$ is an sos-polynomial by Lemma 2.2 above. Roughly speaking, the basic idea is that $h_{i}(x)$ will be, up to positive factor, approximately $1-e^{-cg_{i}(x)}$ for a big constant $c$ when $x$ lies in $S(\ushort g)$ or $x$ lies sufficiently close to $S(\ushort g)$ . The effect of this is that $h_{i}$ will be a polynomial (unfortunately of large degree) that is very close to being a positive constant on the “safe part” of $S(\ushort g)$ consisting of the points in $S(\ushort g)$ that are in “safe distance” to the boundary of $S(\ushort g)$ . On the “safe part” of $S(\ushort g)$ one can hope (and it will turn out from our actual choice of $h$ ) that the Hessian of the $h_{i}$ does not vary too quickly. This will be crucial in the proof of Lemma 4.6 (the interval $J_{3}$ appearing there corresponds to this “safe part”).

In the proof of Lemma 4.5 below, the auxiliary polynomial $h$ will be chosen as $h:=f_{c,d}\in\mathbb{Q}[T]$ for a big real constant $c$ and a large nonnegative even integer $d$ where $f_{c,d}$ is defined in Notation 4.1 below. In [HN1, Lemma 13], Helton and Nie use exactly the same polynomial $f_{c,d}$ except that they do not care about the parity of the degree $d$ . Lemma 4.4 below is an important observation that was probably not known to Helton and Nie. If Helton and Nie had exploited this, they could have sharpened some of their results in [HN1]. However, they would not have come close to our main result Theorem 4.8 which ultimately relies on our new refined and subtle analysis in the proofs of Lemma 4.6 and Theorem 4.7 that focuses on integrals of the Hessian of the $h_{i}$ instead of the Hessians themselves.

Notation 4.1.

For $c>0$ and $d\in\mathbb{N}_{0}$ , we denote by

[TABLE]

the $d$ -th Taylor polynomial of the function

[TABLE]

at the origin and we set

[TABLE]

For any $p\in\mathbb{R}[T]$ , we denote by $p^{\prime}$ its (formal) derivative (with respect to $T$ ) and by $p^{\prime\prime}=(p^{\prime})^{\prime}$ its second derivative.

Proposition 4.2.

For $c>0$ , we have

[TABLE]

Proof.

Use the chain rule, the product rule and the quotient rule for derivation. ∎

The following lemma has been given an easy short proof by Speyer [Spe], which we reproduce here for convenience of the reader.

Lemma 4.3 (Speyer).

For $c\in\mathbb{R}_{>0}$ and $d\in\mathbb{N}_{0}$ , we have:

(a)

If $d$ is even, then $e_{c,d}(t)>0$ for all $t\in\mathbb{R}$ . 2. (b)

If $d$ is odd, then $e_{c,d}$ is strictly increasing on $\mathbb{R}$ .

Proof.

We fix $c\in\mathbb{R}_{>0}$ and proceed by induction on $d$ . The case $d=0$ is trivial since $e_{c,0}=1>0$ . Suppose the lemma is already proven for $d-1$ instead of $d$ where $d\in\mathbb{N}$ is fixed. First consider the case where $d$ is even. Then by induction hypothesis the odd degree polynomial $e_{c,d-1}$ must have exactly one real root $t_{0}$ . By Lemma 4.2(a) the even degree polynomial $e_{c,d}$ takes therefore its (unique) minimum in $t_{0}$ . To prove the statement, it suffices to observe that

[TABLE]

In the case where $d$ is odd, the statement follows immediately from the induction hypothesis and Lemma 4.2(a). ∎

Lemma 4.4.

Let $c\in\mathbb{R}_{>0}$ and suppose $d\in\mathbb{N}_{0}$ is even. Then $f_{c,d}(t)>0$ for all $t\in\mathbb{R}$ .

Proof.

The leading coefficient of $f_{c,d}$ is $\frac{c^{d}(-1)^{d}}{(d+1)!}>0$ . Therefore it suffices to show that $f_{c,d}$ has no real roots. One easily checks that $f_{c,d}$ has no root at the origin. Assume we have a root $t\in\mathbb{R}$ different from the origin. Then $e_{c,d+1}(-t)=1$ . Observing that $e_{c,d+1}(0)=1$ , it follows from Lemma 4.3(b) that $t=0$ , a contradiction. ∎

The following lemma is an improved version of [HN1, Lemma 13]. Most importantly, we manage to get that $h-1$ (defined in this lemma) is an sos polynomial (and in particular $h$ is positive on $\mathbb{R}$ ) instead of just positivity of $h$ on the interval $[0,R]$ . This will come out of Lemmata 4.4 and 2.2 together with the approach we take in the proof that uses simply Taylor approximations of the exponential function instead of the nonconstructive approximation theory used in [HN1]. The second crucial improvement is the new property (c). A surprising improvement coming out of Lemma 4.3 is that we get in Condition (a) positivity on $\mathbb{R}$ instead of just the positivity on $[0,R]$ that Helton and Nie get. At the moment however, we do not have any application for this. Finally, an insignificant improvement again not used by us is the validity of Condition (b) on the interval $[-R,R]$ instead of the interval $[0,R]$ used by Helton and Nie.

Lemma 4.5.

Let $H,\delta,\varepsilon,R\in\mathbb{R}$ such that $H>0$ and $0<\delta<\varepsilon<R$ . Then there exists a univariate polynomial $h\in\mathbb{R}[T]$ such that

[TABLE]

satisfying the following conditions:

[TABLE]

Proof.

By a scaling argument, we can relax the condition that $h-1$ is sos to the condition that $h-\gamma$ is sos for some $\gamma\in\mathbb{R}_{>0}$ . By the Lemmata 4.4 and 2.2, it suffices to find $c\in\mathbb{R}_{>0}$ and $d\in\mathbb{N}_{0}$ even such that (a)–(c) are satisfied for $h:=f_{c,d}\in\mathbb{Q}[T]$ . Noting that

[TABLE]

by Proposition 4.2, this means that we are trying to find $c\in\mathbb{R}_{>0}$ and $d\in\mathbb{N}_{0}$ even with

[TABLE]

Condition (a’) is always satisfied by Lemma 4.3(a) if $d$ is even. Since the functions induced by the polynomials $e_{c,d}$ on the interval $[-R,R]$ converge uniformly to the function $[-R,R]\to\mathbb{R},\ t\mapsto\exp(ct)$ as $d\in\mathbb{N}$ tends to infinity, it suffices to find $c>0$ satisfying

[TABLE]

These conditions can be rewritten as

[TABLE]

Thus it suffices to choose $c>\max\left\{H,\frac{\log H}{\varepsilon-\delta}\right\}$ and $d\in\mathbb{N}_{0}$ even and sufficiently large. ∎

The previous result is now used to prove the following key lemma. This key lemma is our “luxury version” of [HN1, Proposition 10] in the work of Helton and Nie. It will be used in this article only with $C:=S(\ushort g)$ (when $S(\ushort g)$ is compact) but for potential future applications we formulate it in greater generality. It has several advantages over [HN1, Proposition 10]. The most important one is that we only require the $g_{i}$ to be strictly quasiconcave on a set that will be very slim in general whereas Helton and Nie assume them to be strictly quasiconcave on the whole of $S(\ushort g)$ . Another important advantage is that the new polynomials $h_{i}$ lie in $M(\ushort g)$ . The only price that we have to pay is that not the Hessian itself but only an integrated version of it satisfies the negative definiteness condition. This will however be enough for the proof of Theorem 4.7 and the Main Theorem 4.8.

Lemma 4.6.

Let $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ and let $C$ be a compact subset of $S(\ushort g)$ such that $g_{i}$ is strictly *quasi-*concave on $C\cap Z(g_{i})$ for each $i\in\{1,\dots,m\}$ . Then there exists a polynomial $h\in\mathbb{R}[T]$ with $h-1$ an sos polynomial such that $h_{i}:=g_{i}h(g_{i})$ satisfies

[TABLE]

for all $i\in\{1,\dots,m\}$ , $u\in Z(g_{i})$ and $x\in\mathbb{R}^{n}$ with $\{u+s(x-u)\mid 0\leq s\leq 1\}\subseteq C$ .

Proof.

By Lemma 3.5 and the compactness of $C\cap Z(g_{i})$ , we find $\lambda>0$ such that

[TABLE]

satisfies

[TABLE]

for all $i\in\{1,\dots,m\}$ and all $x\in C\cap Z(g_{i})$ . The polynomial $h$ will come out of Lemma 4.5 applied to certain values of $R$ , $H$ , $\varepsilon$ and $\delta$ , which we will now adjust. First of all, we choose $R>0$ such that

[TABLE]

for all $i\in\{1,\dots,m\}$ and $x\in C$ . To get $\varepsilon$ , we observe that the compact set $C$ is contained in the union of the chain consisting of the open sets

[TABLE]

and therefore is contained in those of these sets that belong to a sufficiently small $\varepsilon$ , i.e., there is $\varepsilon$ with $0<\varepsilon<R$ such that

[TABLE]

By compactness, there exists $\xi>0$ such that

[TABLE]

We choose $\delta$ with $0<\delta<\varepsilon$ arbitrary and $d>0$ such that

[TABLE]

for all $x,y\in C$ . The compact subset $C\times C$ of $\mathbb{R}^{2n}$ is contained in the union of the chain consisting of the open sets

[TABLE]

and therefore is contained in those of these sets that belong to a sufficiently small $\sigma$ , i.e., there is $\sigma$ with $0<\sigma\leq d$ such that

[TABLE]

Because $C$ is compact, we can choose $\tau>0$ such that

[TABLE]

for all $x\in C$ and $i\in\{1,\dots,m\}$ . Finally, set

[TABLE]

Choose $h\in\mathbb{R}[T]$ such that $h-1$ is an sos polynomial in $\mathbb{R}[T]$ according to Lemma 4.5 and the chosen values of $H$ , $R$ , $\varepsilon$ and $\delta$ . Fix $i\in\{1,\dots,m\}$ and set $h_{i}:=g_{i}h(g_{i})$ . Using the product and chain rule, we calculate

[TABLE]

and therefore

[TABLE]

Using

[TABLE]

it follows that

[TABLE]

One now recognizes that conditions (a) and (b) from Lemma 4.5 guarantee that

[TABLE]

for all $x\in C$ since $H\geq\lambda$ . Now let $u\in Z(g_{i})$ and $x\in\mathbb{R}^{n}$ with

[TABLE]

It suffices to show

[TABLE]

To this end, we split up the unit interval $[0,1]$ into three disjoint parts

[TABLE]

In particular, each $J_{k}$ is a union of intervals such that $[0,1]=J_{1}\mathbin{\dot{\cup}}J_{2}\mathbin{\dot{\cup}}J_{3}$ . We now analyze the integral in question on each of these parts separately: The integral over $J_{1}$ will contribute a guaranteed amount of positive definiteness, the integral over $J_{2}$ an unknown amount of positive semidefiniteness and the integral over $J_{3}$ will be very small in norm so that it cannot destroy the positive definiteness accumulated over $J_{1}$ . For further use, we set

[TABLE]

Analysis on $J_{1}$ . The subinterval $[0,\frac{\sigma}{d}]$ of $[0,1]$ (note that $\frac{\sigma}{d}\leq 1$ ) is contained in $J_{1}$ since $\|u-(u+s(x-u))\|=s\|x-u\|\leq\frac{\sigma}{d}d=\sigma$ for $s\in[0,\frac{\sigma}{d}]$ and therefore

[TABLE]

for all $s\in[0,\frac{\sigma}{d}]$ by the choice of $\sigma$ (see Property (3) above). By choice of $\xi$ , we have that

[TABLE]

for all $s\in J_{1}$ (in fact also for $s\in J_{2}$ ). By Parts (a) and (c) of Lemma 4.5, we have $(h(g_{i})+g_{i}h^{\prime}(g_{i}))(u+s(x-u))>HM$ for all $s\in J_{1}$ . Hence we get with Property (2) above that

[TABLE]

Analysis on $J_{2}$ . We have of course

[TABLE]

for all $s\in J_{2}$ (in fact also for $s\in J_{1}$ ) and, by Part (a) of Lemma 4.5,

[TABLE]

for all $s\in[0,1]$ . Hence

[TABLE]

Analysis on $J_{3}$ . We have of course $F_{i}(u+s(x-u))\succeq-\|F_{i}(u+s(x-u))\|\operatorname{I}_{n}\succeq-\tau\operatorname{I}_{n}$ for all $s\in[0,1]$ and therefore

[TABLE]

Total analysis. Finally, we get

[TABLE]

∎

Theorem 4.7.

Let $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ such that $S(\ushort g)$ is convex with nonempty interior and $M(\ushort g)$ is Archimedean. Suppose that each $g_{i}$ is strictly quasiconcave on $S(\ushort g)\cap Z(g_{i})$ or $\ushort g$ -sos-concave. Then there is $d\in\mathbb{N}_{0}$ such that for all $f\in\mathbb{R}[\ushort X]_{1}$ with $f\geq 0$ on $S(\ushort g)$ we have $f\in M_{d}(\ushort g)$ .

Proof.

Choose $I$ and $J$ such that $\{1,\dots,m\}=I\mathbin{\dot{\cup}}J$ , $g_{i}$ is strictly quasiconcave on $S(\ushort g)\cap Z(g_{i})$ for $i\in I$ and $g_{j}$ is $\ushort g$ -sos-concave for $j\in J$ . Applying Lemma 4.6 with $(g_{i})_{i\in I}$ instead of $\ushort g$ and the compact subset $C:=S(\ushort g)$ of $S((g_{i})_{i\in I})$ , we get for each $i\in I$ a polynomial

[TABLE]

satisfying $S(h_{i})=S(g_{i})$ , $Z(h_{i})=Z(g_{i})$ and

[TABLE]

for all $u\in S(\ushort g)\cap Z(g_{i})$ and $x\in S(\ushort g)$ . Setting here $x=u$ , we obtain in particular

[TABLE]

for each $i\in I$ . Set

[TABLE]

for all $j\in J$ . Then

[TABLE]

Choose $d_{1}\in\mathbb{N}_{0}$ such that

[TABLE]

for all $i\in I\cup J$ . Define for all $i\in I\cup J$ and $u\in\mathbb{R}^{n}$ a symmetric matrix polynomial $H_{i,u}\in\mathbb{R}[\ushort X]^{n\times n}$ by

[TABLE]

for all $x\in\mathbb{R}^{n}$ . Applying compactness of $S(\ushort g)\cap Z(g_{i})$ , $S(\ushort g)$ and the unit sphere in $\mathbb{R}^{n}$ together with continuity, we find $\delta>0$ such that

[TABLE]

for all $i\in I,u\in S(\ushort g)\cap Z(g_{i})$ and $x\in S(\ushort g)$ . For each $t\in[0,1]$ , we apply this to $u+t(x-u)\in S(\ushort g)$ instead of $x$ to get

[TABLE]

for all $i\in I,u\in S(\ushort g)\cap Z(g_{i})$ and $x\in S(\ushort g)$ . Thus

[TABLE]

for all $i\in I$ , $u\in S(\ushort g)\cap Z(g_{i})$ and $x\in S(\ushort g)$ . Again using the compactness of $S(\ushort g)\cap Z(g_{i})$ and continuity, we find some $E>0$ such that

[TABLE]

for all $i\in I$ and $u\in S(\ushort g)\cap Z(g_{i})$ . Theorem 2.11 yields $d_{2}\in\mathbb{N}$ such that

[TABLE]

for all $i\in I$ and $u\in S(\ushort g)\cap Z(g_{i})$ . Lemma 2.12 yields $d_{3}\in\mathbb{N}$ such that

[TABLE]

for all $j\in J$ and $u\in\mathbb{R}^{n}$ . For later use, set

[TABLE]

Now let $f\in\mathbb{R}[\ushort X]_{1}$ with $f\geq 0$ on $S(\ushort g)$ . Since $S(\ushort g)$ is nonempty and compact, we can define $c$ as the minimum of $f$ on $S(\ushort g)$ . Exchanging $f$ by $f-c$ , we can suppose without loss of generality that $c=0$ . Then there is some $u\in S(\ushort g)$ with

[TABLE]

Consider

[TABLE]

Because of $\operatorname{Hess}h_{i}(u)\prec 0$ (see Property (4)) and continuity, we get a neighborhood $U$ of $u$ such that

[TABLE]

for all $i\in I\cap K$ . Since each $h_{j}=g_{j}$ with $j\in J$ is $\ushort g$ -sos-concave, we have on the other hand

[TABLE]

for all $j\in J$ . Combining both, we have in particular that

[TABLE]

for all $k\in K$ . Applying Lemma 3.6, we get a family $(\lambda_{k})_{k\in K}$ of nonnegative Lagrange multipliers such that $\nabla f=\sum_{k\in K}\lambda_{k}\nabla h_{k}(u)$ (recall that $f$ is linear) and thus

[TABLE]

Fix now $x\in\mathbb{R}^{n}$ . For the map

[TABLE]

we have $h(0)=0$ , $h^{\prime}(0)=0$ and

[TABLE]

for $s\in\mathbb{R}$ . Hence

[TABLE]

Since $x\in\mathbb{R}^{n}$ was arbitrary, we thus have

[TABLE]

and thus $f\in M_{d}(\ushort g)$ . ∎

Note that it is essential in the previous theorem to require $f$ to be linear. It is even not enough to require $f$ to be globally convex of small bounded degree [KL].

Main Theorem 4.8.

Let $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ such that $S(\ushort g)$ is convex with nonempty interior and $M(\ushort g)$ is Archimedean. Suppose that each $g_{i}$ is strictly quasiconcave on $S(\ushort g)\cap Z(g_{i})$ or $\ushort g$ -sos-concave. Then $\ushort g$ has an exact Lasserre relaxation.

Proof.

Directly from 4.7 by the trivial direction of Proposition 2.13. ∎

In the situation of this theorem, now drop the convexity assumption and consequently ask whether the convex hull of $S(\ushort g)$ (instead of $S(\ushort g)$ itself) equals $S_{d}(\ushort g)$ for large $d$ . Helton and Nie proved that in this situation the convex hull of $S(\ushort g)$ is semidefinitely representable [HN2, Theorem 4.4]. The question arises if it even equals $S_{d}(\ushort g)$ for large $d$ . This will be proven in our forthcoming paper [KS] if all $g_{i}$ are strictly quasiconcave on $S(\ushort g)\cap Z(g_{i})$ . However, Example 4.10 below shows that in this case, one cannot allow that some of the $g_{i}$ are linear (or even sos-concave) instead. To prove this, we need the following important criterion from [GN, Proposition 4.1].

Theorem 4.9 (Gouveia and Netzer).

Suppose $\ushort g:=(g_{1},\dots,g_{m})\in\mathbb{R}[\ushort X]^{m}$ , $L\subseteq\mathbb{R}^{n}$ is a straight line in $\mathbb{R}^{n}$ , $S(\ushort g)\cap L$ has nonempty interior in $L$ and $u\in S(\ushort g)$ is an element of the boundary of $\overline{\operatorname{conv}(S(\ushort g))}\cap L$ in $L$ . Suppose that for each $i$ with $g_{i}(u)=0$ , $\nabla g_{i}(u)$ is orthogonal to $L$ . Then $S_{d}(\ushort g)$ strictly contains the closure of the convex hull of $S(\ushort g)$ for all $d$ .

Example 4.10.

Let $n:=2$ , write $X,Y$ for $X_{1},X_{2}$ and consider $\ushort g:=(g_{1},g_{2})$ with

[TABLE]

We see that $S(g_{1})$ is the disjoint union of two closed disks of different radii. The affine half plane $S(g_{2})$ cuts out a piece from the bigger disk and its boundary line $L:=\left(\begin{smallmatrix}0\\ 1\end{smallmatrix}\right)+\mathbb{R}\left(\begin{smallmatrix}1\\ 0\end{smallmatrix}\right)$ is tangent to the smaller disk. Since $S(g_{1})$ is compact, $M(\ushort g)$ is Archimedean by Theorem 2.8(b). By Proposition 3.4(b), $g_{1}$ is strictly quasiconcave on $S(\ushort g)\cap Z(g_{1})$ . The line $L$ is tangent to the smaller disk in the point $\left(\begin{smallmatrix}0\\ 1\end{smallmatrix}\right)$ and passes through the interior of the larger disk. By the criterion 4.9 of Gouveia and Netzer applied with $u:=\left(\begin{smallmatrix}0\\ 1\end{smallmatrix}\right)$ , $S_{d}(\ushort g)$ strictly contains the convex hull of $S(\ushort g)$ for all $d$ . By inspection of the proof of Gouveia and Netzer, we see more precisely that each $S_{d}(\ushort g)$ contains a left neighbourhood of $u$ inside $L$ .

Acknowledgments

The authors would like to thank all three anonymous referees for their thorough reading that helped to improve the presentation of the material.

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[BEFB] S. Boyd, L. El Ghaoui, E. Feron, V. Balakrishnan: Linear matrix inequalities in system and control theory, SIAM Studies in Applied Mathematics 15, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 1994
2[FH] W. Forst, D. Hoffmann: Optimization – theory and practice, Springer Undergraduate Texts in Mathematics and Technology, Springer, New York, 2010
3[Gho] M. Ghomi: Optimal smoothing for convex polytopes, Bull. London Math. Soc. 36 (2004), no. 4, 483–492
4[GN] J. Gouveia, T. Netzer: Positive polynomials and projections of spectrahedra, SIAM J. Optim. 21 (2011), no. 3, 960–976
5[HN 1] J.W. Helton, J. Nie: Semidefinite representation of convex sets, Math. Program. 122 (2010), no. 1, Ser. A, 21–64
6[HN 2] J.W. Helton, J. Nie: Sufficient and necessary conditions for semidefinite representability of convex hulls and sets, SIAM J. Optim. 20 (2009), no. 2, 759–791 [this article is a continuation of [ HN 1 ] although it appeared earlier]
7[KL] E. de Klerk, M. Laurent: On the Lasserre hierarchy of semidefinite programming relaxations of convex polynomial optimization problems, SIAM J. Optim. 21 (2011), no. 3, 824–832
8[Kri] T. Kriel: A new proof for the existence of degree bounds for Putinar’s Positivstellensatz, Ordered algebraic structures and related topics 203–209, Contemp. Math., 697, Amer. Math. Soc., Providence, RI, 2017

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

On the exactness of Lasserre relaxations for

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

2. Reminder on sums of squares

Definition 2.1**.**

Proposition 2.2**.**

Proof.

Definition 2.3**.**

Proposition 2.4**.**

Definition 2.5**.**

Definition 2.6**.**

Proposition 2.7**.**

Proof.

Theorem 2.8** (Schmüdgen).**

Proof.

Remark 2.9**.**

Definition 2.10**.**

Theorem 2.11** (Helton and Nie).**

Lemma 2.12**.**

Proof.

Proposition 2.13** (Netzer, Plaumann and Schweighofer).**

Proof.

3. Reminder on strict quasiconcavity

Definition 3.1**.**

Remark 3.2**.**

Lemma 3.3**.**

Proof.

Proposition 3.4**.**

Proof.

Lemma 3.5**.**

Lemma 3.6**.**

Proof.

4. The main result

Notation 4.1**.**

Proposition 4.2**.**

Proof.

Lemma 4.3** (Speyer).**

Proof.

Lemma 4.4**.**

Proof.

Lemma 4.5**.**

Proof.

Lemma 4.6**.**

Proof.

Theorem 4.7**.**

Proof.

Main Theorem 4.8**.**

Proof.

Theorem 4.9** (Gouveia and Netzer).**

Example 4.10**.**

Acknowledgments

Definition 2.1.

Proposition 2.2.

Definition 2.3.

Proposition 2.4.

Definition 2.5.

Definition 2.6.

Proposition 2.7.

Theorem 2.8 (Schmüdgen).

Remark 2.9.

Definition 2.10.

Theorem 2.11 (Helton and Nie).

Lemma 2.12.

Proposition 2.13 (Netzer, Plaumann and Schweighofer).

Definition 3.1.

Remark 3.2.

Lemma 3.3.

Proposition 3.4.

Lemma 3.5.

Lemma 3.6.

Notation 4.1.

Proposition 4.2.

Lemma 4.3 (Speyer).

Lemma 4.4.

Lemma 4.5.

Lemma 4.6.

Theorem 4.7.

Main Theorem 4.8.

Theorem 4.9 (Gouveia and Netzer).

Example 4.10.