The convex hull of a quadratic constraint over a polytope

Asteroide Santana; Santanu S. Dey

arXiv:1812.10160·math.OC·December 27, 2018·SIAM J. Optim.

The convex hull of a quadratic constraint over a polytope

Asteroide Santana, Santanu S. Dey

PDF

TL;DR

This paper proves that the convex hull of a quadratic constraint intersected with a bounded polyhedron can be represented using second-order cones, aiding in solving non-convex QCQPs more effectively.

Contribution

It establishes that the convex hull of a quadratic constraint over a polytope is second-order cone representable, providing a constructive proof for this key convexification result.

Findings

01

Convex hull is second-order cone representable.

02

Constructive proof of the convex hull characterization.

03

Facilitates convex relaxation of non-convex QCQPs.

Abstract

A quadratically constrained quadratic program (QCQP) is an optimization problem in which the objective function is a quadratic function and the feasible region is defined by quadratic constraints. Solving non-convex QCQP to global optimality is a well-known NP-hard problem and a traditional approach is to use convex relaxations and branch-and-bound algorithms. This paper makes a contribution in this direction by showing that the exact convex hull of a general quadratic equation intersected with any bounded polyhedron is second-order cone representable. We present a simple constructive proof of this result.

Equations64

S := {x \in R^{n} ∣ x^{⊤} Q x + α^{⊤} x = g, x \in P},

S := {x \in R^{n} ∣ x^{⊤} Q x + α^{⊤} x = g, x \in P},

S \supseteq i = 1 ⋃ k B^{i} \supseteq extr (S) .

S \supseteq i = 1 ⋃ k B^{i} \supseteq extr (S) .

conv (S) \supseteq conv (i = 1 ⋃ k B^{i}) \supseteq conv (extr (S)) = conv (S),

conv (S) \supseteq conv (i = 1 ⋃ k B^{i}) \supseteq conv (extr (S)) = conv (S),

conv (S) = conv (i = 1 ⋃ k B^{i}) = conv (i = 1 ⋃ k conv (B^{i})) .

conv (S) = conv (i = 1 ⋃ k B^{i}) = conv (i = 1 ⋃ k conv (B^{i})) .

conv (i = 1 ⋃ k conv (B^{i})),

conv (i = 1 ⋃ k conv (B^{i})),

conv (F (S)) = F (conv (S)),

conv (F (S)) = F (conv (S)),

S := V^{- 1} ({w ∣ w^{⊤} Σ w + α^{⊤} V^{- 1} w = d, w \in \tilde{P}}),

S := V^{- 1} ({w ∣ w^{⊤} Σ w + α^{⊤} V^{- 1} w = d, w \in \tilde{P}}),

S := {(x, y, z) \in R^{n} ∣ i = 1 \sum n_{q} a_{i} x_{i}^{2} + i = 1 \sum n_{q} α_{i} x_{i} + j = 1 \sum n_{l} β_{j} y_{j} = g, (x, y, z) \in P},

S := {(x, y, z) \in R^{n} ∣ i = 1 \sum n_{q} a_{i} x_{i}^{2} + i = 1 \sum n_{q} α_{i} x_{i} + j = 1 \sum n_{l} β_{j} y_{j} = g, (x, y, z) \in P},

S := {(x, y, z) \in R^{n} ∣ i = 1 \sum n_{q} σ (a_{i}) (∣ a_{i} ∣ x_{i} + σ (a_{i}) \frac{α _{i}}{2 ∣ a _{i} ∣})^{2} + i = 1 \sum n_{l} β_{i} y_{i} = g + i = 1 \sum n_{q} \frac{α _{i}^{2}}{4 a _{i}}, (x, y, z) \in P},

S := {(x, y, z) \in R^{n} ∣ i = 1 \sum n_{q} σ (a_{i}) (∣ a_{i} ∣ x_{i} + σ (a_{i}) \frac{α _{i}}{2 ∣ a _{i} ∣})^{2} + i = 1 \sum n_{l} β_{i} y_{i} = g + i = 1 \sum n_{q} \frac{α _{i}^{2}}{4 a _{i}}, (x, y, z) \in P},

S := {(w, x, y, z) \in R^{n_{q +}} \times R^{n_{q -}} \times R^{n_{l}} \times R^{n_{o}} ∣ i = 1 \sum n_{q +} w_{i}^{2} - j = 1 \sum n_{q -} x_{j}^{2} + k = 1 \sum n_{l} y_{k} = g, (w, x, y, z) \in P},

S := {(w, x, y, z) \in R^{n_{q +}} \times R^{n_{q -}} \times R^{n_{l}} \times R^{n_{o}} ∣ i = 1 \sum n_{q +} w_{i}^{2} - j = 1 \sum n_{q -} x_{j}^{2} + k = 1 \sum n_{l} y_{k} = g, (w, x, y, z) \in P},

conv (G) = {(x, w) \in R^{n_{1}} \times R^{n_{2}} ∣ x \in conv (G_{0}), w = C^{⊤} x + h} .

conv (G) = {(x, w) \in R^{n_{1}} \times R^{n_{2}} ∣ x \in conv (G_{0}), w = C^{⊤} x + h} .

x_{B} = C x_{N} + h .

x_{B} = C x_{N} + h .

[x_{B}^{⊤} x_{N}^{⊤}] [Q_{B B} Q_{N B} Q_{B N} Q_{N N}] [x_{B} x_{N}] + α^{⊤} [x_{B} x_{N}] = g .

[x_{B}^{⊤} x_{N}^{⊤}] [Q_{B B} Q_{N B} Q_{B N} Q_{N N}] [x_{B} x_{N}] + α^{⊤} [x_{B} x_{N}] = g .

x_{N}^{⊤} \tilde{Q} x_{N} + \tilde{α}^{⊤} x_{N} = \tilde{g},

x_{N}^{⊤} \tilde{Q} x_{N} + \tilde{α}^{⊤} x_{N} = \tilde{g},

\tilde{Q}

\tilde{Q}

\tilde{α}

\tilde{g}

S := {(x_{B}, x_{N}) \in R^{n} ∣ x_{N}^{⊤} \tilde{Q} x_{N} + \tilde{α}^{⊤} x_{N} = \tilde{g}, x_{N} \in \tilde{P}, x_{B} = C x_{N} + h},

S := {(x_{B}, x_{N}) \in R^{n} ∣ x_{N}^{⊤} \tilde{Q} x_{N} + \tilde{α}^{⊤} x_{N} = \tilde{g}, x_{N} \in \tilde{P}, x_{B} = C x_{N} + h},

i = 1 \sum n_{q +} a_{i}^{2} = g + j = 1 \sum n_{q -} b_{j}^{2} \geq b_{1}^{2} \Leftrightarrow \frac{∣ b _{1} ∣}{∥ a ∥ _{2}} \leq 1.

i = 1 \sum n_{q +} a_{i}^{2} = g + j = 1 \sum n_{q -} b_{j}^{2} \geq b_{1}^{2} \Leftrightarrow \frac{∣ b _{1} ∣}{∥ a ∥ _{2}} \leq 1.

g

g

\Leftrightarrow g

\Leftrightarrow 0

\Leftrightarrow

i = 1 \sum n_{q +} u_{i}^{2} = 1, i = 1 \sum n_{q +} a_{i} u_{i} = b_{1} .

i = 1 \sum n_{q +} u_{i}^{2} = 1, i = 1 \sum n_{q +} a_{i} u_{i} = b_{1} .

(first n_{q +} components \pm λ, 0, \dots, 0, second n_{q -} components \pm λ, 0, \dots, 0)

(first n_{q +} components \pm λ, 0, \dots, 0, second n_{q -} components \pm λ, 0, \dots, 0)

conv ({G \cap {x ∣ f (x) = 0}}) = conv ({G \cap {x ∣ f (x) \leq 0}}) \cap conv ({G \cap {x ∣ f (x) \geq 0}}) .

conv ({G \cap {x ∣ f (x) = 0}}) = conv ({G \cap {x ∣ f (x) \leq 0}}) \cap conv ({G \cap {x ∣ f (x) \geq 0}}) .

S^{'} := {(w, y) \in P ∣ ∥2 w_{1}, \dots, 2 w_{n_{q +}}, (g - y - 1) ∥ \leq (g - y + 1)},

S^{'} := {(w, y) \in P ∣ ∥2 w_{1}, \dots, 2 w_{n_{q +}}, (g - y - 1) ∥ \leq (g - y + 1)},

S^{''} := {(w, y) \in P ∣ ∥2 w_{1}, \dots, 2 w_{n_{q +}}, (g - y - 1) ∥ \geq (g - y + 1)} .

S^{'} := {(x, y) \in P ∣ ∥2 x_{1}, \dots, 2 x_{n_{q -}}, (y - g - 1) ∥ \leq (y - g + 1)},

S^{'} := {(x, y) \in P ∣ ∥2 x_{1}, \dots, 2 x_{n_{q -}}, (y - g - 1) ∥ \leq (y - g + 1)},

S^{''} := {(x, y) \in P ∣ ∥2 x_{1}, \dots, 2 x_{n_{q -}}, (y - g - 1) ∥ \geq (y - g + 1) .}

S^{'} := {(w, x) \in R^{1} \times R^{n_{q -}} ∣ w^{2} \geq g + j = 1 \sum n_{q -} x_{j}^{2}, (w, x) \in P},

S^{'} := {(w, x) \in R^{1} \times R^{n_{q -}} ∣ w^{2} \geq g + j = 1 \sum n_{q -} x_{j}^{2}, (w, x) \in P},

S^{''} := {(w, x) \in R^{1} \times R^{n_{q -}} ∣ w^{2} \leq g + j = 1 \sum n_{q -} x_{j}^{2}, (w, x) \in P} .

S_{+}^{'}

S_{+}^{'}

= Proj_{w, x} ⎩ ⎨ ⎧ (w, x, t) \in R^{1} \times R^{n_{q -}} \times R ∣ w \geq ((g t)^{2} + j = 1 \sum n_{q -} x_{j}^{2})^{\frac{1}{2}}, x \geq 0, t = 1, (w, x) \in P ⎭ ⎬ ⎫,

S_{-}^{'}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\useunder

\ul

The convex hull of a quadratic constraint over a polytope

Asteroide Santana [email protected] School of Industrial and Systems Engineering, Georgia Institute of Technology

Santanu S. Dey [email protected] School of Industrial and Systems Engineering, Georgia Institute of Technology

Abstract

A quadratically constrained quadratic program (QCQP) is an optimization problem in which the objective function is a quadratic function and the feasible region is defined by quadratic constraints. Solving non-convex QCQP to global optimality is a well-known NP-hard problem and a traditional approach is to use convex relaxations and branch-and-bound algorithms. This paper makes a contribution in this direction by showing that the exact convex hull of a general quadratic equation intersected with any bounded polyhedron is second-order cone representable. We present a simple constructive proof of this result.

1 Introduction

A quadratically constrained quadratic program (QCQP) is an optimization problem in which the objective function is a quadratic function and the feasible region is defined by quadratic constraints. A variety of complex systems can be cast as an instance of a QCQP. Combinatorial problems like MAXCUT [24], engineering problems such as signal processing [23, 30], chemical process [28, 40, 4, 19, 26, 55] and power engineering problems such as the optimal power flow [11, 34, 15, 31] are just a few examples.

Solving non-convex QCQP to global optimality is a well-know NP-hard problem and a traditional approach is to use spacial branch-and-bound tree based algorithm. The computational success of any branch-and-bound tree based algorithm depends on the convexification scheme used at each node of the tree. Not surprisingly, there has been a lot of research on deriving strong convex relaxations for general-purpose QCQPs. The most common relaxations found in the literature are based on Linear programming (LP), second order cone programing (SOCP) or semi-definite programming (SDP). Reformulation-linearization technique (RLT) [48, 50] is a LP-based hierarchy, Lasserre hierarchy or the sum-of-square hierarchy [33] is a SDP-based hierarchy which exactly solves QCQPs under some minor technical conditions and, recently, new LP and SOCP-based alternatives to sum of squares optimization have also been proposed [2]. While SDP relaxations are know to be strong, they don’t always scale very well computationally. SOCP relaxations tend to be more computationally attractive, although they are often derived by further relaxing SDP relaxations [14].

Another direction of research focuses on convexification of functions, with the McCormick relaxation [37] being perhaps the most classic example. In this case, a constraint of the form $f(x)=b$ is replaced with $\breve{f}(x)\leq b$ and $\hat{f}(x)\geq b$ , where $\breve{f}$ is a convex lower approximation and $\hat{f}$ is a concave upper approximation of $f$ . While there have been a lot of work in function convexification (see for instance [3, 49, 5, 46, 35, 10, 38, 6, 8, 7, 41, 18, 47, 45, 39, 55, 56, 36, 12, 16, 1, 27, 51]) it is well-known that it does not necessarily yield the convex hull of the set $\{x\,|\,f(x)=b\}$ . To the best of our knowledge, there have been much less work on explicit convexification of sets: [54, 42, 43, 53, 25, 32, 44, 17, 34, 13].

A related question when studying convex relaxations is that of representability of the exact convex hull of the feasible set: Is it LP, SOCP or SDP representable? In [20], we prove that the convex hull of the so-called bipartite bilinear constraint (which is a special case of a quadratic constraint) intersected with a box constraint is SOCP representable (SOCr). The proof yields a procedure to compute this convex hull exactly. Encouraging computational results are also reported in [20] in terms of obtaining dual bounds using this construction, which significantly outperform SDP and McCormick relaxations and also bounds produced by commercial solvers.

2 Our result

For an integer $t\geq 1$ , we use $[t]$ to describe the set $\{1,\dots,t\}$ . For a set $G\subseteq\mathbb{R}^{n}$ , we use $\textup{conv}(G)$ , $\textup{extr}(G)$ to denote the convex hull of $G$ and the set of extreme points of $G$ respectively.

In this paper, we generalize one of the main result in [20]. Specifically, we show that the convex hull of a general quadratic equation intersected with any bounded polyhedron is SOCr. Moreover the proof is constructive, therefore adding to the literature on explicit convexification in the context of QCQPs. The formal result is as following:

Theorem 1.

Let

[TABLE]

where $Q\in\mathbb{R}^{n\times n}$ is a symmetric matrix, $\alpha\in\mathbb{R}^{n}$ , $g\in\mathbb{R}$ and $P:=\{x\,|\,Ax\leq b\}$ is a polytope. Then $\textup{conv}(S)$ is SOCr.

Notice that we make no assumption regarding the structure or coefficients of the quadratic equation defining $S$ . We require $P$ to be a bounded polyhedron, which is not very restrictive given that in global optimization the variables are often assumed to be bounded to use branch-and-bound algorithms.

The result presented in Theorem 1 is somewhat unexpected since the sum-of-squares approach would build a sequence of SDP relaxations for (1) in order to optimize (exactly) a linear function over $S$ , while even the SDP cone of thre-by-three dimensional matrices is not SOCr [22]. Note that optimizing a linear function over $S$ is NP-hard, therefore, while the convex hull is SOCr, the construction involves the introduction of an exponential number of variables.

Surprisingly, the proof of Theorem 1 is fairly straightforward and it introduces a technique (new, to the best of our knowledge) to compute convex hull of certain surfaces over a compact set. In the case of Theorem 1, the key observation is that the surface defined by the quadratic equation either:

is defined as the union of two convex surfaces (see Figure 2); or 2. 2.

it has the property that, through every point of the surface, there exists a straight line that is entirely contained in the surface (see Figure 2).

In Case 1, we can easily obtain that the convex hull of $S$ is SOCr as we show in Section 3.3. In Case 2, no point in the interior of the polytope can be an extreme point of $S$ . Observing that the convex hull of a compact set is also the convex hull of its extreme points, we intersect the surface with each facet of the polytope which will contain all the extreme points of $S$ . Now, each such intersection leads to new sets with the same form as $S$ but in one dimension lower. The argument then goes by recursion. The details of the proof are presented in Section 3.

After we had proved Theorem 1, we learned that the property described in Case 2 is known as “ruled surfaces” and it has been extensively studied from both algebraic and geometric perspectives [21]. To the best of our knowledge, however, no one from the global optimization community has ever exploited such results for convexification.

3 Proof of Theorem 1

3.1 Convex hulls via disjunctions

In this section, we describe a simple procedure to obtain the convex hull of a compact set $S$ using a disjunctive argument. We use this procedure to prove Theorem 1 in Section 3.3. Let $S$ be a compact set and let $\textup{extr}(S)$ be the set of extreme points of $S$ . First, we partition the extreme points of $S$ . Specifically, suppose there exist $B^{1},\dots,B^{k}\subseteq S$ such that:

[TABLE]

We observe that (2) implies that

[TABLE]

where the last equality holds due to $S$ being compact. Finally, we obtain that

[TABLE]

Observation 1.

If $\textup{conv}(B^{i})$ is SOCr for all $i\in[k]$ , then the set

[TABLE]

is SOCr [9]. Thus, we obtain from (4) that $\textup{conv}(S)$ is SOCr. In addition, we obtain a constructive procedure to compute $\textup{conv}(S)$ .

3.2 Reduction

In this section, we discuss how we can apply some transformations to the set $S$ defined in (1) so as to re-write it in a “canonical” form where all the quadratic terms are squared terms. This will allows us to easily classify $S$ into Case 1 and 2 as discussed in Section 2. We start with the following observation.

Observation 2.

Let $S\subseteq\mathbb{R}^{n}$ and let $F:\mathbb{R}^{n}\rightarrow\mathbb{R}^{n}$ be an affine map. Then

[TABLE]

where $F(S):=\{Fx\,|\,x\in S\}$ . Furthermore if $\textup{conv}(S)$ is SOCr, then $\textup{conv}(F(S))$ is also SOCr.

Let $S$ be the set defined in (1). Suppose, without loss of generality, that $Q$ is a symmetric matrix. By the spectral theorem $Q=V^{\top}\Sigma V$ , where $\Sigma$ is a diagonal matrix and the columns of $V$ are a set of orthogonal vectors. Letting $w=Vx,$ we have that

[TABLE]

where $\tilde{P}:=\{w\,|\,AV^{-1}w\leq b\}$ .

Therefore, by Observation 2, it is sufficient to study the convex hull of a set of the form:

[TABLE]

where $z\in\mathbb{R}^{n_{o}}$ does not appear in the quadratic constraints, $n_{q}+n_{l}+n_{o}=n$ , $a_{i}\neq 0$ for $i\in[n_{q}]$ (i.e., the rank of $Q$ is $n_{q}$ ) and $\beta_{j}\neq 0$ for $j\in[n_{l}]$ . By completing squares, we may further write $S$ as:

[TABLE]

where $\sigma(a)$ denotes the sign of $a$ . Now, since $u_{i}=\left(\sqrt{|a_{i}|}x_{i}+\sigma(a_{i})\frac{\alpha_{i}}{2\sqrt{|a_{i}|}}\right)$ for $i\in[n_{q}]$ and $v_{i}=\beta_{i}y_{i}$ for $i\in[n_{l}]$ define linear bijections, it follows from Observation 2 that it is sufficient to study the convex hull of the following set:

[TABLE]

where we may further assume that $g\geq 0$ , since otherwise we may multiply the equation by $-1$ and apply suitable affine transformations to bring it back to the form of (5).

3.3 Recursive argument to prove Theorem 1

We begin by stating a variant of Observation 2 that we will use twice along the proof.

Lemma 1.

Let $G=\{(x,w)\in\mathbb{R}^{n_{1}}\times\mathbb{R}^{n_{2}}\,|\,x\in G_{0},\ w=C^{\top}x+h\}$ , where $G_{0}\subseteq\operatorname*{\mathbb{R}}^{n_{1}}$ is bounded, and $C^{\top}x+h$ is an affine function of $x$ . Then,

[TABLE]

Proof.

See Lemma 4 in [20]. ∎

3.3.1 Dealing with low dimensional polytope

Let $S$ and $P$ be defined as in (1). Next, we show that we may assume without loss of generality that $P$ is full dimension. In fact, if $P$ is not full dimensional, then $P$ is contained in a non-trivial affine subspace defined by a system of linear equations $Mx=f$ . Without loss of generality, we may assume that $M$ has full row-rank $k$ , $1\leq k<n$ . Let $M=\begin{bmatrix}M_{B}&M_{N}\end{bmatrix}$ where $M_{B}$ is invertible. Then, we may write this system as $x_{B}=-M^{-1}_{B}M_{N}x_{N}+M^{-1}_{B}f$ , where $x_{B}\in\mathbb{R}^{k},\ x_{N}\in\mathbb{R}^{n-k}$ and, for simplicity, we assume that $x_{B}$ (resp. $x_{N}$ ) correspond to the first $k$ (resp. last $n-k$ ) components of $x$ . By defining $C=-M^{-1}_{B}M_{N}$ and $h=M^{-1}_{B}f$ to simplify notation, we obtain

[TABLE]

By partitioning $Q$ in sub-matrices of appropriate sizes, we may explicitly write the quadratic equation defining $S$ in terms of $x_{B}$ and $x_{N}$ as follows:

[TABLE]

Using (6), we replace $x_{B}$ in (7) to obtain

[TABLE]

where

[TABLE]

Therefore, we may write $S$ as

[TABLE]

where $\tilde{P}$ is now a full dimensional polytope. Therefore, by Lemma 1, we may assume from now on that $P$ is full dimensional.

3.3.2 Case 2: Sufficient conditions for points to not be extreme

Consider the set $S$ as defined in (5).

Lemma 2.

Suppose $n_{o}\geq 1$ . If $(a,b,c,d)\in S\cap(\mathbb{R}^{n_{q+}}\times\mathbb{R}^{n_{q-}}\times\mathbb{R}^{n_{l}}\times\mathbb{R}^{n_{o}})$ where $(a,b,c,d)\in\textup{int}(P)$ , then $(a,b,c,d)$ is not an extreme point of $S$ .

Proof.

Since $(a,b,c,d)\in\textup{int}(P)$ , there exists a vector $\delta\in\mathbb{R}^{n_{o}}\setminus\{0\}$ such that $(a,b,c,d+\delta),(a,b,c,d-\delta)\in P$ . Clearly these points are in $S$ as well and, therefore, $(a,b,c,d)$ is not an extreme point of $S$ ∎

Lemma 3.

Suppose $n_{o}=0$ and $n_{l}\geq 2$ . If $(a,b,c)\in S\cap(\mathbb{R}^{n_{q+}}\times\mathbb{R}^{n_{q-}}\times\mathbb{R}^{n_{l}})$ where $(a,b,c)\in\textup{int}(P)$ , then $(a,b,c)$ is not an extreme point of $S$ .

Proof.

Since $n_{l}\geq 2$ , $(a,b,c_{1}\pm\lambda,c_{2}\mp\lambda,\dots,c_{n_{3}})$ are feasible for sufficiently small positive values of $\lambda$ . Therefore, $(a,b,c)$ is not an extreme point. ∎

Lemma 4.

Suppose $n_{o}=0$ , $n_{q+},n_{q-}\geq 1$ and $n_{l}=1$ . If $(a,b,c)\in S\cap(\mathbb{R}^{n_{q+}}\times\mathbb{R}^{n_{q-}}\times\mathbb{R}^{n_{l}})$ where $(a,b,c)\in\textup{int}(P)$ , then $(a,b,c)$ is not an extreme point of $S$ .

Proof.

Since $n_{q+},n_{q-}\geq 1$ , and $n_{l}=1$ , $(a_{1}+\lambda,a_{2},\dots,a_{n_{q+}},b_{1}+\lambda,b_{2},\dots,b_{n_{q-}},c+2\lambda(-a_{1}+b_{1})$ are feasible for sufficiently small positive and negative values of $\lambda$ . Therefore, $(a,b,c)$ is not an extreme point. ∎

Lemma 5.

Suppose $n_{o}=0$ , $n_{q+}\geq 2$ , $n_{q-}\geq 1$ and $n_{l}=0$ . If $(a,b)\in S\cap(\mathbb{R}^{n_{q+}}\times\mathbb{R}^{n_{q-}})$ where $(a,b)\in\textup{int}(P)$ , then $(a,b)$ is not an extreme point of $S$ .

Proof.

We show that there exists a straight line through $(a,b)$ that is entirely contained in the surface defined by the quadratic equation. More specifically, we prove that there exists a vector $(u,v)\in(\mathbb{R}^{n_{q+}}\times\mathbb{R}^{n_{q-}})\setminus\{0\}$ such that the line $\{(a,b)+\lambda(u,v)\,|\,\lambda\in\mathbb{R}\}$ satisfies the quadratic equation and therefore, $(a,b)$ being in the interior of $P$ cannot be an extreme point of $S$ . We consider two cases:

$(a,b)\neq\textbf{0}$ : Then observe that $a\neq\textbf{0}$ , since otherwise we would have $a=\textbf{0}$ and $b=\textbf{0}$ , because $g\geq 0$ . Observe that

[TABLE]

Next, observe that:

[TABLE]

Suppose we set $v_{1}=1$ and $v_{j}=0$ for all $j\in\{2,\dots,n_{q-}\}$ . Then satisfying (10) is equivalent to finding real values of $u$ satisfying:

[TABLE]

This is the intersection of a circle of radius $1$ in dimension two or higher (since $n_{q+}\geq 2$ in this case) and a hyperplane whose distance from the orgin is $\frac{|b_{1}|}{\|a\|_{2}}$ . Since, by (9), we have that this distance is at most $1$ , the hyperplane intersects the circle and therefore we know that a real solution exists. 2. 2.

$(a,b)=\textbf{0}$ : In this case, observe that $g=0$ and then 0 is a convex combination of

[TABLE]

for sufficiently small $\lambda>0$ .

∎

3.3.3 Case 1: Sufficient conditions for convex hull to be SOCr

In this section, we repeatedly use the following result from [52].

Theorem 2.

Let $G\subseteq\mathbb{R}^{n}$ be a convex set and let $f:\mathbb{R}^{n}\rightarrow\mathbb{R}$ be a continuous function. Then

[TABLE]

For the two lemmas that follows, consider the notation of $S$ defined in (5).

Lemma 6.

Suppose $n_{o}=0$ , $n_{l}\leq 1$ . If $n_{q+}=0$ or $n_{q-}=0$ , then $\textup{conv}(S)$ is SOCr.

Proof.

We consider two cases.

$n_{q-}=0$ : Let $(w,y)\in S\cap(\mathbb{R}^{n_{q+}}\times\mathbb{R}^{n_{l}})$ . Let $y=y_{1}$ if $n_{l}=1$ and $y=0$ if $n_{l}=0$ . In this case, $g-y$ is non-negative for all feasible values of $y$ and we can use the identity $t=\frac{(t+1)^{2}-(t-1)^{2}}{4}$ to write $S=S^{\prime}\cap S^{\prime\prime}$ , where:

[TABLE]

Notice that $S^{\prime}$ is a SOCr convex set. Also notice that $S^{\prime\prime}$ is a reverse convex set intersected with a polytope and hence $\textup{conv}(S^{\prime\prime}\cap P)$ is polyhedral and contained in $P$ (see [29],Theorem 1). Therefore, by Theorem 2, we have that $\operatorname*{conv}{(S)}=\operatorname*{conv}{(S^{\prime})}\cap\operatorname*{conv}{(S^{\prime\prime})}$ is SOCr. 2. 2.

$n_{q+}=0$ : Let $(x,y)\in S\cap(\mathbb{R}^{n_{q+}}\times\mathbb{R}^{n_{l}})$ . Let $y=y_{1}$ if $n_{l}=1$ and $y=0$ if $n_{l}=0$ . In this case, $g-y$ is non-positive for all feasible values of $y$ and may write $S=S^{\prime}\cap S^{\prime\prime}$ , where:

[TABLE]

Therefore, as in the previous case, $\operatorname*{conv}{(S)}$ is SOCr.

∎

Lemma 7.

Suppose $n_{q+}\leq 1$ and $n_{l}=n_{o}=0$ . Then $\textup{conv}(S)$ is SOCr.

Proof.

If $n_{q+}=0$ , then $S$ is empty set or contains a single point, the origin.

Therefore, consider the case where $n_{q+}=1$ , thus $w=w_{1}$ . Notice that $S=S^{\prime}\cap S^{\prime\prime}$ , where

[TABLE]

By Theorem 2, $\textup{conv}(S)=\textup{conv}(S^{\prime})\cap\textup{conv}(S^{\prime\prime})$ . Next, we show that both $\textup{conv}(S^{\prime})$ and $\textup{conv}(S^{\prime\prime})$ are SOCr. Notice that $S^{\prime}$ is the union of the following two SOCr sets:

[TABLE]

Thus, $\textup{conv}(S^{\prime})=\textup{conv}(S^{\prime}_{+}\cup S^{\prime}_{-})$ is SOCr.

Notice that $S^{\prime\prime}=\{(w,x)\in\mathbb{R}^{1}\times\mathbb{R}^{n_{q-}}\,|\,\ |w|\leq(g+\sum_{j=1}^{n_{q-}}x_{j}^{2})^{\frac{1}{2}},\ (w,x)\in P\}$ and is therefore the union of two sets:

[TABLE]

each of them being a reverse convex set intersected with a polyhedron. Therefore, $\textup{conv}(S^{\prime\prime}_{+})$ and $\textup{conv}(S^{\prime\prime}_{-})$ are polyhedral and therefore $\textup{conv}(S^{\prime\prime})=\textup{conv}(\textup{conv}(S^{\prime\prime}_{+})\cup\textup{conv}(S^{\prime\prime}_{-}))$ is a polyhedral set. ∎

3.3.4 Proof of Theorem 1

Finally, we bring the pieces together to prove Theorem 1.

Proof.

(of Theorem 1) Let $S(n)$ be defined as in (5), where $n=n_{q+}+n_{q-}+n_{l}+n_{o}$ is the dimension of the space in which $S$ is defined and without loss of generality $P$ is full-dimensional (Section 3.3.1). The proof goes by induction on $n$ . Notice that $S(1)$ is a polytope and hence $\textup{conv}(S(1))$ is SOCr. Suppose $S(n)$ is SOCr. We show that $S(n+1)$ is SOCr as well. If $n_{o}=0$ , $n_{l}\leq 1$ , and $n_{q+}=0$ or $n_{q-}=0$ , then the result follows from Lemma 6. Similarly, if $n_{o}=0$ , $n_{q+}\leq 1$ and $n_{l}=0$ , then the result follows from Lemma 7. Otherwise, it follows from Lemma 2, 3, 4 and 5 that no point in the interior of $P$ can be an extreme point of $S(n+1)$ . Let $N$ be the number of facets of $P$ , each of which given by one equation of the linear system $Fx=f$ . Let $B^{i}=S(n+1)\cap\{x\in\mathbb{R}^{n+1}\,|\,F_{i.}x=f_{i}\}$ be the intersection of $S(n+1)$ with the $i$ th facet of $P$ . By the discussion in Section 3.1, it is enough to show that the convex hull of each $B^{i}$ is SOCr. Let $i\in\{1,\dots,N\}$ . Choose $j_{0}$ such that $F_{ij_{0}}\neq 0$ . For simplicity, suppose $j_{0}=1$ . Then, we may write $B^{i}=\{x\in\mathbb{R}^{n+1}\,|\,(x_{2},\dots,n_{n+1})\in B^{i}_{0},\ x_{1}=b_{i}-\sum_{j=2}^{n+1}F_{ij}x_{j}\}$ , where $B_{0}^{i}$ is obtained from $B^{i}$ by replacing $x_{1}=f_{i}-\sum_{j=2}^{n+1}F_{ij}x_{j}$ in all the constraints defining $S(n+1)$ . Now $\textup{conv}(B_{0}^{i})\subseteq\mathbb{R}^{n}$ is SOCr by induction hyptothesis. Therefore, $\textup{conv}(B^{i})$ is SOCr by Lemma 1. ∎

Acknowledgments

Funding: This work was supported by the NSF CMMI [grant number 1562578] and the CNPq-Brazil [grant number 248941/2013-5].

Bibliography56

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Warren Adams, Akshay Gupte, and Yibo Xu. Error bounds for monomial convexification in polynomial optimization. Mathematical Programming , Mar 2018.
2[2] Amir Ali Ahmadi and Anirudha Majumdar. Dsos and sdsos optimization: Lp and socp-based alternatives to sum of squares optimization. In Information Sciences and Systems (CISS), 2014 48th Annual Conference on , pages 1–5. IEEE, 2014.
3[3] Faiz A. Al-Khayyal and James E. Falk. Jointly constrained biconvex programming. Mathematics of Operations Research , 8(2):273–286, 1983.
4[4] Mohammed Alfaki and Dag Haugland. Strong formulations for the pooling problem. Journal of Global Optimization , 56(3):897–916, 2013.
5[5] Ioannis P. Androulakis, Costas D. Maranas, and Christodoulos A Floudas. α 𝛼 \alpha bb: A global optimization method for general constrained nonconvex problems. Journal of Global Optimization , 7(4):337–363, 1995.
6[6] Kurt M. Anstreicher and Samuel Burer. Computable representations for convex hulls of low-dimensional quadratic forms. Mathematical programming , 124(1-2):33–43, 2010.
7[7] Xiaowei Bao, Aida Khajavirad, Nikolaos V. Sahinidis, and Mohit Tawarmalani. Global optimization of nonconvex problems with multilinear intermediates. Mathematical Programming Computation , 7(1):1–37, 2015.
8[8] Pietro Belotti, Andrew J. Miller, and Mahdi Namazifar. Valid inequalities and convex hulls for multilinear functions. Electronic Notes in Discrete Mathematics , 36:805–812, 2010.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

The convex hull of a quadratic constraint over a polytope

Abstract

1 Introduction

2 Our result

Theorem 1**.**

3 Proof of Theorem 1

3.1 Convex hulls via disjunctions

Observation 1**.**

3.2 Reduction

Observation 2**.**

3.3 Recursive argument to prove Theorem 1

Lemma 1**.**

Proof.

3.3.1 Dealing with low dimensional polytope

3.3.2 Case 2: Sufficient conditions for points to not be extreme

Lemma 2**.**

Proof.

Lemma 3**.**

Proof.

Lemma 4**.**

Proof.

Lemma 5**.**

Proof.

3.3.3 Case 1: Sufficient conditions for convex hull to be SOCr

Theorem 2**.**

Lemma 6**.**

Proof.

Lemma 7**.**

Proof.

3.3.4 Proof of Theorem 1

Proof.

Acknowledgments

Theorem 1.

Observation 1.

Observation 2.

Lemma 1.

Lemma 2.

Lemma 3.

Lemma 4.

Lemma 5.

Theorem 2.

Lemma 6.

Lemma 7.