The convex hull of a quadratic constraint over a polytope
Asteroide Santana, Santanu S. Dey

TL;DR
This paper proves that the convex hull of a quadratic constraint intersected with a bounded polyhedron can be represented using second-order cones, aiding in solving non-convex QCQPs more effectively.
Contribution
It establishes that the convex hull of a quadratic constraint over a polytope is second-order cone representable, providing a constructive proof for this key convexification result.
Findings
Convex hull is second-order cone representable.
Constructive proof of the convex hull characterization.
Facilitates convex relaxation of non-convex QCQPs.
Abstract
A quadratically constrained quadratic program (QCQP) is an optimization problem in which the objective function is a quadratic function and the feasible region is defined by quadratic constraints. Solving non-convex QCQP to global optimality is a well-known NP-hard problem and a traditional approach is to use convex relaxations and branch-and-bound algorithms. This paper makes a contribution in this direction by showing that the exact convex hull of a general quadratic equation intersected with any bounded polyhedron is second-order cone representable. We present a simple constructive proof of this result.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\useunder
\ul
The convex hull of a quadratic constraint over a polytope
Asteroide Santana [email protected] School of Industrial and Systems Engineering, Georgia Institute of Technology
Santanu S. Dey [email protected] School of Industrial and Systems Engineering, Georgia Institute of Technology
Abstract
A quadratically constrained quadratic program (QCQP) is an optimization problem in which the objective function is a quadratic function and the feasible region is defined by quadratic constraints. Solving non-convex QCQP to global optimality is a well-known NP-hard problem and a traditional approach is to use convex relaxations and branch-and-bound algorithms. This paper makes a contribution in this direction by showing that the exact convex hull of a general quadratic equation intersected with any bounded polyhedron is second-order cone representable. We present a simple constructive proof of this result.
1 Introduction
A quadratically constrained quadratic program (QCQP) is an optimization problem in which the objective function is a quadratic function and the feasible region is defined by quadratic constraints. A variety of complex systems can be cast as an instance of a QCQP. Combinatorial problems like MAXCUT [24], engineering problems such as signal processing [23, 30], chemical process [28, 40, 4, 19, 26, 55] and power engineering problems such as the optimal power flow [11, 34, 15, 31] are just a few examples.
Solving non-convex QCQP to global optimality is a well-know NP-hard problem and a traditional approach is to use spacial branch-and-bound tree based algorithm. The computational success of any branch-and-bound tree based algorithm depends on the convexification scheme used at each node of the tree. Not surprisingly, there has been a lot of research on deriving strong convex relaxations for general-purpose QCQPs. The most common relaxations found in the literature are based on Linear programming (LP), second order cone programing (SOCP) or semi-definite programming (SDP). Reformulation-linearization technique (RLT) [48, 50] is a LP-based hierarchy, Lasserre hierarchy or the sum-of-square hierarchy [33] is a SDP-based hierarchy which exactly solves QCQPs under some minor technical conditions and, recently, new LP and SOCP-based alternatives to sum of squares optimization have also been proposed [2]. While SDP relaxations are know to be strong, they don’t always scale very well computationally. SOCP relaxations tend to be more computationally attractive, although they are often derived by further relaxing SDP relaxations [14].
Another direction of research focuses on convexification of functions, with the McCormick relaxation [37] being perhaps the most classic example. In this case, a constraint of the form is replaced with and , where is a convex lower approximation and is a concave upper approximation of . While there have been a lot of work in function convexification (see for instance [3, 49, 5, 46, 35, 10, 38, 6, 8, 7, 41, 18, 47, 45, 39, 55, 56, 36, 12, 16, 1, 27, 51]) it is well-known that it does not necessarily yield the convex hull of the set . To the best of our knowledge, there have been much less work on explicit convexification of sets: [54, 42, 43, 53, 25, 32, 44, 17, 34, 13].
A related question when studying convex relaxations is that of representability of the exact convex hull of the feasible set: Is it LP, SOCP or SDP representable? In [20], we prove that the convex hull of the so-called bipartite bilinear constraint (which is a special case of a quadratic constraint) intersected with a box constraint is SOCP representable (SOCr). The proof yields a procedure to compute this convex hull exactly. Encouraging computational results are also reported in [20] in terms of obtaining dual bounds using this construction, which significantly outperform SDP and McCormick relaxations and also bounds produced by commercial solvers.
2 Our result
For an integer , we use to describe the set . For a set , we use , to denote the convex hull of and the set of extreme points of respectively.
In this paper, we generalize one of the main result in [20]. Specifically, we show that the convex hull of a general quadratic equation intersected with any bounded polyhedron is SOCr. Moreover the proof is constructive, therefore adding to the literature on explicit convexification in the context of QCQPs. The formal result is as following:
Theorem 1**.**
Let
[TABLE]
where is a symmetric matrix, , and is a polytope. Then is SOCr.
Notice that we make no assumption regarding the structure or coefficients of the quadratic equation defining . We require to be a bounded polyhedron, which is not very restrictive given that in global optimization the variables are often assumed to be bounded to use branch-and-bound algorithms.
The result presented in Theorem 1 is somewhat unexpected since the sum-of-squares approach would build a sequence of SDP relaxations for (1) in order to optimize (exactly) a linear function over , while even the SDP cone of thre-by-three dimensional matrices is not SOCr [22]. Note that optimizing a linear function over is NP-hard, therefore, while the convex hull is SOCr, the construction involves the introduction of an exponential number of variables.
Surprisingly, the proof of Theorem 1 is fairly straightforward and it introduces a technique (new, to the best of our knowledge) to compute convex hull of certain surfaces over a compact set. In the case of Theorem 1, the key observation is that the surface defined by the quadratic equation either:
is defined as the union of two convex surfaces (see Figure 2); or 2. 2.
it has the property that, through every point of the surface, there exists a straight line that is entirely contained in the surface (see Figure 2).
In Case 1, we can easily obtain that the convex hull of is SOCr as we show in Section 3.3. In Case 2, no point in the interior of the polytope can be an extreme point of . Observing that the convex hull of a compact set is also the convex hull of its extreme points, we intersect the surface with each facet of the polytope which will contain all the extreme points of . Now, each such intersection leads to new sets with the same form as but in one dimension lower. The argument then goes by recursion. The details of the proof are presented in Section 3.
After we had proved Theorem 1, we learned that the property described in Case 2 is known as “ruled surfaces” and it has been extensively studied from both algebraic and geometric perspectives [21]. To the best of our knowledge, however, no one from the global optimization community has ever exploited such results for convexification.
3 Proof of Theorem 1
3.1 Convex hulls via disjunctions
In this section, we describe a simple procedure to obtain the convex hull of a compact set using a disjunctive argument. We use this procedure to prove Theorem 1 in Section 3.3. Let be a compact set and let be the set of extreme points of . First, we partition the extreme points of . Specifically, suppose there exist such that:
[TABLE]
We observe that (2) implies that
[TABLE]
where the last equality holds due to being compact. Finally, we obtain that
[TABLE]
Observation 1**.**
If is SOCr for all , then the set
[TABLE]
is SOCr [9]. Thus, we obtain from (4) that is SOCr. In addition, we obtain a constructive procedure to compute .
3.2 Reduction
In this section, we discuss how we can apply some transformations to the set defined in (1) so as to re-write it in a “canonical” form where all the quadratic terms are squared terms. This will allows us to easily classify into Case 1 and 2 as discussed in Section 2. We start with the following observation.
Observation 2**.**
Let and let be an affine map. Then
[TABLE]
where . Furthermore if is SOCr, then is also SOCr.
Let be the set defined in (1). Suppose, without loss of generality, that is a symmetric matrix. By the spectral theorem , where is a diagonal matrix and the columns of are a set of orthogonal vectors. Letting we have that
[TABLE]
where .
Therefore, by Observation 2, it is sufficient to study the convex hull of a set of the form:
[TABLE]
where does not appear in the quadratic constraints, , for (i.e., the rank of is ) and for . By completing squares, we may further write as:
[TABLE]
where denotes the sign of . Now, since for and for define linear bijections, it follows from Observation 2 that it is sufficient to study the convex hull of the following set:
[TABLE]
where we may further assume that , since otherwise we may multiply the equation by and apply suitable affine transformations to bring it back to the form of (5).
3.3 Recursive argument to prove Theorem 1
We begin by stating a variant of Observation 2 that we will use twice along the proof.
Lemma 1**.**
Let , where is bounded, and is an affine function of . Then,
[TABLE]
Proof.
See Lemma 4 in [20]. ∎
3.3.1 Dealing with low dimensional polytope
Let and be defined as in (1). Next, we show that we may assume without loss of generality that is full dimension. In fact, if is not full dimensional, then is contained in a non-trivial affine subspace defined by a system of linear equations . Without loss of generality, we may assume that has full row-rank , . Let where is invertible. Then, we may write this system as , where and, for simplicity, we assume that (resp. ) correspond to the first (resp. last ) components of . By defining and to simplify notation, we obtain
[TABLE]
By partitioning in sub-matrices of appropriate sizes, we may explicitly write the quadratic equation defining in terms of and as follows:
[TABLE]
Using (6), we replace in (7) to obtain
[TABLE]
where
[TABLE]
Therefore, we may write as
[TABLE]
where is now a full dimensional polytope. Therefore, by Lemma 1, we may assume from now on that is full dimensional.
3.3.2 Case 2: Sufficient conditions for points to not be extreme
Consider the set as defined in (5).
Lemma 2**.**
Suppose . If where , then is not an extreme point of .
Proof.
Since , there exists a vector such that . Clearly these points are in as well and, therefore, is not an extreme point of ∎
Lemma 3**.**
Suppose and . If where , then is not an extreme point of .
Proof.
Since , are feasible for sufficiently small positive values of . Therefore, is not an extreme point. ∎
Lemma 4**.**
Suppose , and . If where , then is not an extreme point of .
Proof.
Since , and , are feasible for sufficiently small positive and negative values of . Therefore, is not an extreme point. ∎
Lemma 5**.**
Suppose , , and . If where , then is not an extreme point of .
Proof.
We show that there exists a straight line through that is entirely contained in the surface defined by the quadratic equation. More specifically, we prove that there exists a vector such that the line satisfies the quadratic equation and therefore, being in the interior of cannot be an extreme point of . We consider two cases:
: Then observe that , since otherwise we would have and , because . Observe that
[TABLE]
Next, observe that:
[TABLE]
Suppose we set and for all . Then satisfying (10) is equivalent to finding real values of satisfying:
[TABLE]
This is the intersection of a circle of radius in dimension two or higher (since in this case) and a hyperplane whose distance from the orgin is . Since, by (9), we have that this distance is at most , the hyperplane intersects the circle and therefore we know that a real solution exists. 2. 2.
: In this case, observe that and then 0 is a convex combination of
[TABLE]
for sufficiently small .
∎
3.3.3 Case 1: Sufficient conditions for convex hull to be SOCr
In this section, we repeatedly use the following result from [52].
Theorem 2**.**
Let be a convex set and let be a continuous function. Then
[TABLE]
For the two lemmas that follows, consider the notation of defined in (5).
Lemma 6**.**
Suppose , . If or , then is SOCr.
Proof.
We consider two cases.
: Let . Let if and if . In this case, is non-negative for all feasible values of and we can use the identity to write , where:
[TABLE]
Notice that is a SOCr convex set. Also notice that is a reverse convex set intersected with a polytope and hence is polyhedral and contained in (see [29],Theorem 1). Therefore, by Theorem 2, we have that is SOCr. 2. 2.
: Let . Let if and if . In this case, is non-positive for all feasible values of and may write , where:
[TABLE]
Therefore, as in the previous case, is SOCr.
∎
Lemma 7**.**
Suppose and . Then is SOCr.
Proof.
If , then is empty set or contains a single point, the origin.
Therefore, consider the case where , thus . Notice that , where
[TABLE]
By Theorem 2, . Next, we show that both and are SOCr. Notice that is the union of the following two SOCr sets:
[TABLE]
Thus, is SOCr.
Notice that and is therefore the union of two sets:
[TABLE]
each of them being a reverse convex set intersected with a polyhedron. Therefore, and are polyhedral and therefore is a polyhedral set. ∎
3.3.4 Proof of Theorem 1
Finally, we bring the pieces together to prove Theorem 1.
Proof.
(of Theorem 1) Let be defined as in (5), where is the dimension of the space in which is defined and without loss of generality is full-dimensional (Section 3.3.1). The proof goes by induction on . Notice that is a polytope and hence is SOCr. Suppose is SOCr. We show that is SOCr as well. If , , and or , then the result follows from Lemma 6. Similarly, if , and , then the result follows from Lemma 7. Otherwise, it follows from Lemma 2, 3, 4 and 5 that no point in the interior of can be an extreme point of . Let be the number of facets of , each of which given by one equation of the linear system . Let be the intersection of with the th facet of . By the discussion in Section 3.1, it is enough to show that the convex hull of each is SOCr. Let . Choose such that . For simplicity, suppose . Then, we may write , where is obtained from by replacing in all the constraints defining . Now is SOCr by induction hyptothesis. Therefore, is SOCr by Lemma 1. ∎
Acknowledgments
Funding: This work was supported by the NSF CMMI [grant number 1562578] and the CNPq-Brazil [grant number 248941/2013-5].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Warren Adams, Akshay Gupte, and Yibo Xu. Error bounds for monomial convexification in polynomial optimization. Mathematical Programming , Mar 2018.
- 2[2] Amir Ali Ahmadi and Anirudha Majumdar. Dsos and sdsos optimization: Lp and socp-based alternatives to sum of squares optimization. In Information Sciences and Systems (CISS), 2014 48th Annual Conference on , pages 1–5. IEEE, 2014.
- 3[3] Faiz A. Al-Khayyal and James E. Falk. Jointly constrained biconvex programming. Mathematics of Operations Research , 8(2):273–286, 1983.
- 4[4] Mohammed Alfaki and Dag Haugland. Strong formulations for the pooling problem. Journal of Global Optimization , 56(3):897–916, 2013.
- 5[5] Ioannis P. Androulakis, Costas D. Maranas, and Christodoulos A Floudas. α 𝛼 \alpha bb: A global optimization method for general constrained nonconvex problems. Journal of Global Optimization , 7(4):337–363, 1995.
- 6[6] Kurt M. Anstreicher and Samuel Burer. Computable representations for convex hulls of low-dimensional quadratic forms. Mathematical programming , 124(1-2):33–43, 2010.
- 7[7] Xiaowei Bao, Aida Khajavirad, Nikolaos V. Sahinidis, and Mohit Tawarmalani. Global optimization of nonconvex problems with multilinear intermediates. Mathematical Programming Computation , 7(1):1–37, 2015.
- 8[8] Pietro Belotti, Andrew J. Miller, and Mahdi Namazifar. Valid inequalities and convex hulls for multilinear functions. Electronic Notes in Discrete Mathematics , 36:805–812, 2010.
