New Dependencies of Hierarchies in Polynomial Optimization
Adam Kurpisz, Timo de Wolff

TL;DR
This paper compares four hierarchies for solving constrained polynomial optimization problems, revealing their relationships and limitations, especially regarding SOS, SDSOS, SONC, and Sherali Adams hierarchies, with implications for Positivstellensatz results.
Contribution
It establishes new dependencies and incomparabilities among these hierarchies for general and Boolean hypercube cases, including polynomial equivalences and containment results.
Findings
SONC and SOS hierarchies are polynomially incomparable.
SDSOS is contained within SONC hierarchy.
Schm"udgen-like versions of SDSOS*, SONC*, and SA* are polynomially equivalent.
Abstract
We compare four key hierarchies for solving Constrained Polynomial Optimization Problems (CPOP): Sum of Squares (SOS), Sum of Diagonally Dominant Polynomials (SDSOS), Sum of Nonnegative Circuits (SONC), and the Sherali Adams (SA) hierarchies. We prove a collection of dependencies among these hierarchies both for general CPOPs and for optimization problems on the Boolean hypercube. Key results include for the general case that the SONC and SOS hierarchy are polynomially incomparable, while SDSOS is contained in SONC. A direct consequence is the non-existence of a Putinar-like Positivstellensatz for SDSOS. On the Boolean hypercube, we show as a main result that Schm\"udgen-like versions of the hierarchies SDSOS*, SONC*, and SA* are polynomially equivalent. Moreover, we show that SA* is contained in any Schm\"udgen-like hierarchy that provides a O(n) degree bound.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
New Dependencies of Hierarchies in Polynomial Optimization
Adam Kurpisz
Adam Kurpisz, ETH Zürich, Department of Mathematics, Rämistrasse 101, 8092 Zürich, Switzerland
and
Timo de Wolff
Timo de Wolff, Technische Universität Berlin, Institut für Mathematik, Sekr. MA 6-2, Straße des 17. Juni 136, 10623 Berlin, Germany
Abstract.
We compare four key hierarchies for solving Constrained Polynomial Optimization Problems (CPOP): Sum of Squares (SOS), Sum of Diagonally Dominant Polynomials (SDSOS), Sum of Nonnegative Circuits (SONC), and the Sherali Adams (SA) hierarchies. We prove a collection of dependencies among these hierarchies both for general CPOPs and for optimization problems on the Boolean hypercube. Key results include for the general case that the (SONC) and (SOS) hierarchy are polynomially incomparable, while (SDSOS) is contained in (SONC). A direct consequence is the non-existence of a Putinar-like Positivstellensatz for SDSOS. On the Boolean hypercube, we show as a main result that Schmüdgen-like versions of the hierarchies SDSOS∗, SONC∗, and SA∗ are polynomially equivalent. Moreover, we show that SA∗ is contained in any Schmüdgen-like hierarchy that provides a degree bound.
Key words and phrases:
Hierarchy, nonnegativity, polynomial comparable, polynomial optimization, Sherali Adams, sum of diagonally dominant polynomials, sum of nonnegative circuit polynomials, sum of squares
2010 Mathematics Subject Classification:
Primary: 14P10, 68Q25, 90C60; Secondary: 14Q20; ACM Subject Classification: Theory of computation Proof complexity Theory of computation Linear programming, Semidefinite programming, Convex optimization
1. Introduction
A Constrained Polynomial Optimization Problem (CPOP) is of the form
[TABLE]
where and are -variate real polynomials. Solving CPOP is a crucial nonconvex optimization problem, which lies at the core of both theoretical and applied computer science. A special case of CPOP is a Binary Constrained Polynomial Optimization Problem (BCPOP) where the polynomials are among the polynomials defining the feasibility set. Many important optimization problems belong to the BCPOP class. However, solving these is NP-hard in general.
A CPOP can be equivalently seen as the problem of maximizing a real such that is nonnegative over the semialgebraic set defined by the polynomials . This is an interesting perspective since various techniques form real algebraic geometry provide methods for certifying nonnegativity of a real polynomial over semialgebraic sets. The class of such theorems is called Positivstellensätze. These theorems state that, under some assumptions, a polynomial , which is positive (or nonnegative) over the feasibility set, can be expressed in a particular algebraic way. Typically, this algebraic expression is a sum of nonnegative polynomials from a chosen ground set of nonnegative polynomials multiplied by the polynomials defining the feasibility set. Choosing a proper ground set of nonnegative polynomials is crucial from the perspective of optimization. Ideally, both testing membership in the ground set and deciding nonnegativity of a polynomial in the ground set should be efficiently doable. Moreover, fixing the maximum degree of polynomials in the ground sets, used for a representation of , provides a family of algorithms parameterized by an integer , which gives a sequence of lower bounds for the value of CPOP. If the ground set of polynomials is chosen properly, then the sequence of lower bounds converges in to the optimal value of CPOP.
One of the most successful approaches for constructing theoretically efficient algorithms is the Sum of Squares (SOS) method [GV01, Nes00, Par00, Sho87], known as Lasserre relaxation [Las01]. The method relies on Putinar’s Positivstellensatz [Put93] using sum of squares of polynomials as the ground set. Finding a degree SOS certificate for nonnegativity of can be performed by solving a semidefinite programming (SDP) formulation of size . Finally, for every (feasible) -variate hypercube optimization problem, with constraints of degree at most , there exists a degree SOS certificate, see e.g., [BS16].
The SOS algorithm is a frontier method in algorithm design. It was used to provide the best available algorithms for a variety of combinatorial optimization problems. The Lovász -function [Lov79] for the Independent Set problem is implied by the SOS algorithm of degree 2. Moreover, the Goemans-Williamson relaxation [GW95] for the Max Cut problem and the Goemans-Linial relaxation for the Sparsest Cut problem (analyzed in [ARV09]) can be obtained by the SOS algorithm of degree 2 and 6, respectively. SOS was also proven to be a successful method for Maximum Constraint Satisfaction problems (Max CSP). For Max CSP, the SOS algorithm is as powerful as any SDP relaxation of comparable size [LRS15]. Furthermore, SOS was applied to problems in dictionary learning [BKS15, SS17], tensor completion and decomposition [BM16, HSSS16, PS17], and robust estimation [KSS18]. For other applications of the SOS method see e.g., [BRS11, BCG09, Chl07, CS08, CGM13, dlVKM07, GS11, MM09, Mas17, RT12], and the surveys [CT12, Lau03, Lau09].
From a practical perspective however, solving SDP problems is known to be very time consuming. Moreover, from a theoretical point of view, it is an open problem whether an SDP of size can be solved in time [O’D16, RW17]. Hence, various methods have been proposed to choose different ground sets of polynomials to make a resulting problem easier to solve, but still effective.
In [AM14] Ahmadi and Majumdar propose an algorithmic framework by choosing the ground set of polynomials to be scaled diagonally-dominant polynomials (SDSOS). SDSOS polynomials can be seen as the binomial squares. Thus, the SDSOS algorithm is not stronger than the SOS algorithm. However, searching for a degree- SDSOS certificate can be performed using Second Order Conic Program (SOCP) of size ; see [AM14]. Since, in practice, an SOCP can be solved much faster than an SDP, the algorithm attracted a lot of attention and has been used to solve problems in Robotics and Control [AMT14, Leo18, PP15, SA16, ZFP18], Option Pricing [AM14], Power Flow [KGNSZ18, SSTL18], and Discrete Geometry [DL16].
An alternative approach, that is a more tractable method than the SOS, was initiated by Sherali and Adams in [SA90]. The technique was first introduced as a method to tighten the Linear Program (LP) relaxations for BCPOP problems and for such settings finding the degree certificate can be done by solving an LP of size . The Sherali Adams (SA) algorithm arises from using the set of polynomials depending on at most variables, which are nonnegative on the Boolean hypercube. These polynomials are called -juntas. The SA algorithm was used to construct some of the most prominent algorithms with good asymptotic running time in combinatorial optimization [CLRS16, LR16, TZ17], logic [AM13], and other fields of computer science.
Finally, a method independent from SOS was introduced in [IdW16] using Sum of Nonnegative Circuit Polynomials (SONC) as a ground set. These polynomials form a full dimensional cone in the cone of nonnegative polynomials, which is not contained in the SOS cone. For example, the well-known Motzkin polynomial is a nonnegative circuit polynomial, but not an SOS. Moreover, SONCs generalize polynomials which are certified to be nonnegative via the arithmetic-geometric mean inequality [Rez89]. SONC certificates of degree can be computed via a convex optimization program called * Relative Entropy Programming (REP)* of size [DIdW17, Theorem 5.3]; see also [CS16, CMW18]. Recently, an experimental comparison of SONC with the SOS method for unconstrained optimization was presented in [SdW18].
For all presented algorithms, one can define a potentially stronger algorithm without changing the corresponding ground set of polynomials, by using a more general construction for the certificate of nonnegativity. Such a certificate expresses a polynomial, which is nonnegative over a given semialgebraic set, as a sum of polynomials from the ground set multiplied by the product of polynomials defining the semialgebraic set; see section 2 for further details. We call the resulting systems {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}SOS^{*}}, {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}SDSOS^{*}}, {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}SA^{*}} and {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}SONC^{*}}. Some of these extensions were intensively studied in the literature, see e.g., [GHP02, Wor15].
Our Results
In this paper, we provide an extensive comparison of the presented semialgebraic proof systems. More precisely, following the definitions in e.g., [BFI*+*18], we analyze their polynomial comparability:
Definition 1.1**.**
Let and be semialgebraic proof systems. contains if for every semialgebraic set and a polynomial admitting a degree certificate of nonnegativity over in , admits also a degree certificate in . System strictly contains if contains but does not contain . Systems and are polynomially equivalent if contains and contains . Finally, systems and are polynomially incomparable if neither contains nor contains .
∎
For a more detailed definition of proof systems and their comparability, see subsection 2.5.
In this article, we show the dependencies between the proof systems presented in figure 1.
In particular, in section 3, we show that for general CPOP problems the SOS proof system is polynomially incomparable with the SONC proof system. We also proved that the same relation holds for SOS∗ and SONC∗ proof systems; see corollary 3.7. So far, it was only known that the cones of SOS and SONC polynomials are not contained in each other [IdW16, Proposition 7.2] however, it has no direct implication on the relation between the SOS and the SONC methods for the CPOP optimization. Similarly, in a very recent result [CMW18], the authors point out that the SONC cone contains SDSOS cone. In this paper, in section 4, we extend this result for CPOP problems by proving, that SONC certificate strictly contains the SDSOS certificate and the same relation holds for SONC∗ and SDSOS∗ certificates; see corollary 4.3. As a consequence, we conclude that there exists no Putinar-like Positivstellensatz for SDSOS; see corollary 4.4.
For the BCPOP we provide a general, sufficient condition for the proof system to contain proof system, see theorem 5.1. This combined with the results from subsection 5.2, and subsection 5.3 proves the polynomial equivalence of , , and on the Boolean hypercube. Moreover, by proving some properties of SONC, SDSOS, and SA polynomials in lemma 4.1, and lemma 5.6, we prove additional dependencies between the hierarchies in subsection 5.2, subsection 5.3, and subsection 5.4.
We remark that all results in this article concern the minimal degrees for certificates in a particular proof system as these are the standard way to measure the complexity of algorithms in theoretical computer science. Our results do not directly imply a particular behaviour of actual runtimes in an experimental setting, as these depend on various further factors other than the degree.
Acknowledgements
AK is supported by SNSF project PZ00P2174117, TdW is supported by the DFG grant WO 2206/1-1.
2. Preliminaries
In this section, we introduce the proof systems used in this article. Moreover, for the sake of clarity, we provide dual formulations for some of the presented proof systems for the BCPOP case. We begin with introducing basic notation. For any we denote {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}[n]}=\{1,\ldots,n\} and {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\binom{n}{\leq d}}\leavevmode\nobreak\ :=\leavevmode\nobreak\ \sum_{i=0}^{d}\binom{n}{i}. Let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{N}^{*}}=\mathbb{N}\setminus\{\mathbf{0}\} and {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{R}_{\geq 0}} ({\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{R}_{>0}}) be the set of nonnegative (positive) real numbers. Let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{R}[\boldsymbol{x}]}=\mathbb{R}[x_{1},\ldots,x_{n}] be the ring of * -variate real polynomials* and for every we define the * real zero set* as {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathcal{V}(f)}=\{\boldsymbol{x}\in\mathbb{R}^{n}\ |\ f(\boldsymbol{x})=0\}. We denote the * Newton polytope* of by {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\operatorname{New}(f)} and the vertices of by {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\operatorname{Vert}\left(\operatorname{New}(f)\right)}. A lattice point is called even if it is in , and a term is called a monomial square if and is even.
In what follows we introduce different proof systems and their notation. Next to the specific sources that we provide later in the section, we refer the reader to introductory literature like [BPT13, Lau09, Las15, Mar08] on the mathematical side, and [Raz16, Rot13] on the computer science side. Moreover, we fix the notation
[TABLE]
for a set of polynomials. Throughout the paper we assume that the cardinality {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}m} of the set is polynomial in the size of . For a given , we define the corresponding * semi-algebraic set*
[TABLE]
Furthermore, for any given semialgebraic set , we consider the set of nonnegative polynomials with respect to
[TABLE]
For a given and a set of constraints , we define the corresponding constrained polynomial optimization problem (CPOP) as (see e.g., [BV04])
[TABLE]
Hence, corresponds to the feasibility region of the program (CPOP).
The problem (CPOP) is NP-hard in general. Thus, one chooses proper subsets such that, on the one hand, the corresponding polynomial optimization problem provides a lower bound on the value of (CPOP) and on the other hand, is computationally tractable. Such subsets are called * certificates of nonnegativity*. The choice of a suitable certificate of nonnegativity is crucial for obtaining a good lower bound for the problem (CPOP).
Let us be more specific. For a given the induced * preprime* is given by
[TABLE]
Note that . Throughout the paper we assume that for a given {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}m_{2d}} is the cardinality of the set restricted to polynomials of degree at most . In order to relax (CPOP) to a finite size optimization problem we introduce polynomial hierarchies.
Definition 2.1**.**
Let be a collection of polynomials and let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{GEN}} be a subset of . We define the following degree depending hierarchy of certificates of nonnegativity:
[TABLE]
In several contexts it is more useful to consider the preprime of the constraints, i.e., {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\operatorname{Hier}_{\operatorname{Prep}(\mathcal{G})}^{2d}({\mathbb{GEN}})}. Every such hierarchy of polynomials yields a sequence of lower bounds given by the following optimization program:
[TABLE]
[TABLE]
∎
Throughout this paper we assume that the set is chosen such that is Archimedean, a property which is e.g., implied by the compactness of . In what follows we occasionally enforce compactness of by adding box constraints {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}l_{i}}:=x_{i}\pm N\geq 0 to with sufficiently large for .
Under this assumption we obtain from Krivine’s general Positivstellensatz [Kri64a, Kri64b], see also [Mar08, Theorem 5.4.4], the following Schmüdgen-type Positivstellensatz; see [Sch02, Theorem 5.1]:
Theorem 2.2**.**
Let be Archimedean and let such that is closed under addition. Let for all . Then there exists a such that .
For the SOS hierarchy this theorem was first shown by Schmüdgen in [Sch91].
In the following subsections we introduce some of the most prominent inner approximations of the cone .
2.1. Sum of Squares
The * SOS method* approximates the cone by using the set of * sum of square polynomials* instead of the entire set of nonnegative polynomials. Let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{SOS}}\ :=\ \{s\leavevmode\nobreak\ |\leavevmode\nobreak\ s=\sum_{i=1}^{k}f_{i}^{2},\leavevmode\nobreak\ f\in\mathbb{R}[\boldsymbol{x}],\leavevmode\nobreak\ k\in\mathbb{N}^{*}\} be the set of (finite) sum of square polynomials (SOS). The SOS program of degree takes the following form:
[TABLE]
analogously the SOS∗ program of degree takes the form {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{\mathbb{SOS}}_{\operatorname{Prep}(\mathcal{G})}^{2d}}. For the SOS-hierarchy Putinar proved the following Positivstellensatz, which is an improvement of Schmüdgen’s Positivstellensatz.
Theorem 2.3** (Putinar’s Positivstellensatz; [Put93]).**
Let be a set of polynomial constraints with being Archimedean, and let with for all . Then there exists a such that .
theorem 2.3 provides a sequence of cones that approximate from the inside, such that the values of give a sequence of lower bounds that converges in to the optimal value of (CPOP).
The program () can be solved using a semidefinite program (SDP) of size ; see e.g., [Las01, Nes00, Par00, Sho87]. This is implied by the following fact; see e.g., [Par00].
Theorem 2.4**.**
A polynomial is a SOS of degree if and only if there exists a positive semidefinite matrix , called the Gram matrix, such that , for being the vector of -variate monomials of total degree at most .
The size of the SDP program is . Moreover, for BCPOP problems, when hypercube constraints are incorporated in , it is known for {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}d_{\mathcal{G}}}:=\max\{\deg(g)\leavevmode\nobreak\ |\leavevmode\nobreak\ g\in\mathcal{G}\} that solves the problem exactly, i.e., ; see e.g., [BS14].
2.1.1. SOS - The dual perspective: Lasserre hierarchy
Consider a BCPOP. Let be such that , for some . By the hyperplane separation theorem for convex cones, there exists a hyperplane that separates from . Note that for BCPOP we can restrict to polynomials defined on the hypercube , i.e., to the vector space of multi-linear polynomials. The hyperplane is represented by the polynomial {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mu}:\{0,1\}^{n}\to\mathbb{R}, which is a normal vector to the hyperplane, such that for every polynomial we have and . By scaling we can assume that . To every function we can associate a linear operator {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\widetilde{\mathbb{E}}_{\mu}}:\{0,1\}^{n}\to\mathbb{R} mapping polynomials to real numbers, defined by
[TABLE]
which is called the pseudoexpectation. The dual problem to () is the program of degree . It takes the form
[TABLE]
and is known as the Lasserre relaxation (of degree ) . It can be solved using an SDP of size [Las01]. Analogously, the program of degree takes the form {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{\overline{\mathbb{SOS}}}_{\operatorname{Prep}(\mathcal{G})}^{2d}}.
Problem () can be reformulated in terms of moments / localizing matrices. Consider , for and . Let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}d^{{}^{\prime}}}=\lfloor\frac{2d-\deg(g)}{2}\rfloor and {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\boldsymbol{v}} be the * vector of coefficients* of , such that . We can write
[TABLE]
where {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}M_{g}^{2d}}\in\mathbb{R}^{\binom{n}{\leq d^{{}^{\prime}}}\times\binom{n}{\leq d^{{}^{\prime}}}} is a real, symmetric matrix whose rows and columns are indexed by sets of size at most such that . For the matrix is called the moment matrix, and for all other it is called the localizing matrix for the constraint . Since for every real valued vector the requirement is equivalent to being positive semidefinite (PSD), denoted by {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}M\succeq 0}, we can reformulate () as:
[TABLE]
2.2. Scaled Diagonally Dominant Sum of Squares
In [AM14] Ahmadi and Majumdar proposed an approximation of the cone based on scaled diagonally-dominant polynomials (SDSOS), defined below in subsection 2.2.1. Let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{SDSOS}} be the set of finite sums of scaled diagonally-dominant polynomials. We obtain the following program:
[TABLE]
analogously, the SDSOS program takes the form . Since for every , we have . Moreover, can be solved using Second Order Conic Programming (SOCP) of size ; see [AM14].
2.2.1. Scaled diagonally-dominant polynomials
We introduce the formal details for SDSOS certificates.
Definition 2.5**.**
A real symmetric matrix is called diagonally-dominant (dd) if for every we have . Moreover, is called scaled diagonally-dominant (sdd) if there exist a positive real diagonal matrix such that is dd. A polynomial of total degree is scaled diagonally-dominant, denoted , if there exist an sdd matrix such that , for being the vector of -variate monomials of total degree at most . ∎
Every SDSOS polynomials is an SOS polynomial: By definition 2.5, consider an sdd matrix , for being a dd matrix. By the Gershgorin circle theorem, the matrix is PSD. Moreover, . Since is a congruent transformation of , that does not change the sign of the eigenvalues, the matrix is also PSD.
Next, we provide a further characterization of SDSOS polynomials. We start with recalling the known characterization of diagonally dominant (dd) matrices.
Lemma 2.6** ([BC75]).**
A symmetric matrix is dd if and only if
[TABLE]
for and being a set of vectors, each with at most two nonzero entries at positions and which equal .
By definition 2.5 and lemma 2.6, every -variate sdd polynomial of degree at most is of the form
[TABLE]
where {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\boldsymbol{z}} is the vector of -variate monomials of maximal degree , {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}Q} is a dd matrix, and {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}D} is a positive diagonal matrix. Since every vector has at most two nonzero entries, both equal to , the SDSOS polynomial is always of the form , where are monomials and .
2.2.2. SDSOS - Dual perspective
For the BCPOP the dual of the problem () is a relaxation of the problem (). Indeed, similar as for formulation (), a conic duality theory can be used to transform program () into its dual of the form
[TABLE]
for being a linear map, defined as in subsection 2.1.1. Analogously the program of degree takes the form {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{\overline{\mathbb{SDSOS}}}_{\operatorname{Prep}(\mathcal{G})}^{2d}}.
Similar as in (2.1), Formulation () can be transformed into matrix form. In this case we obtain a set of matrices that are required to be PSD. More formally, let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}K}\subseteq\{I\leavevmode\nobreak\ |\leavevmode\nobreak\ I\subseteq[n],\leavevmode\nobreak\ |I|\leq d\}. For being a real, symmetric matrix whose rows/columns are indexed with sets of size at most , let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}M_{\big{|}K}} be the principal submatrix of of entries that lie in the rows and columns indexed by the sets in .
We obtain that is equivalent to:
[TABLE]
For BCPOP, both () and (2.3) are solvable via an SOCP of size . For more details we refer the reader to [AM17].
2.3. Sherali Adams
An alternative method to approximate the sum of squares cone is based on nonnegative polynomials that depend on a limited number of variables, called * -juntas*. The resulting program is called the Sherali Adams algorithm (SA) and was first introduced in [SA90] as a method to tighten the linear programming relaxations for 0/1 hypercube optimization problems. Thus, we assume throughout the section and whenever we consider (SA) that the hypercube constraints are contained in , meaning that .
For we denote {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\boldsymbol{x}_{I}}=\prod_{i\in I}x_{i} and {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\overline{\boldsymbol{x}}}_{I}=\prod_{i\in I}(1-x_{i}). Let
[TABLE]
A nonnegative -junta is a function which depends only on at most input coordinates. It is easy to check that the set is precisely the set of nonnegative -juntas over the Boolean hypercube . The * degree- Sherali Adams* is the following problem:
[TABLE]
analogously SA∗ takes the form . Note that the superscript in is (not ), because of the way SA was defined historically, providing that . However, this does not affect the polynomial equivalence between the proof systems; see definition 1.1.
The program can be solved using the linear program (LP) of size .
2.3.1. SA - Dual perspective
Similarly as in subsection 2.1.1 and subsection 2.2.2 one can use a conic duality theory to transform the program () into its dual of the form:
[TABLE]
The program () is a linear system of size . Analogously, the program of degree takes the form {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{\overline{\mathbb{SA}}}_{\operatorname{Prep}(\mathcal{G})}^{2d}}.
2.4. Sum of Nonnegative Circuit
A method for approximating the cone , which is independent of SOS, is based on sums of nonnegative circuit polynomials (SONC), defined below in subsection 2.4.1. The technique was introduced by Iliman and the second author in [IdW16]. Let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{SONC}} be the set of finite sums of nonnegative circuit polynomials. We consider the following program:
[TABLE]
analogously takes the form . As shown in [DIdW17, Theorem 4.8], for an arbitrary real polynomial that is strictly positive on a compact, basic closed semialgebraic set there exists a certificate of nonnegativity, i.e., the Schmüdgen-type Positivstellensatz theorem 2.2 applies to SONC. Moreover, searching through the space of degree certificates can be done via a relative entropy program (REP) [DIdW17] of size ; see also [CS17, CS16, CMW18]. REPs are convex optimization programs and are efficiently solvable with interior point methods; see e.g., [CS17, NN94] for more details.
2.4.1. Nonnegative Circuit Polynomials
We recall the most relevant statements about SONCs.
Definition 2.7**.**
A polynomial is called a circuit polynomial if it is of the form
[TABLE]
with {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}r}\leq n, exponents {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\boldsymbol{\alpha}(j)}, {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\boldsymbol{\beta}}\in A, and coefficients {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}f_{\boldsymbol{\alpha}(j)}}\in\mathbb{R}_{>0}, {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}f_{\boldsymbol{\beta}}}\in\mathbb{R}, such that is a simplex with even vertices and the exponent is in the strict interior of .
For every circuit polynomial we define the corresponding circuit number as
[TABLE]
∎
One determines nonnegativity of circuit polynomials via its circuit number as follows:
Theorem 2.8** ([IdW16], Theorem 3.8).**
Let be a circuit polynomial then is nonnegative if and only if and or and .
Let
[TABLE]
Following Reznick, we define maximal mediated sets; note that these objects are well-defined due to [Rez89, Theorem 2.2]
Definition 2.9**.**
Let such that . We call a set (-)mediated if every element of is the midpoint of two distinct points in .
We define the maximal mediated set as the unique -mediated set which contains every other mediated set.
Let be a simplex. If , then we call an -simplex. If consist only of and the midpoints of the vertices, then we call an -simplex.
∎
Generalizing a result by Reznick in [Rez89], Iliman and the second author proved that maximal mediated sets are exactly the correct object for determining whether a nonnegative circuit polynomial is a sum of squares.
Theorem 2.10** ([IdW16], Theorem 5.2).**
Let be a nonnegative circuit polynomial with inner term . Then is a sum of squares if and only if is a sum of monomial squares or if .
Especially is always an SOS if is an -simplex, and is never an SOS if is an -simplex.
For further details about SONCs see e.g., [dW15, DIdW17, DKdW18, IdW16, SdW18]. A description of the dual of the SONC cone was recently provided in [DNT18], which we, however, do not need for the purpose of this article.
2.5. Comparing proof systems
In this section we introduce the notation used for comparing the proof systems presented in Section 2, from the proof complexity perspective. For a gentle introduction to proof complexity we refer the reader to e.g., [Raz16].
Following the notation in definition 2.1: Let be a set of polynomials which we axiomatically assume to be nonnegative and be a set of polynomials, which form the semialgebraic set . The GEN proof system is the set of all algebraic derivations such that deducing nonnegativity of polynomials over . Analogously, the GEN proof system is the set of all algebraic derivations such that deducing nonnegativity of polynomials over . The proof systems SOS, SOS, SDSOS, SDSOS, SA, SA, SONC and SONC are defined analogously.
The complexity of the certificate depends on the needed to certify the nonnegativity. Revising definition 1.1 we say that a proof system * contains* a proof system if for every set of polynomials and a polynomial admitting a degree certificate of nonnegativity over in , admits also a degree certificate in . A system * strictly contains* if contains but does not contain . I.e., there exist at least one set and a polynomial nonnegative over such that admits a degree certificate in but for every does not admit a degree certificate in . Systems and are * polynomially equivalent* if contains and contains . Finally, systems and are * polynomially incomparable* if neither contains , nor contains . I.e., there exist sets , and polynomials , nonnegative over , , respectively, such that admits a degree certificate in but for every does not admit a degree certificate in and admits a degree certificate in but for every does not admit a degree certificate in .
3. SOS vs. SONC
It is well-known that the cone and cone are not contained in each other [IdW16, Proposition 7.2] This statement, however, gives no prediction whether or not for CPOPs these systems are polynomially equivalent or not. In this section we show that for every there exist CPOPs such that the difference between the minimal degrees of a SOS and a SONC certificate is arbitrarily large and vice versa.
3.1. SONC does not contain SOS
We consider the following family of polynomials:
Definition 3.1**.**
We define the family of signed quadrics by {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}N_{n}}:=\left(1-\sum_{j=1}^{n}x_{j}\right)^{2}. ∎
It is obvious that every is SOS and that its zero set is the unit ball of the 1-norm, i.e., for all we have
[TABLE]
The support of and is depicted together with their Newton polytopes in figure 2.
It is known that for every the function cannot be written as a combination of -juntas [Lee15, Theorem 1.12]. It is, however, also straightforward to conclude that for every the polynomial is not a SONC polynomial.
Lemma 3.2**.**
For all it holds that .
Proof.
By equation 3.1 the real zero set is equal to the boundary of the -dimensional cross-polytope; see e.g., [Zie07]. In particular, it is an dimensional piecewise-linear set. A SONC, however, has at most many distinct real zeros by [IdW16, Corollary 3.9]. ∎
In [SdW18, Example 3.7] it is shown that is not a SONC due to a term by term inspection. We point out that one could build over that argument and reprove inductively lemma 3.2 using the fact that the support set of equals the restriction of the support set of restricted to a specific -face of .
Corollary 3.3**.**
For every with and every there exist infinitely many systems such that .
Proof.
Let be fixed. Consider a system
[TABLE]
where is the signed quadric and is a system of polynomials such that , , and compact.
On the one hand, there exists an SOS certificate of degree for the system equation 3.2 given by alone, as is already an SOS and moreover .
On the other hand, due to the Positivstellensatz result for SONCs [DIdW17, Theorem 4.8], there exists a SONC certificate of the form , for some value of . But since is not a SONC due to lemma 3.2 the certificate necessarily has to incorporate at least one of the constraints defining the set . Hence, . ∎
3.2. SOS does not contain SONC
In this section we show the inverse of the result from subsection 3.1, namely that SONC is not contained in SOS.
Definition 3.4** (Generalized Motzkin Polynomial).**
Let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbf{e}}:=\sum_{j=1}^{n}\mathbf{e}_{j}. For every with we define the Generalized Motzkin Polynomial as
[TABLE]
∎
Note the is the usual * Motzkin polynomial* . The support of and is depicted together with their Newton polytopes in figure 3.
Proposition 3.5**.**
For every we have , but , and moreover .
Note that not being for was in fact already shown by Motzkin (not only ); see [Mot67] and see also [Rez89, Section 6]. Furthermore, being an -simplex, which implies not being was shown by Reznick in [Rez89, Theorem 6.9]. We provide an own proof of all these facts here for convenience of the reader.
Proof.
Let . First, we show that is a nonnegative circuit polynomial which vanishes exactly on the Boolean hypercube .
According to definition 2.7 is a circuit polynomial with inner term . A straightforward computation yields for the barycentric coordinates for every and the circuit number satisfies . Thus, is nonnegative by theorem 2.8. As mentioned before, every nonnegative circuit polynomial has at most one zero on every orthant [IdW16, Corollary 3.9]. Moreover, an evaluation shows that for every . Thus, .
Second, we show that is not a sum of squares. Since is a nonnegative circuit polynomial, which is not a sum of monomial squares, this is equivalent to the fact that the lattice point does not belong to the maximal mediated set by theorem 2.10. Here, we show more generally that is even an -simplex, which implies . We observe:
[TABLE]
This follows from
[TABLE]
and the fact that every lattice point with satisfies either , or due to , for some with .
We have . By definition, every point in is the midpoint of two distinct points in . This is impossible due to (3.3), (3.4), and the fact that the convex combination in (3.4) is unique since is a simplex. Thus, and is an -simplex by [Rez89, Theorem 2.5]. ∎
Corollary 3.6**.**
For every with and every there exist infinitely many systems such that .
Proof.
Let be fixed. Consider a system
[TABLE]
where is the generalized Motzkin polynomial and is a system of polynomials such that , and compact.
On the one hand, there exists a SONC certificate of degree for the system equation 3.5 given by alone as is already a SONC by Proposition 3.5 and moreover .
On the other hand, due to theorem 2.3, there exists an SOS certificate of the form for some . But since is not a SOS due to Proposition 3.5, the certificate has to necessarily involve at least one constraint from . Thus . ∎
Corollary 3.7**.**
The pairs of systems and ; and ; and ; and are polynomially incomparable.
Proof.
Follows immediately from corollary 3.3 and corollary 3.6. ∎
4. SDSOS vs SONC
In this section, we show that for constrained polynomial optimization problems CPOPs, SONC strictly contains SDSOS. The same relations holds for the SONC∗ and for SDSOS∗ algorithms.
4.1. SDSOS is SONC
We start with proving that every SDSOS polynomial of degree is also a SONC polynomial of degree .
Lemma 4.1**.**
Every scaled diagonally dominant polynomial is a circuit polynomial.
Proof.
By [AM14] we know that every scaled diagonally dominant polynomial can be written as a sum of binomial squares, i.e., of the form , for , where are monomials and . Thus, , where for every index in the summation. Moreover, by definition 2.7, for every , is a one dimensional simplex with two even vertices , given by the exponents of the squared monomials, and the exponent of the term is in the strict interior of since . Finally, the circuit number is equal to , thus by theorem 2.8, is a nonnegative circuit polynomial. ∎
We note that a similar statement to lemma 4.1 was very recently, independently observed in [CMW18].
Corollary 4.2**.**
For every we have and .
Proof.
Assume that for some polynomial and there exists a such that . We show that necessarily it has to hold . Since there necessarily exist SDSOS polynomials such that for and for every . Moreover, by lemma 4.1 every SDSOS polynomial is a SONC polynomial, thus we have . The proof works analogously for the second inequality. ∎
It remains to show that the for every there exist CPOPs such that the ratio of the minimal degrees of a SDSOS and a SONC is not bounded by a constant.
Corollary 4.3**.**
SONC proof system strictly contains SDSOS proof system and the same relation holds for SONC and SDSOS.
Proof.
The corollary follows from corollary 4.2 and the fact that () together with Proposition 3.5. ∎
As a consequence we show that there exists no equivalent of Putinar’s Positivstellensatz for SDSOS algorithm.
Corollary 4.4**.**
There exist and infinitely many systems of polynomials such that is Archimedean, for all but for all , .
Proof.
By constructing an explicit example, it was shown in [DKdW18, Theorem 5.1] that the equivalent statement holds for the SONC case. Thus, the statement follows immediately from corollary 4.2. ∎
4.2. Closures under Changes of Bases and Relations to SOCP
In the rest of this section we provide two observations regarding the behaviour of and under a change of bases and their relation to second order cone programming.
In [DIdW17, Lemma 4.1] the authors showed that the SONC cone is not closed under multiplication, i.e., if , then this does not imply in general. Moreover, is not closed under affine transformations or more generally a change of bases; see [DKdW18, Corollary 3.2]. These results are in sharp contrast to the SOS cone, which is closed both under multiplication and under a change of bases. Similarly as for SONC, it is well-known that is not closed under multiplication, and a change of bases; see e.g., [AH17].
More precisely, we define
[TABLE]
analogously for {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\operatorname{cls}(\mathbb{SONC})}.
Currently, it is an open problem in the community to decide whether the closure of with respect to a change of bases equals .
Problem 4.5**.**
Is ?
From the results of this section we obtain the following corollary.
Corollary 4.6**.**
If , then .
Proof.
Follows directly from lemma 4.1, which does not depend on the chosen basis and ; see [IdW16, Proposition 7.2]. ∎
Moreover we obtain the following consequence about the relation of and second order cone programming . Since we use ’s only in the following corollary, we omit a full definition of SOCP and refer the reader to the standard literature like [BV04]. The bounds {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}{f}_{\mathbb{SOCP},\mathcal{G}}^{2d}} and {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}{f}_{\mathbb{SOCP},\operatorname{Prep}(\mathcal{G})}^{2d}} are defined analogously to the other hierarchies.
Corollary 4.7**.**
For every we have and .
Proof.
It was shown by Ahmadi and Majumdar that every certificate is in [AM17, Theorem 10], and, very recently, that every certificate is by Ding and Lim [DL18, Theorem 3.3]. Thus, the statement follows immediately from corollary 4.2. ∎
We remark that, however, Averkov [Ave18, Theorem 17] recently showed that the semidefinite extension degree of SONC equals two, and thus SONC is an SOCP-lift; see also [GPT15] for further details on lifts.
5. Hierarchies on the Boolean Hypercube
In this section we prove the dependencies between various hierarchies on the Boolean hypercube .
Let, for this section, be a collection of polynomials such that for all we have and for , such that . Let be an arbitrary class of polynomials, which are nonnegative on the Boolean hypercube. We consider the corresponding optimization problem (2.1). We start with proving a general statement saying that every proof certificate that can certify nonnegativity of an -variate polynomial over the unconstrained Boolean hypercube with an degree certificate is at least as strong as the hierarchy.
Theorem 5.1**.**
Let . Assume that there exists a such that for every , and such that
[TABLE]
Then for every finite set of polynomial constraints with and for every with it holds that .
Proof.
Consider a polynomial , a set and a real number such that . We show that . Let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}m_{2d}^{{}^{\prime}}} be the cardinality of the set restricted to polynomials of degree at most . By definition, implies that there exists a certificate , for such that every polynomial is of the form such that every is a nonnegative -junta with {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}k_{i}}:=\lfloor\frac{2d-\deg(G_{i})}{2}\rfloor.
For every we consider a set of polynomials such that and . Let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}m_{i,2d}^{{}^{\prime\prime}}} be the cardinality of the set restricted to polynomials of degree at most . By the assumption (5.1) there exists a such that every -variate polynomial, which is nonnegative over the Boolean hypercube , has a degree certificate using polynomials in . Thus, we have in particular for every and . Hence, we can write such that , , and for every we have . In summary, we obtain:
[TABLE]
where for every , , and the degree is at most , and hence
[TABLE]
By the containment , for every , we get that
[TABLE]
and the statement follows. ∎
5.1. Properties of the proof system
In theorem 5.1 we gave a sufficient condition for the proof system to be at least as strong as SA∗. As a consequence every proof system satisfying that condition attains the properties of the SA proof system. In particular, this applies to the conditioning property, which has been widely used to construct algorithmic results for various BCPOP problem, see e.g., [LR16]. In what follows we provide a formal description of the property.
Lemma 5.2** (Conditioning).**
For every , let {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{L}} be the linear operator feasible for . Let be an index such that . We define {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{L}_{i,(0)}},{\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathbb{L}_{i,(1)}}\leavevmode\nobreak\ :\leavevmode\nobreak\ \mathbb{R}[\boldsymbol{x}]\to\mathbb{R} such that:
[TABLE]
Then it holds that . Moreover, both operators are feasible for .
The proof can be found in e.g., [Rot13, Lemma 2]. Note that for every and satisfying the requirements in lemma 5.2 we have . lemma 5.2, applied iteratively implies for every set with that every linear operator feasible for can be written as a convex combination of linear operators, feasible for , that maps variables with indices in to 0 or 1. In other words, such that for every and we have .
Example 5.3**.**
Consider a set of polynomials . The feasibility set of is the convex hull of its integral solutions.The feasibility set for is shown in figure 4 a. The feasibility set for a standard relaxation (replacing the integrality constraints with constraints ) that corresponds to the feasibility region of can be seen in figure 4 b. The feasibility set of linear operators feasible for that additionally satisfies the property of being expressed as a convex combinations of operators integral on variable () can be seen in figure 4 c (d), respectively. Finally, the set of operators feasible for can be seen in figure 4 e. Note that the feasibility region in figure 4 e is an intersection of regions in figure 4 c and figure 4. ∎
In the construction of the algorithm the conditioning property allows to take the solution of and choose a crucial variable that was assigned a fractional value and ask this value to be either 0 or 1, depending of the users preferences. The resulting solution is conditioned to be integral on this variable and stays feasible for degree . Clearly, the more variables are conditioned to be integral, the higher is the required degree of the solution.
In general the conditioning property might not be easy to prove for a dual formulation of a given proof system. Assume that is such that the corresponding conic program admits no duality gap (the minimum value of the primal problem equals the maximum value of the dual program). Note that this assumption is necessary since theorem 5.1 provides arguments on the primal side, namely the semialgebraic proof system. If the corresponding conic program admits a duality gap, then the implications of theorem 5.1 on the dual side might not be correct.
In what follows we provide a corollary giving sufficient conditions for the proof system to admit the conditioning property.
Corollary 5.4**.**
Every proof system satisfying the requirements of theorem 5.1 admits the conditioning property stated in lemma 5.2.
5.2. SONC vs SA
In this section we show two results. First, we show that is at least as strong as , meaning that SONC contains SA. Second, by showing that every circuit polynomial is a -junta, we show that SA is at least as strong as SONC and the same holds for systems strengthened with a Schmüdgen-like Positivstellensatz i.e., SA contains SONC and SA contains SONC. As a result we get that SA and SONC are polynomially equivalent.
Lemma 5.5**.**
There exists a such that for every we get , meaning that SONC contains SA.
Proof.
By [DKdW18, Theorem 4.7] the satisfies the condition of theorem 5.1 and thus the proof follows. ∎
Next we prove that the inverse of the inequality in lemma 5.5 also holds over the Boolean hypercube. We start with a technical lemma.
Lemma 5.6**.**
Let be a circuit polynomial of total degree . Then is a -junta.
Proof.
Consider a nonnegative circuit polynomial of the form
[TABLE]
Since, by the definition 2.7, with , and we have for at least one . Thus, the total degree of satisfies .
This means on the contrary, if is of degree , then the homogenization of contains at most many variables, which implies that is a -junta by definition, see subsection 2.3. ∎
We obtain the following corollary:
Corollary 5.7**.**
Let . For every we have and . In other words SA contains SONC and SA contains SONC.
Proof.
Assume that there exists a degree- SONC certificate for of the form such that the are SONCs and . For every we write where and every is a nonnegative circuit polynomial satisfying . By lemma 5.6, every is a -junta, which yields the first inequality. For the second inequality the proof works analogously with . ∎
5.3. SONC vs. SDSOS
In section 4, we saw that in general every SDSOS certificate is a SONC certificate, but not vice versa. Here, we show that the situation is more special on the Boolean hypercube: It turns out that the relation of the two certificates depends on the type of hierarchy, which we allow for SDSOS. We show the following result:
Theorem 5.8**.**
Let be as before with . Then for all we have the following dependencies ; ; . This implies that SONC is polynomially equivalent with SDSOS, SONC strictly contains SDSOS and SONC contains SDSOS.
Proof.
Following [DKdW18][Definition 10], for every the function
[TABLE]
is called the Kronecker delta (function) of the vector . By [DKdW18, Theorem 12] has a certificate of the form
[TABLE]
where are SONCs of degree at most , , , and .
Part (1): By [DKdW18, Lemma 14], . Thus, by corollary 4.2, it only remains to show that every SONC involved in the certificate is a binomial square. But this was shown in Case 2 of the proof of [DKdW18, Theorem 16].
Part (2): Follows immediately from the fact that the example from [DKdW18, Theorem 19] to prove corollary 4.4 is an example defined over the Boolean hypercube.
Part (3): Follows from the fact that, by lemma 5.6, every SDSOS certificate can be rewritten as the SONC certificate. ∎
5.4. SDSOS vs SA
In this section we show that for BCPOPs, SA contains SDSOS.
Theorem 5.9**.**
Let be as before with . Let . For every we have .
The theorem follows immediately from corollary 5.7 and theorem 5.8. We provide, however, an independent proof here, which works on the dual side, and which we consider to be individually interesting.
For every set we denote its power set by {\color[rgb]{0.2,0.2,0.75}\definecolor[named]{pgfstrokecolor}{rgb}{0.2,0.2,0.75}\mathcal{P}(K)}.
Proof.
Consider the problem
[TABLE]
This problem is a relaxation of the problem , see (2.2), since, by Sylvester’s Theorem, a symmetric matrix is PSD if and only if all the principal submatrices are PSD.
First, we show that the degree Sherali Adams is equivalent to (5.4), a similar reasoning can be found also e.g., in [Lau03, Section 3.2, Equation (19)]. Consider a constraint and a set , such that . A Möbius matrix , , is a square matrix indexed by subsets of such that if and otherwise. Compute a matrix:
[TABLE]
One can check that {D_{g}^{4d}}_{\big{|}\mathcal{P}(K)} is a diagonal matrix with entries
[TABLE]
see. e.g. [Lau03, Lemma 2].
Finally, since Z_{K}^{-1}{M_{g}^{4d}}_{\big{|}\mathcal{P}(K)}\left(Z_{K}^{-1}\right)^{\top} is a congruent transformation of {M_{g}^{4d}}_{\big{|}\mathcal{P}(K)} that preserves eigenvalues, we get that {M_{g}^{4d}}_{\big{|}\mathcal{P}(K)}\succeq 0\Leftrightarrow{D_{g}^{4d}}_{\big{|}\mathcal{P}(K)}\succeq 0\Leftrightarrow{D_{g}^{4d}}_{\big{|}\mathcal{P}(K)}(I,I)\geq 0, for all . Setting we get a one-to-one mapping to functions in .
Next, we show that every linear operator that is feasible for is also feasible for , that is, for every and every , such that the matrix {M_{g}^{2d}}_{\big{|}\{K,L\}} is PSD.
Consider the set . Since , for we have . Now consider a submatrix {M_{g}^{4d}}_{\big{|}\mathcal{P}(H)}. Every linear operator that is feasible for has to satisfy {M_{g}^{4d}}_{\big{|}\mathcal{P}(H)}\succeq 0. Finally, since {M_{g}^{2d}}_{\big{|}\{K,L\}} is a principal submatrix of {M_{g}^{4d}}_{\big{|}\mathcal{P}(H)}, by the Sylvester’s criterion the linear operator that is feasible for has to satisfy also {M_{g}^{2d}}_{\big{|}\{K,L\}}\succeq 0. This finishes the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AH 17] A. A. Ahmadi and G. Hall, Sum of squares basis pursuit with linear and second order cone programming. , Algebraic and geometric methods in discrete mathematics. AMS special session on algebraic and geometric methods in applied discrete mathematics, San Antonio, TX, USA, January 11, 2015. Proceedings, Providence, RI: American Mathematical Society (AMS), 2017, pp. 27–53 (English).
- 2[AM 13] A. Atserias and E. Maneva, Sherali–adams relaxations and indistinguishability in counting logics , SIAM Journal on Computing 42 (2013), no. 1, 112–137.
- 3[AM 14] A. A. Ahmadi and A. Majumdar, DSOS and SDSOS optimization: LP and socp-based alternatives to sum of squares optimization , 48th Annual Conference on Information Sciences and Systems, CISS 2014, Princeton, NJ, USA, March 19-21, 2014, 2014, pp. 1–5.
- 4[AM 17] by same author, DSOS and SDSOS optimization: More tractable alternatives to sum of squares and semidefinite optimization , Co RR abs/1706.02586 (2017).
- 5[AMT 14] A. A. Ahmadi, A. Majumdar, and R. Tedrake, Control and verification of high-dimensional systems with DSOS and SDSOS programming , CDC 2014, Los Angeles, CA, USA, December 15-17, 2014, 2014, pp. 394–401.
- 6[ARV 09] S. Arora, S. Rao, and U. V. Vazirani, Expander flows, geometric embeddings and graph partitioning , J. ACM 56 (2009), no. 2, 5:1–5:37.
- 7[Ave 18] G. Averkov, Optimal size of linear matrix inequalities in semidefinite approaches to polynomial optimization , 2018, Preprint; see ar Xiv:1806.08656.
- 8[BC 75] G. P. Barker and D. Carlson, Cones of diagonally dominant matrices. , Pacific J. Math. 57 (1975), no. 1, 15–32.
