Certifying polynomial nonnegativity via hyperbolic optimization
James Saunderson

TL;DR
This paper introduces a novel method for certifying polynomial nonnegativity using hyperbolic optimization, expanding the toolkit beyond sum of squares and revealing new properties of hyperbolic polynomials.
Contribution
It develops hyperbolic certificates of nonnegativity from hyperbolic polynomials and analyzes their differences from sums of squares, including explicit examples and complexity results.
Findings
Hyperbolic certificates can certify nonnegativity beyond sums of squares.
Existence of hyperbolic polynomials with certificates not representable as sums of squares for certain degrees and variables.
Deciding hyperbolicity of a cubic polynomial is co-NP hard.
Abstract
We describe a new approach to certifying the global nonnegativity of multivariate polynomials by solving hyperbolic optimization problems---a class of convex optimization problems that generalize semidefinite programs. We show how to produce families of nonnegative polynomials (which we call hyperbolic certificates of nonnegativity) from any hyperbolic polynomial. We investigate the pairs for which there is a hyperbolic polynomial of degree in variables such that an associated hyperbolic certificate of nonnegativity is not a sum of squares. If we show that this occurs whenever . In the degree three case, we find an explicit hyperbolic cubic in variables that gives hyperbolic certificates that are not sums of squares. As a corollary, we obtain the first known hyperbolic cubic no power of which has a definite determinantal representation. OurâŠ
| ? | |||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Certifying polynomial non-negativity
via hyperbolic optimization
James Saunderson Department of Electrical and Computer Systems Engineering, Monash University, VIC 3800, Australia. [email protected]
Abstract
We describe a new approach to certifying the global nonnegativity of multivariate polynomials by solving hyperbolic optimization problemsâa class of convex optimization problems that generalize semidefinite programs. We show how to produce families of nonnegative polynomials (which we call hyperbolic certificates of nonnegativity) from any hyperbolic polynomial. We investigate the pairs for which there is a hyperbolic polynomial of degree in variables such that an associated hyperbolic certificate of nonnegativity is not a sum of squares. If we show that this occurs whenever . In the degree three case, we find an explicit hyperbolic cubic in variables that gives hyperbolic certificates that are not sums of squares. As a corollary, we obtain the first known hyperbolic cubic no power of which has a definite determinantal representation. Our approach also allows us to show that, given a cubic , and a direction , the decision problem âIs hyperbolic with respect to ?â is co-NP hard.
1 Introduction
The problem of deciding nonnegativity of a multivariate polynomial is central to solving optimization and feasibility problems expressed in terms of polynomials. These, in turn, arise naturally in a wide range of applications including control systems and robotics, combinatorial optimization, game theory, and quantum information (see Section 1.2 for further discussion, and the edited volume [BPT12] for an introduction to these ideas).
One of the benefits of a polynomial formulation of an optimization problem is that one can then construct a hierarchy of more tractable ârelaxationsâ of the problem, based on the fact that being a sum of squares of polynomials is a sufficient condition for a polynomial to be nonnegative. Deciding whether a polynomial is a sum of squares can be formulated as a semidefinite programming feasibility problem [Sho87, Nes00, Par03, Las01]. While such semidefinite programming-based relaxations of semialgebraic problems have proven very useful for a range of problems, there has been notable recent progress (both qualitative [Sch18] and quantitative [LRS15]) demonstrating the limitations of this approach.
Hyperbolic optimization problems (or hyperbolic programs) are a family of convex optimization problems that generalize semidefinite optimization problems [GĂŒl97]. These involve maximizing a linear functional over the intersection of an affine subspace and a hyperbolicity cone (a convex cone constructed from a hyperbolic polynomial, which is a multivariate polynomial with certain real-rootedness properties that we define precisely in Section 2). Hyperbolic programs enjoy many of the good algorithmic properties of semidefinite programs [GĂŒl97]. Despite this, algorithms for hyperbolic programming are less mature than those for semidefinite programming. Indeed, much of the recent research on hyperbolic programming has focused on trying to reformulate various classes of hyperbolic programs as semidefinite programs, or on related geometric questions associated with the âgeneralized Lax conjectureâ and its variants [Ami19, BrĂ€14, KPV15, SP15, Sau18]. This lack of algorithmic development may be the result of not having generic ways to produce hyperbolic programming-based formulations and/or relaxations of polynomial optimization problems, beyond those already captured by sums of squares.
This paper introduces ways to construct families of nonnegative polynomials that can be searched over via hyperbolic optimization. We call these hyperbolic certificates of nonnegativity. By appropriately choosing the data that specify such a family of nonnegative polynomials, we can recover sums of squares certificates of nonnegativity. As such, using the ideas in this paper, we could construct hyperbolic programming-based relaxations of polynomial feasibility and optimization problems by replacing sums of squares relaxations with relaxations based on hyperbolic certificates of nonnegativity. One significant challenge, not addressed in this paper, is that many choices need to be made to specify a family of hyperbolic certificates of nonnegativity. Given a specific structured class of polynomial optimization problems, it is currently unclear which choices might be appropriate to obtain strong hyperbolic programming-based relaxations using the framework presented in this paper.
A main focus of the paper is the construction of hyperbolic polynomials for which the associated hyperbolic certificates of nonnegativity are not sums of squares. These are of interest because they have the potential to form the basis of hyperbolic programming-based relaxations of polynomial optimization problems that are stronger than semidefinite programming-based relaxations of comparable complexity.
1.1 Contributions
We now discuss the contributions of the paper in more technical detail, and indicate where the main results appear in the paper.
Hyperbolic certificates of nonnegativity
Given a hyperbolic polynomial of degree in variables and a direction of hyperbolicity (see Section 2.2 for the definition of these terms), we construct a polynomial map
[TABLE]
from to linear functionals on such that for all if, and only if, is in the hyperbolicity cone associated with and (Theorem 3.7). This slightly extends a related construction due to Kummer, Plaumann, and Vinzant [KPV15].
If and are polynomial maps, then is a nonnegative polynomial in whenever is in the hyperbolicity cone associated with and . If a polynomial can be written this way for a choice of hyperbolic polynomial , direction of hyperbolicity , and polynomial maps and , we say it has a hyperbolic certificate of nonnegativity (see Definition 3.12). We can search for such a description of a polynomial (for fixed , , , and ) by solving a hyperbolic optimization problem. Moreover, by appropriately specifying these data, we can recover sums of squares certificates of nonnegativity (Proposition 3.13).
Hyperbolic and SOS-hyperbolic polynomials
If all nonnegative polynomials of the form are sums of squares, we say that is SOS-hyperbolic with respect to (see Definition 4.2). From the point of view of this paper, we are most interested in hyperbolic polynomials that are not SOS-hyperbolic, since these give rise to tractable families of nonnegative polynomials that go beyond sums of squares.
In Sections 4 and 5 we investigate the degrees , and numbers of variables , for which there is a hyperbolic polynomial that is not SOS-hyperbolic. In Section 4 we characterize when this happens for by showing that the specialized Våmos polynomial,
[TABLE]
is hyperbolic but not SOS-hyperbolic (Proposition 4.9), and showing how to take a hyperbolic but not SOS-hyperbolic polynomial and increase its number of variables, or its degree, and maintain this property (Propositions 4.10 and 4.13). Variations of the specialized Våmos polynomial have been studied by BrÀndén [BrÀ11], Kummer [Kum16], Kummer, Plaumann, and Vinzant [KPV15], Burton, Vinzant, and Youm [BVY14], and Amini and BrÀndén [AB18]. Notably, [KPV15, Example 5.11] shows that a closely related quartic in five variables is hyperbolic but not SOS-hyperbolic.
In Section 5 we study the case , and obtain a partial classification of when hyperbolicity and SOS-hyperbolicity coincide. We show how to construct, from a simple graph, a one-parameter family of cubic polynomials that are hyperbolic if, and only if, the parameter is bounded by the clique number of the graph. This allows us to establish co-NP-hardness of deciding hyperbolicity of cubic polynomials (Theorem 5.4). By choosing to be the icosahedral graph, (with vertices and edges), we obtain a hyperbolic cubic
[TABLE]
in variables that we show is not SOS-hyperbolic.
The known cases where all hyperbolic polynomials are SOS-hyperbolic occur because, if a power of a polynomial has a definite determinantal representation, then it is SOS-hyperbolic (Proposition 4.7). This is a common generalization of results due to Kummer, Plaumann, and Vinzant [KPV15], and Netzer, Plaumann, and Thom [NPT13]. As such, any hyperbolic polynomial that is not SOS-hyperbolic is a hyperbolic polynomial for which no power has a definite determinantal representation. In particular, the cubic polynomial (1) associated with the icosahedral graph, appears to be the first reported hyperbolic cubic with this property.
Parameterizing the dual cone
The dual cones of hyperbolicity cones are, in general, not well understood. In Section 7 we show that the polynomial map almost (i.e., up to closure) parameterizes the dual of the associated hyperbolicity cone (Theorem 7.2). We then discuss specific situations in which the map exactly parameterizes the closed dual cone, giving new âhidden convexityâ results, where the image of a polynomial map is convex.
1.2 Applications via polynomial optimization
We now discuss, in a little more detail, the connection between certifying nonnegativity of polynomials and polynomial optimization, and touch on some of the myriad applications of polynomial optimization.
Polynomial optimization problems involve the minimization of a multivariate polynomial over a set defined by polynomial inequality and equality constraints. By introducing additional variables, one can reduce to the case of minimization over the real points of an algebraic variety defined by polynomial equations, i.e.,
[TABLE]
For instance, if , for a linear map , then (2) includes the problem of computing the nearest point to a given variety [DHO*+*16], or more generally, the distance from an affine space to a variety. Such problems can often be interpreted as structured linear inverse problems, in which we seek to estimate a structured object (such as a rank one tensor) from noisy linear measurements, represented by . Problems as diverse as noisy low-rank tensor completion [BM16] and simultaneous localization and mapping (SLAM) in robotics [RDL15] can be expressed in this form. Other cases of (2), in which , capture combinatorial optimization problems. For instance, in error correction coding, the problem of decoding binary linear codes can be formulated as the optimization of a linear functional over the real variety defined by the codewords [FWK05, GLPT12].
The polynomial optimization problem (2) can be reformulated as the maximization of a lower bound on over the real points of the variety, i.e.,
[TABLE]
This leads us to the connection with certificates of nonnegativity. If we can find a globally nonnegative polynomial such that agrees with on , it follows that is a lower bound on the objective value of (2). We can make this computationally tractable by searching over families of nonnegative polynomials with tractable certificates of nonnegativity, such as sums of squares of degree at most . This approach gives rise to semidefinite programming relaxations of polynomial optimization problems.
Hyperbolic certificates of nonnegativity, introduced in this paper, expand the scope of tractable relaxations possible for polynomial optimization problems. If we fix a hyperbolic polynomial , a direction of hyperbolicity , and polynomial maps and , we can obtain a hyperbolic programming relaxation of (2) as
[TABLE]
Depending how how is represented, there are different approaches to making the equality constraints, which are linear in and , explicit. If it is straightforward to sample from the irreducible components of , the sampling-based approach of Cifuentes and Parrilo [CP17] could be used. Otherwise one can use methods based on Gröbner bases [PP12]. We show, in Proposition 3.13, that there are ways to choose , , , and so that (3) recovers a sum-of-squares relaxation. However, it may be possible to make other choices that are more tailored to the structure of the original polynomial optimization problem of interest. Such targeted choices may have advantages, such as giving relaxations that can be solved more efficiently than a semidefinite programming-based relaxation, and yet give comparable bounds on the optimal value.
2 Preliminaries
2.1 Basic notation
Let denote polynomials with real coefficients, homogeneous of degree in the indeterminates . Let denote real symmetric matrices and let denote symmetric matrices with entries in . If and , then is the directional derivative of in the direction . For brevity we write .
If is a symmetric matrix, we write to mean that is positive semidefinite. We use to denote the -dimensional real vector space and to denote the dual space of linear functionals on . If and , we use the notation for the image of under . Throughout, we let denote the standard basis vectors, so that is zero except for the th entry, which is one. Occasionally it is convenient to use as the standard basis for . If , we use to denote the Euclidean norm.
2.2 Hyperbolic polynomials, hyperbolic eigenvalues, and hyperbolicity cones
A homogeneous polynomial is hyperbolic with respect to if and, for all , the univariate polynomial has only real zeros. Throughout we let denote the set of polynomials homogeneous in variables of degree that are hyperbolic with respect to . If and , we denote by the zeros of , and often refer to these as the hyperbolic eigenvalues of . These depend on the choice of , but we will usually suppress this in our notation. Associated with a hyperbolic polynomial is the closed hyperbolicity cone
[TABLE]
This is a convex cone, a result due to GÄrding [GÄr59]. A hyperbolic polynomial is complete if . The hyperbolicity cones of complete hyperbolic polynomials are pointed, in the sense that .
The following result describes how the eigenvalue functions change, and appears in a number of slightly different formulations in the literature [GĂŒl97, HLJ13, ABG70]. Note that the functions appearing below are just a particular choice of ordering of the eigenvalues of .
Theorem 2.1** ([ABG70, Lemma 3.27]).**
If and , then
[TABLE]
where the functions are real analytic functions of with the property that if , then for all .
The fact that whenever is in the hyperbolicity cone is the key property from which essentially all nonnegativity statements in this paper can be derived.
2.3 Hyperbolic programming
If is hyperbolic with respect to , and is a linear functional, a convex optimization problem of the form
[TABLE]
is known as a hyperbolic optimization problem. GĂŒler [GĂŒl97] showed that is a self-concordant barrier function for the cone . As such, hyperbolic optimization problems can be solved using interior point methods as long as the polynomial can be evaluated efficiently. More recently, other algorithmic approaches to solving hyperbolic optimization problems have been developed, including primal-dual interior point methods [MT14], affine scaling methods [RS14], first-order methods based on applying a subgradient method to a transformation of the problem [Ren16] and accelerated modifications tailored for hyperbolic programs [Ren19].
2.4 Definite determinantal representations
If are symmetric matrices and satisfies , then the polynomial
[TABLE]
is homogeneous of degree and is hyperbolic with respect to . We say that a polynomial has a definite determinantal representation if it can be expressed in the form (4) for some symmetric matrices such that . In this case its hyperbolicity cone is a spectrahedron, and has the form .
2.5 Bézoutians and Hankel matrices
The nonnegative polynomials we construct from hyperbolic polynomials will come from the positive semidefiniteness of Bézoutians of certain pairs of polynomials, or of Hankel matrices associated with certain rational functions. In this section we summarize some basic facts about these objects and the relationships between them. These can be found, for instance, in [BMO*+*11] or [KN81, Section 2.1]. We give proofs of some of these results in Appendix A to make the paper more self-contained.
If and are univariate polynomials with , define the Hankel matrix
[TABLE]
If and are univariate polynomials with , the Bézoutian is the matrix defined via the identity
[TABLE]
If , then the Bézoutian is zero except in the upper left block.
Under appropriate assumptions on and , the Bézoutian and the Hankel matrix are related by a unimodular congruence transformation.
Proposition 2.2**.**
Suppose is a monic polynomial of degree and is a polynomial of degree at most . If , there exists an unimodular matrix with entries that are linear in the coefficients of , such that .
Proof.
See Appendix A. â
Certain linear transformations on polynomials give rise to particularly nice congruence transformations on Bézoutians.
Lemma 2.3**.**
Let and be univariate polynomials of degree at most . Let and be shifted versions of those polynomials. Then
[TABLE]
with the convention that if .
Proof.
Combine Theorem 2.7 of [BMO*+*11] with the discussion on page 34 of [BMO*+*11]. â
3 Hyperbolicity cones as sections of nonnegative polynomials
In this section we show how to construct, from a hyperbolic polynomial , a subspace of polynomials for which the cone of nonnegative polynomials in the subspace is linearly isomorphic to the hyperbolicity cone . Consequently, we can optimize over nonnegative polynomials from this subspace by solving hyperbolic optimization problems.
Our approach is closely related to the following result of Kummer, Plaumann, and Vinzant.
Theorem 3.1** (Kummer, Plaumann, Vinzant [KPV15]).**
If is square-free, then if, and only if, for all .
This shows that the hyperbolicity cone is linearly isomorphic to the intersection of the cone of nonnegative polynomials in variables of degree with the subspace spanned by the polynomials for .
Our variation on Theorem 3.1 is expressed in terms of the Bézoutian, or alternatively the corresponding Hankel matrix, associated with a polynomial and its directional derivative.
Definition 3.2**.**
If and , let and let . The parameterized Bézoutian is the symmetric matrix with polynomial entries given by
[TABLE]
The parameterized Hermite matrix is the symmetric Hankel matrix with polynomial entries given by
[TABLE]
Note that and are both linear in . Moreover,
[TABLE]
This is (up to a choice of sign) exactly the parameterized Hermite matrix from [NPT13].
Example 3.3** (Parameterized Hermite matrix for the determinant).**
If is the determinant restricted to symmetric matrices, , and is a symmetric matrix, then
[TABLE]
This follows from the fact that
[TABLE]
The following relationship between the parameterized Bézoutian and Hermite matrices allows positivity statements about parameterized Bézoutians to be transferred to corresponding positivity statements about parameterized Hermite matrices, and vice versa.
Proposition 3.4**.**
If and , then there are matrices and , both with polynomial entries, such that .
Proof.
First assume that so that is monic. The result then follows from Proposition 2.2 and the fact that the coefficients of are polynomials in . Moreover, in this case has determinant one. For the general case, write where is monic. Then and so we have from which we see that which has polynomial entries. â
The following result (the essence of which goes back to Hermite), gives a characterization of hyperbolic polynomials in terms of the parameterized Hermite matrix. A statement in this form can be found, for instance, in [NPT13].
Theorem 3.5**.**
Given and , we have that if, and only if, for all .
By using Proposition 3.4, this characterization of hyperbolicity can also be expressed in terms of the parameterized Bézoutian.
Corollary 3.6**.**
Given and , we have that if, and only if, for all .
Our main result for this section shows that these tests for hyperbolicity can be extended to give a description of the full hyperbolicity cone. We defer the proof until Section 6.
Theorem 3.7**.**
If , then
[TABLE]
Remark 3.8**.**
The entry of is . It follows from Theorem 3.7 that, if , and , then . This is one direction of Kummer, Plaumann, and Vinzantâs result (Theorem 3.1).
In what follows, it is sometimes convenient to use the following variations on Theorems 3.5 and 3.7, respectively. In some arguments they allow us to reduce the number of variables in certain polynomials by one.
Corollary 3.9**.**
Let , , and let be a codimension one subspace such that . Then if, and only if, for all , which holds if, and only if, for all .
Corollary 3.10**.**
If and is a subspace such that and , then
[TABLE]
Corollaries 3.9 and 3.10 follow from the following observation about parameterized Bézoutians.
Proposition 3.11**.**
Suppose that and . Then there exists a unimodular polynomial matrix such that for all and all .
Proof.
This follows directly from Lemma 2.3 since
[TABLE]
for all and all . Furthermore, one can directly check from Lemma 2.3 that is upper triangular and . â
It follows immediately that for all if, and only if, for all whenever is a codimension one subspace of and . Corollary 3.9 then follows from Theorem 3.5 and Corollary 3.6. Similarly Corollary 3.10 then follows from Theorem 3.7.
3.1 Hyperbolic certificates of nonnegativity
One consequence of Theorem 3.7 is that if and , then both of the following polynomials
[TABLE]
are globally nonnegative in and . By composing the polynomials or with other polynomial maps we obtain further nonnegative polynomials.
Definition 3.12**.**
We say that a polynomial in variables has a hyperbolic certificate of nonnegativity with respect to if there exists and polynomial maps and such that
[TABLE]
Since the parameterized Bézoutian and Hermite matrix are the same up to a unimodular congruence transformation (see Proposition 3.4), there is no difference between using or in Definition 3.12. We can transform from one representation to another by changing appropriately. From now on, we will often write instead of unless we specifically want to work with the Bézoutian formulation.
Any polynomial that has a hyperbolic certificate of nonnegativity is nonnegative due to Theorem 3.7. Moreover, given a polynomial , the problem of searching for a hyperbolic certificate of nonnegativity of can be cast as a hyperbolic feasibility problem:
[TABLE]
which aims to find a point in the intersection of the hyperbolicity cone and the affine subspace defined by, for instance, equating coefficients in the polynomial identity (8).
Recovering sums of squares certificates
A homogeneous polynomial is a sum of squares if there is a positive integer and homogeneous polynomials such that for all . Clearly any sum of squares is nonnegative. Furthermore, it is well known that if is the vector of all monomials that are homogeneous of degree in variables then is a sum of squares if, and only if,
[TABLE]
This allows one to search for a sum of squares certificate of the nonnegativity of via solving a semidefinite feasibility problem.
If is a sum of squares, we can choose the data (, , , and ) in Definition 3.12 to give a hyperbolic certificate of nonnegativity for . This shows that our notion of hyperbolic certificates of nonnegativity captures sums of squares as a special case.
Proposition 3.13**.**
Let be a sum of squares, and let be such that . Then has a hyperbolic certificate of nonnegativity as
[TABLE]
where
[TABLE]
Proof.
From our choice of and , we see that . From (3.3),
[TABLE]
â
We have now seen that every sum of squares has a hyperbolic certificate of nonnegativity. In Section 4 we will show that there are polynomials that have hyperbolic certificates of nonnegativity, but that are not sums of squares.
4 Hyperbolic certificates and sums of squares
In this section we study conditions under which polynomials with hyperbolic certificates of nonnegativity are, or are not, sums of squares. We will often phrase this in terms of sums of squares certificates of the positive semidefiniteness of the parameterized Bézoutian (and Hermite matrix), which are matrices with polynomial entries.
Definition 4.1**.**
A symmetric matrix with polynomial entries is a matrix sum of squares if there exists a positive integer and a matrix with polynomial entries such that .
It is well known that is a matrix sum of squares if and only if the polynomial is a sum of squares in and . We will freely pass between these two equivalent definitions.
The following is the central definition of this section and Section 5. It specifies a class of hyperbolic polynomials for which we have not just a sum of squares certificate of their hyperbolicity, but also a sum of squares description of the hyperbolicity cone.
Definition 4.2**.**
If , we say that is SOS-hyperbolic with respect to if is a matrix sum of squares for all .
We use the shorthand notation for the collection of polynomials that are homogeneous of degree in variables and SOS-hyperbolic with respect to . If , and has a hyperbolic certificate of nonnegativity via an identity of the form , then is a sum of squares. As such, we are most interested in hyperbolic polynomials that are not SOS-hyperbolic, since these give new certificates of nonnegativity.
Table 1 describes our understanding of the values of and for which the sets and coincide. In this section we develop some preparatory results, establish the equality cases of the table, and the remainder of the first and third columns of the table (corresponding to and ). In Section 5 we focus on the case.
Before giving proofs establishing the entries of the table, we give a number of equivalent characterizations for polynomials that are SOS-hyperbolic with respect to some direction .
Proposition 4.3**.**
Suppose that and let and . Let be an matrix such that the matrix has full rank. Then the following are equivalent
** 2. 2.
* is a matrix sum of squares* 3. 3.
* is a matrix sum of squares* 4. 4.
* is a matrix sum of squares* 5. 5.
* is a matrix sum of squares*
Proof.
The equivalence of 1 and 2 is just the definition of being SOS-hyperbolic with respect to . The equivalence of 2 and 4 and of 3 and 5 both follow from Proposition 3.4. The equivalence of 2 and 3 follows from Proposition 3.11. â
Remark 4.4**.**
If , then can be expressed as the projection of a spectrahedron, i.e., the image of a spectrahedron under a linear map. This is because the cone of matrix sums of squares in is a linear image of the positive semidefinite cone.
To discuss the relationship between our work and that of Kummer, Plaumann, and Vinzant [KPV15], we introduce a slight variation on Definition 4.2.
Definition 4.5**.**
If , we say that is weakly SOS-hyperbolic with respect to if is a sum of squares for all .
Question 4.6**.**
Clearly, if is SOS-hyperbolic with respect to , then it is weakly SOS-hyperbolic with respect to . Under what additional assumptions on (if any) are these two notions actually equivalent?
In Proposition 4.7, to follow, we show that if a power of has a definite determinantal representation, then is SOS-hyperbolic. This is (at least formally) a slight strengthening of a result of Kummer, Plaumann, and Vinzant [KPV15, Corollary 4.3], which can be rephrased as saying that if a power of has a definite determinantal representation, then is weakly SOS-hyperbolic. Proposition 4.7 also generalizes result of Netzer, Plaumann, and Thom [NPT13, Theorem 1.6], which can be rephrased as saying that if a power of has a definite determinantal representation, then is a matrix sum of squares. The proof presented here generalizes the argument of Netzer, Plaumann, and Thom.
Proposition 4.7**.**
If and there is a positive integer such that has a definite determinantal representation, then .
Proof.
We will show that is a matrix sum of squares whenever . It then follows from Proposition 4.3 that is a matrix sum of squares whenever . Let , and note that
[TABLE]
In particular, for all . As such, it suffices to show that is a matrix sum of squares.
Since has a definite determinantal representation, we can write
[TABLE]
for real symmetric matrices such that . If , then and so has a positive semidefinite square root . In Example 3.3 we explicitly computed the matrix and obtained
[TABLE]
Here the inner product is . This is clearly a Gram matrix with factors that are polynomials in , and so is a matrix sum of squares whenever . â
All of the equality signs in Table 1 follow directly from known results about when powers of hyperbolic polynomials have definite determinantal representations. In what follows, for the cases and we give direct proofs, avoiding arguments about determinantal representations. It would be interesting to find a similar direct argument if .
Proposition 4.8**.**
If or or , then .
Proof.
The case follows from a result of Buckley and Koƥir [BK07, Theorem 6.4], which says that the square of any smooth hyperbolic cubic form in variables has a definite determinantal representation. Combining this with the fact that smooth hyperbolic polynomials are dense in all hyperbolic polynomials, it follows that the square of any hyperbolic cubic in four variables has a definite determinantal representation by a limiting argument [PV13, Proof of Corollary 4.10].
The case follows from the celebrated Helton-Vinnikov theorem [HV07] that (in its homogeneous form [LPR05]) says that has a definite determinantal representation. Here we give an alternative direct argument. If we choose a basis for , then, by Proposition 3.11,
[TABLE]
for some polynomial matrix . As such, it suffices to show that if , then is a matrix sum of squares. This is a positive semidefinite matrix-valued polynomial for which each entry is a homogeneous form in . From [BSV16, Remark 5.10] it is known that all such polynomial matrices are matrix sums of squares.
In the case , we again give a direct argument. First note that if , then for all . Moreover, since there is a positive constant , and polynomials and homogeneous of degree one and two respectively, such that
[TABLE]
As such, is a matrix sum of squares if, and only if, the nonnegative quadratic form is a sum of squares. Since any nonnegative quadratic form is a sum of squares, we are done. â
4.1 Hyperbolic certificates that are not sums of squares:
In this section we give an explicit example of a polynomial of degree four in four variables that is hyperbolic, but not SOS-hyperbolic, with respect to . The example is the specialized VĂĄmos polynomial
[TABLE]
which is hyperbolic with respect to and has hyperbolicity cone that contains the nonnegative orthant. This is one of a much larger class of hyperbolic polynomials constructed from -uniform hypergraphs by Amini and BrÀndén [AB18, Theorem 9.4]. The name arises because the basis generating polynomial of the Våmos matroid is
[TABLE]
where . The specialized Våmos polynomial is the restriction of to a four-dimensional subspace. It is known that no power of has a definite determinantal representation [BrÀ11] and that the same holds for  [Kum16].
Kummer, Plaumann, and Vinzant [KPV15, Example 5.11] showed that the hyperbolic polynomial of degree four in five variables obtained by restricting to the subspace is not SOS-hyperbolic (in our language). The following result shows that the polynomial is still not SOS-hyperbolic when further restricted to .
Proposition 4.9**.**
Let be the polynomial defined in (9). Then is hyperbolic with respect to , and , and yet is not a matrix sum of squares.
Proof.
From [AB18, Theorem 9.4] we know that is hyperbolic with respect to any point in the nonnegative orthant, so . Since it follows that is a direction of hyperbolicity for . We will show that is not a sum of squares when restricted to the subspace . Explicitly, consider the ternary sextic form
[TABLE]
We will show that is not a sum of squares by constructing a separating hyperplane with rational coefficients. Ordering the monomials of as above, we write where
[TABLE]
If is the Newton polytope of (the convex hull of the exponent vectors ), then the extreme points of are the integer points . As such, if is a sum of squares, then the are in the subspace spanned by the monomials with exponent vectors in , i.e., . Furthermore, since it follows that each must also vanish at these three points. As such, if , then the must be in the -dimensional subspace spanned by the entries
[TABLE]
and so, if were a sum of squares, then there would be such that
[TABLE]
To show that this is impossible, define a linear functional on the span of the by
[TABLE]
The linear functional was obtained by solving a semidefinite feasibility problem numerically and rounding the solution. It satisfies . If we apply to the entries of , we obtain
[TABLE]
Applying to both sides of (10) would give, , a contradiction. â
4.2 Increasing the degree and number of variables
We now describe a particular way to take any hyperbolic polynomial that is not SOS-hyperbolic and construct from it a hyperbolic polynomial that has larger degree and/or more variables, that is not SOS-hyperbolic. We begin by showing how to increase the degree alone.
Proposition 4.10**.**
Suppose that is such that is not a matrix sum of squares, and is a linear form such that and . Let . Then , , and is not a matrix sum of squares.
Proof.
Since is linear and it follows that is hyperbolic with respect to and . Now so . Furthermore, since ,
[TABLE]
Then . As such, contains as the upper-left submatrix. It follows that if is not a matrix sum of squares, then , and consequently and , are not matrix sums of squares. â
Next we discuss one way to take a hyperbolic polynomial that is not SOS-hyperbolic and construct a new hyperbolic polynomial of the same degree in more variables that is not SOS-hyperbolic. We begin with a simple preliminary result.
Proposition 4.11**.**
If and is a subspace of such that , then (the restriction of to ) is SOS-hyperbolic with respect to .
Proof.
If , then it is straightforward to check that is the restriction of to . As such, if is a matrix sum of squares for all , it follows that is a matrix sum of squares for all . â
The following simple construction of hyperbolic polynomials is a special case of, e.g., the additive convolution (or finite free convolution) of hyperbolic polynomials [MSS15]. It involves two disjoint sets of and variables, respectively denoted by and .
Lemma 4.12**.**
If and , then
[TABLE]
and .
Proof.
If then is hyperbolic with respect to and its hyperbolicy cone is the Cartesian product of and . Furthermore
[TABLE]
and so  [Ren06]. â
Proposition 4.13**.**
If is not SOS-hyperbolic with respect to and , then is not SOS-hyperbolic with respect to .
Proof.
First we note that because and . If is the subspace spanned by the first coordinate directions, then and . Hence, if , it follows that . â
We now construct, when and , an explicit hyperbolic polynomial and direction of hyperbolicity that is not SOS-hyperbolic with respect to that direction.
Theorem 4.14**.**
If and and , there exists a polynomial that is not SOS-hyperbolic with respect to .
Proof.
From Proposition 4.9, we know that if and , then the specialized Våmos polynomial is hyperbolic, but not SOS-hyperbolic, with respect to . Let and .
We can then use Propositions 4.13 and 4.10 to construct . Explicitly, we let and in Proposition 4.13, and let in Proposition 4.10 (which satisfies and ) to obtain
[TABLE]
This is hyperbolic with respect to but is not SOS-hyperbolic with respect to because is not a matrix sum of squares. â
We conclude the section with a natural question raised by one of the referees. We know that if is hyperbolic with respect to , then it is hyperbolic with respect to any in the associated open hyperbolicity cone.
Question 4.15**.**
Is there a polynomial , together with directions and , such that is SOS-hyperbolic with respect to and is in the associated open hyperbolicity cone, and yet is not SOS-hyperbolic with respect to ?
If such a polynomial were to exist, it would also be an example of an SOS-hyperbolic polynomial for which no power has a definite determinantal representation.
5 Cubic hyperbolic polynomials
In this section, we focus on hyperbolic cubics. The simple structure of the discriminant of a univariate cubic means that there is an explicit connection between hyperbolicity of cubic forms and the maximum value of general cubic forms on the unit sphere. Using this connection, in Section 5.2 we show that, given a cubic form and a direction , both with rational coefficients, it is co-NP hard to decide whether is hyperbolic with respect to . In Section 5.3 we construct an explicit cubic form in variables and a direction of hyperbolicity , so that is not SOS-hyperbolic with respect to . This gives a hyperbolic cubic for which no power has a definite determinantal representation.
5.1 Hyperbolicity and extreme values of cubics on the sphere
In this section we focus on cubic polynomials of the form
[TABLE]
for some , and fix the candidate direction of hyperbolicity to be .
The next result gives a characterization of when a cubic polynomial in the form (11) is hyperbolic with respect to , and a necessary condition for it to be SOS-hyperbolic with respect to .
Proposition 5.1**.**
If , where , then
[TABLE]
Moreover, if , then is a sum of squares.
Proof.
Let and note that . By Corollary 3.9, is hyperbolic with respect to if, and only if, for all . An explicit computation shows that
[TABLE]
By taking a Schur complement and dividing by ,
[TABLE]
Similarly (14) holds for all if, and only if,
[TABLE]
for all . If , then is a matrix sum of squares, and
[TABLE]
is a sum of squares. â
This connection between (SOS-)hyperbolicity and (sum of squares relaxations of) the extreme values of cubic forms on the unit sphere, is central to the remaining discussion in this section.
5.2 Hardness of testing hyperbolicity of cubics
We have seen that for cubics in the form (11), to test hyperbolicity we need to be able to compute the extreme values of a cubic form on the unit sphere. This connection will allow us to establish NP-hardness of deciding hyperbolicity of cubic forms. We begin with a result of Nesterov [Nes03, Theorem 4] stating that we can compute the size of the largest clique in a graph by maximizing an associated cubic form over the unit sphere.
Theorem 5.2** (Nesterov).**
Let be a simple graph (i.e., without loops and multiple edges) and let be the size of a largest clique in . Define
[TABLE]
Then
[TABLE]
Moreover, if is a clique with , then is maximized whenever
[TABLE]
and .
Nesterov obtained this by reformulating Motzkin and Strausâ celebrated formula for the maximum size of a clique in a graph in terms of the maximum value of a certain quadratic form over the unit simplex [MS65]. Combining Nesterovâs result and Proposition 5.1 allows us to construct a family of hyperbolic cubic forms from any simple graph.
Proposition 5.3**.**
Given a simple graph and an integer , define a cubic form in variables by
[TABLE]
Then is hyperbolic with respect to if, and only if, .
Proof.
We make the change of variables to obtain
[TABLE]
which is hyperbolic with respect to if, and only if, is hyperbolic with respect to . The result then follows immediately from Proposition 5.1 and Theorem 5.2. â
Given a positive integer and a simple graph , the problem of deciding whether is NP-hard [Kar72]. This immediately gives the following hardness result.
Theorem 5.4**.**
Given a homogeneous cubic polynomial and a direction , both with rational coefficients, the decision problem âIs hyperbolic with respect to ?â is co-NP hard.
This result is strong evidence that hyperbolic cubics that are not SOS-hyperbolic should exist. Indeed, if all hyperbolic cubics were SOS-hyperbolic, we could check hyperbolicity of cubics (and so maximize cubic forms over the unit sphere) by solving semidefinite programs of size polynomial in the input.
5.3 A cubic that is hyperbolic but not SOS-hyperbolic
In this section we construct an explicit hyperbolic cubic that is not SOS-hyperbolic. In light of Proposition 5.1, the basic approach will be to construct a cubic polynomial for which is nonnegative but not a sum of squares. Drawing on the results of the previous section, we consider graphs with maximum clique of size and polynomials of the form
[TABLE]
By Proposition 5.1, if is SOS-hyperbolic with respect to , then
[TABLE]
is a sum of squares.
The following result tells us that, if is a sum of squares, then there exists a correlation matrix (positive semidefinite matrix with unit diagonal) with nullspace containing a set of vectors corresponding to each of the maximum cliques of the graph. (A scaled version of one such vector, for a particular graph, is illustrated in Figure 1a.)
Proposition 5.5**.**
Let be a simple graph with maximum clique size . For each maximum clique of , let
[TABLE]
If is a sum of squares, then there exists a positive semidefinite matrix such that for all and is in the nullspace of for all maximum cliques of .
Proof.
Let and
[TABLE]
If were a sum of squares, then there would exist such that
[TABLE]
This holds because there is no term of the form in and so the monomial has zero coefficient in any polynomial appearing in a sum of squares decomposition. By comparing coefficients of and , we see that for .
The group acts on by
[TABLE]
where for . This induces an action on polynomials which leaves invariant. Averaging over this action, we can see that if is a sum of squares, then it must have a Gram matrix with . This is because every monomial in is fixed by the action, and no monomial in is fixed by the action. As such, there must be positive semidefinite matrices and such that
[TABLE]
and for all . From Nesterovâs result (Theorem 5.2), one can check that if is a maximum clique, then whenever and . It follows that for all maximum cliques . Since it follows that, in fact, for all maximum cliques . Taking completes the proof. â
We now focus on the case in which is the icosahedral graph. This graph has vertices, edges, and triangles, all of which are maximum cliques. As such, the polynomial
[TABLE]
has variables and is hyperbolic with respect to . In the proof of Theorem 5.6, we call a pair of vertices of the icosahedral graph antipodal if they are at distance three in the graph. The twelve vertices can be partitioned into six pairs of antipodal vertices.
Theorem 5.6**.**
If is the icosahedral graph, then , defined in (17), is hyperbolic, but not SOS-hyperbolic, with respect to .
Proof.
By Proposition 5.5 it suffices to show that there is no correlation matrix with all of the vectors , ranging over maximum cliques , in its nullspace.
The -dimensional orthogonal complement of the -dimensional span of the has a basis given by the twelve vectors (one for each vertex) shown in Figure 1b, any five of the six vectors (one for each pair of antipodal vertices), shown in Figure 1c, and any five of the six vectors (one for each pair of antipodal vertices), shown in Figure 1d. Let be the matrix with these vectors as columns. One can check (e.g., by computing a rational LDL decomposition) that
[TABLE]
If there were a correlation matrix with all of the in its nullspace, then it could be written in the form where and for all . But then
[TABLE]
and so cannot be positive semidefinite, and no such correlation matrix can exist. â
The following is an immediate corollary of Theorem 5.6 and Proposition 4.7. We state it explicitly because this appears to be the first known example of a cubic hyperbolic polynomial such that no power has a definite determinantal representation.
Corollary 5.7**.**
If is the icosahedral graph, then no power of , defined in (16), has a definite determinantal representation.
By using Proposition 4.13 together with our example of a hyperbolic, but not SOS-hyperbolic, cubic in variables (Theorem 5.6), we can construct such cubics whenever the number of variables is at least .
Corollary 5.8**.**
If and , then .
Although we have only been able to construct a hyperbolic cubic in variables that is not SOS-hyperbolic, it has been conjectured by Mario Kummer (personal communication), and seems likely, that such examples should already exist with five variables.
Conjecture 5.9**.**
If and , then .
One way to resolve this conjecture, would be to find an explicit cubic form in four variables such that is nonnegative but is not a sum of squares. Then would be hyperbolic, but not SOS-hyperbolic, with respect to .
6 Canonical linear functionals and the proof of Theorem 3.7
This section is devoted to developing tools used in the proof of Theorem 3.7 (and in Section 7 to follow), and then presenting the proof of Theorem 3.7 itself. We begin by showing how a hyperbolic polynomial , and a point , define a set of linear functionals that generalize the orthogonal projectors onto the eigenspaces of a symmetric matrix. This construction may be of independent interest.
6.1 Canonical linear functionals
Given and a fixed , we define a collection of linear functionals associated with the eigenvalues of . These are a generalization of the orthogonal projectors onto the eigenspaces of a symmetric matrix, as we shall see in Example 6.3. If has simple eigenvalues, these canonical linear functionals are the directional derivatives of the eigenvalues. If , recall that
[TABLE]
where the are defined in Theorem 2.1. Let have eigenvalues with multiplicities . Let be a partition of such that and for all . Define the canonical linear functionals to be
[TABLE]
The following result summarizes the main properties of the that we will need.
Lemma 6.1**.**
If and , then the maps satisfy:
; 2. 2.
* is linear;* 3. 3.
*, the multiplicity of the *th eigenvalue of ; 4. 4.
; 5. 5.
* if, and only if, for all and all .*
Proof.
The first statement follows from the following straightforward computation
[TABLE]
For the second statement, note that the numerator of is linear in , so the numerators, , of the partial fraction expansion in the first statement are also linear in .
The third statement follows from the fact that for all so that . The fourth statement follows from the fact that for all so that .
For the final statement, on the one hand, if , then (by Theorem 2.1), and so . On the other hand, if for all and all , then for all , and so . â
Remark 6.2**.**
We could have defined via the first property in Lemma 6.1, as the residue of corresponding to the simple pole at . Then we would need to show, directly, that these are nonnegative if, and only if, , rather than relying on Theorem 2.1.
Example 6.3** (Canonical linear functionals for the determinant).**
Let where is a symmetric matrix and . Suppose has distinct eigenvalues with corresponding multiplicities . Then where is the orthogonal projector onto the eigenspace of corresponding to . The associated canonical linear functionals are the maps . To see why, note that
[TABLE]
These satisfy and and for all and all if, and only if, .
6.2 Proof of Theorem 3.7
By Proposition 3.4, it is enough to show that if, and only if, for all . To achieve this, our main task is to express the parameterized Hermite matrix in terms of the canonical linear functionals.
Lemma 6.4**.**
Let and . With define a univariate polynomial . Then, using the notation of Section 6.1, we have that
[TABLE]
Proof.
We expand the rational function as a Laurent series via
[TABLE]
It then follows that . The result follows because, for any , . â
Specialized to the determinant, this agrees with the explicit computation from Example 3.3.
Example 6.5** (Hankel matrix for the determinant).**
Using the notation of Example 6.3, the quadratic form defined by the Hankel matrix is
[TABLE]
where .
We are now in a position to complete the proof of Theorem 3.7.
Corollary 6.6**.**
If , then if, and only if, for all .
Proof.
If , then for all and all . From Lemma 6.4 it follows that for all and all . Conversely, suppose that for some . Define so that the polynomial satisfies if and if . Such a polynomial can be constructed via Lagrange interpolation, for instance. Then
[TABLE]
â
7 View from the dual cone
In this section we consider the results obtained so far from the point of view of the dual cone
[TABLE]
the (closed convex) cone of linear functionals that take nonnegative values on . In particular, we study the image of the polynomial map , defined in (7). We will see that this image contains the interior of the dual cone and is contained in the dual cone. Moreover, we give examples in which the image of is precisely the dual cone .
The image of can be expressed using the canonical linear functionals of Section 6.1.
Proposition 7.1**.**
If and has distinct hyperbolic eigenvalues, then
[TABLE]
Proof.
From Lemma 6.4 we know that which is clearly a nonnegative combination of the linear functionals . Conversely, given any nonnegative scalars , we can choose so that the polynomial (of degreee ) satisfies (by interpolation). This shows that any nonnegative combination of the is in the set . â
7.1 Parameterization of the dual cone up to closure
The following result shows that the map almost parameterizes the dual of the hyperbolicity cone. One inclusion follows directly from Theorem 3.7. The other direction follows, for instance, from the (well-known) fact that the gradient of maps the interior of the hyperbolicity cone onto the relative interior of its dual cone. In the statement of the theorem we use the notation to denote the interior of the set and to denote the relative interior.
Theorem 7.2**.**
If , then .
Proof.
The right hand inclusion follows from Theorem 3.7. Indeed if , then
[TABLE]
which implies that . For the left-hand inclusion, let be an arbitrary linear functional in the relative interior of . Let be any optimal point (which exists because is in the relative interior of the dual cone) of the convex optimization problem
[TABLE]
Then (because the objective function goes to infinity on the boundary) and so for all . From the optimality conditions
[TABLE]
It follows that (by Proposition 7.1). â
Remark 7.3**.**
If is a complete hyperbolic polynomial, the gradient of gives a rational bijection between the interior of a hyperbolicity cone and the interior of its dual cone. A key difference between and the gradient of is that the former maps all of into the dual cone , where as the latter only maps the hyperbolicity cone itself into the dual cone. This difference allows us to construct polynomials that are globally nonnegative, rather than just nonnegative restricted to the open hyperbolicity cone.
We have seen that the image of is sandwiched between the interior of the dual cone and the dual cone itself. The following example shows that the image of may not contain all of the boundary of the dual cone.
Example 7.4** (Singular cubic curve).**
Consider the hyperbolic cubic
[TABLE]
taken from [Hen10, Section 4.1]. From the definite determinantal representation we can see that is hyperbolic with respect to . Its dual cone can be parameterized by
[TABLE]
as discussed in [Hen10]. The intersection of the hyperbolicity cone with the affine hyperplane and the intersection of with the affine hyperplane are shown in Figure 2.
The intersection of with the same affine hyperplane is the closure of the set shown on the right in Figure 2. The dual cone and the image of differ in one has a two-dimensional face given by the conic hull of and , and in the other this face is replaced by the single ray generated by .
7.2 Exact parameterizations of the dual cone
We now study certain situations in which the image of is precisely the closed dual cone. From Example 7.4, we know that this does not always occur. In this section we show that the image of is the closed dual cone when, for instance, is strictly hyperbolic (see below for a definition), is a product of linear forms, or is the determinant restricted to symmetric matrices.
If and , we define the multiplicity of to be the multiplicity of the root of . If , then we define the multiplicity of to be zero. A polynomial is strictly hyperbolic if every element that is not a multiple of has distinct hyperbolic eigenvalues. It follows that if is strictly hyperbolic and is nonzero and satisfies , then has multiplicity , the gradient of at is non-zero, and is a smooth point of the boundary of the hyperbolicity cone.
Proposition 7.5**.**
If is strictly hyperbolic, then .
Proof.
From Theorem 7.2 we know that . It suffices to show that for any there exists such that .
Any non-zero must vanish at some (since otherwise would be in the interior of the dual cone). Since is a non-zero element of the boundary of the hyperbolicity cone, it is a smooth point. As such, there is a unique supporting hyperplane to the cone at . Since is in the boundary of the hyperbolicity cone, and so . Both and define supporting hyperplanes at , so one must be a nonnegative multiple of the other. In particular is in the cone over the canonical linear functionals at and so is an element of the image of . â
In the case where our hyperbolic polynomial is a product of distinct linear forms, so that the resulting hyperbolicity cone is polyhedral, the image of is the closed dual cone.
Proposition 7.6**.**
If is a product of distinct linear forms, i.e., with for all and for all , then .
Proof.
Let be a generic point in , so that has distinct eigenvalues which are for . Since the eigenvalue functions are linear, the canonical linear functions are simply for . From Proposition 7.1 we know that
[TABLE]
This is the cone generated by the linear forms , which is just . â
Another case in which the image of is the dual cone is when is the determinant restricted to symmetric matrices.
Proposition 7.7**.**
Let be the determinant restricted to real symmetric matrices. Then .
Proof.
Since the positive semidefinite cone is self-dual it follows that an arbitrary element of has the form where . Let so that in the notation of Example 6.5. Let be the positive semidefinite square root of . If , from Example 6.5 we have that
[TABLE]
Since was an arbitrary positive semidefinite matrix, we are done. â
8 Connections with interlacers
To conclude, we discuss the connection between our work and interlacing polynomials. If and , we say that interlaces with respect to if the hyperbolic eigenvalues of any with respect to and satisfy
[TABLE]
For a fixed hyperbolic polynomial the cone of polynomials that interlace with respect to is a convex cone. In [KPV15, KNP18] it is shown that the cone of interlacers of can be described by
[TABLE]
Much of the development of Sections 3 and 6 could have been presented from the point of view of interlacers. This is because if, and only if, , so the hyperbolicity cone is a section of the cone of interlacers [KPV15].
The argument we used to show that every ternary hyperbolic polynomial is SOS-hyperbolic actually generalizes to give a projected spectrahedral description of the cone of interlacers of any ternary hyperbolic polynomial. This was first observed by Kummer, Naldi, and Plaumann [KNP18] via a seemingly quite different proof.
Proposition 8.1** ([KNP18, Page 17]).**
If , then
[TABLE]
Proof.
Recall that is a matrix sum of squares if, and only if, the restriction to a two-dimensional subspace not containing is a matrix sum of squares. But, after choosing appropriate coordinates, this is a matrix with entries that are each homogeneous forms in two variables, and so is a matrix sum of squares via [BSV16, Remark 5.10]. â
If is hyperbolic and has degree two, then the cone of interlacers of with respect to is simply the set of linear forms that are nonnegative on the cone . This is the dual cone, , which is again a quadratic cone, and so is a spectrahedron. Another case where we might expect the cone of interlacers to have a nice description is the case of hyperbolic polynomials of degree three in four variables.
Question 8.2**.**
If , is the cone of interlacers a projected spectrahedron?
Acknowledgements
I would like to thank Amir Ali Ahmadi, Petter BrÀndén, Hamza Fawzi, Mario Kummer, Simone Naldi, Pablo Parrilo, Levent Tunçel, and Cynthia Vinzant for helpful discussions and correspondence related to various aspects of this work, and the anonymous referees for their thoughtful suggestions that have improved the article.
Appendix A Relating Bézoutians and Hankel matrices
In this appendix we prove Proposition 2.2. Before doing so, we establish a slightly simpler statement in which the dimension of the Bézoutian and Hankel matrices is the same as .
Proposition A.1**.**
If is a monic polynomial of degree and is a polynomial of degree strictly less than , then
[TABLE]
is the (symmetric and unimodular) Bézoutian of and the constant polynomial .
Proof.
This result is established in [BMO*+*11, Proposition 7.15]. We reproduce the proof here because there are some confusing typographic errors in [BMO*+*11, page 75]. The argument presented there uses the fact that the Hankel matrix satisfies the identity
[TABLE]
Combining this with the defining identity of the Bézoutian (5) we obtain
[TABLE]
Changing variables via and , and defining for , gives
[TABLE]
from which the matrix identity follows.
The fact that is unimodular follows directly from the fact that, if we reverse the order of the rows of (which either preserves the determinant or changes its sign), we obtain a lower triangular matrix with unit diagonal. â
We now consider the case in which the dimension of the Bézoutian and Hankel matrices is possibly larger than the degree of the polynomial .
Proof of Proposition 2.2.
Let and . First note that
[TABLE]
By Proposition A.1, and the fact that , we see that
[TABLE]
Since is symmetric and unimodular and has entries that are linear in the coefficients of , we see that
[TABLE]
is unimodular and has entries that are linear in the coefficients of , completing the proof. â
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AB 18] N. Amini and P. BrĂ€ndĂ©n. Non-representable hyperbolic matroids. Adv. Math. , 334:417â449, 2018.
- 2[ABG 70] M. F. Atiyah, R. Bott, and L. GĂ„rding. Lacunas for hyperbolic differential operators with constant coefficients I. Acta Math. , 124(1):109â189, 1970.
- 3[Ami 19] N. Amini. Spectrahedrality of hyperbolicity cones of multivariate matching polynomials. J. Algebraic Combin. , 50(2):165â190, 2019.
- 4[BK 07] A. Buckley and T. KoĆĄir. Determinantal representations of smooth cubic surfaces. Geom. Dedicata , 125(1):115â140, 2007.
- 5[BM 16] B. Barak and A. Moitra. Noisy tensor completion via the sum-of-squares hierarchy. In Conference on Learning Theory , pages 417â445, 2016.
- 6[BMO + 11] D. A. Bini, V. Mehrmann, V. Olshevsky, E. Tyrtsyhnikov, and M. van Barel. Numerical Methods for Structured Matrices and Applications: The Georg Heinig Memorial Volume , volume 199. Springer Science & Business Media, 2011.
- 7[BPT 12] G. Blekherman, P. A. Parrilo, and R. R. Thomas. Semidefinite optimization and convex algebraic geometry . SIAM, 2012.
- 8[BrĂ€11] P. BrĂ€ndĂ©n. Obstructions to determinantal representability. Adv. Math. , 226(2):1202â1212, 2011.
