A unified framework of SAGE and SONC polynomials and its duality theory
Lukas Katth\"an, Helen Naumann, Thorsten Theobald

TL;DR
This paper introduces a unified framework called the $\\mathcal{S}$-cone that encompasses SAGE and SONC polynomials, providing new duality insights, characterizations, and limitations in polynomial non-negativity certificates.
Contribution
It unifies SAGE and SONC into a single framework, characterizes its dual cone, and explores implications for polynomial non-negativity and Positivstellensatz analogues.
Findings
Dual cone of the $\mathcal{S}$-cone characterized explicitly.
Extreme rays of the $\mathcal{S}$-cone identified.
A subclass where non-negativity equals membership in the $\mathcal{S}$-cone.
Abstract
We introduce and study a cone which consists of a class of generalized polynomial functions and which provides a common framework for recent non-negativity certificates of polynomials in sparse settings. Specifically, this -cone generalizes and unifies sums of arithmetic-geometric mean exponentials (SAGE) and sums of non-negative circuit polynomials (SONC). We provide a comprehensive characterization of the dual cone of the -cone, which even for its specializations provides novel and projection-free descriptions. As applications of this result, we give an exact characterization of the extreme rays of the -cone and thus also of its specializations, and we provide a subclass of functions for which non-negativity coincides with membership in the -cone. Moreover, we derive from the duality theory an approximation result of non-negative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A unified framework of SAGE and SONC polynomials and its duality theory
Lukas Katthän
,
Helen Naumann
and
Thorsten Theobald
Lukas Katthän, Helen Naumann, Thorsten Theobald: Goethe-Universität, FB 12 – Institut für Mathematik, Postfach 11 19 32, D–60054 Frankfurt am Main, Germany
{katthaen, naumann, theobald}@math.uni-frankfurt.de
Abstract.
We introduce and study a cone which consists of a class of generalized polynomial functions and which provides a common framework for recent non-negativity certificates of polynomials in sparse settings. Specifically, this -cone generalizes and unifies sums of arithmetic-geometric mean exponentials (SAGE) and sums of non-negative circuit polynomials (SONC). We provide a comprehensive characterization of the dual cone of the -cone, which even for its specializations provides novel and projection-free descriptions. As applications of this result, we give an exact characterization of the extreme rays of the -cone and thus also of its specializations, and we provide a subclass of functions for which non-negativity coincides with membership in the -cone.
Moreover, we derive from the duality theory an approximation result of non-negative univariate polynomials and show that a SONC analogue of Putinar’s Positivstellensatz does not exist even in the univariate case.
1. Introduction
In recent years, several interrelated approaches for non-negative polynomials and for non-negative exponential sums have been proposed, which are aimed at sparse settings. In [4], Chandrasekaran and Shah proposed (in the language of exponential sums/signomials) to consider sums of polynomial functions of the form for a given set such that at most one term has a negative coefficient. Non-negativity of signomials can be characterized in terms of the arithmetic-geometric mean inequality, and deciding membership in the resulting cone (SAGE cone) can be formulated as a relative entropy program. In [13], Iliman and de Wolff proposed to consider sums of non-negative circuit polynomials on (SONC polynomials). For a certain subclass of polynomials called ST-polynomials, deciding membership in the cone of SONC polynomials can be formulated in terms of the optimization subclass of geometric programs [7]. Murray, Chandrasekaran and Wierman [21] have shown that an adaption of the SAGE setting to gives exactly the same cone of polynomials as the SONC cone. This yields a computationally tractable method to decide membership of arbitrary polynomials in the SONC cone using a relative entropy program.
While many aspects of these classes of polynomials are connected with open questions and research efforts, they clearly exhibit some fundamental structural phenomena adapted to sparse settings. For example, it was shown by Murray et al. [21] for the SAGE cone and by Wang [26] for the SONC cone that every polynomial in those cones has a cancellation-free representation. Generally, SAGE and SONC approaches can be combined with semidefinite approaches to polynomial optimization, see Karaca, Darivianakis et al. [15] or Averkov [3]. Moreover, by [3, Theorem 2.16] and its proof, the SONC cone is second-order-cone representable (but the size of the second-order formulation from that work depends on the actual values of the support vectors). For a practical algorithm to compute SONC bounds via second-order cone representations, see Magron and Wang [27].
The goal of the present paper is to provide a uniform framework which covers all these classes as well as some more general settings. Since non-negativity of a polynomial function on is equivalent to non-negativity of on , we consider the more general functions of the form
[TABLE]
with sets of exponents , , which also capture the signomial functions. Based on a subset of these functions, we define the -cone which provides the common generalization of the cones mentioned above, see Definition 2.3. Its atomic functions are called AG functions, which are functions of the form (1) with strong support conditions. The AG functions can be seen as a (non-polynomial) generalization of polynomials coming from the arithmetic-geometric inequality. Building upon the earlier work of the second and the third author [9] on the dual SONC cone, a particular focus is the structure and the use of the dual viewpoint.
Non-negative polynomials and polynomial optimization are ubiquitous in applications, and sparsity is one of the central structural properties that provides potential for efficient computation. Besides classical application in control theory and robotics (see, e.g., [2, 12] and the references therein), let us list the more recent applications of non-negative polynomials and polynomial optimization in the optimal power flow problem [14], collision avoidance [1] or shape-constrained regression [10].
Contributions
-
We show that fundamental properties of the SAGE and/or the SONC cone also hold in the more general context of the -cone. In particular, every can be decomposed into a sum of non-negative AG functions whose supports are contained in the support of . See Proposition 2.7, which unifies and generalizes the results of [21] for the SAGE cone and of [26] for the SONC cone.
-
We provide a comprehensive characterization of the dual cone of the -cone, see Theorem 3.5. In particular, we provide projection-free characterizations in terms of AG functions supported on the particular class of reduced circuits. The characterizations of the dual cone go far beyond the characterizations of the dual SAGE cone from [4] and the dual SONC cone from [9], where the dual cones are described in terms of projections. Our proofs provide a uniform tool set for handling the various types of cones.
-
Based on the characterizations of the dual of the -cone, we provide several applications of the duality theory.
(a) We show that every sum of non-negative AG functions can be written as a sum of non-negative circuit functions whose supports are contained in the support of . This unifies and generalizes the results from [21] for the SAGE cone and of [26] for the SONC cone.
(b) We give an exact characterization of the extreme rays of the -cone. Even for the particular case of the SAGE cone, this characterization substantially sharpens the necessary conditions in [21].
(c) We show that not even in the univariate case, SONC polynomials do allow Putinar-type representations. This counterexample strengthens and simplifies the result of Dressler, Kurpisz and de Wolff [8], who have provided a multivariate counterexample.
(d) We give a characterization of a wide class of non-negative AG functions with simplex Newton polytopes. Using the dual -cone, this result unifies and generalizes the results from [13] and [21] and provides a simpler proof.
(e) As a final application of the dual -cone, we show that non-negative univariate polynomials can be approximated by SONC polynomials.
As further related work, let us mention the exploitation of sparsity and symmetries to derive specific SDP relaxations for polynomial optimization [16, 20, 24, 28, 29, 30].
2. The -Cone
In this section, we introduce AG functions and the -cone. We show that every non-negative function in the -cone has a cancellation-free representation (see Proposition 2.7) and characterize non-negativity of an AG function in terms of the relative entropy function (see Theorem 2.8).
Notation**.**
Throughout the article we use the notations and . Moreover, for a finite subset , denote by the set of -dimensional vectors whose components are indexed by the set .
Our main object of study are functions of the form
[TABLE]
where , are finite sets of exponents, . Here we use the notations
[TABLE]
and if one component of is zero and the corresponding exponent is negative, then we set .
For two finite sets , let
[TABLE]
denote the space of all functions of the form (2) with given sets of exponents. This is a vector space of dimension .
Remark 2.1**.**
(1) If , then is exactly the space of polynomials with exponent vectors in . For this reason, we sometimes refer to elements of as even exponents and to elements of as odd exponents.
- (2)
If , then can be identified with the space of signomials, i.e., functions of the form
[TABLE]
via the identification . 2. (3)
It is no restriction to exclude sets in from , since for exponents , we have . 3. (4)
and are not necessarily disjoint (cf. Example 2.9 below).
We study the non-negativity of functions in using the following building blocks:
Definition 2.2**.**
Let . We say that is
- (1)
an even AG function if at most one of the is negative and all the are zero; and 2. (2)
an odd AG function if all the are non-negative and at most one of the is nonzero.
is called an AG function (arithmetic-geometric mean function) if is an even AG function or an odd AG function.
Note that non-negative even AG functions correspond exactly to the AGE functions (arithmetic-geometric exponentials) studied in [4] and [21].
We arrive at the central definition of this section.
Definition 2.3** (-cone).**
Let , be finite sets. The -cone is defined as
[TABLE]
where denotes the conic hull.
Remark 2.4**.**
- (1)
If , then the -cone can be identified with the SAGE cone using the substitution in Remark 2.1(2). Formally, for finite , and , we set
[TABLE]
where for
[TABLE] 2. (2)
If , then is the cone of SONC polynomials supported on from [13, 3]. In those papers, the SONC cone is defined in terms of circuit polynomials (see Remark 3.3). The equivalence of the definitions was established in [21] and also follows from our more general result in Proposition 4.1. 3. (3)
An example where the cone is different from both the SAGE cone and the SONC cone is given by and .
For a non-empty finite set and let
[TABLE]
be the cone of non-negative odd AG functions supported on , and similarly for let
[TABLE]
be the cone of non-negative even AG functions supported on . Note that, by definition,
[TABLE]
As pointed out by a referee, (3) implies the following alternative representation of the -cone.
Proposition 2.5**.**
Let , be finite and denote the unit vector in indexed with . Then,
[TABLE]
In the case , we can shortly write , where denotes the component-wise absolute value. If there exists some , then the corresponding coefficient in the SAGE cone appears only once in the set . However, by slight abuse of notation, we also write shortly as .
Proof.
If , then (3) gives a decomposition
[TABLE]
with for all and for every . Defining the functions
[TABLE]
symmetry implies and hence, . Remark 2.4(1) then shows that the coefficient vector of is contained in .
The converse direction of the first equation follows immediately with the substitution in Remark 2.1(2).
The second equation, which exhibits the convexity of the -cone, is an immediate consequence of the first one. ∎
In our definition of the -cone, we exclude sums of non-negative AG functions with support for , where the corresponding AG functions have bigger support than . This could happen, for example, if two summands cancel in the sum. For a better understanding of the problem, we have a look at the following example.
Example 2.6**.**
Let . Consider the two non-negative AG functions
[TABLE]
whose support is not contained in . But the sum
[TABLE]
is itself a non-negative AG function, whose support is contained in .
In fact, this restriction is not really a restriction. The following proposition states that every sum of non-negative AG functions whose support is bigger than the support of the sum can be decomposed into a sum of non-negative AG functions whose supports are contained in the support of .
For the SAGE case, this was already proven in [21, Theorem 2] and for the SONC case this follows from the more detailed result of [26].
Proposition 2.7**.**
Let be finite sets and . If for some , , then as well. Equivalently, it holds that
[TABLE]
Proof.
By Proposition 2.5,
[TABLE]
with denoting the unit vector with respect to for , resp. . Let with coefficient vector and hence where the absolute value is component-wise. The already mentioned statement for the SAGE-case ([21], Theorem 2) states that . Hence, as well as
[TABLE]
again by Proposition 2.5. The other inclusion is obvious. ∎
Our next result characterizes non-negative AG functions. It is a slight generalization of [5, Lemma 2.2] to the setting of AG functions. The following notation is useful to state the theorem:
Notation**.**
For a non-empty finite set and , let be the polytope
[TABLE]
Note that if and only if is contained in the convex hull of . In the special case that is affinely independent, consists of a single element, which we denote by .
Let be a non-empty finite set. We denote by ,
[TABLE]
the relative entropy function. It can be extended to using the usual conventions for and for .
Theorem 2.8**.**
Let be a non-empty finite set and be an AG function of the form
[TABLE]
where for all and . Then the following statements are equivalent:
- (1)
* for all .* 2. (2)
There exists a such that and
[TABLE] 3. (3)
There exists a such that
[TABLE]
A vector as in this theorem is called an AG witness.
Proof of Theorem 2.8.
Before we prove this theorem, observe that for the AGE cone , defined in Remark 2.4, we have if and only if there exists such that and
[TABLE]
(compare [4], Section ). If is an even AG-function, this is exactly the equivalence due to Remark 2.4. If is an odd AG-function, the equivalence follows from Proposition 2.5 and the mentioned observation.
For the implication (2) (3), set . It is clear from the properties of that . The discussion in [4, p. 1151] shows that
[TABLE]
and thus this has the desired properties. The implication (3) (1) is a direct consequence of the weighted arithmetic-geometric mean inequality:
[TABLE]
Using , we obtain
[TABLE]
As we already know that , we obtain the desired statement. ∎
Example 2.9**.**
Let . A typical AG function with this support is
[TABLE]
Since the equality condition in statement of Theorem 2.8 is trivially satisfied, we have for all if and only if there exists a with
[TABLE]
If , the latter condition can be simplified to . For the case , this is clear from our setting , and to see it for , rewrite (5) as
[TABLE]
Since the function attains its minimum at (which means ), we obtain the claimed result. It is in particular the one of statement (3) in Theorem 2.8.
For later use, we note that our cones of interest are closed:
Proposition 2.10**.**
The cones and are closed pointed convex cones.
Proof.
It is clear that all three cones are pointed, since the only non-negative function where is non-negative as well is the zero function. The cones and are defined as (infinite) intersections of closed halfspaces, and thus they are closed. Finally, since finite sums of closed pointed convex cones are again closed, the cone is closed as well. ∎
3. Circuits and the dual of the -cone
In this section, we introduce circuit functions and provide several characterizations of the dual -cone (see Theorem 3.5).
We can identify the dual space of with . For with coefficients and an element , we consider the natural pairing
[TABLE]
Using this notation, the dual cone is defined as
[TABLE]
Now we consider the representation of AG functions in terms of circuit functions. Here, and denote the relative interior and the convex hull of a set.
Definition 3.1**.**
A circuit is a pair , where is affinely independent and . For finite sets , let
[TABLE]
denote the set of all circuits on . In particular, for we call the set of all even circuits and the set of all odd circuits.
Definition 3.2**.**
Let be a circuit.
- (1)
An even circuit function supported on is an AG function of the form
[TABLE]
with for all and . 2. (2)
For , an odd circuit function supported on is an AG function of the form
[TABLE]
with for all and .
We call the inner exponent of and the other exponents are the outer exponents.
Remark 3.3**.**
(1) In case of a circuit, the vector in Theorem 2.8 is unique, and thus the non-negativity of can be expressed in terms of the circuit number
[TABLE]
which was introduced in [13].
(2) Iliman and de Wolff also introduced the notion of circuit polynomials in [13]. Every circuit polynomial is an even circuit function if the inner exponent is even and an odd circuit function if the inner exponent is odd. With this, circuit polynomials form a special case of circuit functions.
Next, we introduce reduced circuits, which will be used in Section 4.2 to determine the extreme rays of the -cone.
Definition 3.4**.**
For a circuit let
[TABLE]
An even circuit is called reduced if and an odd circuit is called reduced if .
In other words, reduced circuits contain no elements of in their convex hull except those which are trivially there. Note that for , it is possible that a circuit is reduced as an even circuit, but not reduced as an odd circuit. See Example 4.5 below.
We can now provide the following characterization of the dual -cone . Here, recall the definition of from (4) and that denotes the single element of in the case of a circuit. We use the convention that and .
Theorem 3.5**.**
Let and be finite sets and let .
- (1)
If , then for all . 2. (2)
If the condition of part (1) is satisfied, then the following are equivalent:
- (a)
* lies in the dual cone .* 2. (b)
For all (respectively ) and all , it holds that
[TABLE] 3. (c)
For every even circuit (respectively odd circuit ) and , it holds that
[TABLE] 4. (d)
For every reduced even circuit (respectively reduced odd circuit ) and , it holds that
[TABLE]
Before we prove Theorem 3.5 we consider the duals of the sub-cones and of .
Lemma 3.6**.**
Let be a non-empty finite set.
- (1)
For , the dual cone of consists of those where
- (a)
* for all , and* 2. (b)
* for all .* 2. (2)
For , the dual cone of consists of those satisfying (a), (b) and in addition
- (c)
.
Proof.
We prove the even and odd case simultaneously. Let or . First we show that it satisfies the claimed conditions.
(a) and (c)::
For every , it holds that resp. and thus , as claimed. In the even case, we also have that and thus by the same argument (c) holds.
(b)::
Fix a . First assume that for all . Then
[TABLE]
is an (even or odd) AG function and a straightforward computation shows that satisfies the condition of Theorem 2.8 (with the given ), hence is non-negative. Thus,
[TABLE]
which is equivalent to property (b). Since the mapping (6) is continuous in , the statements also hold if for some .
For the converse implication, assume that satisfies conditions (a), (b), and in the even case also (c).
We need to show that every non-negative AG function resp. satisfies . Let be an AG witness for as in Theorem 2.8. Observe that
[TABLE]
which implies
[TABLE]
In the even case, we have by (c), and the right expression in (8) is non-negative, because is a non-negative AG function. In the odd case, observe that then the non-negativity of yields non-negativity of the right expression in (8) as well. ∎
Remark 3.7**.**
In the beginning of this section, we identified the dual space of with . Using the reverse identification and associating for every a function of form (1), we can identify the dual cones resp. with the cones of all functions of the form (1) with coefficients in resp. . If , then by Theorem 2.8 and Lemma 3.6, it is easy to see that and .
In particular, this means that every function of the form (1) with coefficients in resp. is non-negative. Hence, .
The reverse inclusion does not hold in general. With , , and , we obtain as well as . Setting it follows that .
In addition, we need the following lemma for the proof of Theorem 3.5. Here, for , denote by its support.
Lemma 3.8** (Essentially Lemma 8 of [21]).**
Let be a non-empty finite set and . Then every can be written as a sum
[TABLE]
with , and for all , such that the support of each is affinely independent.
Proof.
Since the polytope is the convex hull of its vertices, it suffices to show that the support of every vertex of is an affinely independent set.
Let be a vertex of and be its support. Assume to the contrary that is affinely dependent. Then there exists with , and for . Since for all , for sufficiently small both and are contained in . But this implies that is not a vertex of , a contradiction. ∎
Proof of Theorem 3.5.
- (1):
Since for every , every satisfies
[TABLE] 2. (2):
The implications (b) (c) (d) are trivial. For the equivalence of (a) and (b) note that
[TABLE]
because Minkowski sum and intersection are dual operations (see, e.g., [25], Theorem 1.6.3). Hence, the claim follows with Lemma 3.6. It remains to show (c) (b) and (d) (c).
- (c) (b):
Let and . By Lemma 3.8, we can decompose as with , and , such that the support of each is affinely independent. Now the claim follows from
[TABLE]
For , the proof is analogous by considering instead of . 2. (d) (c):
We start with the even case and proceed by induction on . Since the base case captures exactly the reduced circuits, there is nothing to prove in this case.
Now consider an even circuit with . Then there exists a with and . Set and .
Let be the maximal real number with . This number exists clearly, and we have because the coordinate sums of and are equal. Further, it holds that because all components of are positive.
Similarly, let be the maximal real number with . As above, it holds that . Moreover, note that implies . The construction gives
[TABLE]
Note that at least one of the entries of is zero, and moreover, or at least one of the entries of is zero. Define two new even circuits and with and
[TABLE]
We observe , and since is not counted towards , it follows that . Similarly, since and is not counted towards , we obtain . Hence, by induction,
[TABLE]
Note that and . Adding times (10) to (9) gives, due to , the uniform inequality
[TABLE]
Since , this proves the claim.
For the odd case, we proceed by induction on , and the base case consists again of the reduced circuits. Fix an odd circuit with . Again, there exists a with , but this time is possible.
We define and as above. This time, is possible. Further, we set and (thus) . We define the new circuits and as above, where this time is odd and is even. Since as above, we obtain
[TABLE]
where the second inequality follows since we have already shown (d) (c) for even circuits. As above, we add times (12) to (11) to obtain the desired inequality.∎
A description of the dual of the SONC cone was obtained in [9, Theorem 3.1], and a description of the dual of the SAGE cone in [4, Proposition 2.4]. Both descriptions are based on projections and differ from the one in Theorem 3.5. For completeness, we show here that they are in fact equivalent.
Proposition 3.9**.**
Let be a finite set and . For and , the following are equivalent:
- (1)
. 2. (2)
** 3. (3)
**
In this proposition, statement (1) is the one we used earlier, statement (2) is the description of the dual SAGE cone used in [4], and statement (3) in conjunction with Theorem 3.5(c) is the description of the dual SONC cone used in [9].
Proof.
If then all three conditions hold. Moreover, if for some , then it is easy to see that all three conditions hold if and only if . Thus we may assume that and for all . We will show the equivalence via the following variant of statement (2),
- (2’)
- (1) (2’):
Consider (2’) as the feasibility of a linear system of inequalities in . (2’) is satisfied if and only if its Farkas alternative system (in the version of Proposition 1.7 of [31])
[TABLE]
does not have a solution.
We can normalize so that all its components sum to 1. Hence, the alternative system simplifies to
[TABLE]
i.e., to . Since this is the opposite of (1), the equivalence of (1) and (2’) follows. 2. (2’) (2):
We obtain (2) from (2’) by multiplying with and replacing by . 3. (2) (3):
This is trivial. 4. (3) (2’):
We have that and thus we may divide the inequality in (3) by to obtain
[TABLE]
where . Note that the left-hand side of the inequality is monotonous in , and hence,
[TABLE]
We further replace by to obtain (2’).∎
4. Applications of the dual cone
4.1. Non-negative AG functions are sums of non-negative circuit functions
As a first application of our description of the dual cone, we prove the following generalization of [21, Theorem 4].
Proposition 4.1**.**
Let and be finite sets. For every , the following statements hold.
- (1)
* can be written as a sum of non-negative circuit functions whose supports are contained in .* 2. (2)
* can be written as a sum of non-negative circuit functions supported on reduced circuits in .*
Note that in statement (2), the support of the reduced circuits does not need to be contained in the support of . The following example shows a situation, in which this phenomenon happens.
Example 4.2**.**
Let and . Consider the non-negative circuit function . Its support is not reduced with respect to , and indeed, we can write as sum
[TABLE]
of non-negative circuit functions, whose supports and are reduced. Note that the coefficient of cancels in the sum.
Proof of Proposition 4.1.
By Lemma 3.6 and part (c) of Theorem 3.5, the dual of the -cone is
[TABLE]
Let and assume that the support of is given by and . By Proposition 2.7, . Apply (13) on the sub-cone and dualize that identity. Using that (because the cone is closed, Proposition 2.10) then yields
[TABLE]
This shows part (1).
Part (2) then follows from part (d) of Theorem 3.5. Note that in this case we cannot restrict the sets of exponents to and as it depends on the choice of and whether a circuit is reduced or not. ∎
Remark 4.3**.**
If we demand , we obtain the same statement about the support in (2) as in (1).
4.2. Extreme rays of the -cone
Our next application of our description of the dual cone is a precise characterization of the extreme rays of . Even for the specific case of the SAGE cone, this sharpens the result in [21, Theorem 4], where the necessary condition is that every extreme ray of the SAGE cone is supported on a single coordinate or on a circuit. The essential concept for this characterization is provided by the reduced circuits.
Let and be finite sets and write shortly . For let
[TABLE]
for let
[TABLE]
and for let
[TABLE]
and are the (even and odd) non-negative circuit functions, for which the inequality (7) on the circuit number holds with equality. provides the special case for circuits supported on a single element.
Proposition 4.4**.**
For finite sets and , the set of extreme rays of is
[TABLE]
Here, recall from Definition 3.4 that an even (respectively odd) circuit is reduced if and only if (respectively ). The following example shows that the case distinctions are indeed necessary.
Example 4.5**.**
For and , the sets of (even resp. odd) circuits are
[TABLE]
We have a closer look at those elements which are both even and odd circuits.
- (1)
The circuit is reduced as an even circuit and non-reduced as an odd circuit. In the context of extreme rays this is necessary. The even circuit function is an element of an extreme ray, but for the odd circuit function we have
[TABLE]
and hence this is not an extreme ray of .
- (2)
Further, it holds that
[TABLE]
so does not support an even extreme ray but in fact it does support an odd extreme ray.
Again, we obtain corollaries for the special cases of SONC polynomials and of the SAGE cone.
Corollary 4.6**.**
Let be a finite set and write shortly . The set of extreme rays of the cone of SONC-polynomials with support in is
[TABLE]
Corollary 4.7**.**
Let be a finite set, and write . The set of extreme rays of the cone of SAGE functions with support in is
[TABLE]
Example 4.8**.**
As an example corresponding to the SAGE setting, let and be a non-negative circuit function. With the substitution we obtain the arithmetic-geometric exponential , and its support is again not reduced. We write as a sum
[TABLE]
of circuit functions, whose supports and are reduced.
This is different from Example 4.2 in that the exponent is contained in rather than and thus is treated like an even number in the SAGE setting.
In [21], after Theorem 4, the authors remark that every circuit “supports a family of extreme rays in the SAGE cone.” This is not quite correct, as shown by the current example.
For the proof of Proposition 4.4, we will use a variant of Hölder’s inequality.
Theorem 4.9** (Theorem 11, p. 22, [11]).**
Let . Let be a matrix and let with . Then
[TABLE]
and equality holds if and only if either (1) for some , , or (2) the matrix has rank one.
Note that in case (1) both sides of the inequality are zero.
Proof of Proposition 4.4.
By Proposition 4.1, every non-negative function can be written as a sum of non-negative circuit functions supported on reduced circuits. Hence, it suffices to show the following two statements:
- (a)
Every non-negative circuit function supported on a circuit can be written as a sum of non-negative circuit functions with the same support whose circuit condition is satisfied with equality. 2. (b)
Every function in is indeed an extreme ray, i.e., it cannot be written as a sum of other non-negative AG functions.
- (a)
Let be a non-negative circuit function supported on the circuit , whose coefficients are denoted by and .
If is supported on an odd circuit , then the circuit number from (7) satisfies . Hence, is a convex combination of and , where have the same support and coefficients as , except for and .
If is supported on an even circuit , then can be written as the sum of a non-negative circuit function with the same support whose inner coefficient equals the negative of the circuit number and of some function for . If , then the latter is contained in . Otherwise, if , then , whose two summands are elements of . 2. (b)
Let with coefficients and . Assume that can be decomposed into with non-negative AG functions . Denote the coefficients of by and .
For the duration of this proof, we use the notation . Moreover, set . We claim that .
To show this, we consider a vertex of . Since must be an outer exponent of each with , we have for all . It follows that and thus . As this holds for every vertex of , we obtain that .
Next, we distinguish three cases depending on whether , or .
- Case , :
In this case, for each . Thus, if then each is a multiple of and thus a multiple of .
On the other hand, if , then w.l.o.g. we can assume that for some . Moreover, each is of the form with . Then
[TABLE]
and thus for each . Hence, all are multiples of .
- Case for with and :
In this case, our initial considerations imply that . Since is reduced we can also conclude that . Hence, each is of the form
[TABLE]
It follows that for all and , because otherwise the cannot be non-negative.
Next, we claim that
[TABLE]
for all , where again we write . To prove the claim, we distinguish two cases:
- (i)
If , then it trivially holds that . 2. (ii)
Consider the case that . Since is a non-negative AG function, it holds that the last sum in (14) vanishes and the claim follows from Theorem 2.8(3).
In the next step, we derive
[TABLE]
where in (a) we use (15), (b) follows from Hölder’s Inequality 4.9 and (c) uses that . Moreover, by Theorem 4.9 equality in (b) implies that either (1) there exists an such that vanishes for all , or (2) the matrix with entries has rank one. However, (1) would imply that which is impossible, thus we are in case (2). Hence, there exist scalars such that for all and all . Further, equality in (a) implies that
[TABLE]
By (14), it follows that every is of the form
[TABLE]
Now, if for some , then has only terms with exponents in and thus it is the zero function or it cannot be non-negative. It follows that for all . But this implies that . Hence, since the are AG functions, they cannot have any other terms in , and thus they are all multiples of .
- Case for with and :
In this case, the argument is similar, except that (16) becomes
[TABLE]
Since we have equality in (e), it follows that all terms on the left-hand side of that triangle inequality have the same sign. Since , this implies that each has the same sign as . Now we also obtain (17). Note that for the vanishing of the terms with exponents in , we can argue as above, or alternatively obtain this directly from the oddness of the . Altogether, this yields again that all the are multiples of . ∎
4.3. Univariate SONC polynomials do not satisfy Putinar’s Positivstellensatz
In [8, Section 5], a multivariate example was given to show that the analogue of Putinar’s Positivstellensatz does not hold for SONC polynomials. Using Theorem 3.5, we provide a simpler example of this phenomenon, which in addition shows that the analogue of Putinar’s result does not even hold for univariate SONC polynomials.
Recall Putinar’s Theorem from the theory of sums of squares polynomials ([23], see also, e.g., [17, Theorem 2.14]).
Theorem 4.10**.**
Let and assume that the quadratic module
[TABLE]
is Archimedean. If is strictly positive on the set , then can be written in the form with sum of squares polynomials .
Here, the Archimedean condition can be defined by the existence of some with , and it is well known that is Archimedean if are affine (see, e.g., [17]).
Theorem 4.11**.**
The univariate polynomial
[TABLE]
satisfies for , but it cannot be written in the form
[TABLE]
with SONC polynomials and .
Note that a SONC analogue of Putinar’s Positivstellensatz would even assert a representation of using only and .
Proof.
As a notation, for we set . This is the cone of SONC polynomials of degree up to .
The polynomial is clearly positive on (in fact, on ), so we only need to show that it does not have a representation as in the claim.
Assume to the contrary that there exist SONC polynomials , such that (19) holds. Let be the maximum of the degrees of the . Then is contained in . On the other hand, consider the vector
[TABLE]
A direct computation shows that
[TABLE]
Hence, once we show that lies in
[TABLE]
we obtain a contradiction.
For this, note that we only need to consider inequalities involving the first two components of , because apart from those equals the vector , which is clearly contained in the cone. Further, note that if and only if the shifted vector where we omitted the [math]-th coordinate lies in . Similarly, lies in if and only if lies in , and an analogous description holds for . Using these observations, it is a straightforward computation to verify that lies in the cone. ∎
4.4. Functions with simplex Newton polytopes
We provide a subclass of functions for which non-negativity coincides with containment in the -cone .
Proposition 4.12**.**
Let , be finite sets and . Assume that
- (1)
* is a simplex and ,* 2. (2)
* for every which is not a vertex of , and* 3. (3)
* for every .*
Then is non-negative if and only if . In this case, can be written as a sum of circuit functions using only vertices of as outer exponents.
This has been shown for SONC polynomials under a slightly stronger hypothesis in [13, Theorem 5.5]. Moreover, the analogous statement in the SAGE setting has been obtained in [21, Theorem 10]. We provide a simple proof using Theorem 3.5 as well as the following lemma.
Lemma 4.13**.**
Let be affinely independent and let . Then there exists a point and a scalar such that for all .
Proof.
Since is affinely independent, there exists an affine map such that for all . Explicitly, there exists a and with for all . We set and for . A straightforward computation shows that and satisfy our claim:
[TABLE]
Proof of Proposition 4.12.
For the nontrivial direction, let be non-negative and denote by the set of vertices of . We show that is contained in the sub-cone
[TABLE]
Let be an arbitrary element of . First, consider the case that for all . Since is affinely independent, Lemma 4.13 gives a and a with for all . For set and observe that by Lemma 3.6,
[TABLE]
respectively . Hence,
[TABLE]
Therefore, the hypotheses (2) and (3) imply that
[TABLE]
In the case for some , continuity of the mapping in (6) implies as well. Altogether, . ∎
4.5. Approximating non-negative polynomials by SONC polynomials
Unlike the situation with sum of squares polynomials, not every non-negative univariate polynomial is a SONC polynomial. However, in this section we show that non-negative univariate polynomials can at least be approximated by SONC polynomials.
For a univariate polynomial we set
[TABLE]
This is very similar to the SAGE-representative of a polynomial considered in [21, Section 5].
Theorem 4.14**.**
Let be a univariate polynomial of even degree with positive constant coefficient. Let .
- (1)
If (i.e., if for all ), then is a SONC polynomial. 2. (2)
Otherwise, there exists a sequence of SONC polynomials which converges to uniformly on every compact subset of the open interval .
Note that the have the property that for . Part (1) is essentially a special case of Proposition 4.12, which itself has been obtained before by de Wolff and Iliman [13, Theorem 5.5], see also [21, Theorem 10]. Hence, only part (2) is new. It can be seen as a univariate SONC analogon of the (even multivariate) approximation result in terms of sum of squares polynomials by Lasserre and Netzer (see [18, 19]), where our result also has a restriction to .
Proof.
For part (1), note that is non-negative on if and only if it is non-negative on . Moreover, it satisfies the hypothesis of Proposition 4.12, and thus implies that is a SONC polynomial. Moreover, Proposition 4.12 implies that can be written as a sum of circuit polynomials using only the constant term and the highest term as outer exponents. Since a non-negative circuit polynomial with negative inner coefficient remains non-negative under flipping the sign of the inner coefficient, is a SONC polynomial as well.
It remains to show part (2). As in the proof of Theorem 4.11, we use the shorthand notation for for . Since the two parameters of are disjoint sets, we shortly write elements in the dual cone as rather than , by slight abuse of notation. For define
[TABLE]
for some constant . It is immediately clear that if , then converges to zero for , and we even have uniform convergence on every compact subset of . It remains to find a suitable value of such that is a SONC polynomial.
We claim that for every , where is the minimum of the function on the set
[TABLE]
To prove the claim, consider a fixed and observe that because is even. The definitions of and imply and . For , Theorem 3.5(c) applied on the circuit with outer exponents [math], and inner exponent then gives
[TABLE]
Hence, the non-empty set is bounded and compact. It follows that the function attains the minimum value on . This implies the claim.
Since for every element of can be truncated to obtain an element of , it follows that the auxiliary claim also holds for every . We set and it remains to show that for this value of our candidate is a SONC polynomial. For this, it is sufficient to show that for we have for all , and we may assume that is even. We distinguish two cases.
Case .:
The inequalities defining imply that for . Using this, we derive
[TABLE]
If , then this expression is non-negative since both and are. Otherwise, we continue as follows:
[TABLE]
for any by the choice of .
Case .:
By Theorem 3.5(c) applied on the circuit with outer exponents [math], and inner exponent , we have , so that using the hypothesis of the current case twice gives
[TABLE]
Since , employing the auxiliary claim as well as (20) we can conclude
[TABLE]
As a corollary of the theorem, we see that we can also approximate in the -adic topology:
Corollary 4.15**.**
Let be a univariate polynomial with . Then for each there exists a SONC polynomial such that
[TABLE]
Proof.
If the degree of is odd, then we may consider as a polynomial of higher degree with leading coefficient [math], which has even degree. The hypothesis implies that the of Theorem 4.14 exists and is positive, hence we may consider the sequence from that theorem. From its construction in the proof of Theorem 4.14, it is clear that it satisfies our claim. ∎
5. Outlook and open problems
We have introduced the -cone as a unified framework for the classes of SAGE and SONC polynomials, provided characterizations of its dual cone and presented several new and several improved results associated with the dual viewpoint. The -cone exhibits a prominent computationally tractable class within the class of sparse non-negative polynomials. For further computational aspects building upon the projection-free descriptions of the dual cones from Section 3, we refer to the subsequent work of the second author together with Dressler, Heuer and de Wolff [6].
It remains a future task to further understand the relation of the -cone and its specializations to the underlying class of all non-negative functions (in some special cases polynomials), both from the primal and the dual point of view. Specifically, the relation of the SONC cone to the cone of sparse non-negative polynomials and the dual SONC cone to sparse moment cones (as studied by Nie [22]) deserve further study. It is an open question whether SONC polynomials are dense inside the non-negative ones.
Moreover, since by the results in Section 4.3, the analogue of Putinar’s Positivstellensatz already fails in the univariate case, it also remains a challenge to provide computationally attractive types of Positivstellensätze for the -cone and its specializations.
Acknowledgment. We thank the anonymous referees for their helpful suggestions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. A. Ahmadi and A. Majumdar, Some applications of polynomial optimization in operations research and real-time decision making , Optimization Letters 10 (2016), no. 4, 709–729.
- 2[2] by same author, DSOS and SDSOS optimization: more tractable alternatives to sum of squares and semidefinite optimization , SIAM J. Appl. Algebra Geom. 3 (2019), no. 2, 193–230. MR 3939321
- 3[3] G. Averkov, Optimal size of linear matrix inequalities in semidefinite approaches to polynomial optimization , SIAM J. Appl. Algebra and Geometry 3 (2019), no. 1, 128–151.
- 4[4] V. Chandrasekaran and P. Shah, Relative entropy relaxations for signomial optimization , SIAM J. Optim. 26 (2016), no. 2, 1147–1173.
- 5[5] by same author, Relative entropy optimization and its applications , Math. Program., Ser. A 161 (2017), no. 1-2, 1–32.
- 6[6] M. Dressler, J. Heuer, H. Naumann, and T. de Wolff, Global optimization via the dual SONC cone and linear programming , Proc. 45th International Symposium on Symbolic and Algebraic Computation, 2020, pp. 138–145.
- 7[7] M. Dressler, S. Iliman, and T. de Wolff, An approach to constrained polynomial optimization via nonnegative circuit polynomials and geometric programming , J. Symbolic Comput. 91 (2019), 149–172.
- 8[8] M. Dressler, A. Kurpisz, and T. de Wolff, Optimization over the Boolean hypercube via sums of nonnegative circuit polynomials , 43rd International Symposium on Mathematical Foundations of Computer Science, LIP Ics. Leibniz Int. Proc. Inform., vol. 117, Schloss Dagstuhl, 2018, pp. Art. No. 82, 17. MR 3854037
