Exponential convexifying of polynomials
Krzysztof Kurdyka, Katarzyna Kuta, Stanis{\l}aw Spodzieja

TL;DR
This paper proves that multiplying a positive polynomial by an exponential function with a sufficiently large exponent makes it strongly convex on convex semialgebraic sets, aiding in finding polynomial critical points.
Contribution
It introduces a method to convexify polynomials via exponential transformations, extending to unbounded sets under certain positivity conditions.
Findings
Existence of an exponent N for strong convexity on bounded sets.
Extension to unbounded sets with additional positivity assumptions.
Application to critical point search of polynomials.
Abstract
Let be a convex closed and semialgebraic set and let be a polynomial positive on . We prove that there exists an exponent , such that for any the function is strongly convex on . When is unbounded we have to assume also that the leading form of is positive in . We obtain strong convexity of on possibly unbounded , provided is sufficiently large, assuming only that is positive on . We apply these results for searching critical points of polynomials on convex closed semialgebraic sets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Differential Equations and Dynamical Systems · Functional Equations Stability Results · Advanced Optimization Algorithms Research
Exponential convexifying of polynomials
Krzysztof Kurdyka
Krzysztof Kurdyka, Laboratoire de Mathematiques (LAMA) Université Savoie Mont Blanc, UMR-5127 de CNRS 73-376 Le Bourget-du-Lac cedex FRANCE
,
Katarzyna Kuta
Katarzyna Kuta, Faculty of Mathematics and Computer Science, University of Łódź, S. Banacha 22, 90-238 Łódź, POLAND
and
Stanisław Spodzieja
Stanisław Spodzieja, Faculty of Mathematics and Computer Science, University of Łódź, S. Banacha 22, 90-238 Łódź, POLAND
Abstract.
Let be a convex closed and semialgebraic set and let be a polynomial positive on . We prove that there exists an exponent , such that for any the function is strongly convex on . When is unbounded we have to assume also that the leading form of is positive in . We obtain strong convexity of on possibly unbounded , provided is sufficiently large, assuming only that is positive on . We apply these results for searching critical points of polynomials on convex closed semialgebraic sets.
Key words and phrases:
Polynomial, semialgebraic set, convex function, exponential function, lower critical point.
2010 Mathematics Subject Classification:
Primary 11E25, 12D15; Secondary 26B25.
1. Introduction
In [3] we considered several questions concerning convexification of a polynomial which is positive on a closed convex set . One of the main results in [3], is the following [Theorem 5.1]: if is a compact set than there exists a positive integer such that the function
[TABLE]
is strongly convex on . Moreover, explicit estimates for the exponent were given in [3]. They depend on the diameter of , the size of coefficients of the polynomial and on the minimum of on . In fact a stronger version of (1.1) was given in [3]; there exists an integer , which can be explicitly estimated, such that the polynomials
[TABLE]
are strongly convex on . The fact that can be chosen independent of was crucial for a construction of an algorithm which for a given polynomial , positive in the convex compact semialgebraic set , produces a sequence starting from an arbitrary point , defined by induction: , i.e., is the unique point of at which has a global minimum on . The sequence converges to a lower critical point of on (see [3, Theorem 7.5]).
In the case of non-compact closed convex set the results mentioned above require an additional assumption, that the leading form of , satisfy
[TABLE]
Under this assumption we have that: if a polynomial is positive on then for any there exists such that for each , , the polynomial is strongly convex on .
The assumption (1.2) is necessary for local convexity of in a neighborhood of infinity, see [3, Proposition 6.3]. However, this assumption is not sufficient to obtain convexity of the polynomial for some fixed independent of . For instance the polynomial , has this property, cf. [3, Example 4.5].
The main goal of this paper is to study convexification of polynomials functions by exponential factors of the form or by double exponential of the form . Surprisingly they play distinct roles. We set
[TABLE]
and prove the following (see Theorem 2.3 and Corollary 2.4): if a polynomial is positive on a compact and convex set , than there exists effectively computed number such that for any and the function is strongly convex on .
If is not compact, we obtain the above assertions under the assumption (1.2), see Theorem 3.3. In general the assumption (1.2) can not be ommited as we show in Example 3.6.
Surprisingly convexification in the noncompact case without assumption (1.2) is possible using double exponential factors. Namely in Theorems 4.1 and 4.6 we prove that: if is a convex and closed semialgebraic set and is a polynomial positive on , then for any there exists effectively computed number such that for any and any , , the function
[TABLE]
is strongly convex on .
In the case when is a convex and closed set, but non necessary semialgebraic, the result still holds (Theorems 4.4 and 4.7) under an additional assumption
[TABLE]
In the above theorems one can replace by the function
[TABLE]
It turns out that convexification of polynomials using exponential function is somehow more natural and powerful than the convexification by the factors of the form done in [3]. In particular it applies also to the noncompact case and the explicit formulae for the exponent are nicer.
We believe that the results mentioned above could be of interest, also to study o-minimal structures expanded by the exponent function. It fits particularly to the structure semialgebraic sets expanded by the exponent function. The remarkable fact that is indeed an o-minimal structure was established by A. Wilkie [5]. It would be interesting to explain a different power of exponential and double exponential for convexification.
The main difficulty when determining explicitely the number such that the function is strongly convex on a convex compact set , comes from an effective estimation of the number in (1.3) and the number . Using results of G. Jeronimo, D. Perrucci, E. Tsigaridas [2] we show in Theorem 2.6) and Theorem 2.3 how it is feasible when is a compact semialgebraic set described by polynomial inequalities with integer coefficients and is also a polynomial with integer coefficients (see Theorem 2.7).
As an application to optimization we propose an algorithm which produces, starting from an arbitrary point , a sequence which tends to a lower critical point of a polynomial restricted to or to infinity. We assume that is a closed convex semialgebraic set and a polynomial which is bounded from below on . Then by adding to an appropriate constant we may assume that on . If is unbounded we assume also condition (1.2). Hence by the above mentioned theorems we obtain strong convexity of for . Let us choose any and set by induction: . Then we prove that the sequence tends to a lower critical point of a polynomial restricted to or to infinity. Note that computing , that is minimizing on , is usually easier since the function is convex. This type of algorithm, based on convexification, is called sometimes proximal, see for instance [1]. Observe that computing the critical point of involves only algebraic equations.
The paper is organized as follows. In Section 2 we prove that the function in one variable is strongly convex on a closed interval , provided for and some and is sufficiently large. We also estimate from above the number . In Section 2.2 we consider this problem in the several variables case on a compact convex set (see Theorem 2.3). In Sections 3 and 4 we consider the case when the set is not compact.
2. Convexifying polynomials
2.1. Convexifying -functions in one variable
In this section we prove that if is a function of class positive on a closed interval (not necessary compact), then for large enough the function is strongly convex on .
Let be function. For any and we define the following function:
[TABLE]
For positive numbers we put
[TABLE]
Lemma 2.1**.**
Let be a function of class , which is positive on a closed interval . Let be such that
[TABLE]
and
[TABLE]
Assume that and
[TABLE]
then
[TABLE]
for , thus is strongly convex on .
Proof.
By definition of we have
[TABLE]
Hence, from the assumptions we obtain
[TABLE]
for .Note that the function
[TABLE]
attains its minimum, equal , at the point . Thus for we have
[TABLE]
for any . Therefore
[TABLE]
which implies that is strongly convex on . ∎
From Lemma 2.1 we immediately obtain
Corollary 2.2**.**
Let be a function of class on a closed interval . Let be such that
[TABLE]
and
[TABLE]
Than for any and any the function
[TABLE]
is strongly convex on . In particular the function
[TABLE]
is strongly convex on .
2.2. Convexifying polynomials in several variables
We will show that the function in variables is strongly convex on a compact convex set , provided is a polynomial positive on and is suficiently large.
Let be a real polynomial in of the form
[TABLE]
where and for (we assume that ). For we denote
[TABLE]
Theorem 2.3**.**
Let be a polynomial which is positive on a compact and convex set . Let and
[TABLE]
Than for any , any and any real the function is strongly convex on .
Proof.
Let
[TABLE]
where denotes the standard scalar product on Set
[TABLE]
Clearly, the family of all curves , where describes all affine lines in Denote by the set of all such that the line parametrized by intersects the set X. Then is a compact set and
[TABLE]
We will prove that for any and the function is strongly convex on
[TABLE]
Because is a compact and convex set, so is a closed interval or only one point.
It is obvious that for the set is an interval, which contains the point 0 or it is equal to . Denote this interval by (under convention ). Then
[TABLE]
Let be of the form (2.7). Than for we have . Let us fix . Then
[TABLE]
and
[TABLE]
Consequently,
[TABLE]
Take any , then
[TABLE]
then for and , we have and
[TABLE]
So, by Lemma 2.1 we get that for and is strongly convex on , provided . ∎
From Theorem 2.3 we obtain the following corollary.
Corollary 2.4**.**
Let and let be a compact and convex set. Let and let be a constant such that
[TABLE]
Than for any and any , the function
[TABLE]
is strongly convex on .
By a similar argument as in the proof of Theorem 2.3, we obtain the following fact.
Remark 2.5**.**
Let be a function of class and let be a compact and convex set. Assume that are numbers satisfying
[TABLE]
and the first and second directional derivatives of in directions of vectors of length , are bounded by on . Then the function
[TABLE]
is strongly convex on .
2.3. Convexifying polynomials with integer coefficients
For actual applications of Theorem 2.3 it is important to compute the number for a given convex semialgebraic set and a polynomial which is positive on . Hence the main difficulty is to compute (or rather estimate) and . This actually possible if we suppose that has integer coefficients and is described by equations and inequalities with integer coefficients.
More precisely, let , , be a compact semialgebraic set of the form
[TABLE]
where . Under the above notations G. Jeronimo, D. Perrucci, E. Tsigaridas in [2] proved that
Theorem 2.6**.**
Let be polynomials with degrees bound by an even integer and coefficients of absolute values at most , and let . If for and of the form (2.9) is compact, then
[TABLE]
For a positive real number and positive integers we put
[TABLE]
From Theorems 2.6 and 2.3 we immediately obtain
Theorem 2.7**.**
Let be a compact and convex semialgebraic set of the form (2.9) and let be polynomials with degrees bound by an even integer and coefficients of absolute values at most . Set
[TABLE]
Then
[TABLE]
Moreover, if for , then for any , and for any the function
[TABLE]
is strongly convex on .
Proof.
By Theorem 2.6 we have . Let
[TABLE]
and let . Then is a compact semialgebraic set defined by polynomial equations and inequalities of degrees bounded by . Moreover, the absolute values of coefficients of those polynomials and are bounded by . Then, by Theorem 2.7,
[TABLE]
and consequently we obtain (2.10). Summing up, Theorem 2.3 gives the assertion. ∎
3. Convexifying polynomials on non-compact sets
In this section we will show that the function in variables is strongly convex on a closed and convex set (not necessary compact), provided the polynomial takes values larger than a certain number , the leading form of a polynomial has only positive values and is suficiently large.
3.1. Convexifying polynomials in one variable
For a polynomial of the form , , , we put
[TABLE]
Lemma 3.1**.**
Let be a polynomial of degree which is positive on a closed interval (not necessary compact). Let be a positive number such that
[TABLE]
Let be a polynomial of the form
[TABLE]
and let be a polynomial of the form
[TABLE]
for and . Then for , where and , we have
[TABLE]
Proof.
Consider the following quadratic function in
[TABLE]
Then its discrirminant is of the form . Take , and . Then we have
[TABLE]
for , . On the other hand for , . So for and we deduce the assertion. ∎
Theorem 3.2**.**
Let be a polynomial of degree and let be a closed interval (not necessary compact). Assume that there exists such that
[TABLE]
Let and . Then for any , , and any the function
[TABLE]
is strongly convex on .
Proof.
It suffices to observe that and apply Lemma 3.1. ∎
3.2. Convexifying polynomials in several variables
Theorem 3.3**.**
Let be a convex closed set. Assume that is a polynomial of degree which is positive on ,
[TABLE]
and there exists such that
[TABLE]
Then there exists such that for any integer and any the function is strongly convex on .
Proof.
Take any line of the form , where , and . Then
[TABLE]
Then
[TABLE]
where . Consider the function in the square bracket as a quadratic function in . Then its discriminant is of the form
[TABLE]
Note that and are the first and the second directional derivatives of at in the direction and .
Observe that there exists such that for any we have . Indead, it suffices to prove that for any and any , we have
[TABLE]
If for then the set is compact and the inequality follows from the assumption that for . Indead, let for , . Since is compact, then
[TABLE]
for . This gives (3.4).
Consider the case when for , and let
[TABLE]
where is the unit sphere in , i.e., .
Let be a polynomial of the form (2.7). We set
[TABLE]
Then . If then we set
[TABLE]
and
[TABLE]
In the further part of the proof we will need the following lemma.
Lemma 3.4**.**
If and , then and for any such that .
Proof.
Put
[TABLE]
Since , then
[TABLE]
and since for , then for . Moreover, , so . On the other hand
[TABLE]
for . This gives the assertion of Lemma 3.4. ∎
Take , and then for we have
[TABLE]
for .
For , we have
[TABLE]
Since for ,
[TABLE]
then for and we have
[TABLE]
for . This gives (3.4). Moreover, there exists such that
[TABLE]
for any and .
From (3.4) and the above there follows that for any and such that . Since for , then if . This gives the assertion. ∎
By analogous argument as for Theorem 3.3 and under notations of the proof we obtain the following corollary.
Corollary 3.5**.**
Let be a polynomial of degree and let be a convex and closed set. Under assumptions of Theorem 3.3 and notations of the proof, for any , and , and any the function is strongly convex on . In particular the function
[TABLE]
is strongly convex on .
The assumption (3.2) that for , in Theorem 3.3, can not be omited as the following example shows.
Example 3.6**.**
Let be a polynomial of the form
[TABLE]
Since for then we easily see that
[TABLE]
Note that , and the leading form has nontrivial zeroes.
Now take any and . Then for we have
[TABLE]
and
[TABLE]
Hence for sufficiently large ,
[TABLE]
therefore can not be a convex function.
The assumption (3.2) in Theorem 3.3, cannot be replaced by a condition . Indead, consider a modification of the previous example of the form
[TABLE]
where . Then and the function is not convex for any by the previous argument.
It turns out that the use of a double exponential function leads to a convexity of an appropriate function on . We show it in the next section.
4. Double exponential convexifying polynomials
In this section, without the assumption that the leading form of a polynomial in variables has only positive values, we will show that the function is strongly convex on a closed and convex semialgebraic set (not necessary compact), provided the polynomial takes positive values on and is suficiently large.
Theorem 4.1**.**
Let be a closed and convex semialgebraic set, and let be a polynomial which has only positive values on . Then there exists such that for any the function is strongly convex on .
Proof.
Let be of the form (2.7), . Then , , the first and second directional derivatives of in directions of vectors of length at , are bounded by , where
[TABLE]
Take an affine line in of the form
[TABLE]
where and the set is defined in (2.8). Then , , and . Let write the second derivative of in the form
[TABLE]
where
[TABLE]
The discriminant of the polynomial is of the form
[TABLE]
So, by the choice of the number , we have
[TABLE]
Since the set is semialgebraic and , then by Hörmander-Łojasiewicz inequality, see eg. [4, Corollary 2.4], there exist , where , , depend on and the complexity of , (i.e., degrees and the number of polynomials describing ) such that
[TABLE]
Moreover, the numbers are effectively computable. By the above,
[TABLE]
If is large enough, then for any we have
[TABLE]
Therefore , so for any . Consequently
[TABLE]
for . Note that , hence there exists such that for . Moreover, the number can be chosen independet of . This gives the assertion. ∎
Remark 4.2**.**
The number in Theorem 4.1 can be effectively computed, provided we can estimate the constant . More precisely, under notations in the proof, if , then for we have
[TABLE]
for
[TABLE]
If additionally , then the above inequality holds for any .
Remark 4.3**.**
We cannot omit the assumption in Theorem 4.1 that the set is semialgebraic. For instance if and , then on , but the function is not convex on for any .
Assuming that on , for some , we can omit the assumption in Theorem 4.1 on semialgebraicity of . More precisely, by a similar argument as in the proof of Theorem 4.1 we obtain
Theorem 4.4**.**
Let be a closed and convex set, and let be a polynomial such that for and some . Then there exists such that for any the function is strongly convex on .
Remark 4.5**.**
By a similar argument as for the proof of Theorem 4.4 we obtain that the assertion of this Theorem occurs not only for the function but also for the function . More precisely, we have:
Let be a closed and convex set, and let be a polynomial such that for and some . Then there exists such that for any the function is strongly convex on .
A similar argument as for Theorem 4.1 gives the following theorems.
Theorem 4.6**.**
Let be a convex closed semialgebraic set and let . If is a polynomial such that
[TABLE]
then there exists such that for any integer and any , , the function is strongly convex on .
Moreover, there exists such that the function
[TABLE]
is strongly convex on , provided , .
Theorem 4.7**.**
Let be a convex closed set and let . Assume that is a polynomial of degre such that there exists such that
[TABLE]
Then there exists such that for any integer and any , the function is strongly convex on .
Moreover, there exists such that the function
[TABLE]
is strongly convex on , provided , .
It is impossible to obtain in the above theorem, such that the function is convex for any as the following example shows.
Example 4.8**.**
Let be a polynomial of the form
[TABLE]
Analogously as in Example 3.6 we see that
[TABLE]
and the leading form has nontrivial zeroes.
Now take any and . Then for , we have
[TABLE]
Since
[TABLE]
and
[TABLE]
then we easily see that for sufficiently large . So, can not be a convex function.
Remark 4.9**.**
It is worth noting that the use of triple exponential convexifying of a polynomial does not improve convexity of the function regardless of .
5. Algorithm for searching lower critical points
5.1. Searching lower critical points in a compact set
In this part we give an algorithm which produces, starting from an arbitrary point, a sequence of points converging to a lower critical point of a polynomial on a convex compact semialgebraic set. A similar algorithm was proposed in [3].
Let be a closed set and let be a function of class in a neighborhood of . We denote the set of lower critical points of the function on the set by It is obvious that the set of ordinary critical points of the function is contained in the set
Our algorithm for approximation of lower critical points of is based on the iteration of computation of the smallest value of the strongly convex function on the convex and compact set . More precisely, let
[TABLE]
Take any polynomial of the form (2.7). Let
[TABLE]
and let
[TABLE]
Then we have
[TABLE]
and from Corollary 2.4, we have that for any , the function
[TABLE]
is -strongly convex on for some . Since we are looking for lower critical points of , so without loss of generality, we may assume that , therefore
[TABLE]
is -strongly convex function in for any .
Any strictly convex function defined on a compact and convex set has the unique point, denoted by , in which the function has the minimal value on the set . Therefore, chosing an arbitrary point , we can determine by induction a sequence , , in the following way
[TABLE]
Theorem 5.1**.**
Let be a compact convex semialgebraic set and a positive polynomial on . Let be a sequence defined as with Then the limit
[TABLE]
exist and
The proof of Theorem 5.1 follows word by word the proof of Theorem 6.5 in [3], where we should use the following three lemmas instead of the corresponding lemmas in [3].
Lemma 5.2**.**
For any , we have
[TABLE]
Lemma 5.3**.**
For any we have
[TABLE]
In particular the sequence is decreasing.
Proof.
Since is strongly convex, the definition of implies that the function
[TABLE]
decrease, so . Again by the fact that is -strictly convex, we get
[TABLE]
This gives the assertion. ∎
We can also addapt the following lemma ([3, Lemma 6.3]).
Lemma 5.4**.**
Let be a function such that and on for some and Assume that is strictly convex on Then Hence
Remark 5.5**.**
The function is defined by using the function . However, to determine the minimum value of this function on a compact convex semialgebraic set it is enough to solve only polynomial equations and inequalities. More precisely, the set is the union of a finite collection of basic semialgebraic sets, so we may assume that
[TABLE]
where . Then
[TABLE]
Therefore, when applying Lagrange Multipliers or Karush-Kuhn-Tucker Theorem to compute the point it is enought to solve a system of polynomial equations and inequalities.
5.2. Searching lower critical points in an unbounded set
Let be a convex and closed semialgebraic set. Let be a polynomial of degree of the form (2.7) and let be the leading form of . Assume that .
Then by Theorem 3.3, we may effectively compute a real number such that the function for is strongly convex on . Moreover, for , , so we have
[TABLE]
Then we may uniquely determine the sequence
[TABLE]
Analogous argument as for Theorem 5.1 gives the following theorem.
Theorem 5.6**.**
Let be a sequence defined by (5.2), Then the limit
[TABLE]
exist and
Remark 5.7**.**
If is a closed and convex semialgebraic set and a polynomial is positive on and it is proper on (i.e., ), then by Theorem 4.6 one can repeat the argument from Theorem 5.1 and obtain a sequence such that .
If we assume only that on , then the sequence can tend to infinity. Moreover, in the construction of we have to change step by step.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Attouch, J. Bolte, P. Redont, A. Soubeyran, Proximal alternating minimization and projection methods for nonconvex problems: an approach based on the Kurdyka-Łojasiewicz inequality . Math. Oper. Res. 35 (2010), no. 2, 438–457.
- 2[2] G. Jeronimo, D. Perrucci, E. Tsigaridas, On the minimum of a polynomial function on a basic closed semialgebraic set and applications . SIAM J. Optim. 23 (2013), no. 1, 241–255.
- 3[3] K. Kurdyka, S. Spodzieja, Convexifying positive polynomials and sums of squares approximation . SIAM J. Optim. 25 (2015), no. 4, 2512–2536.
- 4[4] K. Kurdyka, S. Spodzieja, A, Szlachcińska, Metric properties of semialgebraic mappings . Discrete Comput. Geom. 55 (2016), no. 4, 786–800.
- 5[5] A.Wilkie, Model completeness results for expansions of the ordered field of real numbers by restricted Pfaffian functions and the exponential function. J. Amer. Math. Soc. 9 (1996), no. 4, 1051–1094.
