Algorithm for Optimization and Interpolation based on Hyponormality
C\'edric Josz

TL;DR
This paper introduces a unified linear algebra-based algorithm leveraging hyponormality to solve complex polynomial optimization and exponential interpolation problems, extending existing methods and avoiding algebraic geometry.
Contribution
It presents the first algorithm capable of extracting global solutions from complex polynomial optimization problems and offers an alternative to Prony's method for interpolation.
Findings
Successfully extracts global solutions for complex polynomial optimization.
Provides an algebraic alternative to Prony's method for interpolation.
Demonstrates enforcement of hyponormality to improve relaxation solutions.
Abstract
On one hand, consider the problem of finding global solutions to a polynomial optimization problem and, on the other hand, consider the problem of interpolating a set of points with a complex exponential function. This paper proposes a single algorithm to address both problems. It draws on the notion of hyponormality in operator theory. Concerning optimization, it seems to be the first algorithm that is capable of extracting global solutions from a polynomial optimization problem where the variables and data are complex numbers. It also applies to real polynomial optimization, a special case of complex polynomial optimization, and thus extends the work of Henrion and Lasserre implemented in GloptiPoly. Concerning interpolation, the algorithm provides an alternative to Prony's method based on the Autonne-Takagi factorization and it avoids solving a Vandermonde system. The algorithm and…
| Truncated data | Shift operators | Experiment |
|---|---|---|
| General case (only Hermitian) | Existence not guaranteed | Example 6.1 |
| Hermitian but not hyponormal | do not commute | Example 6.2 |
| Hyponormal | commute | Example 6.3 |
| Enforced hyponormality | are made to commute | Example 6.4 |
| Hermitian Toeplitz | Unitary | Example 6.5 |
| Hermitian Hankel | Real symmetric | Example 6.6 |
| Complex Hankel | Complex symmetric | Example 6.7 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPolynomial and algebraic computation · Advanced Optimization Algorithms Research · Numerical Methods and Algorithms
Algorithm for Optimization and Interpolation based on Hyponormality
Cédric Josz111Laboratory for Analysis and Architecture of Systems (LAAS), French National Center for Scientific Research (CNRS), 7, avenue du Colonel Roche, Toulouse, 31000, France (). The research was funded by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement 666981 TAMING).
Abstract
On one hand, consider the problem of finding global solutions to a polynomial optimization problem and, on the other hand, consider the problem of interpolating a set of points with a complex exponential function. This paper proposes a single algorithm to address both problems. It draws on the notion of hyponormality in operator theory. Concerning optimization, it seems to be the first algorithm that is capable of extracting global solutions from a polynomial optimization problem where the variables and data are complex numbers. It also applies to real polynomial optimization, a special case of complex polynomial optimization, and thus extends the work of Henrion and Lasserre implemented in GloptiPoly. Concerning interpolation, the algorithm provides an alternative to Prony’s method based on the Autonne-Takagi factorization and it avoids solving a Vandermonde system. The algorithm and its proof are based exclusively on linear algebra. They are devoid of notions from algebraic geometry, contrary to existing methods for interpolation. The algorithm is tested on a series of examples, each illustrating a different facet of the approach. One of the examples demonstrates that hyponormality can be enforced numerically to strenghten a convex relaxation and to force its solution to have rank one.
keywords:
Autonne-Takagi factorization, Cholesky factorization, Hankel matrix, hyponormality, moment problem, Toeplitz matrix.
AMS:
49M20, 65F99, 47N10.
\slugger
mmsxxxxxxxx–x
1 Introduction
Consider the problem of finding global solutions to the following complex polynomial optimization problem
[TABLE]
where we use the multi-index notation for , , and stands for the conjugate of . As usual, denotes the set of complex numbers and will denote the set of real numbers. The functions are real-valued polynomials so that in the above sums only a finite number of coefficients and are nonzero and they satisfy and . The feasibility set is defined as
[TABLE]
We define its degree to be
[TABLE]
where is the maximal degree in either the conjugate or non-conjugate powers of the polynomial . Note that this is different from the degree of the polynomial, which is related to the sum of the conjugate and non-conjugate powers, i.e. .
To solve (1), Molzahn and I proposed [43] proposed a semidefinite programming relaxation hierarchy in complex numbers which generalizes Lasserre’s hierarchy [49, 50, 60, 61] and relies on the recent results [25, 68]. It consists in the following primal-dual problems
[TABLE]
where stands for positive semidefinite and the integer is the truncation order. The moment matrix is defined by
[TABLE]
and the localizing matrix is defined by
[TABLE]
A polynomial is a Hermitian sum of squares, i.e. it belongs to , if it is of the form
[TABLE]
This is equivalent to where stands for positive semidefinite (see [43] for an explanation). In the above equation, stands for modulus of a complex number.
If one were to convert a complex polynomial optimization problem to real numbers and apply Lasserre’s hierarchy, the moment matrix would be times bigger asymptotically (for a large number of variables), hence the advantage of the complex hierarchy we’ve just described. This comes at the cost of a potentially lower optimal value at a given truncation order . Significant numerical advantages can be seen on the optimal power flow in electrical engineering [12, 57, 43] on instances with several thousand variables and constraints.
Global convergence is guaranteed in the presence of a sphere constraint, i.e. . At the cost of an additional variable, any complex polynomial optimization problem with compact feasible set can be solved by this approach, as explained in [43]. In that work, conditions for extracting global minimizers were given, but a general procedure for extracting them was left for future work. One of the objectives of this paper is to fill this gap. Specifically, we propose an algorithm to extract an atomic measure from the truncated data , i.e. that satisfies
[TABLE]
The atoms are then global solutions to the polynomial optimization problem. There exists no method in the present literature that achieves this to the best of our knowledge. Our algorithm also applies to real polynomial optimization, for which a method already exists [39] and was implemented in Gloptipoly [40]. A variant to that method was later proposed in [53]. We next consider a seemingly unrelated problem for which the same algorithm applies.
Consider the following sum of complex exponential functions
[TABLE]
composed of weights and frequencies (using the shorthand where stands for transpose). Say we want to interpolate a set of imposed values with such a function
[TABLE]
In other words, the problem consists in computing weights and frequencies that match the interpolation values. As is well known (e.g.,[47]), these values satisfy
[TABLE]
where
[TABLE]
We use the notation and stands for the Dirac measure at . The interpolation values are thus the moments on of the measure . In this setting, we can consider the following moment matrix
[TABLE]
which is a complex Hankel matrix (i.e. for all such that ). Our algorithm extracts the sought measure from this matrix by using the Autonne-Takagi factorization [77, 7], also known as symmetric singular value decomposition, which applies to complex symmetric matrices. To the best of our knowledge, this factorization has not been used in this context. For a thorough survey on the applications of complex symmetry, see [31]; for recent development on the Autonne-Takagi factorization, see [41]. Another feature of our algorithm is that it avoids solving a Vandermonde linear system, as in the recent preprint [36]. Existing methods were initiated by Baron Gaspard Riche de Prony in 1795 [26], and have been subject to formidable developments [9, 62, 71, 5, 58, 46] and applications [8, 32, 64, 63, 70, 76]. One application is to recover a damped sinusoidal function of real variable of the form
[TABLE]
from a small number of its evaluations, where
- •
: amplitude
- •
: damping
- •
: angular frequency
- •
: phase shift.
Indeed, such a function is special case of a complex exponential function presented above since . We briefly recall Prony’s method in the univariate setting of (9)-(10) above. It consists in defining the polynomial
[TABLE]
which, for all , satisfies
[TABLE]
so that its coefficients lie in the Kernel of the Hankel matrix in (13). The Kernel is unidimensional and , so the coefficients are uniquely determined. The frequencies of the interpolation function are given by the its roots, and the weights can then be deduced by the Vandermonde system
[TABLE]
This paper is organized as follows. Section 2 summarizes the contributions of this paper. Section 3 provides some background on hyponormality and compares the notion in infinite and finite dimensions. Section 4 reviews a result on the moment problem in complex numbers and provides a new result for Toeplitz matrices. The same machinery is then applied to the truncated moment problem which arises in exponential interpolation. The constructive proofs of Section 4 yield an algorithm for optimization and interpolation in Section 5. It is illustrated on various examples in Section 6. Finally, Section 7 concludes our work.
2 Contributions
The main contribution of this paper is to apply hyponormality to polynomial optimization and exponential interpolation. This has not been considered in past literature to the best of our knowledge. We obtain the following new results using hyponormality.
We propose a procedure to extract global solutions from a polynomial optimization problem when using the moment/sum-of-squares hierarchy in complex numbers (algorithm in Section 5). 2. 2.
We propose a variant to Prony’s method based on the Autonne-Takagi factorization using the same algorithm in Section 5, thereby unifying two a priori unrelated problems: optimization and interpolation. 3. 3.
We propose a moment/sum-of-squares hierarchy for complex polynomial optimization in which hyponormality is enforced via convex constraints and illustrate it on an example (Example 6.4 in Section 6). 4. 4.
We propose a solution to the truncated moment problem on a semi-algebraic set when the data forms a Toeplitz matrix (Theorem 2 in Section 4). We find that Toeplitz and Hankel matrices play analogeous role with respect to the moment problem. 5. 5.
We analyse the different properties of shift operators associated to truncated data (eg. unitary, symmetric) and relate them to applications in optimization and interpolation (examples 1 through 7 in Section 6).
3 Joint hyponormality
We recall some fundamental notions in operator theory in order to discuss joint hyponormality. Let denote the set of linear bounded operators acting on a complex Hilbert space . We convene that the inner product is conjugate-linear in its first variable, and linear in its second. A notion of positivity can be defined for an element of , which is that for all , and which we denote . In addition, the commutator of is defined as . Finally, let denote the adjoint of . Given these notations, an operator is said to be
- •
normal if ;
- •
subnormal if it can be extended to a normal operator on a larger Hilbert space ;
- •
hyponormal if .
The following implications hold
[TABLE]
The first implication is obvious. Concerning the second, let denote the orthogonal projection of on . Then, for all , we have and thus, as projections are self-adjoint. Following [20], we have
[TABLE]
where stands for the norm induced by the inner product. The notions of subnormality and hyponormality were introduced by Halmos [35] in 1950 when studying the unilateral shift operator, that is
[TABLE]
where is the Hilbert space of square-summable sequences of complex numbers indexed by the natural numbers . Its adjoint is
[TABLE]
It is not normal since for all , we have . However, it admits the following normal extension
[TABLE]
where is the Hilbert space of square-summable sequences indexed by the integers . By definition, is subnormal, and, as a consequence, it is hyponormal. We can check that for all , we have . Hyponormality has been subject to various developments since it was first introduced, see for example [14, 22, 27, 28, 29, 56, 78, 69, 15].
In this paper, we will define a shift operator for each variable (optimization variable or variable of interpolation function). We will thus rely on an extension of hyponormality to multiple operators. Following the definition of Athavale [6], operators are jointly hyponormal if
[TABLE]
in the sense that for all , there holds
[TABLE]
This is equivalent to111This can seen with the Schur complement. If and is invertible, then
[TABLE]
but it is stronger than requiring to be hyponormal for all , that is
[TABLE]
The key ingredient to making the notion of joint hyponormality relevant for practical purposes is that in finite dimensions, the trace of is equal to zero, and thus
[TABLE]
This brings about the following equivalences in finite dimension which will be used throughout the paper:
are jointly hyponormal. 2. 2.
The inequality in (22) is an equality. 3. 3.
commute pair-wise. 4. 4.
For all pairs with , it holds that
[TABLE] 5. 5.
There exists a unitary matrix (i.e. where denotes the identity) and diagonal matrices such that
[TABLE]
These equivalences bring a new perspective on the moment problem to the best of our knowledge. They were implicitely used in our preprint [43]. It allows one to deal with a kind of truncated moment problem that arises in practice, but which has not been given much attention to from a theoretical perspective.
4 Truncated moment problem
The moment problem is an old yet active subject of research [37, 3, 48, 18, 17, 20, 66, 72, 16, 19, 45, 44, 74, 73, 23, 54, 38, 10, 59, 52, 2]. We now focus on a kind of moment problem that arises in practice. Consider an integer and an infinite sequence of complex numbers . In the context of polynomial optimization, we are interested in knowing whether there exists a positive Borel measure supported on the semi-algebraic set such that
[TABLE]
Should it exist, we are also interested in computing the measure from its moments . In the context of exponential interpolation, a complex-valued measure supported on is presumed to exist such that
[TABLE]
and we are solely interested in computing the measure from its moments.
In previous work [18, 16, 19], the authors raise the question of whether there exists a positive Borel measure supported on the semi-algebraic set such that
[TABLE]
This corresponds to a degree truncation, as opposed to a square truncation as above. The discrepancy is illustrated in Figure 1 below. It calls for different notions of moment matrices, as shown in Figure 2. The moment matrix resulting from square truncation is referred to as pruned complex moment matrix in [51], but the moment problem with square truncation is not considered. The moment problem with degree truncation is equivalent in some sense (see [19, Theorem 5.2]) to the even-dimensional real moment problem, i.e. where we seek a measure on a real semi-algebraic set such that
[TABLE]
given a real sequence . In contrast, the square truncation we consider captures the real truncated moment problem as the special case (both even- and odd-dimensional). It corresponds to the case where the moment data forms a Hankel matrix (see [43, Corollary 3.9] and Theorem 2 below).
In Theorem 1 and Theorem 2 below, solutions to the moment problem with square truncation are given using the notion of hyponormality presented in Section 3. The proofs are constructive and the algorithm proposed in Section 5 replicates each step of the proofs. To introduce the results, we need some notation. In the multivariate case, we say that truncated data is Hermitian if for all . Given an integer , a -atomic measure is sum of Dirac measures in distinct points (called atoms) with nonzero weights. It is said to be positive if all the weights are positive, and supported on a set if the atoms lie in the set. In the setting of interpolation which we treat after Theorem 1 and Theorem 2, the weights are complex numbers.
Theorem 1**.**
[43*, Theorem 3.8]**
Consider some complex numbers with forming a Hermitian matrix. Assume that contains the constraint for some radius . Then there exists a positive -atomic measure* supported on such that
[TABLE]
if and only if
and ; 2. 2.
; 3. 3.
.
Moreover, is the unique -atomic measure satisfying (33), and for each , it has exactly atoms that are zeros of . In the univariate case , Condition 3 must be replaced by
[TABLE]
Proof.
We provide a sketch of the proof. We focus on the “if” part, as it is important in the sequel. The positive semidefinite moment matrix of rank can be factorized as where stands for adjoint, i.e. conjugate transpose. This can be achieved with the Cholesky factorization [13]. The rows of are indexed by and the columns are are indexed by . Thanks to Conditions 1 and 2, there exists complex matrices of order , called shift operators, such that for each , we have
[TABLE]
Here, denotes the -column of and is the row vector of size that contains only zeros apart from 1 in position . We now explain why the shift operators exist. Consider the finite-dimensional Hilbert space . Condition 2 implies that . Given some complex numbers , Condition 1 implies that
[TABLE]
where denote the 2-norm of a vector . As a result, given two possibly different sets of coefficients and , if , then . In other words, each element of has a unique image by , which makes the shift well-defined. In fact, it is bounded by the radius .
Condition 3 implies that for all , we have
[TABLE]
As discussed in Section 3, this makes the shifts jointly hyponormal. Thus there exists a unitary matrix such that where is a diagonal matrix for each . Bearing in mind that , we have for all that
[TABLE]
where denote the columns of and . As a result, eigenvalues of the shift operators correspond to the support of a measure, and their eigenvectors yield the weights of a measure. Precisely, the following measure
[TABLE]
solves the truncated problem up to order , i.e. (33). The uniqueness of the measure can easily be deduced from Lemma 3 and Lemma 4 in the Appendix. ∎
We will say that the truncated moment data is hyponormal if it satisfies Condition 3 of Theorem 1. Indeed, this condition corresponds to the joint hyponormality of the shift operators. We propose to enforce the joint hyponormality of the shift operators in the complex hierarchy by requiring the truncated data to be hypornormal. This yields the following primal problem
[TABLE]
and its dual counterpart
[TABLE]
where a polynomial belongs to if it is of the form where . Recall that for complex polynomials, the coefficients are unique (see Section 3.2, footnote 4 in [43] for a comparison with real polynomials). Thus the complex polynomial can be identified with the Hermitian matrix . This is what we do in the semidefinite constraints in the above dual problem. In Example 6.4, we apply this hierarchy of relaxations to a bivariate optimization problem. Compared to the complex moment/sum-of-squares hierarchy, it solves this particular problem at a lower order.
There exists several identifiable cases where the shift operators are naturally jointly hyponormal. It was noticed in [43, Corollary 3.9] that it is the case when the truncated data forms a Hermitian Hankel matrix (thus real-valued). This corresponds exactly to the truncated data generated by the Lasserre hierarchy for real polynomial optimization. In that case, the shift operators are real symmetric. Below, we show that hyponormality is also guaranteed if we assume that the truncated data forms a Toeplitz matrix. This result has not been presented in the literature to the best of our knowledge. In the multivariate case, we say that the truncated data is Hankel if for all such that , and we say that the truncated data is Toeplitz if for all such that . In other words, only depends on in a Hankel matrix, and it only depends on in a Toeplitz matrix.
Theorem 2**.**
Consider some complex numbers with forming either a Hermitian Toeplitz matrix or a Hermitian Hankel matrix. There exists a positive -atomic measure supported on such that
[TABLE]
if and only if
and ; 2. 2.
.
Moreover, is the unique -atomic measure satisfying (42), and for each , the measure has exactly atoms that are zeros of .
Proof.
() Same as proof as in [43, Theorem 3.8]. () The proof is similar to that of Theorem 1. We focus on the areas where it differs and only consider the case of Toeplitz matrices. The Toeplitz property implies that the shift operators in the proof of Theorem 1 are well-defined and unitary. Indeed, for all complex numbers , it holds that
[TABLE]
Using the same argument as in the proof of Theorem 1, the shift is well-defined. In addition, it is isometric and thus satisfies . Now, observe that is a pair-wise commuting tuple of operators on the Hilbert space . As a consequence, is also a pair-wise commuting tuple of operators. Indeed, if two invertible square matrices and commute, so do and (since ), and so do and (since ). It follows that are jointly hyponormal. The rest of the proof is identical to that of Theorem 1. ∎
In the univariate case with support equal to the full space , the truncated moment problem in Theorem 2 with Toeplitz data is actually the truncated trigonometric moment problem. A solution to this problem has been given by [42, P. 211], [4, Theorem I.I.12], and [21, Theorem 6.12]. It can be stated as follows. A Toeplitz matrix with positive upper left element is positive semidefinite if and only if it is represented by a positive Borel measure. In other words, the rank need not be preserved (Condition 2 of Theorem 2) for there to exist a measure. The trigonometric moment problem has been considered more recently in [30, 1, 81].
In the setting of interpolation, the shift operators are more simple to study since we assume the existence of a representing measure (i.e. ). The measure is uniquely determined by its moments up to degree , and thus the interpolation problem has a unique solution with exponentials. In addition, the rank is preserved, i.e. . These claims can be easily be deduced from Lemma 3 and Lemma 4 in the Appendix. We next prove that shift operators associated to the Hankel moment matrix are guaranteed to exist, that they are simultaneously diagonalizable, and that they are complex symmetric.
The moment matrix is of Hankel type and is thus complex symmetric. According to the Autonne-Takagi factorization [7] [77, Theorem II] which applies to any square complex symmetric matrix, there exists a unitary matrix (i.e. ) and a diagonal matrix with real nonnegative entries such that . Note that the diagonal values of are the eigenvalues of , whose rank is equal to that of . Defining , we may in fact write that where the rows of are indexed by and the columns are are indexed by . In addition, since our data is represented by the measure , we know that where the rows of are indexed by and the columns are are indexed by . Here, is a complex number such that where is a complex weight of the measure . As a result, and thus and have the same ranges. Hence, there exists an invertible matrix such that . Thus , or, in other words, . According to Lemma 3 and Lemma 4 in the Appendix, the range of is equal to , so that we in fact have for all . As a result, . Going back to the relationship , we have in particular that for all . Defining , we have that , and thus . The shift operators are thus for . Remembering that , we have
[TABLE]
so that the sought measure is entirely determined
[TABLE]
We conclude by proving that the shifts are complex symmetric. Since , we have that . Consider some complex numbers and . Let and and compute
[TABLE]
whence .
5 Algorithm
Input:
- •
number of variables
- •
truncation order
- •
moment matrix of rank
Output:
- •
-atomic measure
Below, the notation either stands either for conjugate transpose or transpose depending on whether the algorithm is being applied to polynomial optimization () or interpolation (). In the case of real polynomial optimization, .
Factorize the moment matrix into . The rows of are indexed by and the columns of are indexed by . 2. 2.
Find a subset of the columns of that generate the column space of . Let denote their indexes. 3. 3.
Compute the shift operators by applying them only to the column basis, i.e. for . 4. 4.
Generate some random and diagonalize the matrix with . Let denote the columns of . 5. 5.
Compute the measure .
In the case of polynomial optimization, the atoms of the measure are global solutions. In the case of interpolation, the arguments of the atoms are the frequencies (modulo for their imaginary parts) and the weights of the measure are the weights of the complex exponential sum.
6 Numerical experiments
We use Matlab 2015b, CVX [24, 33], and SeDuMi [75]. The Cholesky factorization of positive semidefinite matrices is computed via an eigendecomposition followed by a QR factorization. The Autonne-Takagi factorization is computed using the implementation of Guo, Luk, Xu, and Piao [80, 79, 34, 55]. Another algorithm for this factorization is discussed in [11]. Table 1 below summarizes the experiments. Each of them illustrates a different property of the shift operators that arises in applications.
Example** 6.1**** (Nonexistent shifts).**
Consider the following truncated data:
[TABLE]
The existence of shift operators is guaranteed if Conditions 1 and 2 of Theorem 1 hold. Condition 2 is satified because . However, Condition 1 is not satisfied. The spectrum of is equal to but there does not exist such that . Indeed, the characteristic polynomial of with indeterminate is equal to
[TABLE]
[TABLE]
The moment matrix can be factorized as where
[TABLE]
*There does not exist a shift operator acting on , i.e. a scalar , such that and . That would imply that and , which is absurd. *
Example** 6.2**** (Non-hyponormal shifts).**
Consider the following truncated data which was randomly generated:
[TABLE]
[TABLE]
We chose the data so that and is invertible. As result, holds as long a is big enough. Hence Condition 1 of Theorem 1 holds. In addition, we chose the data so that the rank is preserved, i.e. and so Condition 2 holds. All the conditions of Theorem 1 are thus satisfied, apart from Condition 3. We now show that Condition 3 fails to hold, and thus no atomic measure may be extracted from the data.
We have the Cholesky decomposition where
[TABLE]
[TABLE]
for which the column basis indexed by is readily identified. We then obtain the shift operators
[TABLE]
and
[TABLE]
For example, the image by of the column of indexed is equal to the column of indexed by :
[TABLE]
Using an eigendecomposition, we find that
[TABLE]
and
[TABLE]
where stands for spectrum. This shows that Condition 3 of Theorem 1 does not hold, that is to say, and are not jointly hyponormal.
Example** 6.3**** (Complex polynomial optimization over an ellipse).**
Minimize
[TABLE]
over subject to
[TABLE]
[TABLE]
The feasible set of this example is taken from [43, Example 3.2], itself originally from [67]. It is represented in Figure 3.
The first order complex relaxation is not defined because the variables appear to the second power (notice that ). The second order relaxation yields the value 1.00047 and a spectral decomposition indicates that and . Thus Condition 2 of Theorem 1 does not hold. The third order relaxation yields 1.93291 and the following moment matrix:
[TABLE]
[TABLE]
Using a spectral decomposition, we find that . In addition, we find the Cholesky decomposition where
[TABLE]
[TABLE]
for which the column basis indexed by is readily identified. We then obtain the shift operators
[TABLE]
For example, the image by of the column of indexed by is equal to the column of indexed by :
[TABLE]
Using an eigendecomposition, we find that
[TABLE]
and
[TABLE]
This implies that and are jointly hyponormal and all three conditions of Theorem 1 hold. An atomic measure can be extracted from the truncated data. Due to hyponormality, we may diagonalize the shifts simultaneously. Taking a random linear combination, we have
[TABLE]
with
[TABLE]
and
[TABLE]
The first coordinate of the atoms is given by the diagonal of
[TABLE]
The second coordinate of the atoms is given by the diagonal of
[TABLE]
The atoms can now be read by looking at the first diagonal elements, then the second diagonal elements:
[TABLE]
With denoting the column of indexed by , and with and denoting the first and second colums of , the respective weights of the atoms are
[TABLE]
Example** 6.4**** (Enforcing joint hyponormality in a convex relaxation).**
Minimize
[TABLE]
over subject to
[TABLE]
[TABLE]
The feasible set of this example is the same as in Example 6.3 and Figure 3.
The first order complex relaxation is not defined because the variables appear to the second power (notice that ). The second order relaxation yields the value 0.155089 and the following moment matrix:
[TABLE]
A spectral decomposition reveals that . In addition, we have the Cholesky decomposition where
[TABLE]
for which the column basis indexed by is readily identified. We then obtain the shift operators
[TABLE]
[TABLE]
Using an eigendecomposition, we find that
[TABLE]
and
[TABLE]
This shows that Condition 3 of Theorem 1 does not hold and that and are not hyponormal. Thus no measure can be extracted from the data.
The third order complex relaxation yields the value 0.428175 and the moment matrix satisfies . Thus a Dirac measure can be extracted, with support and weight . Instead of computing the third order relaxation, we enforce hyponormality by adding the following constraint in the second order relaxation:
[TABLE]
We then obtain the value 0.428175 and the following moment matrix:
[TABLE]
which satisfies . In addition, we have the Cholesky decomposition where
[TABLE]
The shift operators can be read from and they are equal to
[TABLE]
Using an eigendecomposition, we find that
[TABLE]
and
[TABLE]
Jointly hyponormality has been successfully enforced. It has reduced the rank of the moment matrix from rank 3 to rank 1. The eigenvalues of and are their values, thus we find the Dirac measure with support and weight , precisely as in the third order relaxation. Notice that in the third order relaxation, there is one conic constraint of size and one of size . In the second order relaxation augmented with (71), there is one conic constraint of size , one of size , and one of size .
Example** 6.5**** (Complex polynomial optimization on the torus).**
Minimize
[TABLE]
over subject to
[TABLE]
The feasible set is represented in Figure 4.
The first and second order complex relaxations are not defined, while the third relaxation yields the value and the following moment matrix:
[TABLE]
[TABLE]
Notice that the Toeplitz property holds. For instance, the term indexed by is equal to the term indexed by (their common value is ). The moment matrix satisfies and . While Condition 2 in Theorem 2 does not hold since , the rank is preserved between the current truncation order () and the previous (). This suffices to guarantee the extraction of -atomic measure, i.e with 2 atoms. However, the atoms will not necessarily lie in the feasible set, so we must check for this in the end. We have the Cholesky decomposition where
[TABLE]
[TABLE]
for which the row basis indexed by is readily identified. We then obtain the shift operator
[TABLE]
which is unitary
[TABLE]
We have where
[TABLE]
and
[TABLE]
whence the two atoms are
[TABLE]
Their respective weights are
[TABLE]
It is easy to check that the atoms found are third roots of unity. In the fourth order relaxation, all conditions of Theorem 2 hold, and we find the optimal value and the same solutions.
Example** 6.6**** (Real polynomial optimization).**
Minimize
[TABLE]
over subject to
[TABLE]
This example is taken from [50, Ex. 5]. Henrion and Lasserre’s method for extracting global minimizers is illustrated on this example in [39, Section 2.3]. The feasible set is represented in Figure 5.
The first order relaxation yields the value -3.0000 and a spectral decomposition indicates that and . Thus, the rank is not preserved. The third order relaxation yields -2.0000 and the following moment matrix:
[TABLE]
in which the rank is preserved since . Notice that the Hankel property holds. For instance the term indexed by is equal to the term indexed by (their common value is 4.9625). We have the Cholesky decomposition where
[TABLE]
for which the row basis indexed by is readily identified. We then obtain the shift operators
[TABLE]
which are symmetric matrices. For example, the image by of the column of indexed by is equal to the column of indexed by :
[TABLE]
Taking a random linear combination, we have
[TABLE]
where
[TABLE]
The first coordinate of the atoms is given by the diagonal of
[TABLE]
while the second coordinate of the atoms is given by the diagonal of
[TABLE]
The atoms can now be read
[TABLE]
With denoting the column of indexed by , and with denoting the colums of , the respective weights of the atoms are
[TABLE]
Example** 6.7**** (Prony’s method via Autonne-Takagi factorization).**
Consider the following complex exponential function where :
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Its real part is represented in Figure 6.
We reconstruct this function by evaluating it at for and . We start with and increment until the rank is preserved, i.e. . We have and . Thus we stop at the second order. The second order Hankel moment matrix contains the evaluations for . Precisely, and we have
[TABLE]
[TABLE]
The Autonne-Takagi factorization yields where
[TABLE]
[TABLE]
in which the row basis indexed by can be identified. We then obtain the complex symmetric shift operators
[TABLE]
[TABLE]
For example, the image by of the column of indexed by is equal to the column of indexed by :
[TABLE]
Taking a random linear combination, we get
[TABLE]
where
[TABLE]
[TABLE]
We confirm that the inverse of is equal to its transpose since
[TABLE]
The first coordinate of the atoms is given by the diagonal of
[TABLE]
while the second coordinate of the atoms is given by the diagonal of
[TABLE]
The atoms can now be read
[TABLE]
With denoting the column of indexed by , and with and denoting the first and second colums of , the respective weights of the atoms are
[TABLE]
Taking the complex logarithm of the atoms, we obtain
[TABLE]
As for the weights, they can be written
[TABLE]
which leads to the following function of and
[TABLE]
[TABLE]
[TABLE]
The two terms in the sum can be visualized in Figure 7.
7 Conclusion
An algorithm is proposed for finding global solutions to polynomial optimization problems and for exponential interpolation. It is founded on the notion of hyponormality and is related to the truncated moment problem. Various numerical applications are provided along with graphical illustrations.
Acknowledgements
Many thanks to Paulo Ricardo Arantes Gilz, Didier Henrion, Jean Bernard Lasserre, and Mihai Putinar for fruitful discussions.
Appendix A First Lemma
Lemma 3**.**
If are distinct points of , then are linearly independent vectors, where .
Proof.
Consider some complex numbers such that
[TABLE]
Given , define the Lagrange interpolation polynomial
[TABLE]
where is an index such that . It satisfies if and if . The degree of is equal to . Thus we may multiply the equation in (97) by to obtain
[TABLE]
Summing over all yields . ∎
Appendix B Second Lemma
Lemma 4**.**
If are linearly independent, and , then where denotes the range.
Proof.
If , then and . Conversly, an element of the span with belongs to the the range of if there exists such that
[TABLE]
which is equivalent to each of the next three lines:
[TABLE]
[TABLE]
[TABLE]
Since has rank , its transpose has rank . Thus there exists a desired . Likewise, belongs to the the range of if there exists such that
[TABLE]
Since has rank , its conjugate transpose has rank . Thus there exists a desired . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] X. Li A. and S. Ranga , Szegö Polynomials and the Truncated Trigonometric Moment Problem , The Ramanujan Journal, 12 (2006), pp. 461–472.
- 2[2] J. Agler and J. E. Mc Carthy , Pick Interpolation and Hilbert Function Spaces , American Mathematical Soc., 2002.
- 3[3] N.I. Akhiezer , The Classical Moment Problem and Some Related Questions in Analysis , Hafner Publ. Co., New York, 1965.
- 4[4] N. I. Akhiezer and M. Krein , Some Questions in the Theory of Moments , Transl. Math. Monographs 2, 58 (1962), pp. 164–168.
- 5[5] F. Andersson, Marcus Carlsson, and M. V. de Hoop , Nonlinear Approximation of Functions in Two Dimensions by Sums of Wave Packets , Appl. Comput. Harmon. Anal., 29 (2010), pp. 198––213.
- 6[6] A. Athavale , On Joint Hyponormality of Operators , Proceedings of the American Mathematical Society, 103 (1988).
- 7[7] L. Autonne , Sur les Matrices Hypohermitiennes et les Matrices Unitaires , Annales de l’Université de Lyon, Nouvelle Série, I. Sciences, Médecine. Fascicule 38, (1915).
- 8[8] T. Bäckström , Vandermonde Factorization of Toeplitz Matrices and Applications in Filtering and Warping , IEEE Trans. on Signal Processing, 61 (2013).
