Positive semidefinite interval of matrix pencil and its applications for the generalized trust region subproblems
Van-Bong Nguyen, Thi Ngan Nguyen

TL;DR
This paper characterizes the positive semidefinite interval of matrix pencils and applies these findings to simplify solving the hard case of generalized trust region subproblems.
Contribution
It provides a detailed analysis of the positive semidefinite interval of matrix pencils, especially when matrices are not simultaneously diagonalizable, and applies this to improve GTRS solutions.
Findings
The interval $I_{\succeq}(A,B)$ can be empty, a singleton, or an interval.
When both matrices are singular, the interval can be decomposed into block diagonals.
The approach simplifies solving the hard case of GTRS by reducing it to linear systems or smaller GTRS problems.
Abstract
We are concerned with finding the set of real values such that the matrix pencil is positive semidefinite. If are not simultaneously diagonalizable via congruence (SDC), either is empty or has only one value When are SDC, if not empty, can be a singleton or an interval. Especially, if is an interval and at least one of the matrices is nonsingular then its interior is the positive definite interval If are both singular, then even is an interval, its interior may not be but are then decomposed to block diagonals of submatrices with nonsingular such that Applying the hard-case of the generalized trust-region subproblem (GTRS) can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Matrix Theory and Algorithms · graph theory and CDMA systems
Positive semidefinite interval of matrix pencil and its applications for the generalized trust region subproblems
Thi Ngan Nguyen
Van-Bong Nguyen
Department of Mathematics, Tay Nguyen University, 632090, Vietnam
Abstract
We are concerned with finding the set of real values such that the matrix pencil is positive semidefinite. If are not simultaneously diagonalizable via congruence (SDC), either is empty or has only one value When are SDC, if not empty, can be a singleton or an interval. Especially, if is an interval and at least one of the matrices is nonsingular then its interior is the positive definite interval If are both singular, then even is an interval, its interior may not be but are then decomposed to block diagonals of submatrices with nonsingular such that Applying the hard-case of the generalized trust-region subproblem (GTRS) can be dealt with by only solving a system of linear equations or reduced to the easy-case of a GTRS of smaller size.
keywords:
Simultaneously diagonalizable via congruence , Trust region subproblem , Generalized trust region subproblem, Positive semidefinite interval , Matrix pencil
MSC:
[2010] 15A18, 15A21, 15A22 , 15A23, 15A27 , 90C20, 90C26
1 Introduction
Let be the space of real symmetric matrices and In this paper we compute the set
[TABLE]
and then apply it for solving the following optimization problem
[TABLE]
where are dimension vectors, This problem is referred as the generalized trust region subproblem (GTRS) since it contains the trust region subproblem (TRS), when is the identity matrix, and as a special case.
Our study is inspired by the following results obtained by Moré [10]: i) a vector is a global optimizer of the GTRS (1) if and only if and there exists such that
[TABLE]
and ii) if the set defined by is nonempty then it is an open interval and the function is strictly decreasing on unless uniquely solved from (2) is constant on These two results suggest that can be found efficiently whenever is computed. Unfortunately, in literature, results on are very limited. The earliest study on computing was, perhaps, by Caron and Gould [3], where the authors suppose that is positive semidefinite and is of rank one or two. Song [11] computed under the assumption that where denotes the range of a matrix. Moré [10], Adachi et al. [1] computed under the assumption that Out of above conditions, computing is still an open question so far. However, if is then the closure of and computing is much easier since the matrices are then simultaneously diagonalizable via congruence (SDC), i.e., there exists a nonsingular matrix such that and are all diagonal, please see [9]. Moré [10] proposed an algorithm for finding under the assumption that is nonempty, while the case is referred as the hard-case. The assumption is so strictive that it restricts the GTRS (1) into a very special class which always attains a unique optimal solution. Moreover, if then are SDC and the GTRS (1) is equivalently reformulated as a convex second-order cone programming (SOCP) [2]. This result is then extended that any bounded GTRS can be reformulated as an SOCP even the SDC condition fails [5]. The assumption also allows to solve GTRS (1) by solving only one eigenpair of a generalized eigenvalue problem of dimension [1]. A more recent result by Jiang and Li can solve GTRS (1) even in linear time but under a stricter condition that there exist and such that here is the smallest eigenvalue of [6]. Hsia et al. [8] solve the GTRS (1) when are SDC, while the case when are not SDC is still referred as the hard-case.
In this paper, we show that not only when but also even the interval can be computed by only solving a generalized eigenvalue problem of dimension. Specifically, we show that if are not SDC then either is empty or has only one point: Such a value can be found efficiently and then checked whether If are SDC and is nonsingular, then the matrix pencil can always be decomposed into the form
[TABLE]
where are symmetric matrices, are distinct eigenvalues of the matrix The set is then computed quickly since is equivalent to for all If is singular and is nonsingular, we can decompose to the taking the forms where are symmetric of the same size, is nonsingular such that if then otherwise Especially, if has more than one point, then and please see Corollary 1 below. If are SDC and both are singular, then there always exists a nonsingular matrix such that are decomposed to either the following form
[TABLE]
or
[TABLE]
where is diagonal and, in both cases, are diagonal and of the same size, is nonsingular. If are decomposed to (5) and has even one diagonal negative element then Otherwise, in both cases, with nonsingular. To apply for solving GTRS (1), we consider the set of candidates of Lagrange multipliers
If (1) has no optimal solution since it is unbounded from below; 2. 2.
If is singleton: we need only to solve linear equations to check whether 3. 3.
If is an interval and then there is a unique optimal Lagrange multiplier If is not an endpoint of a bisection algorithm can find in the interior of since the function is strictly decreasing on 4. 4.
If is an interval and are converted to the form either (4) or (5) and In this case, the GTRS (1) is either unbounded below or reduced to a GTRS of variables with matrices such that
2 Computing the positive semidefinite interval
In this section, we show computing in two separate cases: are SDC and are not SDC. For the former case, we need first the following result.
Lemma 1** ([7]).**
Let and be nonsingular. Then are SDC if and only if there is a nonsingular real matrix such that is a real diagonal matrix.
Now, if are SDC and is nonsingular, by Lemma 1, there is a nonsingular matrix such that
[TABLE]
is a diagonal matrix, where are the distinct eigenvalues of is the identity matrix of size and We can suppose without loss of generality that Using together with the following result we show how to simultaneously decompose and into block diagonals.
Lemma 2** ([13]).**
Let be a Jordan matrix of form
[TABLE]
where are Jordan blocks associated with eigenvalue and
[TABLE]
For a symmetric matrix if is symmetric, then is block diagonal and with
[TABLE]
Observe that and is symmetric. Lemma 2 indicates that is a block diagonal matrix with the same partition as That is
[TABLE]
where is real symmetric matrices of size for every We now have
[TABLE]
Both (6) and (7) show that are now decomposed into the same block structure and the matrix pencil now becomes
[TABLE]
The requirement is then equivalent to
[TABLE]
Using (9) we compute as follows.
Theorem 1**.**
Suppose are SDC and is nonsingular.
If then 2. 2.
If then 3. 3.
If is indefinite then
- (i)
if and for some then
- (ii)
if is indefinite and then
- (iii)
in other cases, that is either are indefinite for some or for some or is indefinite and for some then
Proof.
If then The inequality (9) is then equivalent to Since we need only This shows 2. 2.
Similarly, if then The inequality (9) is then equivalent to Then 3. 3.
The case is indefinite:
- (i)
if and for some the inequality (9) then implies
[TABLE]
Since we have
- (ii)
if is indefinite and for some The inequality (9) then implies
[TABLE]
Since we have
- (iii)
if are indefinite, (9) implies and This cannot happen since If and for some then
[TABLE]
implying This also cannot happen since Finally, if is indefinite and for some Again, by (9),
[TABLE]
implying This also cannot happen. So in these all three cases.
∎
The proof of Theorem 1 indicates that if are SDC, is nonsingular and is an interval then is nonempty. In that case we have please see [10]. If is singular and is nonsingular, we have the following result.
Theorem 2**.**
Suppose are SDC, is singular and is nonsingular. Then
- (i)
there always exists a nonsingular matrix such that
[TABLE]
[TABLE]
where are symmetric of the same size, is nonsingular; 2. (ii)
if then Otherwise,
Proof.
(i) Since is symmetric and singular, there is an orthogonal matrix that puts into the form
[TABLE]
such that is a nonsingular symmetric matrix of size where Let Since are SDC, are SDC too (the converse also holds true). We can write in the following form
[TABLE]
such that is a symmetric matrix of size is a matrix, is symmetric of size and, importantly, Indeed, if then Then we can choose a nonsingular matrix written in the same partition as such that both are diagonal and is of the form
[TABLE]
where is nonsingular. This implies On the other hand,
[TABLE]
is diagonal implying that and so
[TABLE]
This cannot happen since is nonsingular.
Let be an orthogonal matrix such that where is a nonsingular diagonal matrix of size and and set We then have
[TABLE]
where and are of size and respectively. Let
[TABLE]
We can verify that
[TABLE]
and, by (11),
[TABLE]
We denote and rewrite the matrices as follows
[TABLE]
We now consider whether it can happen that We note that are SDC. We can choose a nonsingular congruence matrix written in the form
[TABLE]
such that not only the matrices are diagonal but also is remained a nonsingular submatrix at the northwest corner. That is
[TABLE]
is diagonal and is nonsingular diagonal of size This implies that Then
[TABLE]
is diagonal implying that
[TABLE]
are diagonal. Note that is nonsingular, must be nonsingular. But then with nonsingular is a contradiction. It therefore holds that Then
[TABLE]
with as desired.
(ii) We note first that is nonsingular so is If then if and only if So it holds in that case Otherwise, is either indefinite or negative definite then ∎
The proofs of Theorems 1 and 2 reveal the following important result.
Corollary 1**.**
Suppose are SDC and either or is nonsingular. Then is nonempty if and only if has more than one point.
If are both singular, they can be decomposed to either of the following form.
Lemma 3** ([12]).**
For any there always exists a nonsingular matrix that puts to
[TABLE]
such that is nonsingular diagonal of size and puts to of either form
[TABLE]
or
[TABLE]
where is symmetric of dimension is a matrix, and is a nonsingular diagonal matrix of dimension
It is easy to verify that are SDC if and only if are SDC.
Lemma 4** ([12]).**
- i)
If takes the form (12) then are SDC if and only if are SDC and 2. ii)
If take the form (13) then are SDC if and only if are SDC and or does not exist, i.e.,
Now suppose that are SDC, Lemmas 3 and 4 allow to assume without loss of generality that are already SDC. That is
[TABLE]
or
[TABLE]
where are of the same size and diagonal, is nonsingular and if takes the form (12) or (13) and then or if takes the form (13) and does not exist then Now we can compute as follows.
Theorem 3**.**
- (i)
If take the from then
- (ii)
If take the from then if and otherwise.
We note that is nonsingular, is therefore computed by Theorem 1. Especially, if has more than one point, then see Corollary 1.
In the rest of this section we consider when are not SDC. We need first to show that if are not SDC, then either is empty or has only one point. The proof of Lemma 5 is easy, we omit it.
Lemma 5**.**
If are positive semidefinite then are SDC.
Lemma 6**.**
If are not SDC then either is empty or has only one element.
Proof.
Suppose in contrary that has more than one elements, then we can choose such that and By Lemma 5, are SDC, i.e., there is a nonsingular matrix such that are diagonal. Then is diagonal because and Since is also diagonal. That is are SDC and we get a contradiction. ∎
To know when is empty or has one element, we need the following result.
Lemma 7** ([13]).**
Let B be nonsingular. Let have the real Jordan normal form , where are Jordan blocks corresponding to real eigenvalues of and are Jordan blocks for pairs of complex conjugate roots of . Then there exists a nonsingular matrix such that
[TABLE]
[TABLE]
where
Theorem 4**.**
Let be as in Lemma 7 and are not SDC. The followings hold.
- (i)
if then 2. (ii)
if and there is a real eigenvalue of such that then
[TABLE] 3. (iii)
if (i) and (ii) do not occur then
Proof.
It is sufficient to prove only (iii). Lemma 7 allows us to decompose and to the forms (17) and (16), respectively. Since are not SDC, at least one of the following cases must occur.
Case 1 *There is a Jordan block such that and * We then consider the following principal minor of
[TABLE]
If then Since so If then always contains the following not positive semidefinite principal minor of size
[TABLE]
So
Case 2 There is a Jordan block such that and We then consider
[TABLE]
This matrix always contains either a principal minor of size or a principal minor of size
[TABLE]
Both are not positive semidefinite for any ∎
Similarly, we have the following result. We omit its proof.
Theorem 5**.**
Let be not SDC. Suppose is nonsingular and has real Jordan normal form , where are Jordan blocks corresponding to real eigenvalues of and are Jordan blocks for pairs of complex conjugate roots of .
- (i)
If then 2. (ii)
If and there is a real eigenvalue of such that then 3. (iii)
If cases and do not occur then
Finally, if and are not SDC and both singular. Lemma 3 indicates that and can be simultaneously decomposed to and in either (12) or (13). If and take the forms (12) and then where are not SDC and is nonsingular. In this case we apply Theorem 4 to compute If and take the forms (13) and In this case, if is not positive definite then Otherwise, where are not SDC and is nonsingular, again we can apply Theorem 4. Therefore we need only to consider the case with noting that
Theorem 6**.**
Given are not SDC and singular such that and take the forms in either (12) or (13) with Suppose that Then, if and then
Proof.
We consider and in (13), the form in (12) is considered similarly. Suppose in contrary that and Since has more than one point, by Lemma 6, are SDC. Let be a nonsingular matrix such that are diagonal, then is a diagonal matrix. Moreover, is nonsingular, we have please see Corollary 1. Then for because Let we then have
[TABLE]
We note that is singleton implying and so On the other hand, since is nonsingular diagonal and the first columns of the matrix are linearly independent. One of the following cases must occur: i) the columns of the right side submatrix are linearly independent and at least one column, suppose is a linear combination of the columns of the matrix
[TABLE]
where is the th column of the matrix or ii) the columns of the right side submatrix are linearly dependent. If the case i) occurs then there are scalars which are not all zero such that
[TABLE]
Equation (18) implies that
[TABLE]
which further implies
[TABLE]
This cannot happen with and This contradiction shows that If the case ii) happens then there always exists a nonsingular matrix such that
[TABLE]
where is a full column-rank matrix. Let
[TABLE]
we have and so This implies and the right side submatrix is full column-rank. We return to the case i). ∎
3 Application for the GTRS
In this section we find an optimal Lagrangian multiplier
[TABLE]
together with an optimal solution for the GTRS (1). We need first to recall the following optimality conditions for the GTRS (1).
Lemma 8** ([10]).**
A vector is an optimal solution to GTRS (1) if and only if there exists such that
[TABLE]
In fact, the conditions (19) and (22) are necessary and sufficient for the GTRS (1) to be bounded below [14]. However, a bounded GTRS may have no optimal solution, see, for example [1]. The conditions (20)-(21) are thus added to guarantee the existence of an optimal solution to GTRS (1). To check whether a satisfies conditions (19)-(21) we need to apply the following result.
Lemma 9** ([10]).**
Suppose is nonempty. If is the solution of (19), and if the function is defined on by
[TABLE]
then is strictly decreasing on unless is constant on
We note that can be a nonempty interval while is empty. Fortunately, our Corollary 1 shows that this case happens only when and are both singular. Then we can decompose to and apply Theorem 3 if are SDC, Theorem 4 or Theorem 5 or Theorem 6 if are not SDC.
By Lemma 8, if the GTRS (1) has no optimal solution, it is even unbounded from below [14]. If but has only one element: Then we need only to solve the linear system (19)-(21) to check whether If is an interval, suppose where and may be then are SDC. We assume that The equation (19) is then of the following simple form
[TABLE]
Solving (23) for a fixed is very simple, the main duty is thus finding in the following cases.
If at least one of the matrices and is nonsingular then and The GTRS (1) then attains a unique optimal solution at an optimal Lagrange multiplier [10]. We first check whether If not, we apply Lemma 9 for finding such that Observe from (23) that is constant on only when for all This case can be dealt with easily since is then constant. Otherwise, is strictly decreasing on and we have the following results.
Lemma 10** ([4],[1]).**
Suppose the Slater condition holds for the GTRS (1), i.e., there exists such that and
- (a)
If on and then 2. (b)
If on then 3. (c)
If changes its sign on then 4. (d)
If on then cannot be
The case (d) indicates that if and then 2. 2.
If both and are singular, by Lemma 4, are decomposed to the form either (14) or (15), and are now in the form
[TABLE]
or
[TABLE]
where are nonzero. We have
[TABLE]
where please see Theorem 3 and Corollary 1. We note also that if for then and we can apply Lemma 10. Otherwise, and the GTRS (1) may have no optimal solution. We deal with this case as follows.
If take the form (24), the equations (23) become
[TABLE]
If for then (1) is reduced to a GTRS of variables with matrices such that We then apply Lemma 10 for it. Otherwise, either (26) has no solution for all or it has solutions at only one Then we check easily whether
If take the form (25), the equations (23) become
[TABLE]
By the same arguments as above, either (1) is then reduced to a GTRS of variables with matrices
[TABLE]
such that We then apply Lemma 10 for it. Otherwise, either (27) has no solution for all or it has solutions at only one
4 Conclusion and remarks
In this paper, we showed that for a given pair of real symmetric matrices the set of real values such that the matrix pencil is positive semidefinite is always computable by solving the generalized eigenvalue problem of dimension. The computation is considered in two separated cases: If are not simultaneously diagonalizable via congruence (SDC), is either empty or singleton, while if are SDC, can be empty, singleton or an interval. In case is an interval, if either or is nonsingular, is the closure of the positive definite interval Otherwise, are decomposable to block diagonals of submatrices with nonsingular such that is now the closure of With in hand, we are able to solve the generalized trust region subproblem (1) not only in the easy-case when is nonempty but also in the hard-case by only solving the linear equations. Our result needs only to solve the generalized eigenvalue problem of dimension compared with an generalized eigenvalue problem in [1]. On the other hand, we can completely deal with the hard-case of the GTRS (1), which was an open problem in [10] and [8].
References
- [1] S. Adachi, Y. Nakatsukasa, Eigenvalue-based algorithm and analysis for nonconvex QCQP with one constraint, Math. Program., Ser. A 173 (2019) 79-116.
- [2] A. Ben-Tal, D. Hertog, Hidden conic quadratic representation of some nonconvex quadratic optimization problems, Math. Program., 143 (2014) 1-29.
- [3] R.J. Caron, N.I.M. Gould, Finding a positive semidefinite interval for a parametric matrix, Linear Algebra Appl., Vol. 76 (1986) 19-29.
- [4] J. M. Feng, G. X. Lin, R. L. Sheu, Y. Xia, Duality and solutions for quadratic programming over single non-homogeneous quadratic constraint, J. Glob. Optim. 54(2) (2012) 275-293.
- [5] R. Jiang, D. Li, B. Wu, SOCP reformulation for the generalized trust region subproblem via a canonical form of two symmetric matrices, Math. Program., Ser. A, 169 (2018) 531-563.
- [6] R. Jiang, D. Li, A Linear-Time Algorithm for Generalized Trust Region Subproblems, SIAM J. Optim. Vol. 30, No. 1 (2020) 915-932.
- [7] W. Greub, Linear Algebra, 1st ed., Springer-Verlag (1958); Heidelberger Taschenbücker, Bd. 179 (1976).
- [8] Y. Hsia, G. X. Lin, R. L. Sheu, A revisit to quadratic programming with one inequality quadratic constraint via matrix pencil, Pac J Optim 10(3) (2014) 461-481.
- [9] R. A. Horn, C. R. Johnson, Matrix Analysis, Cambridge University Press, Cambridge, UK (1990).
- [10] J. J. Moré, Generalization of the trust region problem, Optim. Methods Softw., 2 (1993) 189-209.
- [11] H. Song, Positive semidefifinite intervals for matrix pencils, University of Windsor (2004).
- [12] T. N. Nguyen, V. B. Nguyen, T. H. Le, R. L. Sheu, On Simultaneous Diagonalization via Congruence of Real Symmetric Matrices and Applications, https://arxiv.org/abs/2004.06360v5
- [13] F. Uhlig, A canonical form for a pair of real symmetric matrices that generate a nonsingular pencil, Linear Algebra Appl., 14 (1976) 189-209.
- [14] Y. Xia, S. Wang, R. L. Sheu, S-Lemma with Equality and Its Applications, Math. Program., Ser. A, Vol. 156, Issue 1-2 (2016) 513-547.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Adachi, Y. Nakatsukasa, Eigenvalue-based algorithm and analysis for nonconvex QCQP with one constraint, Math. Program., Ser. A 173 (2019) 79-116.
- 2[2] A. Ben-Tal, D. Hertog, Hidden conic quadratic representation of some nonconvex quadratic optimization problems, Math. Program., 143 (2014) 1-29.
- 3[3] R.J. Caron, N.I.M. Gould, Finding a positive semidefinite interval for a parametric matrix, Linear Algebra Appl., Vol. 76 (1986) 19-29.
- 4[4] J. M. Feng, G. X. Lin, R. L. Sheu, Y. Xia, Duality and solutions for quadratic programming over single non-homogeneous quadratic constraint, J. Glob. Optim. 54(2) (2012) 275-293.
- 5[5] R. Jiang, D. Li, B. Wu, SOCP reformulation for the generalized trust region subproblem via a canonical form of two symmetric matrices, Math. Program., Ser. A, 169 (2018) 531-563.
- 6[6] R. Jiang, D. Li, A Linear-Time Algorithm for Generalized Trust Region Subproblems, SIAM J. Optim. Vol. 30, No. 1 (2020) 915-932.
- 7[7] W. Greub, Linear Algebra, 1st ed., Springer-Verlag (1958); Heidelberger Taschenbücker, Bd. 179 (1976).
- 8[8] Y. Hsia, G. X. Lin, R. L. Sheu, A revisit to quadratic programming with one inequality quadratic constraint via matrix pencil, Pac J Optim 10(3) (2014) 461-481.
