An efficient global optimization algorithm for maximizing the sum of two generalized Rayleigh quotients
Xiaohui Wang, Longfei Wang, Yong Xia

TL;DR
This paper introduces a branch-and-bound algorithm for globally maximizing the sum of two generalized Rayleigh quotients, reformulating the problem into a one-dimensional optimization with SDP subproblems, and demonstrating superior efficiency over existing heuristics.
Contribution
The paper presents a novel branch-and-bound method that explicitly overestimates the objective using dual SDP subproblems for efficient global optimization.
Findings
The proposed algorithm outperforms recent SDP-based heuristics in efficiency.
Reformulation reduces the problem to a one-dimensional optimization with SDP subproblems.
Numerical results confirm the effectiveness of the method.
Abstract
Maximizing the sum of two generalized Rayleigh quotients (SRQ) can be reformulated as a one-dimensional optimization problem, where the function value evaluations are reduced to solving semi-definite programming (SDP) subproblems. In this paper, we first use the dual SDP subproblem to construct an explicit overestimation and then propose a branch-and-bound algorithm to globally solve (SRQ). Numerical results demonstrate that it is even more efficient than the recent SDP-based heuristic algorithm.
| n | “two-stage” algorithm NRX | Our new algorithm | ||||
|---|---|---|---|---|---|---|
| time(s) | iter. | time(s) | iter. | |||
| 30 | 58.84 | 233.6 | 11.93 | 50.1 | ||
| 50 | 98.19 | 320.8 | 16.80 | 58.6 | ||
| 80 | 192.09 | 400.9 | 31.59 | 68.7 | ||
| 100 | 299.23 | 459.3 | 44.08 | 71.4 | ||
| 120 | 493.83 | 536.9 | 62.52 | 71.3 | ||
| 150 | 915.29 | 609.4 | 108.95 | 75.8 | ||
| 180 | 1519.09 | 634.0 | 186.84 | 81.2 | ||
| 200 | 2118.18 | 672.2 | 262.78 | 86.6 | ||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Polynomial and algebraic computation · graph theory and CDMA systems
∎
11institutetext: X. Wang 22institutetext: School of Astronautics, Beihang University, Beijing, 100191, P. R. China 22email: [email protected] 33institutetext: L.F. Wang 44institutetext: Y. Xia 66institutetext: State Key Laboratory of Software Development Environment, LMIB of the Ministry of Education, School of Mathematics and System Sciences, Beihang University, Beijing 100191, P. R. China 66email: [email protected] (L.F. Wang); [email protected] (Y. Xia)
An efficient global optimization algorithm for maximizing the sum of two generalized Rayleigh quotients
††thanks: This research was supported by National Natural Science Foundation of China under grants 11471325 and 11571029.
Xiaohui Wang
Longfei Wang
Yong Xia
(Received: date / Accepted: date)
Abstract
Maximizing the sum of two generalized Rayleigh quotients (SRQ) can be reformulated as a one-dimensional optimization problem, where the function value evaluations are reduced to solving semi-definite programming (SDP) subproblems. In this paper, we first use the optimal value of the dual SDP subproblem to construct a new saw-tooth-type overestimation. Then, we propose an efficient branch-and-bound algorithm to globally solve (SRQ), which is shown to find an -approximation optimal solution of (SRQ) in at most O iterations. Numerical results demonstrate that it is even more efficient than the recent SDP-based heuristic algorithm.
Keywords:
: fractional programming, Rayleigh quotient, semidefinite programming, branch and bound.
MSC:
90C32 90C26 90C22
1 Introduction
The problem of maximizing the sum of two generalized Rayleigh quotients
[TABLE]
with positive definite matrices and , has recent applications in the multi-user MIMO system PG and the sparse Fisher discriminant analysis in pattern recognition DFB ; FM ; WZW . Without loss of generality, we can assume that is identity. Otherwise, we reformulate (1) as a problem in terms of by substituting . Moreover, since the objective function in (1) is homogeneous, (SRQ) can be further recast as the following sphere-constrained optimization problem, which is first proposed by Zhang Hong ; HZ :
[TABLE]
where denotes the -norm throughout this paper.
The single generalized Rayleigh quotient optimization problem (i.e., (SRQ) with ) is related to the classical eigenvalue problem and solved in polynomial time ZYL . However, to our best knowledge, whether the general (SRQ) (or (P)) can be efficiently solved in polynomial time remains open. Actually, as shown in [Hong , Example 1.1], there could exist a few local non-global maximizers of (P). Moreover, even finding the critical point of (P) is nontrivial, see Hong ; HZ .
Recently, (P) is reformulated as the problem of maximizing the following one-dimensional function NRX :
[TABLE]
where is related to a non-convex quadratic optimization:
[TABLE]
and the lower and upper bounds
[TABLE]
are the smallest and the largest generalized eigenvalues of the matrix pencil respectively. In order to solve the one-dimensional problem (2), a “two-stage” heuristic algorithm is proposed in NRX by first subdividing into coarse intervals such that each one contains a local maximizer of and then applying the quadratic fit line search An ; Baz ; Lu in each interval. For any given , (or ) can be evaluated by solving an equivalent semi-definite programming (SDP) formulation, according to an extended version of S-Lemma in [Poly , Proposition 4.1, see also [D , Theorem 5.17]]. Finally, for the returned optimal solution , the optimal vector solution of (P) is recovered by a rank-one decomposition procedure [NRX , Theorem 3]. Though this “two-stage” algorithm could find the global solutions of the tested examples, it is still a heuristic algorithm since the function is not guaranteed to be quasi-concave. Besides, there is no meaningful stopping criterion for the “two-stage” algorithm. That is, we cannot estimate the gap between the obtained solution and the global maximizer of (P1).
In this paper, we propose an easy-to-evaluate function for upper bounding . It provides saw-tooth-curve upper bounds of over , which are used to establish an efficient branch-and-bound algorithm. We further show that the new algorithm returns an -approximation optimal solution of (P1) in at most iterations. Numerical results show that the new algorithm is even much more efficient than the “two-stage” heuristic algorithm NRX .
The remainder of this paper is organized as follows. In Section 2, we give some preliminaries on the evaluation of . In Section 3, we propose an easy-to-compute upper bounding function, which provides saw-tooth-curve upper bounds of . In Section 4, we establish a new branch-and-bound algorithm and estimate the worst-case computational complexity. In Section 5, we do numerical comparison experiments, which demonstrate the efficiency of our new algorithm. Conclusions are made in Section 6.
Throughout the paper, denotes the optimal objective value of the problem . We use to stand for a positive (negative) semi-definite matrix . The positive definite matrix is denoted by . Let and be the maximal and minimal eigenvalue of , respectively. The inner product of two matrices and is denoted by trace. For a real number , returns the largest integer less than or equal to .
2 Preliminaries
In the section, we first show how to evaluate . Then, we present the “two-stage” algorithm NRX to maximize (2). Finally, we discuss how to get the optimal vector solution of (P) from the maximizer of .
Lifting to (since ) yields the primal SDP relaxation of the optimization problem of evaluating for any given :
[TABLE]
The conic dual problem of is
[TABLE]
which coincides with the Lagrangian dual problem of .
It is trivial to see that has an interior feasible solution, i.e., the Slater’s condition holds. We can verify that, for any satisfying
[TABLE]
the Slater’s condition holds for , i.e., there is an such that and . Therefore, under the assumption (5), strong duality holds for , that is, and both optimal values are attained.
Under the assumption (5), by further applying the extended version of S-Lemma in [Poly , Proposition 4.1, see also [D , Theorem 5.17]], we can show that the strong duality holds for the optimization problem of evaluating , i.e., . For more details, we refer to NRX .
Next, we present the “two-stage” algorithm proposed in NRX for solving (2). Firstly, it partitions into a rather coarse mesh and then collects all subintervals containing an interior local maximizer. In the second stage, the quadratic fit method Baz ; An ; Lu is applied to find a corresponding local maximizer in each subinterval that has been collected in the first stage. Finally, the optimal solution is selected from all these obtained local maximizers. In this paper, we will not present the detailed quadratic fit line search subroutine, which can be found in NRX . One of the reason is that the algorithm in the first stage is already quite time-consuming.
The “two-stage” scheme proposed in NRX
-
- Step 1.
Given Let and for . If is not an integer, set for .
- Step 2.
For collect all the three-point pattern such that .
- Step 3.
Call the quadratic fit line search subroutine (with a smaller tolerance than ) to find a corresponding local maximizer in each three-point pattern .
- Step 4.
Select the best maximizer among , , and all the local maximizers found in Step 3.
Suppose (2) is solved, let be the returned maximizer. If , the feasible region of (3) is reduced to
[TABLE]
which contains only the unit eigenvector corresponding to the maximal eigenvalue. In this case, is actually a maximum eigenvalue problem. On the other hand, suppose , the optimal vector solution of (P) is recovered from the equivalent based on the rank one constraint, by using a rank-one procedure similar to that in SZ ; Y , see details in NRX .
There is an alternative approach to recover the optimal solution of (P). Let be the optimal solution of the dual problem . It is not difficult to verify that
[TABLE]
Consequently, the optimal vector solution of (P) is the unit eigenvector corresponding to the maximum eigenvalue of .
3 Saw-tooth upper bounds
In this section, we propose an easy-to-evaluate upper bounding function, which provides saw-tooth upper bounds for over .
Let be a partition of , where and .
Consider the interval with (so that ). Solve with and denote the optimal solutions by and , respectively. Then, we have , , and
[TABLE]
For any , it follows from the strong duality that
[TABLE]
Similarly, we have
[TABLE]
Now, we obtain an upper bounding function of over :
[TABLE]
which is a concave function as and are both linear functions. It provides the following upper bound of over :
[TABLE]
Problem (10) is a convex program. Moreover, it has a closed-form solution.
Theorem 1
Under the assumption , an upper bound of over is given by
[TABLE]
where
[TABLE]
Proof
The trivial proof is omitted as both and are linear functions and is the unique solution of the equation .
Finally, we also have a simple estimation of the upper bound .
Theorem 2
For any , we have
[TABLE]
Proof
The inequality (15) follows from the definition (7) and the facts that and (as ).
Remark 1
The estimation (15) is independent of . Therefore, it can be satisfied for the extended case .
4 A saw-tooth branch-and-bound algorithm
In this section, we first propose a branch-and-bound algorithm based on the new saw-tooth-curve upper bounds and then establish the worst-case computational complexity of the new algorithm.
Our algorithm works on a list
[TABLE]
The initial list is . In each iteration, we first select the interval from the -list that provides the maximal upper bound (10). Then, we insert the mid-point into the -list (16) and increase by one. The process is repeated until the stopping criterion is reached. The detailed algorithm is presented as follows.
The saw-tooth branch-and-bound algorithm
-
- Step 0.
Given the approximation error . Compute , (4), and . Initialize the iteration number .
Let . Solve () to obtain the optimal solution . Then, and let , .
Let . If , stop and return as an approximate maximizer. Otherwise, solve () to obtain the optimal solution . Then, . If , update and . Set and .
- Step 1.
Let . Solve () and obtain the optimal solution . Then, . If , update and .
- Step 2.
According to Theorem 1, compute the upper bounds:
[TABLE]
Update and .
- Step 3
Find . If , stop and return as an approximate maximizer. Otherwise, update and go to Step 1.
Theoretically, we can show that our new algorithm returns an -approximation optimal solution of (P1) in at most iterations. Here, we call an -approximation optimal solution of (P1) if it is feasible and satisfies
[TABLE]
Theorem 3
The above algorithm terminates in at most steps and returns an -approximation optimal solution of (P1).
Proof
If the algorithm terminates at Step 0, that is,
[TABLE]
then for any , it follows from the inequality (15) in Theorem 2 that
[TABLE]
Therefore, we have
[TABLE]
It follows that is an -approximation optimal solution of (P1).
Now, we suppose that the algorithm does not terminate at Step 0. Consider in the -th iteration of the algorithm. If , then the interval will be not selected to partition. In the following, we assume . According to the inequality (15) in Theorem 2, for any , we have
[TABLE]
Since and , according to the stopping criterion, the algorithm terminates when
[TABLE]
Therefore, there are at most elements in . Since the number of elements of increases by one in each iteration, the algorithm stops in steps.
Let be the approximation solution returned by the algorithm. We have
[TABLE]
To show that is an -approximation optimal solution of (P1), it is sufficient to prove that
[TABLE]
Let . According to the inequality (15) in Theorem 2, for any , we obtain
[TABLE]
Therefore, we have
[TABLE]
where the equality (19) follows from (17). Then, we obtain (18). The proof is complete.
5 Computational Experiments
We test the new branch-and-bound algorithm for solving (P1) on the same numerical examples as in NRX . The SDP subproblems are solved by SDPT3 within CVX Boyd . Since there is no unified stopping criterion in the “two-stage” heuristic algorithm NRX , we just report the number of function evaluations (i.e., solving the SDP subproblems) in the first stage, with the setting used in NRX . For our algorithm, we set .
The first example is taken from [Hong , Example 3.2]. It has many local non-global maximizers.
Example 1
Let
In this case, . The “two-stage” algorithm NRX gives an approximation solution The number of function evaluations in the first stage is . Our algorithm returns an -approximation optimal solution, , in iterations.
The second example in NRX is taking from [Hong , Example 3.1], where the optimal solution of is achieved at the right-hand side end-point .
Example 2
**
In this case, . The number of function evaluations in the first stage of the “two-stage” algorithm NRX is . While our algorithm finds in iterations.
Example 3** (NRX , Example 3)**
Let
[TABLE]
In this case, The “two-stage” algorithm NRX gives an approximation solution The number of function evaluations in the first stage is . Our algorithm returns an -approximation optimal solution, , in iterations.
Example 4** (NRX , Example 4)**
Let
The searching interval is The optimal solution is the left-hand side end-point . The number of function evaluations in the first stage is . Our algorithm returns an -approximation optimal solution, , in iterations.
Example 5** (NRX , Example 5)**
*Let
*
The searching interval of this example is The “two-stage” algorithm NRX gives an approximation solution The number of function evaluations in the first stage is . Our algorithm returns an -approximation optimal solution, , in iterations.
In addition to Examples 2-5 reported above, our algorithm highly outperforms the “two-stage” algorithm NRX . For Example 1, our algorithm is also competitive. Notice that our algorithm is an exact algorithm and the “two-stage” algorithm NRX is heuristic.
Finally, we test more examples where the data are chosen randomly as follows. Each component of the symmetric matrices and is uniformly distributed in . We generate , where is a randomly generated lower bi-diagonal matrix with each nonzero element being uniformly distributed in and is a constant number to guarantee the positive definiteness of and . For each dimension varying from to , we independently run the “two-stage” algorithm NRX and our new algorithm ten times and report in Table 1 the average numerical results including the time in seconds and the number of iterations. It follows from the limited numerical results that our new global optimization algorithm highly outperforms the “two-stage” heuristic algorithm.
6 Conclusions
The recent SDP-based heuristic algorithm for maximizing the sum of two generalized Rayleigh quotients (SRQ) is based on the one-dimensional parametric reformulation where each functional evaluation corresponds to solving a semi-definite programming (SDP) subproblem. In this paper, we propose an efficient branch-and-bound algorithm to globally solve (SRQ) based on the new-developed saw-tooth overestimating approach. It is shown to find an -approximation optimal solution of (SRQ) in at most O iterations. Numerical results demonstrate that it is much more efficient than the recent SDP-based heuristic algorithm.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Antoniou, A., Lu, W. S.: Practical optimization: algorithms and engineering applications, Springer Science+ Business Media, LLC ( 2007 ) 2007 (2007)
- 2(2) Bazaraa, M.S., Sherali, H.D., Shetty, C. M.: Nonliear programming: theory and algorithms, Third Edition. John Wiley and Sons, Inc., Hoboken, New Jersey ( 2006 ) 2006 (2006)
- 3(3) Dundar, M.M., Fung, G., Bi, J., Sandilya, S., Rao, B.: Sparse fisher discriminant analysis for computer aided detection. Proceedings of SIAM International Conference on Data Mining ( 2005 ) 2005 (2005)
- 4(4) Fung, E., Michael, K. Ng.: On sparse fisher discriminant method for microarray data analysis. Bioinformation 2 , 230 − 234 ( 2007 ) 2 230 234 2007 2,230-234(2007)
- 5(5) Grant, M., Boyd, S.: CVX: MATLAB software for disciplined convex programming, Version 2.1, http://cvxr.com/cvx, ( 2015 ) 2015 (2015) .
- 6(6) Luenberger, D. G., Ye, Y.: Linear and nonlinear programming, Third Edition. Springer Science+Business Media, LLC. ( 2008 ) 2008 (2008)
- 7(7) Nguyen, V.B., Sheu, R.L., Xia Y.: Maximizing the sum of a generalized Rayleigh quotient and another Rayleigh quotient on the unit sphere via semidefinite programming, J. Glob. Optim., 64 ( 2 ) , 399 − 416 ( 2016 ) 64 2 399 416 2016 64(2),399-416(2016)
- 8(8) Pólik, I., Terlaky, T.: A servey of S-lemma. SIAM review. 49 ( 3 ) , 371 − 418 ( 2007 ) 49 3 371 418 2007 49(3),371-418(2007)
