Polyhedral approximations of the semidefinite cone and their application
Yuzhu Wang, Akihiro Tanaka, Akiko Yoshise

TL;DR
This paper introduces a new sparse polyhedral approximation of the semidefinite cone using expanded SD bases, improving efficiency in solving semidefinite relaxations of combinatorial problems.
Contribution
It proposes an expanded SD basis for polyhedral approximation that maintains sparsity and enhances computational efficiency in semidefinite programming.
Findings
Approximation contains diagonally dominant matrices
Approximation is contained in scaled diagonally dominant matrices
Methods outperform existing approaches in efficiency
Abstract
We develop techniques to construct a series of sparse polyhedral approximations of the semidefinite cone. Motivated by the semidefinite (SD) bases proposed by Tanaka and Yoshise (2018), we propose a simple expansion of SD bases so as to keep the sparsity of the matrices composing it. We prove that the polyhedral approximation using our expanded SD bases contains the set of all diagonally dominant matrices and is contained in the set of all scaled diagonally dominant matrices. We also prove that the set of all scaled diagonally dominant matrices can be expressed using an infinite number of expanded SD bases. We use our approximations as the initial approximation in cutting plane methods for solving a semidefinite relaxation of the maximum stable set problem. It is found that the proposed methods with expanded SD bases are significantly more efficient than methods using other existing…
| Method | Number of cuts added at each iteration | Solver | ||
| LP cut | SOCP cut | |||
| CPDD | 2 | 0 | Gurobi | |
| CPSDB | 2 | 0 | Gurobi | |
| CPSDD | 2 | 0 | Mosek | |
| SDSOS | 2 | 1 | Mosek | |
| n | p | CPSDD0/SDSOS0 | CPSDD | SDSOS | SDP | ||||
|---|---|---|---|---|---|---|---|---|---|
| Value | Time (s) | (5 min) | (10 min) | (5 min) | (10 min) | Value | Time (s) | ||
| 150 | 0.3 | 105.70 | 0.95 | 38.91 | 37.02 | 40.97 | 37.38 | 20.44 | 105.46 |
| 150 | 0.8 | 31.78 | 1.00 | 10.07 | 9.66 | 9.70 | 9.31 | 6.00 | 110.63 |
| 200 | 0.3 | 140.47 | 3.14 | 70.48 | 55.52 | 75.46 | 61.31 | 23.73 | 549.63 |
| 200 | 0.8 | 40.92 | 3.14 | 12.10 | 11.29 | 12.17 | 11.38 | 6.45 | 497.55 |
| 250 | 0.3 | 176.25 | 6.60 | 115.41 | 93.81 | 119.67 | 99.99 | 26.78 | 1562.52 |
| 250 | 0.8 | 51.87 | 6.79 | 17.36 | 15.30 | 17.43 | 15.39 | 7.18 | 1553.63 |
| 300 | 0.3 | 210.32 | 13.05 | 160.42 | 138.60 | 162.77 | 143.12 | (29.13) | (32300.60) |
| 300 | 0.8 | 60.97 | 13.31 | 21.71 | 17.77 | 22.66 | 18.50 | (7.65) | (20586.02) |
| n | p | CPDD0 | CPDD | CPSDB0 | CPSDB | ||||
|---|---|---|---|---|---|---|---|---|---|
| Value | Time (s) | (5 min) | (10 min) | Value | Time (s) | (5 min) | (10 min) | ||
| 150 | 0.3 | 117 | 0.06 | 76.76 | 67.51 | 107.29 | 0.24 | 36.80 | 35.12 |
| 150 | 0.8 | 46 | 0.05 | 13.70 | 12.71 | 32.76 | 0.28 | 9.51 | 9.06 |
| 200 | 0.3 | 157 | 0.1 | 113.28 | 104.07 | 142.25 | 0.52 | 55.07 | 48.18 |
| 200 | 0.8 | 54 | 0.11 | 17.39 | 16.07 | 42.14 | 0.57 | 11.58 | 11.00 |
| 250 | 0.3 | 194 | 0.17 | 154.75 | 146.20 | 178.30 | 0.84 | 91.88 | 73.24 |
| 250 | 0.8 | 68 | 0.17 | 28.02 | 22.26 | 53.22 | 1.00 | 14.76 | 13.57 |
| 300 | 0.3 | 230 | 0.26 | 183.89 | 174.02 | 212.97 | 1.29 | 133.83 | 110.95 |
| 300 | 0.8 | 78 | 0.24 | 47.87 | 32.28 | 62.47 | 1.36 | 18.11 | 16.05 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Polyhedral approximations of the semidefinite cone and their application††thanks:
An earlier version of this paper was entitled “Polyhedral approximations of the semidefinite cone and their applications.” This research was supported by the Japan Society for the Promotion of Science through a Grant-in-Aid for Challenging Exploratory Research (17K18946) and a Grant-in-Aid for Scientific Research ((B)19H02373) of the Ministry of Education, Culture, Sports, Science and Technology of Japan.
Yuzhu Wang, Akihiro Tanaka and Akiko Yoshise
Graduate School of Systems and Information Engineering, University of Tsukuba, Tsukuba, Ibaraki 305-8573, Japan. email: [email protected]
Central Research Institute of Electric Power Industry, Yokosuka, Kanagawa 240-0196, Japan. email: [email protected] Corresponding author. Faculty of Engineering, Information and Systems, University of Tsukuba, Tsukuba, Ibaraki 305-8573, Japan. email: [email protected]
(May 2019
Revised December 2020)
Abstract
We develop techniques to construct a series of sparse polyhedral approximations of the semidefinite cone. Motivated by the semidefinite (SD) bases proposed by Tanaka and Yoshise (2018), we propose a simple expansion of SD bases so as to keep the sparsity of the matrices composing it. We prove that the polyhedral approximation using our expanded SD bases contains the set of all diagonally dominant matrices and is contained in the set of all scaled diagonally dominant matrices. We also prove that the set of all scaled diagonally dominant matrices can be expressed using an infinite number of expanded SD bases. We use our approximations as the initial approximation in cutting plane methods for solving a semidefinite relaxation of the maximum stable set problem. It is found that the proposed methods with expanded SD bases are significantly more efficient than methods using other existing approximations or solving semidefinite relaxation problems directly.
Key words: Semidefinite optimization problems; Conic optimization problems; Polyhedral approximation; Semidefinite bases; Expanded semidefinite bases.
AMS subject classifications: 90C05, 90C22, 90C25
1 Introduction
A semidefinite optimization problem (SDP) is an optimization problem in variables in the space of symmetric matrices with a linear objective function and linear constraints over the semidefinite cone. We denote the space of symmetric matrices as and the semidefinite cone as . Accordingly, we can readily define an SDP in the standard form, as
[TABLE]
where , , (), and is the inner product over .
SDPs are powerful tools that provide convex relaxations for combinatorial and nonconvex optimizations, such as the max-cut problem (e.g., [19], [12]) and the k-equipartition problem (e.g., [46], [23]). Some of these relaxations can even attain the optimum, as shown in [31] and [24]. Interested readers may find details about SDPs and their relaxations in [46], [42] and [32].
A cone is called proper if it has a non-empty interior and is closed, pointed (i.e., ), and convex. It is known that the SDP cone is a proper cone [9]. By replacing the semidefinite constraint with a general conic constraint in (1) (say, a proper cone ), one can obtain a general class of problems, namely, conic optimization problems. The class of conic optimization problems has been an active field of study because it contains many popular classes of problems, including linear optimization problems (LPs), second-order cone programs (SOCPs), SDPs, and copositive programs. Copositive programs have been shown capable of providing tight lower bounds for combinatorial and quadratic optimization problems, as described in the survey paper by Dür [17] and the recent work of Arima et al. [3], [25], [4], etc. It has been shown that a copositive relaxation sometimes gives a highly accurate approximate solution for some combinatorial problems under certain conditions [5], [11]. However, the copositive program and its dual problem are both NP-hard (see, e.g., [16] and [36]).
SDPs are also attractive because they can be solved in polynomial time to any desired precision. There are state-of-the-art solvers, such as SDPA [47], SeDuMi [40], SDPT3 [43], and Mosek [35], but their computations become difficult when the size of the SDP becomes large. To overcome this deficiency, for example, one may use preprocessing to reduce the size of the SDPs, which leads to facial reduction methods [37], [38] and [44]. As another idea, one may generate relaxations of SDPs and solve them as easily handled optimization problems, e.g., LPs and SOCPs, which leads to cutting plane methods. We will focus on these latter methods.
The cutting plane method solves an SDP by transforming it into an optimization problem (e.g., an LP or an SOCP), adding cutting planes at each iteration to cut the current approximate solution out of the feasible region in the next iterations and to get close to the optimal value. The cutting plane method was first used on the traveling-salesman problem, by Dantzig, Fulkerson, and Johnson [13], [14] in 1954. It was used in 1958 by Gomory [20] to solve integer linear programming problems. As SDPs became popular, it came to be used on them as well; see, for instance, Krishnan and Mitchell [28], [30] and [29], and Konno et al. [27]. Kobayashi and Takano [26] applied it to a class of mixed-integer SDPs. In [1], Ahmadi, Dash, and Hall applied it to nonconvex polynomial optimization problems and copositive programs.
In the above-mentioned cutting plane methods for SDPs, the semidefinite constraint in (1) is first relaxed to , where , and an initial relaxation of the SDP is obtained. If is polyhedral, the initial relaxation may give an LP; if is given by second-order constraints, the initial relaxation becomes an SOCP. To improve the performance of these cutting plane methods, we consider generating initial relaxations for SDPs that are both tight and computationally efficient and focus on approximations of .
Many approximations of have been proposed on the basis of its well-known properties. Kobayashi and Takano [26] used the fact that the diagonal elements of semidefinite matrices are nonnegative. Konno et al. [27] imposed an assumption that all diagonal elements of the variable in the SDPs appearing in their iterative algorithm are bounded by a constant. The sets of diagonally dominant matrices and scaled diagonally dominant matrices are known to be cones contained in , (see, e.g., [22] and [1] for details). The inclusive relation among them has been studied in, e.g., [7] and [8]. Ahmadi et al. [1] and [2] used these sets as initial approximations of their cutting plane method. Boman et al. [10] defined the factor width of a semidefinite matrix, and Permenter and Parrilo used it to generate approximations of , which they applied to facial reduction methods in [37].
Tanaka and Yoshise defined various bases of , wherein each basis consists of semidefinite matrices, called semidefinite (SD) bases, and used them to devise approximations of [41]. They showed that the conical hull of SD bases and its dual cone give inner and outer polyhedral approximations of , respectively. On the basis of the SD bases, they also developed techniques to determine whether a given matrix is in the semidefinite plus nonnegative cone , which is the Minkowski sum of and the nonnegative matrices cone . In this paper, we focus on the fact that SD bases are sometimes sparse, i.e., the number of nonzero elements in a matrix is relatively small, and hence, it is not so computationally expensive to solve polyhedrally approximated problems in such SD bases. We call such an approximation, a sparse polyhedral approximation, and propose efficient sparse approximations of .
The goal of this paper is to construct tight and sparse polyhedral approximations of by using SD bases in order to solve hard conic optimization problems, e.g., doubly nonnegative (DNN, or ) and semidefinite plus nonnegative () optimization problems. The contributions of this paper are summarized as follows.
- •
This paper gives the relation between the conical hull of sparse SD bases and the set of diagonally dominant matrices. We propose a simple expansion of SD bases without losing the sparsity of the matrices and prove that one can generate a sparse polyhedral approximation of that contains the set of diagonally dominant matrices and is contained in the set of scaled diagonally dominant matrices.
- •
The expanded SD bases are used by cutting plane methods for a semidefinite relaxation of the maximum stable set problem. It is found that the proposed methods with expanded SD bases are significantly more efficient than methods using other approximations or solving semidefinite relaxation problems directly.
The organization of this paper is as follows. Various approximations of are introduced in section 2, including those based on the factor width by Boman et al. [10], diagonal dominance by Ahmadi et al. [1], and SD bases by Tanaka and Yoshise [41]. The main results of this paper, i.e., an expansion of SD bases and an analysis of its theoretical properties, are provided in section 3. In section 4, we introduce the cutting plane method using different approximations of for calculating upper bounds of the maximum stable set problem. We also describe the results of numerical experiments and evaluate the efficiency of the proposed method with expanded SD bases.
2 Some approximations of the semidefinite cone
2.1 Factor width approximation
In [10], Boman et al. defined a concept called factor width.
Definition 2.1**.**
(Definition 1 in [10])* The factor width of a real symmetric matrix is the smallest integer such that there exists a real matrix where and each column of contains at most nonzero elements.*
For , we can also define
[TABLE]
It is obvious that the factor width is only defined for semidefinite matrices, because for every matrix in Definition 2.1, the decomposition implies that . Therefore, for every , the set of matrices with a factor width of at most gives an inner approximation of :
2.2 Diagonal dominance approximation
In [1] and [2], the authors approximated the cone with the set of diagonally dominant matrices and the set of scaled diagonally dominant matrices.
Definition 2.2**.**
The set of diagonally dominant matrices and the set of scaled diagonally dominant matrices are defined as follows:
[TABLE]
It is easy to see that is a convex cone and is a cone in . As a consequence of the Gershgorin circle theorem [18], we have the relation . Ahmadi et al. [1] defined as the set of vectors in with at most nonzeros, each equal to or . They also defined a set of matrices . Barker and Carlson [6] proved the following theorem.
Theorem 2.3**.**
(Barker and Carlson [6])* *
The conical hull of a given set is defined as , where is the set of nonnegative integers. A cone generated in this way by a finite number of elements is called finitely generated. Theorem 2.3 implies that has extreme rays; thus, it is a finitely generated cone.
A cone is polyhedral if for some . The following theorem follows from the results of Minkowski [34] and Weyl [45].
Theorem 2.4**.**
(Minkowski-Weyl theorem, see Corollary 7.1a in [39])* A convex cone is polyhedral if and only if it is finitely generated.*
The above theorem ensures that is a polyhedral cone. Using the expression in Theorem 2.3, Ahmadi et al. proved that optimization problems over can be solved as LPs. They also proved that optimization problems over can be solved as SOCPs. They designed a column generation method using and to obtain a series of inner approximations of . As for the relation between the factor width and diagonal dominance, useful results were presented in [10] and in [2], which gives a relation between and the set of matrices with a factor width of at most .
Lemma 2.5**.**
(See [10] and Theorem 8 in [2])* *
Note that Definition 2.1 implies that the set is convex for any , and we obtain the following corollary as Lemma 2.5:
Corollary 2.6**.**
The set is a convex cone.
2.3 SD basis approximation
Tanaka and Yoshise defined semidefinite (SD) bases [41].
Definition 2.7**.**
(Definitions 1 and 2 in [41])* Let denotes the vector with a at the th coordinate and [math] elsewhere, and let be the identity matrix. Then*
[TABLE]
is called an SD basis of Type I, and
[TABLE]
is called an SD basis of Type II. Matrices in SD bases Type I and II are defined as
[TABLE]
As shown in [41], and are subsets of and bases of . Given a set , we define the dual cone of as . The conical hull of and its dual give an inner and an outer polyhedral approximation of , as follows.
Definition 2.8**.**
Let be the identity matrix. The inner and outer approximations of by using SD bases are defined as
[TABLE]
By Definition 2.7, we know that . Since is a convex cone, we have . By Lemma 1.7.3 in [32], we know that is self-dual; that is, . Accordingly, we can conclude that .
Remark 2.9**.**
In [41], and are defined as and using an orthogonal matrix instead of the identity matrix . In fact, for any orthogonal matrix ,
[TABLE]
also give other bases and generalizations of and . However, as we will see in section 4, we use the matrices in the bases as in optimization problems of the form
[TABLE]
which is equivalent to
[TABLE]
Therefore, we consider that the generalizations and are not essential throughout this paper and omit those descriptions from subsequent sections to simplify the presentation.
3 Expansion of SD bases
When we use the SD bases for approximating , the sparsity of the matrices in those bases is quite important in terms of computational efficiency. As we mentioned in Remark 2.9, for any orthogonal matrix , and give generalizations of the SD bases. However, it is hard to choose an appropriate orthogonal matrix (except for the identity matrix ) to keep the sparsity of the matrices and in (2). In this section, we try to extend the definition of the SD bases in order to obtain various sparse SD bases which will lead us to sparse polyhedral approximations of .
3.1 SD bases and their relations with and
First, we give a lemma that provides an expression of by using SD bases. The lemma is a direct corollary of the fact that any has nonnegative eigenvalues and a corresponding orthogonal basis of eigenvectors.
Lemma 3.1**.**
[TABLE]
where is the set of orthogonal matrices in .
Lemma 3.1 gives a way to approximate by changing the matrix when creating SD bases. However, a dense matrix may lead to a dense formulation of the approximation using SD basis, which is unattractive from the standpoint of computational efficiency.
Note that we can easily see that the set , the conical hull of the sparse SD bases and , is equivalent to . Thus, we obtain the following proposition as a corollary of Theorem 2.3.
Proposition 3.2**.**
[TABLE]
3.2 Expansion of SD bases without losing sparsity
The previous section shows that we can obtain a sparse polyhedral approximation of by using the SD bases. In this section, we try to extend the definition of the SD bases in order to obtain various sparse polyhedral approximations of .
Definition 3.3**.**
Let be the identity matrix. Define the expansion of the SD basis with one parameter as
[TABLE]
The proposition below ensures that the expansion of the SD bases also gives bases of .
Proposition 3.4**.**
Let be the identity matrix. For any , is a set of independent matrices and thus a basis of .
Proof.
Let . Accordingly, for , we have
[TABLE]
and for every , we also have
[TABLE]
Suppose that there exist such that
[TABLE]
Then, by (3) and (4), we see that
[TABLE]
Since is a set of linearly independent matrices, all the coefficients for in (5) should be [math]. Thus, we have
[TABLE]
Since , by (9) we have
[TABLE]
[TABLE]
The above leads us to conclude that is a set of linearly independent matrices. ∎
If we let , then it is straightforward that . If we let be other real numbers, we may obtain different SD bases. The following proposition gives the condition for generating different expanded SD bases.
Proposition 3.5**.**
Let be the identity matrix. Suppose that and . Then, for every ,
[TABLE]
Proof.
For , let us define
[TABLE]
Note that if , then
[TABLE]
For every , we can write as a linear combination of :
[TABLE]
Since , Proposition 3.4 ensures that is linearly independent, and hence, the expression (13) for is unique.
Suppose that . In this case, all the coefficients in (13) should be nonnegative, which implies that
[TABLE]
From the last inequality in (14), we have either
[TABLE]
For case (i), from the first and second inequalities of (14), we have and , which implies and contradicts the assumption . A similar contradiction is obtained for case (ii). Thus, we have . ∎
3.3 Expression of with expanded SD bases
As we have seen in Corollary 2.6, the set is a convex cone. This fact ensures that as a corollary of Theorem 2.3, the conical hull of the union of the extended SD bases on coincides with and hence, the set of scaled diagonally dominant matrices :
Corollary 3.6**.**
[TABLE]
3.4 Notes on the parameter
Here, we discuss the choice for the parameter to increase the “volume” of the polyhedral approximation of the semidefinite cone . For any and , by Definition 3.3, we can calculate the Frobenius norm of :
[TABLE]
According to Proposition 3.5, by changing , one can obtain different polyhedral approximations. However, we can see that
[TABLE]
and by Definitions 2.7 and 3.3, we have
[TABLE]
This shows that, if or , the new matrix will become close to the existing matrices, e.g. , , and , and the “volume” of the polyhedral approximation of the semidefinite cone will also be close to the “volume” of the existing inner approximation of .
To give an illustrative explanation of the above discussion, here we consider the specific case
[TABLE]
and draw some figures in with coordinate and . Fig. 1 [a] shows the set of in . The red arrow in Fig. 1 [b] shows the extreme rays with and . The conical hull of these extreme rays is and its cross section with is illustrated as the blue area. To avoid generating a new matrix that is close to the existing matrices, we should choose an such that the angle between and existing matrices are equal, as illustrated in Fig. 1 [c].
We expand this idea to the case of generating a matrix . Given an , we can define the angles between matrices in the expanded SD bases and SD bases Type I and II for every , as follows:
[TABLE]
Thus, we have
[TABLE]
Similarly, we have
[TABLE]
In general, to obtain a large enough inner approximation with limited parameters, we prefer an that makes , which means that the new matrix will be in the middle of and on the boundary of . Similarly, we can obtain by calculating , and . By solving these equalities, we find that
[TABLE]
The expansions with these parameters are expected to provide generally large inner approximations for .
4 Cutting plane methods for the maximum stable set problem
Conic optimization problems, including SDPs and copositive programs, have been shown to provide tight bounds for NP-hard combinatorial and noconvex optimization problems. Here, we consider applying approximations of to one of those NP-hard problems, the maximum stable set problem. A stable set of a graph is a set of vertices in , such that there is no edge connecting any pair of vertices in the set. The maximum stable set problem aims to find the stability number, i.e. the number of vertices of the largest stable set of , namely .
De Klerk and Pasechnik [15] proposed a copositive programming formulation to obtain the exact stability number of a graph with vertices:
[TABLE]
where is the all-ones vector, is the adjacency matrix of graph , and is the dual cone of the copositive cone .
Although problem (4) is a conic optimization problem, it is still difficult since determining whether or not is NP-hard [16]. A natural approach is to relax this problem to a more tractable optimization problem. From the definition of each cone, we can see the validity of the following inclusions:
[TABLE]
By replacing with , one can obtain an SDP relaxation of (4):
[TABLE]
Solving this SDP is not as easy as it seems to be; in fact, we could not obtain a useful result of (4) after 6 hours of calculation using the state-of-the-art SDP solver Mosek for a random generalized problem when . Combining the expanded SD bases with the cutting plane method, we apply the approximations of to (4) and solve it by calculating a series of more tractable problems.
Let satisfy and replace by in (4). Then, we obtain a relaxation of (4):
[TABLE]
Usually, the relaxed problem (4) is expected to be easier to solve and to give us a better upper bound of problem (4) from its optimal solution . To get a better upper bound, we select some eigenvectors with negative eigenvalues of an optimal solution of problem (4), say , by adding cutting planes
[TABLE]
to (4), and obtain a new optimization problem
[TABLE]
Notice that the optimal solution of problem (4) is cut from the feasible region of problem (4) since . On the other hand, since , every feasible solution of (4) is feasible for (4), and hence problem (4) is a relaxation of problem (4). These facts ensure that problem (4) is a tighter relaxation of problem (4) than problem (4). By repeating this procedure, we are able to obtain a series of nonincreasing upper bounds of (4). Since the eigenvectors are usually dense, we only have to add eigenvectors corresponding to up to the second smallest eigenvalues to at every iteration, which increases computational efficiency.
As for the selection of the initial relaxation , we are ready to use the approximations of based on the expanded SD bases. Let be the set of parameters calculated in Section 3.4, and let denote the conical hull of expanded SD bases using :
[TABLE]
Then, as has been described in the previous sections, we have
[TABLE]
If or is selected to be , the corresponding relaxed problem in the cutting plane procedure becomes an LP, which allows us to use powerful state-of-the-art LP solvers, such as Gurobi [21]. Ahmadi et. al. [1] showed that when is selected, the relaxations turn out to be SOCPs. Although provides a tighter relaxation than either or , the latter two relaxations are expected to have a lower computational cost. In addition, in [1], Ahmadi et al. also proposed an SOCP-based cutting plane approach, named SDSOS, which adds SOCP cuts at every iteration. We conducted experiments to compare the efficiencies of those cutting plane methods using different approximations and SDSOS. The specifications of the experimental methods are summarized in Table 1.
We tested these methods on the Erds-Rényi graphs , randomly generated by Ahmadi et al. in [1], where is the number of vertices and every pair of vertices has an edge with probability . All experiments were performed with MATLAB 2018b on a Windows PC with an Intel(R) Core(TM) i7-6700 CPU running at 3.4 GHz and 16 GB of RAM. The LPs were solved using Gurobi Optimizer 8.0.0 [21] and the SOCPs and SDPs are solved using Mosek Optimizer 9.0 [35].
Fig. 2 shows the result for an instance with and . The x-axis is the number of iterations, and the y-axis is the gap between the upper bounds of each method and the SDP bound obtained by (4); the gap is computed by for the obtained upper bound at ’s iteration and the SDP bound obtained by solving (4) directly.
As can be seen in this figure, the accuracy of CPDD is the worst among the four methods at each iteration. CPSDB achieves almost the same upper bounds as CPSDD and SDSOS, which shows that the proposed polyhedral approximation is promising for obtaining a solution close to the non-polyhedral approximation of . Although SDSOS adds an extra SOCP cut at every iteration and takes longer to solve, the accuracy of SDSOS does not seem to be affected and is not so different from the accuracy of CPSDD at each iteration.
Fig. 3 shows the relation between the computation time and the gap of each method for the same instance. Although its accuracy is not necessarily the best at every iteration, it seems that CPSDB is the most efficient method. CPSDB attains an upper bound whose gap is within s, while CPSDD and SDSOS attain upper bounds whose gap is after the same amount of time. The difference might come from that the subproblems of CPSDB are sparse LPs at earlier iterations and the computations are relatively cheaper than those of CPSDD and SDSOS whose subproblems are SOCPs.
Table 2 and 3 give the bounds of iterative methods and the SDP bound for all the instances. In Table 2, the CPSDD0/SDSOS0 column shows the first upper bound obtained by CPSDD and SDSOS, i.e., the upper bound obtained by solving the same SOCP before adding any cutting plane. The (5 min) and (10 min) columns of CPSDD (SDSOS) show the upper bounds obtained after minutes and after minutes of the CPSDD (SDSOS) computation, respectively. The SDP column shows the SDP bound obtained by solving (4).
Similarly, in Table 3, the CPDD0 and CPSDB0 columns show the first upper bounds obtained by CPDD and CPSDB, respectively, before adding any cutting plane. The (5 min) and (10 min) columns of CPDD (CPSDB) show the upper bounds obtained after minutes and after minutes of the CPDD (CPSDB) computation, respectively.
Note that we failed to solve SDPs (4) for instances having nodes within our time limit . In Table 2, the Value and Time (s) columns of SDP with show the results obtained in [1] for these two instances, as a reference.
As can be seen in Table 2 and 3, for all instances, the values of CPSDD0/SDSOS0 are better than the values of CPSDB0 and CPDD0. These results correspond to the inclusion relationship of initial approximations (20). We can also see that the values of CPSDB0 are almost the same as those of CPSDD0/SDSOS0 for all instances, while the values of CPDD0 are much worse than others. For all instances, CPSDB seems to be significantly more efficient than all other methods. For example, for instance with and , after min of calculation, CPSDB obtained an upper bound of , while CPSDD and SDSOS got upper bounds greater than and CPDD got a bound of more than .
At present, solving a large SDP, e.g., one with more than nodes requires a significant amount of computational time. The cutting plane method CPSDB with our polyhedral approximation is a promising way of obtaining efficient upper bounds of such large SDPs in a moderate time.
5 Concluding remarks
We developed techniques to construct a series of sparse polyhedral approximations of the semidefinite cone. We provided a way to approximate the semidefinite cone by using SD bases and proved that the set of diagonally dominant matrices can be expressed with sparse SD bases. We proposed a simple expansion of SD bases that keeps the sparsity of the matrices that compose it. We gave the conditions for generating linearly independent matrices in expanded SD bases as well as for generating an expansion different from the existing one. We showed that the polyhedral approximation using our expanded SD bases contains the set of diagonally dominant matrices and is contained in the set of scaled diagonally dominant matrices. We also proved that the set of scaled diagonally dominant matrices can be expressed using an infinite number of expanded SD bases.
The polyhedral approximations were applied to the cutting plane method for solving a semidefinite relaxation of the maximum stable set problem. The results of the numerical experiments showed that the method with our expanded SD bases is more efficient than other methods (see Fig. 3); improving the efficiency of our method still remains an important study issue.
One future direction of study is to increase the number of vectors in the definition of the SD bases. The current SD bases are defined as a set of matrices . If we use three vectors, as in , we might obtain another inner approximation that remains relatively sparse when the dimension is large.
Another future direction is to focus on the factor width of a matrix. The cone of matrices with factor width at most was introduced in order to give another expression of the set of scaled diagonally dominant matrices. By considering a larger width , we may obtain a larger inner approximation of the semidefinite cone , although it would not be polyhedral, or even characterized by using SOCP constraints. Finding efficient ways to solve approximation problems over such cones might be an interesting challenge.
Also, our expanded SD bases can be applied to some other difficult problems. Mixed integer nonlinear programming has recently become popular in many practical applications. In [33], Lubin et al. proposed a cutting plane framework for mixed integer convex optimization problems. In [26], Kobayashi and Takano proposed a branch and bound cutting plane method for mixed integer SDPs. It would be interesting to see whether the approximations of proposed in this paper could be used to improve the efficiency of those methods.
Acknowledgments
The authors would like to sincerely thank the anonymous reviewers for their thoughtful and valuable comments which have significantly improved the paper. Among others, one of the reviewers pointed out Remark 2.9 which helped the authors to simplify the presentation of the paper. This research was supported by the Japan Society for the Promotion of Science through a Grant-in-Aid for Challenging Exploratory Research (17K18946) and a Grant-in-Aid for Scientific Research ((B)19H02373) from the Ministry of Education, Culture, Sports, Science and Technology of Japan.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. A. Ahmadi, S. Dash, and G. Hall , Optimization over structured subsets of positive semidefinite matrices via column generation , Discrete Optimization, 24 (2017), pp. 129–151.
- 2[2] A. A. Ahmadi and A. Majumdar , DSOS and SDSOS optimization: more tractable alternatives to sum of squares and semidefinite optimization , SIAM Journal on Applied Algebra and Geometry, 3 (2019), pp. 193–230.
- 3[3] N. Arima, S. Kim, and M. Kojima , A quadratically constrained quadratic optimization model for completely positive cone programming , SIAM Journal on Optimization, 23 (2013), pp. 2320–2340.
- 4[4] N. Arima, S. Kim, M. Kojima, and K.-C. Toh , A robust Lagrangian-DNN method for a class of quadratic optimization problems , Computational Optimization and Applications, 66 (2017), pp. 453–479.
- 5[5] N. Arima, S. Kim, M. Kojima, and K.-C. Toh , Lagrangian-conic relaxations, part i: A unified framework and its applications to quadratic optimization problems , Pacific Journal of Optimization, 14 (2018), pp. 161–192.
- 6[6] G. Barker and D. Carlson , Cones of diagonally dominant matrices , Pacific Journal of Mathematics, 57 (1975), pp. 15–32.
- 7[7] A. Berman and R. J. Plemmons , Nonnegative Matrices in the Mathematical Sciences , vol. 9, Siam, 1994.
- 8[8] L. Bishan, L. Lei, M. Harada, H. Niki, and M. J. Tsatsomeros , An iterative criterion for H-matrices , Linear Algebra and Its Applications, 271 (1998), pp. 179–190.
