A modified generalized shift-splitting preconditioner for nonsymmetric saddle point problems
Zhengge Huang, Ligong Wang, Zhong Xu, Jingjing Cui

TL;DR
This paper introduces a modified generalized shift-splitting preconditioner and iteration method for nonsymmetric saddle point problems, proving their convergence and demonstrating superior numerical performance over existing methods.
Contribution
The paper develops a new MGSSP preconditioner and iteration method, extending previous work, with proven convergence and improved efficiency for nonsymmetric saddle point problems.
Findings
MGSSP iteration is unconditionally convergent and semi-convergent.
Numerical results show MGSSP outperforms existing methods.
MGSSP preconditioner is more effective than other preconditioners.
Abstract
For the nonsymmetric saddle point problems with nonsymmetric positive definite (1,1) parts, the modified generalized shift-splitting (MGSSP) preconditioner as well as the MGSSP iteration method are derived in this paper, which generalize the MSSP preconditioner and the MSSP iteration method newly developed by Huang and Su (J. Comput. Appl. Math. 2017), respectively. The convergent and semi-convergent analysis of the MGSSP iteration method are presented, and we prove that this method is unconditionally convergent and semi-convergent. In addition, some spectral properties of the preconditioned matrix are carefully analyzed. Numerical results demonstrate the robustness and effectiveness of the MGSSP preconditioner and the MGSSP iteration method, and also illustrate that the MGSSP iteration method outperforms the GSS and GMSS iteration methods, and the MGSSP preconditioner is superior to…
| Method | ||||
|---|---|---|---|---|
| 16 | 32 | 64 | ||
| 20 | 51 | 125 | ||
| 2.7 | 5 | 1.5 | ||
| GSS | IT | 58 | 72 | 102 |
| CPU | 0.2556 | 1.1583 | 21.2715 | |
| RES | 8.79e-07 | 8.68e-07 | 9.80e-07 | |
| 22 | 36 | 38 | ||
| 16 | 8.3 | 5.9 | ||
| GMSS | IT | 66 | 73 | 89 |
| CPU | 0.4955 | 1.5732 | 26.4811 | |
| RES | 8.45e-07 | 9.09e-07 | 9.50e-07 | |
| 0.2 | 0.5 | 0.2 | ||
| 0.1 | 0.1 | 0.1 | ||
| MGSSP | IT | 21 | 21 | 21 |
| CPU | 0.1514 | 0.7012 | 10.1562 | |
| RES | 9.88e-07 | 9.85e-07 | 9.57e-07 |
| 16 | IT | 121 | 9 | 9 | 15 | 13 | 7 |
|---|---|---|---|---|---|---|---|
| CPU | 0.1550 | 0.0447 | 0.1505 | 0.1838 | 0.1705 | 0.0837 | |
| RES | 7.21e-07 | 5.61e-07 | 3.29e-07 | 3.29e-07 | 6.69e-07 | 3.78e-07 | |
| 32 | IT | 264 | 10 | 9 | 15 | 14 | 7 |
| CPU | 3.8574 | 0.5004 | 0.4703 | 0.8234 | 0.7600 | 0.3831 | |
| RES | 9.74e-07 | 2.54e-07 | 7.25e-07 | 7.63e-07 | 9.24e-07 | 9.67e-07 | |
| 48 | IT | 429 | 10 | 10 | 16 | 15 | 8 |
| CPU | 24.7021 | 3.7594 | 3.5617 | 6.2255 | 5.7171 | 2.8951 | |
| RES | 9.95e-07 | 6.29e-07 | 1.77e-07 | 6.33e-07 | 5.21e-07 | 1.78e-07 | |
| 64 | IT | – | 11 | 10 | 16 | 15 | 8 |
| CPU | – | 22.5881 | 21.3997 | 33.6562 | 31.2381 | 16.2309 | |
| RES | – | 3.75e-07 | 2.58e-07 | 8.29e-07 | 8.93e-07 | 2.50e-07 |
| 16 | IT | 115 | 8 | 8 | 17 | 17 | 6 |
|---|---|---|---|---|---|---|---|
| CPU | 0.1326 | 0.0982 | 0.0850 | 0.2155 | 0.2371 | 0.0783 | |
| RES | 9.50e-07 | 4.54e-07 | 1.56e-07 | 5.89e-07 | 4.29e-07 | 5.96e-07 | |
| 32 | IT | 240 | 9 | 8 | 17 | 17 | 7 |
| CPU | 3.4868 | 0.4959 | 0.4568 | 0.8974 | 0.8876 | 0.4244 | |
| RES | 9.34e-07 | 2.10e-07 | 6.49e-07 | 8.09e-07 | 4.93e-07 | 7.16e-07 | |
| 48 | IT | 367 | 9 | 9 | 18 | 17 | 7 |
| CPU | 20.4798 | 3.3953 | 3.3951 | 6.8142 | 6.4413 | 2.7642 | |
| RES | 9.80e-07 | 4.38e-07 | 1.36e-07 | 3.97e-07 | 6.37e-07 | 1.40e-07 | |
| 64 | IT | 495 | 9 | 9 | 18 | 17 | 7 |
| CPU | 81.8770 | 18.4334 | 18.5499 | 37.0634 | 35.4719 | 15.1190 | |
| RES | 9.73e-07 | 6.88e-07 | 2.15e-07 | 4.82e-07 | 7.42e-07 | 2.18e-07 |
| 16 | IT | 246 | 9 | 10 | 51 | 54 | 7 |
|---|---|---|---|---|---|---|---|
| CPU | 0.3743 | 0.1904 | 0.1345 | 0.5309 | 0.8654 | 0.1071 | |
| RES | 9.65e-07 | 8.26e-07 | 3.04e-07 | 9.10e-07 | 7.90e-07 | 8.61e-07 | |
| 32 | IT | 429 | 9 | 10 | 55 | 56 | 7 |
| CPU | 7.2934 | 0.4691 | 0.5084 | 2.6743 | 3.2658 | 0.4067 | |
| RES | 9.88e-07 | 8.40e-07 | 3.19e-07 | 8.57e-07 | 8.40e-07 | 8.72e-07 | |
| 48 | IT | – | 9 | 10 | 57 | 58 | 7 |
| CPU | – | 3.5469 | 3.8431 | 21.1369 | 23.6098 | 2.7461 | |
| RES | – | 8.61e-07 | 3.21e-07 | 9.91e-07 | 9.05e-07 | 8.50e-07 | |
| 64 | IT | – | 9 | 10 | 57 | 57 | 7 |
| CPU | – | 18.9807 | 20.9074 | 113.9468 | 117.1580 | 15.2981 | |
| RES | – | 7.78e-07 | 2.58e-07 | 9.56e-07 | 9.58e-07 | 7.56e-08 |
| Method | ||||
|---|---|---|---|---|
| 16 | 32 | 64 | ||
| 13 | 29 | 66 | ||
| 39 | 53 | 60 | ||
| GSS | IT | 85 | 136 | 230 |
| CPU | 0.3211 | 2.1355 | 47.4292 | |
| RES | 9.48e-07 | 9.83e-07 | 9.73e-07 | |
| 16 | 18 | 24 | ||
| 75 | 134.4 | 240 | ||
| GMSS | IT | 143 | 213 | 337 |
| CPU | 0.8641 | 4.5574 | 100.4047 | |
| RES | 9.86e-07 | 9.90e-07 | 9.98e-07 | |
| 0.02 | 0.01 | 0.05 | ||
| 0.1 | 0.05 | 0.1 | ||
| MGSSP | IT | 21 | 21 | 21 |
| CPU | 0.0990 | 0.6706 | 11.0860 | |
| RES | 9.53e-07 | 9.54e-07 | 9.54e-07 |
| 16 | IT | 145 | 9 | 8 | 15 | 13 | 6 |
|---|---|---|---|---|---|---|---|
| CPU | 0.2146 | 0.0447 | 0.0968 | 0.1838 | 0.1705 | 0.0813 | |
| RES | 7.95e-07 | 5.61e-07 | 2.16e-07 | 3.29e-07 | 6.69e-07 | 8.81e-07 | |
| 32 | IT | 278 | 10 | 9 | 15 | 14 | 7 |
| CPU | 4.1297 | 0.5004 | 0.4388 | 0.8234 | 0.7600 | 0.4251 | |
| RES | 9.79e-07 | 2.54e-07 | 1.46e-07 | 7.63e-07 | 9.24e-07 | 2.09e-07 | |
| 48 | IT | 366 | 10 | 9 | 16 | 15 | 7 |
| CPU | 20.2558 | 3.7594 | 3.4238 | 6.2255 | 5.7171 | 2.6904 | |
| RES | 9.71e-07 | 6.29e-07 | 4.26e-07 | 6.33e-07 | 5.21e-07 | 5.66e-07 | |
| 64 | IT | 465 | 11 | 9 | 16 | 15 | 8 |
| CPU | 76.1434 | 22.5881 | 18.1854 | 33.6562 | 31.2381 | 16.5291 | |
| RES | 9.71e-07 | 3.75e-07 | 8.37e-07 | 8.29e-07 | 8.93e-07 | 9.27e-08 |
| 16 | IT | 122 | 9 | 9 | 19 | 19 | 7 |
|---|---|---|---|---|---|---|---|
| CPU | 0.1422 | 0.1298 | 0.1276 | 0.3788 | 0.3205 | 0.1223 | |
| RES | 8.71e-07 | 5.68e-07 | 2.67e-07 | 7.02e-07 | 5.33e-07 | 1.62e-07 | |
| 32 | IT | 237 | 10 | 9 | 19 | 19 | 7 |
| CPU | 3.3291 | 0.6218 | 0.5728 | 1.1392 | 1.1254 | 0.4491 | |
| RES | 9.87e-07 | 2.31e-07 | 5.53e-07 | 6.67e-07 | 5.62e-07 | 2.83e-07 | |
| 48 | IT | 350 | 10 | 9 | 19 | 19 | 7 |
| CPU | 19.3841 | 4.1896 | 3.8573 | 7.9254 | 8.3686 | 3.1636 | |
| RES | 9.99e-07 | 3.20e-07 | 7.60e-07 | 7.45e-07 | 5.74e-07 | 3.81e-07 | |
| 64 | IT | 461 | 10 | 9 | 19 | 19 | 7 |
| CPU | 76.1620 | 23.0578 | 20.8402 | 44.0056 | 42.1010 | 17.2456 | |
| RES | 9.82e-07 | 3.91e-07 | 9.15e-07 | 8.17e-07 | 6.00e-07 | 4.58e-07 |
| 16 | IT | 250 | 10 | 10 | 59 | 59 | 7 |
|---|---|---|---|---|---|---|---|
| CPU | 0.4184 | 0.1320 | 0.1333 | 0.8935 | 0.9098 | 0.1191 | |
| RES | 9.42e-07 | 5.70e-07 | 4.30e-07 | 8.22e-07 | 8.03e-07 | 8.47e-07 | |
| 32 | IT | 419 | 10 | 10 | 60 | 60 | 7 |
| CPU | 7.0626 | 0.6292 | 0.6296 | 3.3032 | 3.3277 | 0.4901 | |
| RES | 9.85e-07 | 5.31e-07 | 3.93e-07 | 9.43e-07 | 9.14e-07 | 8.11e-07 | |
| 48 | IT | – | 10 | 10 | 60 | 60 | 7 |
| CPU | – | 4.1788 | 4.3580 | 23.9170 | 23.6870 | 3.0734 | |
| RES | – | 5.75e-07 | 4.28e-07 | 9.46e-07 | 9.15e-07 | 8.72e-07 | |
| 64 | IT | – | 10 | 10 | 60 | 60 | 7 |
| CPU | – | 23.6671 | 24.2199 | 130.4922 | 130.1184 | 16.9088 | |
| RES | – | 6.43e-07 | 4.83e-07 | 9.40e-07 | 9.07e-07 | 9.34e-07 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Electromagnetic Scattering and Analysis · Advanced Numerical Methods in Computational Mathematics
A modified generalized shift-splitting preconditioner for nonsymmetric saddle point problems
††thanks: Supported by the National Natural Science Foundation of China (No. 11171273) and Innovation Foundation for Doctor Dissertation of Northwestern Polytechnical University (No. CX201628).
Zhengge Huang, Ligong Wang111Corresponding author. , Zhong Xu and Jingjing Cui
Department of Applied Mathematics, School of Science, Northwestern Polytechnical University,
Xi’an, Shaanxi 710072, People’s Republic of China.
E-mails: [email protected]; [email protected](or [email protected]);
[email protected]; [email protected]
Abstract
For the nonsymmetric saddle point problems with nonsymmetric positive definite (1,1) parts, the modified generalized shift-splitting (MGSSP) preconditioner as well as the MGSSP iteration method are derived in this paper, which generalize the MSSP preconditioner and the MSSP iteration method newly developed by Huang and Su (J. Comput. Appl. Math. 2017), respectively. The convergent and semi-convergent analysis of the MGSSP iteration method are presented, and we prove that this method is unconditionally convergent and semi-convergent. In addition, some spectral properties of the preconditioned matrix are carefully analyzed. Numerical results demonstrate the robustness and effectiveness of the MGSSP preconditioner and the MGSSP iteration method, and also illustrate that the MGSSP iteration method outperforms the GSS and GMSS iteration methods, and the MGSSP preconditioner is superior to the shift-splitting (SS), generalized SS (GSS), modified SS (MSS) and generalized MSS (GMSS) preconditioners for the GMRES method for solving the nonsymmetric saddle point problems.
Key Words: Nonsymmetric saddle point problem, Modified generalized shift-splitting, Convergence, Semi-convergence, Spectral properties. AMS Subject Classification (2010): 65F08, 65F10.
1 Introduction
In a wide variety of scientific and engineering applications, such as mixed finite element approximation of elliptic partial differential equations, the image reconstruction and registration, computational fluid dynamics, weighted least-squares problems, networks computer graphics, constrained optimization and son on [2, 16, 27], we need to solve the following nonsymmetric saddle point problems of the form
[TABLE]
where is nonsymmetric positive definite, is a rectangular matrix, and are given vectors, with . Here, denotes the transpose of . The system of linear equations (1) is also termed as a Karush-Kuhn-Tucker (KKT) system, or an augmented system [28, 25]. For a wider class of saddle point problems, the readers can refer to [13].
Since the matrices and are large and sparse in general, the iteration methods are often much more suitable for solving it than direct methods. When is of full column rank, a large variety of effective iterative methods based on matrix splitting as well as their numerical properties have been investigated in the literature. For example, Golub et al. [29] developed the SOR-like method, and in the sequel, Bai et al. [10, 11] extended the SOR-like method to the generalized SOR (GSOR) method and the parameterized inexact Uzawa method, respectively. For SOR-like methods established recently, see [30, 39]. Based on the Uzawa method presented by Bramble et al. and Elman and Golub in [15, 26], Bai et al. [10, 11], Dai et al. [23] and Ma and Zheng [37] employed the Uzawa-type methods and so forth in recent years. Besides, Bai et al. put forward the well-known Hermitian and skew-Hermitian splitting (HSS) methods [7] and its variants [6, 8, 9, 5]. On the basis of the shift-splitting (SS) of a non-Hermitian matrix [12], Cao et al. [17] derived the SS iteration method as well as the SS preconditioner for the nonsingular saddle point problems, and Chen and Ma [20] and Cao et al. [18] generalized the SS iteration method and obtained the generalized SS (GSS) iteration method. To increase the convergence rate of the GSS iteration method, Huang and Su [33] newly developed the modified shift-splitting (MSSP) iteration method.
If in (1) is rank deficient, then the coefficient matrix in (1) is singular, and we call (1) the singular saddle point problem. Some iteration methods and preconditioning techniques for solving singular saddle point problems have been proposed in the recent literature, see, e.g., [35, 42, 41, 34]. Zheng et al. [43] proposed some sufficient conditions for the semi-convergence of the GSOR method and determined the optimal iteration parameters. Bai [3] derived some necessary and sufficient conditions to assure the semi-convergence of the HSS method. Chen et al. [21] and Cao et al. [19] investigated the generalized shift-splitting iteration method for singular saddle point problems. Very recently, Dou et al. [24] introduced the modifying the parameterized inexact Uzawa (PIU) for singular saddle point problems, and Zheng and Lu [44] proved the semi-convergence of the upper and lower triangular (ULT) splitting iteration method for singular saddle point problems.
Recently, based on the preconditioner [12] studied for a class of non-Hermtian positive definite linear systems, Cao et al. [17] presented a shift-splitting (SS) preconditioner
[TABLE]
for the saddle point problem (1), where is a positive constant and is the identity matrix. The authors also proved the corresponding SS iteration method is unconditional convergent.
On the basis of the shift-splitting (SS) preconditioner [17], Chen and Ma [20] and Cao et al. [18] replaced the parameter in (2,2)-block of the SS preconditioner by another parameter , and employed the generalized SS (GSS) preconditioner of the form
[TABLE]
where , and is the identity matrix. It is easy to see that is a special case of when . Numerical results in [21, 20] confirmed that the GSS preconditioner is superior to the SS preconditioner.
Very recently, based on the well-known Hermitian and skew-Hermitian splitting (HSS) of the matrix : , where , , and similar to the shift-splitting [17, 12], the modified shift-splitting (MSS) preconditioner [45] was proposed for nonsymmetric saddle point problem (1), the form of is:
[TABLE]
with being a constant and being the identity matrix with appropriate dimension.
In the sequel, by replacing the parameter in (2,2)-block in the MSS preconditioner by another parameter , Huang et al. [32] established the generalized MSS (GMSS) preconditioner. They discussed the corresponding GMSS iteration method is convergent and semi-convergent under proper conditions, and showed that the GMSS iteration method and the GMSS preconditioner are better than the MSS iteration method and the MSS preconditioner, respectively by numerical experiments.
In order to increase the convergence rate of the GSS method for the nonsingular saddle point problems with symmetric positive definite (1,1) parts, Huang and Su [33] newly developed the modified shift-splitting (MSSP) preconditioner of the form:
[TABLE]
with being a constant and being the identity matrix with appropriate dimension, which derived from the following modified shift-splitting of the saddle point matrix :
[TABLE]
The authors in [33] theoretically verified the corresponding MSSP iteration method is unconditional convergent and estimated the bounds of the eigenvalues of the iteration matrix of the MSSP iteration method. Numerical experiments illustrated that the MSSP preconditioner outperforms the SS and GSS preconditioners for the nonsingular saddle point problems with symmetric positive definite (1,1) parts.
To further accelerate the convergence rates of the GSS and the GMSS preconditioned GMRES methods for the saddle point problems with nonsymmetric positive definite (1,1) parts, a new preconditioner which is referred to as the modified generalized shift-splitting (MGSSP) preconditioner is developed for nonsymmetric saddle point problems in this paper. Theoretical analysis also shows that the corresponding splitting iteration method is convergent and semi-convergent unconditionally. Besides, we investigate the spectral properties of the corresponding preconditioned matrix and show that it has clustered eigenvalue distribution by choosing proper parameters. Numerical experiments are presented to confirm the effectiveness of the MGSSP iteration method and the MGSSP preconditioned GMRES method for solving the nonsymmetric saddle point problems.
The outline of this paper is organized as follows. In Section 2, we propose the MGSSP iteration method which induces the MGSSP preconditioner. The unconditional convergent and semi-convergent properties of the MGSSP iteration method will be proved in Sections 3 and 4, respectively. The spectral properties of the MGSSP preconditioned matrix are obtained correspondingly in Section 5. We examine the feasibility and effectiveness of the MGSSP iteration method and the MGSSP preconditioned GMRES method for solving the nonsymmetric nonsingular and singular saddle point problems by numerical experiments in Section 6. Finally, a brief conclusion will be given to end this work in Section 7.
Throughout this paper, and represent the minimum eigenvalue and the spectral radius of the matrix , respectively. denotes the conjugate transpose of either a vector or a matrix.
2 The modified generalized shift-splitting (MGSSP) preconditioner and its implementation
In this section, inspired by the ideas of [20, 18, 33], we develop a new splitting called the modified generalized shift-splitting (MGSSP) of the nonsymmetric saddle point matrix by combining the generalized splitting-splitting and the modified shift-splitting of the saddle point matrix as follows.
[TABLE]
where , are two constants and is the unit matrix with appropriate dimension. Then, the splitting (2) naturally leads to the following modified generalized shift-splitting iteration method for solving the nonsymmetric saddle point problem (1):
The modified generalized shift-splitting (MGSSP) iteration method: Let and be two given constants. Given an initial guess . For , until converges, compute
[TABLE]
Hence the MGSSP iteration method can be written in the following fixed point form
[TABLE]
where
[TABLE]
is the iteration matrix.
It should be noted that any matrix splitting not only can automatically lead to a splitting iteration method, but also can naturally induce a splitting preconditioner for the Krylov subspace methods. The splitting preconditioner corresponds to the MGSSP iteration (2) is given by
[TABLE]
which is called the MGSSP preconditioner for the nonsymmetric saddle point matrix .
At each step of the MGSSP iteration (3) or applying the MGSSP preconditioner within a Krylov subspace method, we need to solve a linear system with as the coefficient matrix. That is to say, we need to solve a linear system of the form
[TABLE]
where and with and . It is not difficult to check that
[TABLE]
It follows from the decomposition of in (5) that
[TABLE]
Therefore, we can derive the following algorithmic version of the MGSSP iteration method.
Algorithm 2.1 For a given vector , the vector can be computed by (6) according to the following steps:
(1) compute ;
(2) solve ;
(3) compute .
From Algorithm 2.1, it is known that at each iteration, a linear system with the coefficient matrix only needs to be solved. However, it may be very costly and impractical in actual implementations because of the sparsity pattern of . Fortunately, the matrix is positive definite for all and . Therefore, we can employ the Krylov subspace method, such as the GMRES method to solve the sub-linear systems with the coefficient matrix by a prescribed accuracy. In addition, it can be solved by some direct methods, such as the sparse LU factorization. What we want to pose here is that we always use the sparse LU factorization to solve this problem in our paper.
3 Convergence of the MGSSP iteration method for nonsingular saddle point problems
The main purpose of this section is to study the convergence properties of the MGSSP iteration method by analyzing the spectral properties of the iteration matrix. Before doing this, we derive some lemmas which will be useful in the following proofs.
Lemma 3.1**.**
[11]* Both roots of the complex quadratic equation are less than one in modulus if and only if , where denotes the conjugate complex of .*
Lemma 3.2**.**
Let be a positive definite matrix, be of full column rank, and and be two given constants. If is an eigenvalue of the iteration matrix , then .
Proof. Let be an eigenvalue of the iteration matrix of the MGSSP iteration method, and be the corresponding eigenvector. Then it holds that
[TABLE]
After proper manipulations, we obtain
[TABLE]
Now we will give the proof by contradiction. If , then from (7), it has and , which lead to and . Thus we get by the positive definiteness of , and therefore and , a contradiction. In addition, if , then it follows from the second equation of (7) that . Substituting this relation into the first equation of (7) gives , then is due to the fact that is nonsingular, which yields that , a contradiction.
Lemma 3.3**.**
Assume that the conditions in Lemma 3.2 are satisfied. Let be an eigenvalue of the iteration matrix of the MGSSP iteration method and , with and , be the corresponding eigenvector. Then . Moreover, if , then .
Proof. If , then from the second equation of (7), we have . Inasmuch as and , we derive . This contradicts to the assumption that is an eigenvector. Furthermore, if , then it follows from the first equation of (7) that
[TABLE]
Since , the definition does make sense. Premultiplying (8) with gives
[TABLE]
where . Since is positive definite, . It follows from (9) that
[TABLE]
Thus, we completes our proof of Lemma 3.3.
Theorem 3.1**.**
Assume the conditions in Lemma 3.2 are satisfied. Let be an eigenvalue of the iteration matrix of the MGSSP iteration method and , with and , be the corresponding eigenvector. Then the MGSSP iteration method converges to the exact solution of the saddle point problem (1) for all and .
Proof. By making use of Lemma 3.2, we have , then from the second equation of (7), it has
[TABLE]
substituting it into the first equation of (7) results in
[TABLE]
By making use of Lemma 3.3, it holds that . Denote
[TABLE]
By multiplying on (10) from the left, we have
[TABLE]
Having mind that is positive definite, we get and , which lead to by and . Hence, (11) can be rewritten as , where
[TABLE]
If , then (11) can be expressed as
[TABLE]
Solving the two roots of (12), we obtain
[TABLE]
Lemma 3.2 implies that , then
[TABLE]
Now we turn to prove under the condition . According to Lemma 3.1, we know that if and only if . After some manipulations, we derive
[TABLE]
and
[TABLE]
Hence, is valid if and only if
[TABLE]
which is equivalent to
[TABLE]
Since , , , and , it holds that
[TABLE]
which implies that (13) holds true, i.e., and therefore . Hence, the MGSSP iteration method is convergent for any and . This proof is completed.
4 Semi-convergence of the MGSSP iteration method for singular saddle point problems
When the saddle point matrix is nonsingular, the MGSSP iteration scheme (3) converges to the exact solution of (1) for any initial vector if and only if , whereas for the singular matrix , we have . In this section, we assume that the sub-matrix in (1) is rank deficient and discuss the semi-convergence of the MGSSP iteration method for solving the singular saddle point problems.
To analyze the semi-convergent properties of the MGSSP iteration method, we present the following lemma which describes the semi-convergence property about the iteration scheme (3) when is singular.
Lemma 4.1**.**
[14]* The iteration scheme (3) is semi-convergent if and only if the following two conditions are satisfied:
(i) , or equivalently, , where is the iteration matrix;
(ii) the pseudo-spectral radius of is less than , i.e.,*
[TABLE]
where is the spectral set of the matrix . Here, we denote the null space, the index and the rank of by , and , respectively.
Lemma 4.1 describes the semi-convergence property about the iteration scheme (3) when is singular. Therefore, to get the semi-convergence property of the MGSSP iteration method, only the two conditions in Lemma 4.1 need to verify. We consider these two conditions in Lemmas 4.2 and 4.3, respectively.
Lemma 4.2**.**
Let be nonsymmetric positive definite, be rank deficient and be given constants. Then, the iteration matrix of the MGSSP iteration method satisfies , or equivalent
[TABLE]
where is the iteration matrix of the MGSSP iteration method defined as in (3).
Proof. Inasmuch as , Equation (14) holds if
[TABLE]
It is easy to see that . Thus we only need to prove
[TABLE]
Let , then it has . Denote by . After suitable manipulations, we have
[TABLE]
i.e.,
[TABLE]
Since , it holds that , i.e.,
[TABLE]
Since is positive definite, from the first equation of (16) we can easily get . Then substituting this relationship into the second equation of (16), we obtain , which leads to . Taking into , we obtain . Hence, the first equation of (15) becomes
[TABLE]
Substituting into yields . Since , , it has . This results in , then we get . Thus, , i.e.,
[TABLE]
The conclusion follows by (17).
In the sequel, we show that the iteration scheme (3) satisfies the condition (ii) in Lemma 4.1. Let be the singular decomposition of matrix , where
[TABLE]
with , being two unitary matrices and being a singular value of .
We introduce a block diagonal matrix
[TABLE]
which is a unitary matrix, and the iteration matrix is unitary similar to the matrix . Hence, the matrix has the same spectrum with the matrix . Thus we only need to analyze the pseudo-spectral radius of the matrix now.
Denoting , then it holds that
[TABLE]
Then, from Equation (18), holds if and only if .
Note that can be viewed as the iteration matrix of the MGSSP iteration method applied to the nonsymmetric nonsingular saddle point problem
[TABLE]
where and .
implies . By making use of the proof of Theorem 3.1, we derive the following result.
Lemma 4.3**.**
Let be nonsymmetric positive definite, be rank deficient and be two given constants. Then, the pseudo-spectral radius of the matrix is less than 1, i.e., for all and .
It follows from Lemmas 4.2 and 4.3 that two conditions in Lemma 4.1 are satisfied. Thus, the semi-convergence of the MGSSP iteration method for solving nonsymmetric singular saddle point problems can be obtained in the following theorem.
Theorem 4.1**.**
Let be nonsymmetric positive definite, be rank deficient and be two given constants. Then the MGSSP iteration method is semi-convergent for solving the nonsymmetric singular saddle point problem (1) for all and .
5 Spectral analysis of the MGSSP preconditioned matrix
The MGSSP iteration method is a stationary iteration method. Although the unconditional convergence and semi-convergence properties of the MGSSP iteration method are studied in Theorem 3.1 and Theorem 4.1, respectively, the convergence (semi-convergence) rates of the MGSSP iteration method may be slow even with the optimal parameters. To accelerate the convergence (semi-convergence) rates of the MGSSP iteration method, we consider applying the preconditioning techniques. In general, the eigenvalue and eigenvector distributions of the preconditioned matrix relate closely to the convergence rates of Krylov subspace methods. Therefore, it is of significance to investigate the spectral properties of the preconditioned matrix . In this section, some spectral properties of the preconditioned matrix are studied.
Theorem 5.1**.**
Let the MGSSP preconditioner be defined as in (4) and be an eigenpair of the preconditioned matrix . Then if is of full column rank and , then
[TABLE]
where and denote the real part and the imaginary part of , respectively. If is rank deficient and , then . Besides, if is rank deficient and , then or satisfies the Inequalities (19). If , then the eigenvalues of the preconditioned matrix satisfy
[TABLE]
where
[TABLE]
and are real numbers and is one of the square roots of , with
[TABLE]
and
[TABLE]
and the second root of is . The eigenvalues satisfy the following inequality:
[TABLE]
When , it holds that
[TABLE]
i.e., for , the eigenvalues of the preconditioned matrix tend to scatter near the point as ; and when , it has
[TABLE]
That is, for , the eigenvalues of the preconditioned matrix tend to scatter near the point and the point as .
*In addition, the eigenvalues of tend to scatter near the points
, and as and .*
Proof. Let be an eigenpair of the preconditioned matrix , we consider the eigenvalue problem , where , then it holds that
[TABLE]
By simple manipulations, we get
[TABLE]
i.e.,
[TABLE]
If has full column rank and , then it follows from the second equation of (24) that and therefore , which contradicts to the assumption that is an eigenvector. Hence . If is of full column rank and , then from the second equation of (24), we have and
[TABLE]
Owing to , it holds that the definition does make sense. Premultiplying Equation (25) with and utilizing the symbols defined as in (21) give
[TABLE]
It is easy to verify that as . Besides, (26) implies that
[TABLE]
Since
[TABLE]
it is not difficult to derive (19).
If is rank deficient and , then from the second equation of (24), we derive . Additionally, if is rank deficient and , then it holds that or , by virtue of the second equation of (24). Similar to the derivation of (19), we also deduce (19) as is rank deficient, and .
In the sequel, we assume that . Then and . Otherwise, it follows from the second equation of (24) that , a contradiction. From the second equation of (24) we can easily get . Then substituting this relationship into the first equation of (24) gives
[TABLE]
Multiplying on Equation (27) from the left and utilizing the symbols defined as in (21) give
[TABLE]
which can be equivalently transformed into the following equation
[TABLE]
By solving Equation (28), we obtain its two roots as follows:
[TABLE]
where and are given by (22). Applying (22) leads to
[TABLE]
which yields that
[TABLE]
It is evident that an upper bound of is , with being bounded as follows:
[TABLE]
which leads to
[TABLE]
Furthermore, it is not difficult to verify that as , and therefore for , as . Moreover, if , then it follows from (22) that and , thus
[TABLE]
which means that for , the eigenvalues of the preconditioned matrix tend to scatter near the point and the point as . Additionally, it is easily seen that the eigenvalues of tend to scatter near the points and as and .
Remark 5.1**.**
It follows from Theorem 5.1 that
[TABLE]
as , and , and if is of full column rank and , then from (19), we infer that , where is an eigenpair of the preconditioned matrix . Thus all eigenvalues of have positive real parts and lie in a positive box as is of full column rank, which may result in fast convergence of Krylov subspace acceleration. Besides, from the proof of Theorem 5.1, it can be seen that when and , it holds that or ; when , as for . This implies that the MGSSP preconditioned matrix with proper parameters and has much denser spectrum distribution compared with the saddle point matrix . This means that when the MGSSP preconditioner is applied for the GMRES method, the rate of convergence (semi-convergence) can be improved considerably. This fact is further confirmed by the numerical results presented in Tables 2-4 and Tables 6-8 of Section 6. What is more, since
[TABLE]
then from (32), we have that
[TABLE]
Then all eigenvalues of are located in a circle centered at with radius .**
Owing to the fact the convergence of Krylov subspace methods is not only dependent on the eigenvalue distribution of the preconditioned matrix, but also on the corresponding eigenvectors of the preconditioned matrix [1, 4] except for the case that the preconditioned matrix is symmetric. We next discuss the eigenvector distribution of in the following theorem.
Theorem 5.2**.**
*Let the MGSSP preconditioner be defined as in (4). If is of full column rank and , then the preconditioned matrix has linearly independent eigenvectors, and if is of full column rank and , then the preconditioned matrix has linearly independent eigenvectors. If is rank deficient and , then the preconditioned matrix has linearly independent eigenvectors, and if is rank deficient and , then the preconditioned matrix has linearly independent eigenvectors. There are
1) eigenvectors of the form \left(\begin{array}[]{c}u_{l}\\ 0\\ \end{array}\right) that correspond to the eigenvalue as , where are arbitrary linearly independent vectors;
2) If is of full column rank and , eigenvectors of the form \left(\begin{array}[]{c}u_{l}^{1}\\ \frac{(2\lambda-1)B^{T}u_{l}^{1}}{\lambda\beta}\\ \end{array}\right) that correspond to the eigenvalues , where satisfy .
3) If is rank deficient and , eigenvectors of the form \left(\begin{array}[]{c}u_{l}^{1}\\ \frac{(2\lambda-1)B^{T}u_{l}^{1}}{\lambda\beta}\\ \end{array}\right) that correspond to the eigenvalues , where satisfy ; and eigenvectors \left(\begin{array}[]{c}0\\ v_{l}^{2}\\ \end{array}\right) that correspond to the eigenvalue [math], where satisfy .*
Proof. Let be an eigenvalue of the preconditioned matrix and \left(\begin{array}[]{c}u\\ v\\ \end{array}\right) be the corresponding eigenvector. To investigate the eigenvector distribution, we consider Equation (24) as follows:
[TABLE]
Firstly, we consider has full column rank. If , then it follows from the second equation of (33) that and therefore , which contradicts to the assumption that is an eigenvector. Hence . If , then from (33) we can easily get and . If , then Equation (33) is always true for the case of . Hence, there are linearly independent eigenvectors \left(\begin{array}[]{c}u_{l}\\ 0\\ \end{array}\right) corresponding to the eigenvalue as , where are arbitrary linearly independent vectors. If , then and , a contradiction.
Next, we consider the case . It follows from the second equation of (33) that . Substituting this relation into the first equation of (33) results in
[TABLE]
If there exists which satisfies (34), there will be linearly independent eigenvectors \left(\begin{array}[]{c}u_{l}^{1}\\ v_{l}^{1}\\ \end{array}\right) corresponding to the eigenvalues . Here, satisfy and the forms of are
[TABLE]
If is rank deficient, then is an eigenvalue of . If , then from (33), it holds that and , which lead to and therefore is due to the fact that is positive definite, then we have and . Recalling that is rank deficient, then there exists which satisfies , hence there will be linearly independent eigenvectors \left(\begin{array}[]{c}0\\ v_{l}^{2}\\ \end{array}\right) corresponding to the eigenvalue [math], where satisfy . With a quite similar strategy utilized in the case that has full column rank, we also can obtain the eigenvectors that correspond to and are the same as those for the case that is of full column rank.
Now, we show that the eigenvectors are linearly independent when is of full column rank and . Let and be two vectors with . Then, we need to show that
[TABLE]
holds if and only if the vectors and both are zero vectors. Recall that in (35) the first matrix arises from the case in 1), and the second matrix from the case in 2). Multiplying both sides of (35) from left with leads to
[TABLE]
Then, by subtracting (35) from (36), it holds that
[TABLE]
Since the eigenvalues and \left(\begin{array}[]{c}u_{l}^{1}\\ v_{l}^{1}\\ \end{array}\right) are linearly independent, we infer that . Because of the linear independence of , it follows that . Therefore, the eigenvectors are linearly independent.
In the sequel, we verify the eigenvectors are linearly independent when is rank deficient and . Let , and be three vectors with and , and
[TABLE]
It is necessary for us to prove that (37) holds if and only if the vectors , and are all zero vectors, where the first matrix consists of the eigenvectors corresponding to the eigenvalue for the case 1), and the second and the third matrices consist of those for the case 3). Premultiplying (37) with and going through the same algebraic operations as before, we also obtain
[TABLE]
Inasmuch as and are linearly independent, it holds that . Then it has
[TABLE]
As the vectors are also linearly independent, we have . Thus, (37) becomes to
[TABLE]
Since are linearly independent, we have . As a result, it holds that the eigenvectors are linearly independent.
Finally, we prove the eigenvectors are linearly independent when is rank deficient and . Let and be two vectors with . Then, we need to show that
[TABLE]
holds if and only if the vectors and both are zero vectors. Since are linearly independent, we infer that . Because of the linear independence of , it follows that . Consequently, the above eigenvectors are linearly independent.
6 Numerical experiments
In this section, two numerical examples are used to verify the performance of the MGSSP iteration method and the MGSSP preconditioned GMRES method. In the meanwhile, we compare the MGSSP iteration method with the GSS and GMSS methods, and also compare the MGSSP preconditioner with the SS, GSS, MSS and GMSS preconditioners for the GMRES method according to the number of iterations (denoted by “IT”) and the elapsed CPU times (denoted by “CPU”). All codes are run in MATLAB (version R2016a) and all experiments are performed on an Intel(R) Pentium(R) CPU G3240T 2.70 GHz, 4.0GB memory and XP operating system. In our implementations, the linear systems , and involved in the SS, GSS and MGSSP iteration, respectively are solved exactly by the the LU factorization. In addition, the linear systems and contained in the MSS and the GMSS iteration are solved exactly by the Cholesky factorization.
In our numerical experiments, we choose the right-hand side vector so that the exact solution of the saddle point problem (1) is . All experiments are started from the initial vector , terminated once the current iterate satisfies
[TABLE]
and we use “–” to indicate that the corresponding iteration method does not satisfy the prescribed stopping criterion until iteration steps.
Example 6.1**.**
Consider the nonsymmetric nonsingular saddle point problem structured as (1) with the following coefficient sub-matrices [36]:
[TABLE]
where denotes the Kronecker product symbol and is the discretization mesh size.
In Table 1, we list the parameters involved in the tested methods which are chosen to be the experimentally found optimal ones that minimize the total number of iteration steps for those methods, as well as the numerical results of the GSS, GMSS and MGSSP iteration methods when with respect to different grids , and . Moreover, numerical results of the GMRES method and the preconditioned GMRES methods incorporated with the SS, GSS, MSS, GMSS and the MGSSP preconditioners are listed in Tables 2-4 for , and on different uniform grids, respectively.
In order to better understand the numerical results in Table 1, convergence history of the GSS, GMSS and MGSSP iteration methods with experimental optimal parameters are depicted in Figure 1. To further confirm the effectiveness of the MGSSP preconditioned GMRES method, we plot the IT of the three preconditioned GMRES methods with parameters from 0.1 to 10 with step size 0.1 in Figure 2. For more investigations, the eigenvalue distributions of the original matrix and the five preconditioned matrices with and for and are displayed in Figure 3.
Looking into Tables 1-4 and Figures 1-3 one may make the following observations.
- •
From Table 1, it can be observed that the experimental optimal parameters of the MGSSP iteration method are more stable compared with those of other two methods. Besides, the results in Table 1 imply that the MGSSP iteration method is superior to the other two methods from the point view of the IT and CPU times, and the IT of the MGSSP iteration method remains constant under the experimental optimal parameters with the increasing of the problem size.
- •
By comparing the results in Tables 2-4, it can be seen that without preconditioning, the GMRES method converges very slow even invalid within iteration steps for larger linear systems. All aforementioned preconditioners can largely accelerate the convergence rate of the GMRES method. The proposed MGSSP preconditioned GMRES method performs better than other five preconditioned GMRES methods as it requires less IT and CPU times. Another observation which can be pointed out is that, the convergence behavior of the MGSSP preconditioned GMRES method is not sensitive to , in the sense the iterations barely change.
- •
Figure 1 indicates that the three tested methods converge while the MGSSP iteration method returns better numerical results than the GSS and the GMSS iteration methods. From Figure 2, as we expected for Example 6.1, we see that the MGSSP preconditioned GMRES method outperforms the other two methods with the changing of , and show that our proposed preconditioner is more effective and practical for solving the nonsymmetric nonsingular saddle point problems, in comparison with the other preconditioners. Additionally, as seen from Figure 3, the preconditioned matrix has more clustered eigenvalues than the other ones. This means that the MGSSP preconditioner outperforms the other five preconditioners for the GMRES method, which is congruous with the results of Table 2.
Example 6.2**.**
Consider the nonsymmetric singular saddle point problem structured as (1) with the following coefficient sub-matrices [40]:
[TABLE]
where
[TABLE]
Here denotes the Kronecker product and is the discretization meshsize. The iterations of all tested methods are terminated once the current iterate satisfies (38) or the maximum prescribed number of iterations is exceeded.
Table 5 reports the iteration counts, CPU times and relative residual (RES) of the tested iteration methods with respect to different values of the problem size for . We adopt the parameters of the tested methods to be the experimentally found optimal ones. From Table 5, we observe that although all tested methods succeed in producing approximate solutions in all cases, the MGSSP iteration method outperforms other two methods in terms of the IT and CPU times, and the advantage of the MGSSP iteration method becomes more pronounced as the system size increases.
With respect to different sizes of the coefficient matrix, we list the numerical results of the SS, GSS, MSS, GMSS and MGSSP preconditioned GMRES methods with different values of (, and ) in Tables 6-8, respectively. From Tables 6-8, we can conclude some observations as follows. Firstly, the GMRES method does not converge when and becomes large. Secondly, the five preconditioners can improve the convergence behavior of the GMRES method, but the MGSSP preconditioned GMRES method returns better numerical results than the other preconditioned GMRES methods in terms of IT and CPU time. Lastly, the MSS and GMSS preconditioned GMRES methods have worse convergence behaviors as becomes small.
The graphs of RES(log10) against number of iterations of in Table 5 for three different sizes are displayed in Figure 4. As observed in Figure 4, the MGSSP iteration method leads to much better performance than the GSS and the GMSS iteration methods. It is worthy noting that the IT of the GSS and the GMSS iteration methods increase when becomes large, but this is not true for the MGSSP iteration method.
In order to compare effects of the GSS, GMSS, and the MGSSP preconditioned GMRES methods in terms of the parameters and , we test these methods with and plot the IT of the three preconditioned GMRES methods with from 0.1 to 10 with step size 0.1 in Figure 5. From Figure 5, we can obtain the same results as those of Figure 2.
In order to better investigate the performance of the tested preconditioned GMRES methods, Figure 6 depicts the eigenvalue distributions of the saddle point matrix , the SS, GSS, MSS, GMSS and MGSSP preconditioned matrices with and . These subfigures clearly show that the preconditioned matrices have more tightly clustered eigenvalues than the original matrix. Moreover, the eigenvalues of the MGSSP preconditioned matrix are much tighter than the other ones. These observations imply that the MGSSP preconditioned GMRES method has better numerical performance than other preconditioned GMRES methods and it can act as an efficient preconditioner for solving the nonsymmetric singular saddle point problem by the preconditioned GMRES method.
7 Conclusions
For nonsymmetric saddle point problems, by combining the GSS and MSSP of a matrix, we establish a modified generalized shift-splitting (MGSSP) iteration method and the corresponding preconditioner called the MGSSP preconditioner in this paper. The unconditional convergence and semi-convergence of the MGSSP iteration method for solving nonsingular and singular saddle point problems, respectively are discussed. Moreover, eigenproperties of the preconditioned matrix are described. Numerical results given in Section 6 illustrate that the efficiency of the MGSSP iteration method and the MGSSP preconditioner for saddle point problems with nonsymmetric positive definite (1,1) parts, and confirm that they outperform some existing ones.
We should point out that the MGSSP preconditioner may not have the optimality property, i.e., the iteration counts depend on the parameters and (see Figures 2 and 5). Besides, admittedly, the choices of the optimal parameters of the MGSSP iteration method and the MGSSP preconditioned GMRES method is a challenging problem that deserves further study. For most iterative methods, this work is very complicated. Nevertheless, by adopting certain approximation strategies, there have been practically useful formula for obtaining nearly optimal iteration parameters; see [38, 22, 31]. To further investigations, we would like to study how to further improve the MGSSP preconditioner and choose the optimal parameters for the MGSSP iteration method.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Z.-Z. Bai, Sharp error bounds of some Krylov subspace methods for non-Hermitian linear systems, Appl. Math. Comput. 109 (2000), 273-285.
- 2[2] Z.-Z. Bai, Structured preconditioners for nonsingular matrices of block two-by-two structures, Math. Comp. 75 (2006), 791-815.
- 3[3] Z.-Z. Bai, On semi-convergence of Hermitian and skew-Hermitian splitting methods for singular linear systems, Computing 89 (2010), 171-197.
- 4[4] Z.-Z. Bai, Motivations and realizations of Krylov subspace methods for large sparse linear systems, J. Comput. Appl. Math. 283 (2015), 71-78.
- 5[5] Z.-Z. Bai and G. H. Golub, Accelerated Hermitian and skew-Hermitian splitting iteration methods for saddle-point problems, IMA J. Numer. Anal. 27 (2007), 1-23.
- 6[6] Z.-Z. Bai, G. H. Golub, L.-Z. Lu and J.-F. Yin, Block triangular and skew-Hermitian splitting methods for positive-definite linear systems, SIAM J. Sci. Comput. 26 (2005), 844-863.
- 7[7] Z.-Z. Bai, G. H. Golub and M. K. Ng, Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems, SIAM J. Matrix Anal. Appl. 24 (2003), 603-626.
- 8[8] Z.-Z. Bai, G. H. Golub and M. K. Ng, On inexact Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems, Linear Algebra Appl. 428 (2008), 413-440.
