A modified generalized shift-splitting preconditioner for nonsymmetric   saddle point problems

Zhengge Huang; Ligong Wang; Zhong Xu; Jingjing Cui

arXiv:1701.04157·math.NA·January 17, 2017·Numer. Algorithms

A modified generalized shift-splitting preconditioner for nonsymmetric saddle point problems

Zhengge Huang, Ligong Wang, Zhong Xu, Jingjing Cui

PDF

Open Access

TL;DR

This paper introduces a modified generalized shift-splitting preconditioner and iteration method for nonsymmetric saddle point problems, proving their convergence and demonstrating superior numerical performance over existing methods.

Contribution

The paper develops a new MGSSP preconditioner and iteration method, extending previous work, with proven convergence and improved efficiency for nonsymmetric saddle point problems.

Findings

01

MGSSP iteration is unconditionally convergent and semi-convergent.

02

Numerical results show MGSSP outperforms existing methods.

03

MGSSP preconditioner is more effective than other preconditioners.

Abstract

For the nonsymmetric saddle point problems with nonsymmetric positive definite (1,1) parts, the modified generalized shift-splitting (MGSSP) preconditioner as well as the MGSSP iteration method are derived in this paper, which generalize the MSSP preconditioner and the MSSP iteration method newly developed by Huang and Su (J. Comput. Appl. Math. 2017), respectively. The convergent and semi-convergent analysis of the MGSSP iteration method are presented, and we prove that this method is unconditionally convergent and semi-convergent. In addition, some spectral properties of the preconditioned matrix are carefully analyzed. Numerical results demonstrate the robustness and effectiveness of the MGSSP preconditioner and the MGSSP iteration method, and also illustrate that the MGSSP iteration method outperforms the GSS and GMSS iteration methods, and the MGSSP preconditioner is superior to…

Tables8

Table 1. Table 1: Numerical results for the three iteration methods with v = 0.1 𝑣 0.1 v=0.1 .

Method		$p$
		16	32	64
	$α_{e x p}$	20	51	125
	$β_{e x p}$	2.7	5	1.5
GSS	IT	58	72	102
	CPU	0.2556	1.1583	21.2715
	RES	8.79e-07	8.68e-07	9.80e-07
	$α_{e x p}$	22	36	38
	$β_{e x p}$	16	8.3	5.9
GMSS	IT	66	73	89
	CPU	0.4955	1.5732	26.4811
	RES	8.45e-07	9.09e-07	9.50e-07
	$α_{e x p}$	0.2	0.5	0.2
	$β_{e x p}$	0.1	0.1	0.1
MGSSP	IT	21	21	21
	CPU	0.1514	0.7012	10.1562
	RES	9.88e-07	9.85e-07	9.57e-07

Table 2. Table 2: Numerical results for the six preconditioned GMRES methods with v = 1 𝑣 1 v=1 , α = 0.6 𝛼 0.6 \alpha=0.6 and β = 0.8 𝛽 0.8 \beta=0.8 .

$p$		$I$	$𝒫_{S S}$	$𝒫_{G S S}$	$𝒫_{M S S}$	$𝒫_{G M S S}$	$𝒫_{M G S S P}$
16	IT	121	9	9	15	13	7
	CPU	0.1550	0.0447	0.1505	0.1838	0.1705	0.0837
	RES	7.21e-07	5.61e-07	3.29e-07	3.29e-07	6.69e-07	3.78e-07
32	IT	264	10	9	15	14	7
	CPU	3.8574	0.5004	0.4703	0.8234	0.7600	0.3831
	RES	9.74e-07	2.54e-07	7.25e-07	7.63e-07	9.24e-07	9.67e-07
48	IT	429	10	10	16	15	8
	CPU	24.7021	3.7594	3.5617	6.2255	5.7171	2.8951
	RES	9.95e-07	6.29e-07	1.77e-07	6.33e-07	5.21e-07	1.78e-07
64	IT	–	11	10	16	15	8
	CPU	–	22.5881	21.3997	33.6562	31.2381	16.2309
	RES	–	3.75e-07	2.58e-07	8.29e-07	8.93e-07	2.50e-07

Table 3. Table 3: Numerical results for the six preconditioned GMRES methods with v = 0.1 𝑣 0.1 v=0.1 , α = 1 𝛼 1 \alpha=1 and β = 0.8 𝛽 0.8 \beta=0.8 .

$p$		$I$	$𝒫_{S S}$	$𝒫_{G S S}$	$𝒫_{M S S}$	$𝒫_{G M S S}$	$𝒫_{M G S S P}$
16	IT	115	8	8	17	17	6
	CPU	0.1326	0.0982	0.0850	0.2155	0.2371	0.0783
	RES	9.50e-07	4.54e-07	1.56e-07	5.89e-07	4.29e-07	5.96e-07
32	IT	240	9	8	17	17	7
	CPU	3.4868	0.4959	0.4568	0.8974	0.8876	0.4244
	RES	9.34e-07	2.10e-07	6.49e-07	8.09e-07	4.93e-07	7.16e-07
48	IT	367	9	9	18	17	7
	CPU	20.4798	3.3953	3.3951	6.8142	6.4413	2.7642
	RES	9.80e-07	4.38e-07	1.36e-07	3.97e-07	6.37e-07	1.40e-07
64	IT	495	9	9	18	17	7
	CPU	81.8770	18.4334	18.5499	37.0634	35.4719	15.1190
	RES	9.73e-07	6.88e-07	2.15e-07	4.82e-07	7.42e-07	2.18e-07

Table 4. Table 4: Numerical results for the six preconditioned GMRES methods with v = 0.01 𝑣 0.01 v=0.01 , α = 1.2 𝛼 1.2 \alpha=1.2 and β = 1.5 𝛽 1.5 \beta=1.5 .

$p$		$I$	$𝒫_{S S}$	$𝒫_{G S S}$	$𝒫_{M S S}$	$𝒫_{G M S S}$	$𝒫_{M G S S P}$
16	IT	246	9	10	51	54	7
	CPU	0.3743	0.1904	0.1345	0.5309	0.8654	0.1071
	RES	9.65e-07	8.26e-07	3.04e-07	9.10e-07	7.90e-07	8.61e-07
32	IT	429	9	10	55	56	7
	CPU	7.2934	0.4691	0.5084	2.6743	3.2658	0.4067
	RES	9.88e-07	8.40e-07	3.19e-07	8.57e-07	8.40e-07	8.72e-07
48	IT	–	9	10	57	58	7
	CPU	–	3.5469	3.8431	21.1369	23.6098	2.7461
	RES	–	8.61e-07	3.21e-07	9.91e-07	9.05e-07	8.50e-07
64	IT	–	9	10	57	57	7
	CPU	–	18.9807	20.9074	113.9468	117.1580	15.2981
	RES	–	7.78e-07	2.58e-07	9.56e-07	9.58e-07	7.56e-08

Table 5. Table 5: Numerical results for the three iteration methods with v = 0.1 𝑣 0.1 v=0.1 .

Method		$p$
		16	32	64
	$α_{e x p}$	13	29	66
	$β_{e x p}$	39	53	60
GSS	IT	85	136	230
	CPU	0.3211	2.1355	47.4292
	RES	9.48e-07	9.83e-07	9.73e-07
	$α_{e x p}$	16	18	24
	$β_{e x p}$	75	134.4	240
GMSS	IT	143	213	337
	CPU	0.8641	4.5574	100.4047
	RES	9.86e-07	9.90e-07	9.98e-07
	$α_{e x p}$	0.02	0.01	0.05
	$β_{e x p}$	0.1	0.05	0.1
MGSSP	IT	21	21	21
	CPU	0.0990	0.6706	11.0860
	RES	9.53e-07	9.54e-07	9.54e-07

Table 6. Table 6: Numerical results for the six preconditioned GMRES methods with v = 1 𝑣 1 v=1 , α = 0.6 𝛼 0.6 \alpha=0.6 and β = 0.8 𝛽 0.8 \beta=0.8 .

$p$		$I$	$𝒫_{S S}$	$𝒫_{G S S}$	$𝒫_{M S S}$	$𝒫_{G M S S}$	$𝒫_{M G S S P}$
16	IT	145	9	8	15	13	6
	CPU	0.2146	0.0447	0.0968	0.1838	0.1705	0.0813
	RES	7.95e-07	5.61e-07	2.16e-07	3.29e-07	6.69e-07	8.81e-07
32	IT	278	10	9	15	14	7
	CPU	4.1297	0.5004	0.4388	0.8234	0.7600	0.4251
	RES	9.79e-07	2.54e-07	1.46e-07	7.63e-07	9.24e-07	2.09e-07
48	IT	366	10	9	16	15	7
	CPU	20.2558	3.7594	3.4238	6.2255	5.7171	2.6904
	RES	9.71e-07	6.29e-07	4.26e-07	6.33e-07	5.21e-07	5.66e-07
64	IT	465	11	9	16	15	8
	CPU	76.1434	22.5881	18.1854	33.6562	31.2381	16.5291
	RES	9.71e-07	3.75e-07	8.37e-07	8.29e-07	8.93e-07	9.27e-08

Table 7. Table 7: Numerical results for the six preconditioned GMRES methods with v = 0.1 𝑣 0.1 v=0.1 , α = 1.8 𝛼 1.8 \alpha=1.8 and β = 1.5 𝛽 1.5 \beta=1.5 .

$p$		$I$	$𝒫_{S S}$	$𝒫_{G S S}$	$𝒫_{M S S}$	$𝒫_{G M S S}$	$𝒫_{M G S S P}$
16	IT	122	9	9	19	19	7
	CPU	0.1422	0.1298	0.1276	0.3788	0.3205	0.1223
	RES	8.71e-07	5.68e-07	2.67e-07	7.02e-07	5.33e-07	1.62e-07
32	IT	237	10	9	19	19	7
	CPU	3.3291	0.6218	0.5728	1.1392	1.1254	0.4491
	RES	9.87e-07	2.31e-07	5.53e-07	6.67e-07	5.62e-07	2.83e-07
48	IT	350	10	9	19	19	7
	CPU	19.3841	4.1896	3.8573	7.9254	8.3686	3.1636
	RES	9.99e-07	3.20e-07	7.60e-07	7.45e-07	5.74e-07	3.81e-07
64	IT	461	10	9	19	19	7
	CPU	76.1620	23.0578	20.8402	44.0056	42.1010	17.2456
	RES	9.82e-07	3.91e-07	9.15e-07	8.17e-07	6.00e-07	4.58e-07

Table 8. Table 8: Numerical results for the six preconditioned GMRES methods with v = 0.01 𝑣 0.01 v=0.01 , α = 1.85 𝛼 1.85 \alpha=1.85 and β = 1.75 𝛽 1.75 \beta=1.75 .

$p$		$I$	$𝒫_{S S}$	$𝒫_{G S S}$	$𝒫_{M S S}$	$𝒫_{G M S S}$	$𝒫_{M G S S P}$
16	IT	250	10	10	59	59	7
	CPU	0.4184	0.1320	0.1333	0.8935	0.9098	0.1191
	RES	9.42e-07	5.70e-07	4.30e-07	8.22e-07	8.03e-07	8.47e-07
32	IT	419	10	10	60	60	7
	CPU	7.0626	0.6292	0.6296	3.3032	3.3277	0.4901
	RES	9.85e-07	5.31e-07	3.93e-07	9.43e-07	9.14e-07	8.11e-07
48	IT	–	10	10	60	60	7
	CPU	–	4.1788	4.3580	23.9170	23.6870	3.0734
	RES	–	5.75e-07	4.28e-07	9.46e-07	9.15e-07	8.72e-07
64	IT	–	10	10	60	60	7
	CPU	–	23.6671	24.2199	130.4922	130.1184	16.9088
	RES	–	6.43e-07	4.83e-07	9.40e-07	9.07e-07	9.34e-07

Equations187

\displaystyle\mathcal{A}u=\left(\begin{array}[]{cc}A&B\\ -B^{T}&0\\ \end{array}\right)\left(\begin{array}[]{c}x\\ y\\ \end{array}\right)=\left(\begin{array}[]{c}f\\ -g\\ \end{array}\right)\equiv{b},

\displaystyle\mathcal{A}u=\left(\begin{array}[]{cc}A&B\\ -B^{T}&0\\ \end{array}\right)\left(\begin{array}[]{c}x\\ y\\ \end{array}\right)=\left(\begin{array}[]{c}f\\ -g\\ \end{array}\right)\equiv{b},

\displaystyle\mathcal{P}_{SS}=\frac{1}{2}\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\alpha I\\ \end{array}\right)

\displaystyle\mathcal{P}_{SS}=\frac{1}{2}\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\alpha I\\ \end{array}\right)

\displaystyle\mathcal{P}_{GSS}=\frac{1}{2}\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right),

\displaystyle\mathcal{P}_{GSS}=\frac{1}{2}\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right),

\displaystyle\mathcal{P}_{MSS}=\frac{1}{2}\left(\begin{array}[]{cc}\alpha I+2H&B\\ -B^{T}&\alpha I\\ \end{array}\right)

\displaystyle\mathcal{P}_{MSS}=\frac{1}{2}\left(\begin{array}[]{cc}\alpha I+2H&B\\ -B^{T}&\alpha I\\ \end{array}\right)

\displaystyle\mathcal{P}_{MSSP}=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\alpha I\\ \end{array}\right)

\displaystyle\mathcal{P}_{MSSP}=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\alpha I\\ \end{array}\right)

\displaystyle\mathcal{A}=\mathcal{P}_{MSSP}-\mathcal{Q}_{MSSP}=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\alpha I\\ \end{array}\right)-\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\alpha I\\ \end{array}\right).

\displaystyle\mathcal{A}=\mathcal{P}_{MSSP}-\mathcal{Q}_{MSSP}=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\alpha I\\ \end{array}\right)-\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\alpha I\\ \end{array}\right).

\displaystyle\mathcal{A}=\mathcal{P}_{MGSSP}-\mathcal{Q}_{MGSSP}=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)-\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right),

\displaystyle\mathcal{A}=\mathcal{P}_{MGSSP}-\mathcal{Q}_{MGSSP}=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)-\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right),

\displaystyle\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)\left(\begin{array}[]{c}x^{(k+1)}\\ y^{(k+1)}\\ \end{array}\right)=\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right)\left(\begin{array}[]{c}x^{(k)}\\ y^{(k)}\\ \end{array}\right)+\left(\begin{array}[]{c}f\\ -g\\ \end{array}\right).

\displaystyle\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)\left(\begin{array}[]{c}x^{(k+1)}\\ y^{(k+1)}\\ \end{array}\right)=\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right)\left(\begin{array}[]{c}x^{(k)}\\ y^{(k)}\\ \end{array}\right)+\left(\begin{array}[]{c}f\\ -g\\ \end{array}\right).

\displaystyle\left(\begin{array}[]{c}x^{(k+1)}\\ y^{(k+1)}\\ \end{array}\right)=\mathcal{T}(\alpha,\beta)\left(\begin{array}[]{c}x^{(k)}\\ y^{(k)}\\ \end{array}\right)+\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)^{-1}\left(\begin{array}[]{c}f\\ -g\\ \end{array}\right),

\displaystyle\left(\begin{array}[]{c}x^{(k+1)}\\ y^{(k+1)}\\ \end{array}\right)=\mathcal{T}(\alpha,\beta)\left(\begin{array}[]{c}x^{(k)}\\ y^{(k)}\\ \end{array}\right)+\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)^{-1}\left(\begin{array}[]{c}f\\ -g\\ \end{array}\right),

\displaystyle\mathcal{T}(\alpha,\beta)=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)^{-1}\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right)

\displaystyle\mathcal{T}(\alpha,\beta)=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)^{-1}\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right)

\displaystyle\mathcal{P}_{MGSSP}=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right),

\displaystyle\mathcal{P}_{MGSSP}=\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right),

\displaystyle\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)z=r,

\displaystyle\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)z=r,

\displaystyle\mathcal{P}_{MGSSP}=\left(\begin{array}[]{cc}I&\frac{2}{\beta}B\\ 0&I\\ \end{array}\right)\left(\begin{array}[]{cc}\alpha I+2A+\frac{4}{\beta}BB^{T}&0\\ 0&\beta I\\ \end{array}\right)\left(\begin{array}[]{cc}I&0\\ -\frac{2}{\beta}B^{T}&I\\ \end{array}\right).

\displaystyle\mathcal{P}_{MGSSP}=\left(\begin{array}[]{cc}I&\frac{2}{\beta}B\\ 0&I\\ \end{array}\right)\left(\begin{array}[]{cc}\alpha I+2A+\frac{4}{\beta}BB^{T}&0\\ 0&\beta I\\ \end{array}\right)\left(\begin{array}[]{cc}I&0\\ -\frac{2}{\beta}B^{T}&I\\ \end{array}\right).

\displaystyle\left(\begin{array}[]{c}z_{1}\\ z_{2}\\ \end{array}\right)=\left(\begin{array}[]{cc}I&0\\ \frac{2}{\beta}B^{T}&I\\ \end{array}\right)\left(\begin{array}[]{cc}\alpha I+2A+\frac{4}{\beta}BB^{T}&0\\ 0&\beta I\\ \end{array}\right)^{-1}\left(\begin{array}[]{cc}I&-\frac{2}{\beta}B\\ 0&I\\ \end{array}\right)\left(\begin{array}[]{c}r_{1}\\ r_{2}\\ \end{array}\right).

\displaystyle\left(\begin{array}[]{c}z_{1}\\ z_{2}\\ \end{array}\right)=\left(\begin{array}[]{cc}I&0\\ \frac{2}{\beta}B^{T}&I\\ \end{array}\right)\left(\begin{array}[]{cc}\alpha I+2A+\frac{4}{\beta}BB^{T}&0\\ 0&\beta I\\ \end{array}\right)^{-1}\left(\begin{array}[]{cc}I&-\frac{2}{\beta}B\\ 0&I\\ \end{array}\right)\left(\begin{array}[]{c}r_{1}\\ r_{2}\\ \end{array}\right).

\displaystyle\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right)\left(\begin{array}[]{c}u\\ v\\ \end{array}\right)=\lambda\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)\left(\begin{array}[]{c}u\\ v\\ \end{array}\right).

\displaystyle\left(\begin{array}[]{cc}\alpha I+A&B\\ -B^{T}&\beta I\\ \end{array}\right)\left(\begin{array}[]{c}u\\ v\\ \end{array}\right)=\lambda\left(\begin{array}[]{cc}\alpha I+2A&2B\\ -2B^{T}&\beta I\\ \end{array}\right)\left(\begin{array}[]{c}u\\ v\\ \end{array}\right).

{(α I + A) u + B v = λ (α I + 2 A) u + 2 λ B v, - B^{T} u + β v = - 2 λ B^{T} u + λ β v .

{(α I + A) u + B v = λ (α I + 2 A) u + 2 λ B v, - B^{T} u + β v = - 2 λ B^{T} u + λ β v .

(α I + A) u = λ (α I + 2 A) u .

(α I + A) u = λ (α I + 2 A) u .

λ = \frac{( α + a ) + ib}{( α + 2 a ) + 2 ib},

λ = \frac{( α + a ) + ib}{( α + 2 a ) + 2 ib},

∣ λ ∣ = \frac{( α + a ) ^{2} + b ^{2}}{( α + 2 a ) ^{2} + 4 b ^{2}} < 1.

∣ λ ∣ = \frac{( α + a ) ^{2} + b ^{2}}{( α + 2 a ) ^{2} + 4 b ^{2}} < 1.

v = \frac{( 2 λ - 1 ) B ^{T} u}{( λ - 1 ) β},

v = \frac{( 2 λ - 1 ) B ^{T} u}{( λ - 1 ) β},

λ^{2} (α β I + 2 β A + 4 B B^{T}) u - λ (2 α β I + 3 β A + 4 B B^{T}) u + (α β I + β A + B B^{T}) u = 0.

λ^{2} (α β I + 2 β A + 4 B B^{T}) u - λ (2 α β I + 3 β A + 4 B B^{T}) u + (α β I + β A + B B^{T}) u = 0.

a + ib = \frac{u ^{*} A u}{u ^{*} u}, c = \frac{u ^{*} B B ^{T} u}{u ^{*} u} \geq 0 .

a + ib = \frac{u ^{*} A u}{u ^{*} u}, c = \frac{u ^{*} B B ^{T} u}{u ^{*} u} \geq 0 .

λ^{2} (α β + 2 β a + 4 c + 2 β bi) - λ (2 α β + 3 β a + 4 c + 3 β bi) + (α β + β a + c + β bi) = 0.

λ^{2} (α β + 2 β a + 4 c + 2 β bi) - λ (2 α β + 3 β a + 4 c + 3 β bi) + (α β + β a + c + β bi) = 0.

ϕ = \frac{2 α β + 3 β a + 4 c + 3 β bi}{α β + 2 β a + 4 c + 2 β bi}, ψ = \frac{α β + β a + c + β bi}{α β + 2 β a + 4 c + 2 β bi} .

ϕ = \frac{2 α β + 3 β a + 4 c + 3 β bi}{α β + 2 β a + 4 c + 2 β bi}, ψ = \frac{α β + β a + c + β bi}{α β + 2 β a + 4 c + 2 β bi} .

λ^{2} - λ \frac{2 α + 3 a + 3 bi}{α + 2 a + 2 bi} + \frac{α + a + bi}{α + 2 a + 2 bi} = 0.

λ^{2} - λ \frac{2 α + 3 a + 3 bi}{α + 2 a + 2 bi} + \frac{α + a + bi}{α + 2 a + 2 bi} = 0.

λ = 1 or λ = \frac{α + a + bi}{α + 2 a + 2 bi} .

λ = 1 or λ = \frac{α + a + bi}{α + 2 a + 2 bi} .

∣ λ ∣ = \frac{α + a + bi}{α + 2 a + 2 bi} = \frac{( α + a ) ^{2} + b ^{2}}{( α + 2 a ) ^{2} + 4 b ^{2}} < 1.

∣ λ ∣ = \frac{α + a + bi}{α + 2 a + 2 bi} = \frac{( α + a ) ^{2} + b ^{2}}{( α + 2 a ) ^{2} + 4 b ^{2}} < 1.

ϕ - \overset{ˉ}{ϕ} ψ = \frac{2 α β ^{2} a + 6 α β c + 3 β ^{2} a ^{2} + 13 β a c + 12 c ^{2} + 3 β ^{2} b ^{2} + 3 β b c i}{( α β + 2 β a + 4 c ) ^{2} + 4 β ^{2} b ^{2}}

ϕ - \overset{ˉ}{ϕ} ψ = \frac{2 α β ^{2} a + 6 α β c + 3 β ^{2} a ^{2} + 13 β a c + 12 c ^{2} + 3 β ^{2} b ^{2} + 3 β b c i}{( α β + 2 β a + 4 c ) ^{2} + 4 β ^{2} b ^{2}}

1 - ∣ ψ ∣^{2} = \frac{2 α β ^{2} a + 6 α β c + 3 β ^{2} a ^{2} + 14 β a c + 15 c ^{2} + 3 β ^{2} b ^{2}}{( α β + 2 β a + 4 c ) ^{2} + 4 β ^{2} b ^{2}} .

1 - ∣ ψ ∣^{2} = \frac{2 α β ^{2} a + 6 α β c + 3 β ^{2} a ^{2} + 14 β a c + 15 c ^{2} + 3 β ^{2} b ^{2}}{( α β + 2 β a + 4 c ) ^{2} + 4 β ^{2} b ^{2}} .

∣2 α β^{2} a + 6 α β c + 3 β^{2} a^{2} + 13 β a c + 12 c^{2} + 3 β^{2} b^{2} + 3 β b c i ∣

∣2 α β^{2} a + 6 α β c + 3 β^{2} a^{2} + 13 β a c + 12 c^{2} + 3 β^{2} b^{2} + 3 β b c i ∣

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMatrix Theory and Algorithms · Electromagnetic Scattering and Analysis · Advanced Numerical Methods in Computational Mathematics

Full text

A modified generalized shift-splitting preconditioner for nonsymmetric saddle point problems

††thanks: Supported by the National Natural Science Foundation of China (No. 11171273) and Innovation Foundation for Doctor Dissertation of Northwestern Polytechnical University (No. CX201628).

Zhengge Huang, Ligong Wang111Corresponding author. , Zhong Xu and Jingjing Cui

Department of Applied Mathematics, School of Science, Northwestern Polytechnical University,

Xi’an, Shaanxi 710072, People’s Republic of China.

E-mails: [email protected]; [email protected](or [email protected]);

[email protected]; [email protected]

Abstract

For the nonsymmetric saddle point problems with nonsymmetric positive definite (1,1) parts, the modified generalized shift-splitting (MGSSP) preconditioner as well as the MGSSP iteration method are derived in this paper, which generalize the MSSP preconditioner and the MSSP iteration method newly developed by Huang and Su (J. Comput. Appl. Math. 2017), respectively. The convergent and semi-convergent analysis of the MGSSP iteration method are presented, and we prove that this method is unconditionally convergent and semi-convergent. In addition, some spectral properties of the preconditioned matrix are carefully analyzed. Numerical results demonstrate the robustness and effectiveness of the MGSSP preconditioner and the MGSSP iteration method, and also illustrate that the MGSSP iteration method outperforms the GSS and GMSS iteration methods, and the MGSSP preconditioner is superior to the shift-splitting (SS), generalized SS (GSS), modified SS (MSS) and generalized MSS (GMSS) preconditioners for the GMRES method for solving the nonsymmetric saddle point problems.

Key Words: Nonsymmetric saddle point problem, Modified generalized shift-splitting, Convergence, Semi-convergence, Spectral properties. AMS Subject Classification (2010): 65F08, 65F10.

1 Introduction

In a wide variety of scientific and engineering applications, such as mixed finite element approximation of elliptic partial differential equations, the image reconstruction and registration, computational fluid dynamics, weighted least-squares problems, networks computer graphics, constrained optimization and son on [2, 16, 27], we need to solve the following nonsymmetric saddle point problems of the form

[TABLE]

where $A\in{\mathbb{R}^{m\times{m}}}$ is nonsymmetric positive definite, $B\in{\mathbb{R}^{m\times{n}}}$ is a rectangular matrix, $p\in\mathbb{R}^{m}$ and $q\in\mathbb{R}^{n}$ are given vectors, with $n\leq{m}$ . Here, $B^{T}$ denotes the transpose of $B$ . The system of linear equations (1) is also termed as a Karush-Kuhn-Tucker (KKT) system, or an augmented system [28, 25]. For a wider class of saddle point problems, the readers can refer to [13].

Since the matrices $A$ and $B$ are large and sparse in general, the iteration methods are often much more suitable for solving it than direct methods. When $B$ is of full column rank, a large variety of effective iterative methods based on matrix splitting as well as their numerical properties have been investigated in the literature. For example, Golub et al. [29] developed the SOR-like method, and in the sequel, Bai et al. [10, 11] extended the SOR-like method to the generalized SOR (GSOR) method and the parameterized inexact Uzawa method, respectively. For SOR-like methods established recently, see [30, 39]. Based on the Uzawa method presented by Bramble et al. and Elman and Golub in [15, 26], Bai et al. [10, 11], Dai et al. [23] and Ma and Zheng [37] employed the Uzawa-type methods and so forth in recent years. Besides, Bai et al. put forward the well-known Hermitian and skew-Hermitian splitting (HSS) methods [7] and its variants [6, 8, 9, 5]. On the basis of the shift-splitting (SS) of a non-Hermitian matrix [12], Cao et al. [17] derived the SS iteration method as well as the SS preconditioner for the nonsingular saddle point problems, and Chen and Ma [20] and Cao et al. [18] generalized the SS iteration method and obtained the generalized SS (GSS) iteration method. To increase the convergence rate of the GSS iteration method, Huang and Su [33] newly developed the modified shift-splitting (MSSP) iteration method.

If $B$ in (1) is rank deficient, then the coefficient matrix $\mathcal{A}$ in (1) is singular, and we call (1) the singular saddle point problem. Some iteration methods and preconditioning techniques for solving singular saddle point problems have been proposed in the recent literature, see, e.g., [35, 42, 41, 34]. Zheng et al. [43] proposed some sufficient conditions for the semi-convergence of the GSOR method and determined the optimal iteration parameters. Bai [3] derived some necessary and sufficient conditions to assure the semi-convergence of the HSS method. Chen et al. [21] and Cao et al. [19] investigated the generalized shift-splitting iteration method for singular saddle point problems. Very recently, Dou et al. [24] introduced the modifying the parameterized inexact Uzawa (PIU) for singular saddle point problems, and Zheng and Lu [44] proved the semi-convergence of the upper and lower triangular (ULT) splitting iteration method for singular saddle point problems.

Recently, based on the preconditioner [12] studied for a class of non-Hermtian positive definite linear systems, Cao et al. [17] presented a shift-splitting (SS) preconditioner

[TABLE]

for the saddle point problem (1), where $\alpha$ is a positive constant and $I$ is the identity matrix. The authors also proved the corresponding SS iteration method is unconditional convergent.

On the basis of the shift-splitting (SS) preconditioner [17], Chen and Ma [20] and Cao et al. [18] replaced the parameter $\alpha$ in (2,2)-block of the SS preconditioner by another parameter $\beta$ , and employed the generalized SS (GSS) preconditioner of the form

[TABLE]

where $\alpha\geq{0}$ , $\beta>0$ and $I$ is the identity matrix. It is easy to see that $\mathcal{P}_{SS}$ is a special case of $\mathcal{P}_{GSS}$ when $\alpha=\beta$ . Numerical results in [21, 20] confirmed that the GSS preconditioner is superior to the SS preconditioner.

Very recently, based on the well-known Hermitian and skew-Hermitian splitting (HSS) of the matrix $A$ : $A=H+S$ , where $H=\frac{1}{2}(A+A^{T})$ , $S=\frac{1}{2}(A-A^{T})$ , and similar to the shift-splitting [17, 12], the modified shift-splitting (MSS) preconditioner [45] was proposed for nonsymmetric saddle point problem (1), the form of $\mathcal{P}_{MSS}$ is:

[TABLE]

with $\alpha>0$ being a constant and $I$ being the identity matrix with appropriate dimension.

In the sequel, by replacing the parameter $\alpha$ in (2,2)-block in the MSS preconditioner by another parameter $\beta$ , Huang et al. [32] established the generalized MSS (GMSS) preconditioner. They discussed the corresponding GMSS iteration method is convergent and semi-convergent under proper conditions, and showed that the GMSS iteration method and the GMSS preconditioner are better than the MSS iteration method and the MSS preconditioner, respectively by numerical experiments.

In order to increase the convergence rate of the GSS method for the nonsingular saddle point problems with symmetric positive definite (1,1) parts, Huang and Su [33] newly developed the modified shift-splitting (MSSP) preconditioner of the form:

[TABLE]

with $\alpha>0$ being a constant and $I$ being the identity matrix with appropriate dimension, which derived from the following modified shift-splitting of the saddle point matrix $\mathcal{A}$ :

[TABLE]

The authors in [33] theoretically verified the corresponding MSSP iteration method is unconditional convergent and estimated the bounds of the eigenvalues of the iteration matrix of the MSSP iteration method. Numerical experiments illustrated that the MSSP preconditioner outperforms the SS and GSS preconditioners for the nonsingular saddle point problems with symmetric positive definite (1,1) parts.

To further accelerate the convergence rates of the GSS and the GMSS preconditioned GMRES methods for the saddle point problems with nonsymmetric positive definite (1,1) parts, a new preconditioner which is referred to as the modified generalized shift-splitting (MGSSP) preconditioner is developed for nonsymmetric saddle point problems in this paper. Theoretical analysis also shows that the corresponding splitting iteration method is convergent and semi-convergent unconditionally. Besides, we investigate the spectral properties of the corresponding preconditioned matrix and show that it has clustered eigenvalue distribution by choosing proper parameters. Numerical experiments are presented to confirm the effectiveness of the MGSSP iteration method and the MGSSP preconditioned GMRES method for solving the nonsymmetric saddle point problems.

The outline of this paper is organized as follows. In Section 2, we propose the MGSSP iteration method which induces the MGSSP preconditioner. The unconditional convergent and semi-convergent properties of the MGSSP iteration method will be proved in Sections 3 and 4, respectively. The spectral properties of the MGSSP preconditioned matrix are obtained correspondingly in Section 5. We examine the feasibility and effectiveness of the MGSSP iteration method and the MGSSP preconditioned GMRES method for solving the nonsymmetric nonsingular and singular saddle point problems by numerical experiments in Section 6. Finally, a brief conclusion will be given to end this work in Section 7.

Throughout this paper, $\lambda_{\min}(A)$ and $\rho(A)$ represent the minimum eigenvalue and the spectral radius of the matrix $A$ , respectively. $(.)^{*}$ denotes the conjugate transpose of either a vector or a matrix.

2 The modified generalized shift-splitting (MGSSP) preconditioner and its implementation

In this section, inspired by the ideas of [20, 18, 33], we develop a new splitting called the modified generalized shift-splitting (MGSSP) of the nonsymmetric saddle point matrix $\mathcal{A}$ by combining the generalized splitting-splitting and the modified shift-splitting of the saddle point matrix $\mathcal{A}$ as follows.

[TABLE]

where $\alpha\geq{0}$ , $\beta>0$ are two constants and $I$ is the unit matrix with appropriate dimension. Then, the splitting (2) naturally leads to the following modified generalized shift-splitting iteration method for solving the nonsymmetric saddle point problem (1):

The modified generalized shift-splitting (MGSSP) iteration method: Let $\alpha\geq{0}$ and $\beta>0$ be two given constants. Given an initial guess $(x^{(0)^{T}},y^{(0)^{T}})^{T}$ . For $k=0,1,2,\cdots$ , until $(x^{(k)^{T}},y^{(k)^{T}})^{T}$ converges, compute

[TABLE]

Hence the MGSSP iteration method can be written in the following fixed point form

[TABLE]

where

[TABLE]

is the iteration matrix.

It should be noted that any matrix splitting not only can automatically lead to a splitting iteration method, but also can naturally induce a splitting preconditioner for the Krylov subspace methods. The splitting preconditioner corresponds to the MGSSP iteration (2) is given by

[TABLE]

which is called the MGSSP preconditioner for the nonsymmetric saddle point matrix $\mathcal{A}$ .

At each step of the MGSSP iteration (3) or applying the MGSSP preconditioner $\mathcal{P}_{MGSSP}$ within a Krylov subspace method, we need to solve a linear system with $\mathcal{P}_{MGSSP}$ as the coefficient matrix. That is to say, we need to solve a linear system of the form

[TABLE]

where $z=(z_{1}^{T},z_{2}^{T})^{T}$ and $r=(r_{1}^{T},r_{2}^{T})^{T}$ with $z_{1},r_{1}\in{\mathbb{R}}^{m}$ and $z_{2},r_{2}\in{\mathbb{R}}^{n}$ . It is not difficult to check that

[TABLE]

It follows from the decomposition of $\mathcal{P}_{MGSSP}$ in (5) that

[TABLE]

Therefore, we can derive the following algorithmic version of the MGSSP iteration method.

Algorithm 2.1 For a given vector $r=(r_{1}^{T},r_{2}^{T})^{T}$ , the vector $z=(z_{1}^{T},z_{2}^{T})^{T}$ can be computed by (6) according to the following steps:

(1) compute $t_{1}=r_{1}-\frac{2}{\beta}Br_{2}$ ;

(2) solve $(\alpha I+2A+\frac{4}{\beta}BB^{T})z_{1}=t_{1}$ ;

(3) compute $z_{2}=\frac{1}{\beta}(2B^{T}z_{1}+r_{2})$ .

From Algorithm 2.1, it is known that at each iteration, a linear system with the coefficient matrix $\alpha I+2A+\frac{4}{\beta}BB^{T}$ only needs to be solved. However, it may be very costly and impractical in actual implementations because of the sparsity pattern of $\alpha I+2A+\frac{4}{\beta}BB^{T}$ . Fortunately, the matrix $\alpha I+2A+\frac{4}{\beta}BB^{T}$ is positive definite for all $\alpha\geq{0}$ and $\beta>0$ . Therefore, we can employ the Krylov subspace method, such as the GMRES method to solve the sub-linear systems with the coefficient matrix $\alpha I+2A+\frac{4}{\beta}BB^{T}$ by a prescribed accuracy. In addition, it can be solved by some direct methods, such as the sparse LU factorization. What we want to pose here is that we always use the sparse LU factorization to solve this problem in our paper.

3 Convergence of the MGSSP iteration method for nonsingular saddle point problems

The main purpose of this section is to study the convergence properties of the MGSSP iteration method by analyzing the spectral properties of the iteration matrix. Before doing this, we derive some lemmas which will be useful in the following proofs.

Lemma 3.1.

[11]* Both roots of the complex quadratic equation $x^{2}-\phi x+\psi=0$ are less than one in modulus if and only if $|\phi-\bar{\phi}\psi|+|\psi|^{2}<1$ , where $\bar{\phi}$ denotes the conjugate complex of $\phi$ .*

Lemma 3.2.

Let $A\in{\mathbb{R}}^{m\times{m}}$ be a positive definite matrix, $B\in{\mathbb{R}}^{m\times{n}}$ be of full column rank, and $\alpha\geq{0}$ and $\beta>0$ be two given constants. If $\lambda$ is an eigenvalue of the iteration matrix $\mathcal{T}(\alpha,\beta)$ , then $\lambda\neq{\pm 1}$ .

Proof. Let $\lambda$ be an eigenvalue of the iteration matrix $\mathcal{T}(\alpha,\beta)$ of the MGSSP iteration method, and $(u^{*},v^{*})^{*}\in{\mathbb{C}}^{m+n}$ be the corresponding eigenvector. Then it holds that

[TABLE]

After proper manipulations, we obtain

[TABLE]

Now we will give the proof by contradiction. If $\lambda=1$ , then from (7), it has $Au+Bv=0$ and $B^{T}u=0$ , which lead to $u=-A^{-1}Bv$ and $B^{T}A^{-1}Bv=0$ . Thus we get $Bv=0$ by the positive definiteness of $A^{-1}$ , and therefore $v=0$ and $u=-A^{-1}Bv=0$ , a contradiction. In addition, if $\lambda=-1$ , then it follows from the second equation of (7) that $v=\frac{3B^{T}u}{2\beta}$ . Substituting this relation into the first equation of (7) gives $\bar{A}u=(2\alpha I+3A+\frac{9BB^{T}}{2\beta})u=0$ , then $u=0$ is due to the fact that $\bar{A}$ is nonsingular, which yields that $v=\frac{3B^{T}u}{2\beta}=0$ , a contradiction. $\blacksquare$

Lemma 3.3.

Assume that the conditions in Lemma 3.2 are satisfied. Let $\lambda$ be an eigenvalue of the iteration matrix $\mathcal{T}(\alpha,\beta)$ of the MGSSP iteration method and $\mathbf{u}=(u^{*},v^{*})^{*}\in{\mathbb{C}}^{m+n}$ , with $u\in{\mathbb{C}}^{m}$ and $v\in{\mathbb{C}}^{n}$ , be the corresponding eigenvector. Then $u\neq 0$ . Moreover, if $v=0$ , then $|\lambda|<1$ .

Proof. If $u=0$ , then from the second equation of (7), we have $(\lambda-1)\beta v=0$ . Inasmuch as $\lambda\neq{1}$ and $\beta>0$ , we derive $v=0$ . This contradicts to the assumption that $\mathbf{u}=(u^{*},v^{*})^{*}$ is an eigenvector. Furthermore, if $v=0$ , then it follows from the first equation of (7) that

[TABLE]

Since $u\neq{0}$ , the definition $\frac{u^{*}}{u^{*}u}$ does make sense. Premultiplying (8) with $\frac{u^{*}}{u^{*}u}$ gives

[TABLE]

where $a+ib=\frac{u^{*}Au}{u^{*}u}$ . Since $A$ is positive definite, $a>0$ . It follows from (9) that

[TABLE]

Thus, we completes our proof of Lemma 3.3. $\blacksquare$

Theorem 3.1.

Assume the conditions in Lemma 3.2 are satisfied. Let $\lambda$ be an eigenvalue of the iteration matrix $\mathcal{T}(\alpha,\beta)$ of the MGSSP iteration method and $\mathbf{u}=(u^{*},v^{*})^{*}\in{\mathbb{C}}^{m+n}$ , with $u\in{\mathbb{C}}^{m}$ and $v\in{\mathbb{C}}^{n}$ , be the corresponding eigenvector. Then the MGSSP iteration method converges to the exact solution of the saddle point problem (1) for all $\alpha\geq{0}$ and $\beta>0$ .

Proof. By making use of Lemma 3.2, we have $\lambda\neq{1}$ , then from the second equation of (7), it has

[TABLE]

substituting it into the first equation of (7) results in

[TABLE]

By making use of Lemma 3.3, it holds that $u\neq{0}$ . Denote

[TABLE]

By multiplying $\frac{u^{*}}{u^{*}u}$ on (10) from the left, we have

[TABLE]

Having mind that $A$ is positive definite, we get $a>0$ and $c\geq{0}$ , which lead to $\alpha\beta+2\beta a+4c+2\beta bi\neq{0}$ by $\alpha\geq{0}$ and $\beta>0$ . Hence, (11) can be rewritten as $\lambda^{2}-\phi\lambda+\psi=0$ , where

[TABLE]

If $c=0$ , then (11) can be expressed as

[TABLE]

Solving the two roots of (12), we obtain

[TABLE]

Lemma 3.2 implies that $\lambda\neq{1}$ , then

[TABLE]

Now we turn to prove $|\lambda|<1$ under the condition $c>0$ . According to Lemma 3.1, we know that $|\lambda|<1$ if and only if $|\phi-\bar{\phi}\psi|+|\psi|^{2}<1$ . After some manipulations, we derive

[TABLE]

and

[TABLE]

Hence, $|\phi-\bar{\phi}\psi|+|\psi|^{2}<1$ is valid if and only if

[TABLE]

which is equivalent to

[TABLE]

Since $a>0$ , $c>{0}$ , $b^{2}\geq{0}$ , $\alpha\geq{0}$ and $\beta>0$ , it holds that

[TABLE]

which implies that (13) holds true, i.e., $|\phi-\bar{\phi}\psi|+|\psi|^{2}<1$ and therefore $|\lambda|<1$ . Hence, the MGSSP iteration method is convergent for any $\alpha\geq{0}$ and $\beta>0$ . This proof is completed. $\blacksquare$

4 Semi-convergence of the MGSSP iteration method for singular saddle point problems

When the saddle point matrix $\mathcal{A}$ is nonsingular, the MGSSP iteration scheme (3) converges to the exact solution of (1) for any initial vector if and only if $\rho(\mathcal{T}(\alpha,\beta))<1$ , whereas for the singular matrix $\mathcal{A}$ , we have $\rho(\mathcal{T}(\alpha,\beta))\geq 1$ . In this section, we assume that the sub-matrix $B$ in (1) is rank deficient and discuss the semi-convergence of the MGSSP iteration method for solving the singular saddle point problems.

To analyze the semi-convergent properties of the MGSSP iteration method, we present the following lemma which describes the semi-convergence property about the iteration scheme (3) when $\mathcal{A}$ is singular.

Lemma 4.1.

[14]* The iteration scheme (3) is semi-convergent if and only if the following two conditions are satisfied:

(i) $index(I-T)=1$ , or equivalently, $rank((I-T)^{2})=rank(I-T)$ , where $T=I-GM$ is the iteration matrix;

(ii) the pseudo-spectral radius of $T$ is less than $1$ , i.e.,*

[TABLE]

where $\sigma(T)$ is the spectral set of the matrix $T$ . Here, we denote the null space, the index and the rank of $A$ by $null(A)$ , $index(A)$ and $rank(A)$ , respectively.

Lemma 4.1 describes the semi-convergence property about the iteration scheme (3) when $\mathcal{A}$ is singular. Therefore, to get the semi-convergence property of the MGSSP iteration method, only the two conditions in Lemma 4.1 need to verify. We consider these two conditions in Lemmas 4.2 and 4.3, respectively.

Lemma 4.2.

Let $A$ be nonsymmetric positive definite, $B$ be rank deficient and $\alpha\geq{0},\beta>0$ be given constants. Then, the iteration matrix $\mathcal{T}(\alpha,\beta)$ of the MGSSP iteration method satisfies $index(I-\mathcal{T}(\alpha,\beta))=1$ , or equivalent

[TABLE]

where $\mathcal{T}(\alpha,\beta)$ is the iteration matrix of the MGSSP iteration method defined as in (3).

Proof. Inasmuch as $\mathcal{T}(\alpha,\beta)=\mathcal{P}_{MGSSP}^{-1}\mathcal{Q}_{MGSSP}=I-\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ , Equation (14) holds if

[TABLE]

It is easy to see that $null(\mathcal{P}_{MGSSP}^{-1}\mathcal{A})\subseteq null((\mathcal{P}_{MGSSP}^{-1}\mathcal{A})^{2})$ . Thus we only need to prove

[TABLE]

Let $x=(x_{1}^{*},x_{2}^{*})^{*}\in{\mathbb{C}^{m+n}}\in null((\mathcal{P}_{MGSSP}^{-1}\mathcal{A})^{2})$ , then it has $(\mathcal{P}_{MGSSP}^{-1}\mathcal{A})^{2}x=0$ . Denote by $y=\mathcal{P}_{MGSSP}^{-1}\mathcal{A}x$ . After suitable manipulations, we have

[TABLE]

i.e.,

[TABLE]

Since $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}y=(\mathcal{P}_{MGSSP}^{-1}\mathcal{A})^{2}x=0$ , it holds that $\mathcal{A}y=0$ , i.e.,

[TABLE]

Since $A$ is positive definite, from the first equation of (16) we can easily get $y_{1}=-A^{-1}By_{2}$ . Then substituting this relationship into the second equation of (16), we obtain $B^{T}A^{-1}By_{2}=0$ , which leads to $By_{2}=0$ . Taking $By_{2}=0$ into $y_{1}=-A^{-1}By_{2}$ , we obtain $y_{1}=0$ . Hence, the first equation of (15) becomes

[TABLE]

Substituting $y_{1}=0$ into $y_{2}$ yields $y_{2}=-\frac{1}{\beta}B^{T}x_{1}$ . Since $By_{2}=0$ , $-\frac{1}{\beta}BB^{T}x_{1}=0$ , it has $x_{1}^{*}BB^{T}x_{1}=0$ . This results in $B^{T}x_{1}=0$ , then we get $y_{2}=-\frac{1}{\beta}B^{T}x_{1}=0$ . Thus, $y=\mathcal{P}_{MGSSP}^{-1}\mathcal{A}x=0$ , i.e.,

[TABLE]

The conclusion follows by (17). $\blacksquare$

In the sequel, we show that the iteration scheme (3) satisfies the condition (ii) in Lemma 4.1. Let $B=U(B_{r},0)V^{*}$ be the singular decomposition of matrix $B$ , where

[TABLE]

with $U\in{\mathbb{C}^{m\times{m}}}$ , $V\in{\mathbb{C}^{n\times{n}}}$ being two unitary matrices and $\sigma_{i}$ $(i=1,2,\cdots,r)$ being a singular value of $B$ .

We introduce a block diagonal matrix

[TABLE]

which is a $(m+n)\times{(m+n)}$ unitary matrix, and the iteration matrix $\mathcal{T}(\alpha,\beta)$ is unitary similar to the matrix $\hat{\mathcal{T}}(\alpha,\beta)=P^{*}\mathcal{T}(\alpha,\beta)P$ . Hence, the matrix $\mathcal{T}(\alpha,\beta)$ has the same spectrum with the matrix $\hat{\mathcal{T}}(\alpha,\beta)$ . Thus we only need to analyze the pseudo-spectral radius of the matrix $\hat{\mathcal{T}}(\alpha,\beta)$ now.

Denoting $\hat{A}=U^{*}AU$ , then it holds that

[TABLE]

Then, from Equation (18), $\gamma(\hat{\mathcal{T}}(\alpha,\beta))<1$ holds if and only if $\rho(\tilde{\mathcal{T}}(\alpha,\beta))<1$ .

Note that $\tilde{\mathcal{T}}(\alpha,\beta)$ can be viewed as the iteration matrix of the MGSSP iteration method applied to the nonsymmetric nonsingular saddle point problem

[TABLE]

where $\hat{A}=U^{*}AU$ and $\hat{y},\hat{g}\in{\mathbb{R}^{r}}$ .

$\rho(\tilde{\mathcal{T}}(\alpha,\beta))<1$ implies $\gamma(\mathcal{T}(\alpha,\beta))=\gamma(\hat{\mathcal{T}}(\alpha,\beta))<1$ . By making use of the proof of Theorem 3.1, we derive the following result.

Lemma 4.3.

Let $A$ be nonsymmetric positive definite, $B$ be rank deficient and $\alpha\geq{0},\beta>0$ be two given constants. Then, the pseudo-spectral radius of the matrix $\gamma(\alpha,\beta)$ is less than 1, i.e., $\mathcal{V}(\mathcal{T}(\alpha,\beta))<1$ for all $\alpha\geq{0}$ and $\beta>0$ .

It follows from Lemmas 4.2 and 4.3 that two conditions in Lemma 4.1 are satisfied. Thus, the semi-convergence of the MGSSP iteration method for solving nonsymmetric singular saddle point problems can be obtained in the following theorem.

Theorem 4.1.

Let $A$ be nonsymmetric positive definite, $B$ be rank deficient and $\alpha\geq{0},\beta>0$ be two given constants. Then the MGSSP iteration method is semi-convergent for solving the nonsymmetric singular saddle point problem (1) for all $\alpha\geq{0}$ and $\beta>0$ .

5 Spectral analysis of the MGSSP preconditioned matrix

The MGSSP iteration method is a stationary iteration method. Although the unconditional convergence and semi-convergence properties of the MGSSP iteration method are studied in Theorem 3.1 and Theorem 4.1, respectively, the convergence (semi-convergence) rates of the MGSSP iteration method may be slow even with the optimal parameters. To accelerate the convergence (semi-convergence) rates of the MGSSP iteration method, we consider applying the preconditioning techniques. In general, the eigenvalue and eigenvector distributions of the preconditioned matrix relate closely to the convergence rates of Krylov subspace methods. Therefore, it is of significance to investigate the spectral properties of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ . In this section, some spectral properties of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ are studied.

Theorem 5.1.

Let the MGSSP preconditioner be defined as in (4) and $(\lambda,(u^{*},v^{*})^{*})$ be an eigenpair of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ . Then if $B$ is of full column rank and $B^{T}u=0$ , then

[TABLE]

where $Re(\lambda)$ and $Im(\lambda)$ denote the real part and the imaginary part of $\lambda$ , respectively. If $B$ is rank deficient and $u=0$ , then $\lambda=0$ . Besides, if $B$ is rank deficient and $B^{T}u=0$ , then $\lambda=0$ or $\lambda$ satisfies the Inequalities (19). If $B^{T}u\neq 0$ , then the eigenvalues of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ satisfy

[TABLE]

where

[TABLE]

and $z_{1},z_{2}$ are real numbers and $z_{1}+iz_{2}$ is one of the square roots of $a_{2}+b_{2}i$ , with

[TABLE]

and

[TABLE]

and the second root of $a_{2}+b_{2}i$ is $-(z_{1}+iz_{2})$ . The eigenvalues $\lambda_{\pm}$ satisfy the following inequality:

[TABLE]

When $\beta\rightarrow{0_{+}}$ , it holds that

[TABLE]

i.e., for $\alpha>0$ , the eigenvalues of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ tend to scatter near the point $(\frac{1}{2},0)$ as $\beta\rightarrow{0_{+}}$ ; and when $\alpha\rightarrow{0_{+}}$ , it has

[TABLE]

That is, for $\beta>0$ , the eigenvalues of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ tend to scatter near the point $(\frac{1}{2},0)$ and the point $(\frac{\beta a_{1}c_{1}+2c_{1}^{2}}{(\beta a_{1}^{2}+2c_{1})^{2}+\beta^{2}b_{1}^{2}},-\frac{\beta b_{1}c_{1}}{(\beta a_{1}^{2}+2c_{1})^{2}+\beta^{2}b_{1}^{2}})$ as $\alpha\rightarrow{0_{+}}$ .

*In addition, the eigenvalues of $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ tend to scatter near the points

$(\frac{\alpha_{0}\beta_{0}^{2}a_{1}+2\beta_{0}^{2}(a_{1}^{2}+b_{1}^{2})+12\beta_{0}a_{1}c_{1}+(\alpha_{0}\beta_{0}+4c_{1})(4c_{1}+z_{1})+2\beta_{0}(a_{1}z_{1}+|b_{1}z_{2}|)}{2[(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})^{2}+4\beta_{0}^{2}b_{1}^{2}]}$ , $\frac{(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})(z_{2}+\beta_{0}b_{1})-2\beta_{0}b_{1}(\beta_{0}a_{1}+4c_{1}+z_{1})}{2[(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})^{2}+4\beta_{0}^{2}b_{1}^{2}]})$ and $(\frac{\alpha_{0}\beta_{0}^{2}a_{1}+2\beta_{0}^{2}(a_{1}^{2}+b_{1}^{2})+12\beta_{0}a_{1}c_{1}+(\alpha_{0}\beta_{0}+4c_{1})(4c_{1}-z_{1})-2\beta_{0}(a_{1}z_{1}+|b_{1}z_{2}|)}{2[(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})^{2}+4\beta_{0}^{2}b_{1}^{2}]},\frac{(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})(\beta_{0}b_{1}-z_{2})-2\beta_{0}b_{1}(\beta_{0}a_{1}+4c_{1}-z_{1})}{2[(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})^{2}+4\beta_{0}^{2}b_{1}^{2}]})$ as $\alpha\rightarrow\alpha_{0}$ and $\beta\rightarrow\beta_{0}$ $(0\leq\alpha_{0}<+\infty,0<\beta_{0}<+\infty)$ .*

Proof. Let $(\lambda,(u^{*},v^{*})^{*})$ be an eigenpair of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ , we consider the eigenvalue problem $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}\eta=\lambda\eta$ , where $\eta=(u^{*},v^{*})^{*}$ , then it holds that

[TABLE]

By simple manipulations, we get

[TABLE]

i.e.,

[TABLE]

If $B$ has full column rank and $u=0$ , then it follows from the second equation of (24) that $\lambda v=0$ and therefore $v=0$ , which contradicts to the assumption that $(u^{*},v^{*})^{*}$ is an eigenvector. Hence $u\neq{0}$ . If $B$ is of full column rank and $B^{T}u=0$ , then from the second equation of (24), we have $v=0$ and

[TABLE]

Owing to $u\neq 0$ , it holds that the definition $\frac{u^{*}}{u^{*}u}$ does make sense. Premultiplying Equation (25) with $\frac{u^{*}}{u^{*}u}$ and utilizing the symbols defined as in (21) give

[TABLE]

It is easy to verify that $\lambda\rightarrow\frac{1}{2}$ as $\alpha\rightarrow 0_{+}$ . Besides, (26) implies that

[TABLE]

Since

[TABLE]

it is not difficult to derive (19).

If $B$ is rank deficient and $u=0$ , then from the second equation of (24), we derive $\lambda=0$ . Additionally, if $B$ is rank deficient and $B^{T}u=0$ , then it holds that $\lambda=0$ or $v=0$ , $\lambda\neq{0}$ by virtue of the second equation of (24). Similar to the derivation of (19), we also deduce (19) as $B$ is rank deficient, $v=0$ and $\lambda\neq{0}$ .

In the sequel, we assume that $B^{T}u\neq{0}$ . Then $\lambda\neq{0}$ and $u\neq{0}$ . Otherwise, it follows from the second equation of (24) that $B^{T}u={0}$ , a contradiction. From the second equation of (24) we can easily get $v=\frac{(2\lambda-1)B^{T}u}{\lambda\beta}$ . Then substituting this relationship into the first equation of (24) gives

[TABLE]

Multiplying $\frac{u^{*}}{u^{*}u}$ on Equation (27) from the left and utilizing the symbols defined as in (21) give

[TABLE]

which can be equivalently transformed into the following equation

[TABLE]

By solving Equation (28), we obtain its two roots as follows:

[TABLE]

where $z_{1}$ and $z_{2}$ are given by (22). Applying (22) leads to

[TABLE]

which yields that

[TABLE]

It is evident that an upper bound of $\left|\lambda_{\pm}-\frac{1}{2}\right|^{2}$ is $f(a_{1},b_{1},c_{1})$ , with $a_{1},b_{1},c_{1}$ being bounded as follows:

[TABLE]

which leads to

[TABLE]

Furthermore, it is not difficult to verify that $z_{1},z_{2}\rightarrow 0$ as $\beta\rightarrow{0_{+}}$ , and therefore for $\alpha>0$ , $\lambda_{+},\lambda_{-}\rightarrow\frac{1}{2}$ as $\beta\rightarrow{0_{+}}$ . Moreover, if $\alpha\rightarrow{0_{+}}$ , then it follows from (22) that $z_{1}\rightarrow\beta a_{1}$ and $z_{2}\rightarrow\beta b_{1}$ , thus

[TABLE]

which means that for $\beta>0$ , the eigenvalues of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ tend to scatter near the point $(\frac{1}{2},0)$ and the point $(\frac{\beta a_{1}c_{1}+2c_{1}^{2}}{(\beta a_{1}^{2}+2c_{1})^{2}+\beta^{2}b_{1}^{2}},-\frac{\beta b_{1}c_{1}}{(\beta a_{1}^{2}+2c_{1})^{2}+\beta^{2}b_{1}^{2}})$ as $\alpha\rightarrow{0_{+}}$ . Additionally, it is easily seen that the eigenvalues of $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ tend to scatter near the points $(\frac{\alpha_{0}\beta_{0}^{2}a_{1}+2\beta_{0}^{2}(a_{1}^{2}+b_{1}^{2})+12\beta_{0}a_{1}c_{1}+(\alpha_{0}\beta_{0}+4c_{1})(4c_{1}+z_{1})+2\beta_{0}(a_{1}z_{1}+|b_{1}z_{2}|)}{2[(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})^{2}+4\beta_{0}^{2}b_{1}^{2}]},\frac{(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})(z_{2}+\beta_{0}b_{1})-2\beta_{0}b_{1}(\beta_{0}a_{1}+4c_{1}+z_{1})}{2[(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})^{2}+4\beta_{0}^{2}b_{1}^{2}]})$ and $(\frac{\alpha_{0}\beta_{0}^{2}a_{1}+2\beta_{0}^{2}(a_{1}^{2}+b_{1}^{2})+12\beta_{0}a_{1}c_{1}+(\alpha_{0}\beta_{0}+4c_{1})(4c_{1}-z_{1})-2\beta_{0}(a_{1}z_{1}+|b_{1}z_{2}|)}{2[(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})^{2}+4\beta_{0}^{2}b_{1}^{2}]},\frac{(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})(\beta_{0}b_{1}-z_{2})-2\beta_{0}b_{1}(\beta_{0}a_{1}+4c_{1}-z_{1})}{2[(\alpha_{0}\beta_{0}+2\beta_{0}a_{1}+4c_{1})^{2}+4\beta_{0}^{2}b_{1}^{2}]})$ as $\alpha\rightarrow\alpha_{0}$ and $\beta\rightarrow\beta_{0}$ $(0\leq\alpha_{0}<+\infty,0<\beta_{0}<+\infty)$ . $\blacksquare$

Remark 5.1.

It follows from Theorem 5.1 that

[TABLE]

as $\alpha\geq{0}$ , $\beta>0$ and $B^{T}u\neq{0}$ , and if $B$ is of full column rank and $B^{T}u={0}$ , then from (19), we infer that $Re(\lambda)>0$ , where $(\lambda,(u^{*},v^{*})^{*})$ is an eigenpair of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ . Thus all eigenvalues of $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ have positive real parts and lie in a positive box as $B$ is of full column rank, which may result in fast convergence of Krylov subspace acceleration. Besides, from the proof of Theorem 5.1, it can be seen that when $B^{T}u=0$ and $\alpha\rightarrow{0_{+}}$ , it holds that $\lambda\rightarrow\frac{1}{2}$ or $\lambda=0$ ; when $B^{T}u\neq 0$ , $\lambda\rightarrow(\frac{1}{2},0)$ as $\beta\rightarrow{0_{+}}$ for $\alpha\geq 0$ . This implies that the MGSSP preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ with proper parameters $\alpha$ and $\beta$ has much denser spectrum distribution compared with the saddle point matrix $\mathcal{A}$ . This means that when the MGSSP preconditioner is applied for the GMRES method, the rate of convergence (semi-convergence) can be improved considerably. This fact is further confirmed by the numerical results presented in Tables 2-4 and Tables 6-8 of Section 6. What is more, since

[TABLE]

then from (32), we have that

[TABLE]

Then all eigenvalues of $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ are located in a circle centered at $(\frac{1}{2},0)$ with radius $\frac{1}{2}$ .**

Owing to the fact the convergence of Krylov subspace methods is not only dependent on the eigenvalue distribution of the preconditioned matrix, but also on the corresponding eigenvectors of the preconditioned matrix [1, 4] except for the case that the preconditioned matrix is symmetric. We next discuss the eigenvector distribution of $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ in the following theorem.

Theorem 5.2.

*Let the MGSSP preconditioner $\mathcal{P}_{MGSSP}$ be defined as in (4). If $B$ is of full column rank and $\alpha=0$ , then the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ has $m+i$ $(0\leq i\leq m)$ linearly independent eigenvectors, and if $B$ is of full column rank and $\alpha>0$ , then the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ has $i$ $(0\leq i\leq m)$ linearly independent eigenvectors. If $B$ is rank deficient and $\alpha=0$ , then the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ has $m+i+j$ $(0\leq i\leq{m},1\leq j\leq{n})$ linearly independent eigenvectors, and if $B$ is rank deficient and $\alpha>0$ , then the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ has $i+j$ $(0\leq i\leq{m},1\leq j\leq{n})$ linearly independent eigenvectors. There are

1) $m$ eigenvectors of the form $\left(\begin{array}[]{c}u_{l}\\ 0\\ \end{array}\right)$ $(1\leq l\leq m)$ that correspond to the eigenvalue $\frac{1}{2}$ as $\alpha=0$ , where $u_{l}\neq{0}$ $(1\leq l\leq m)$ are arbitrary linearly independent vectors;

2) If $B$ is of full column rank and $\alpha>0$ , $i$ $(0\leq{i}\leq m)$ eigenvectors of the form $\left(\begin{array}[]{c}u_{l}^{1}\\ \frac{(2\lambda-1)B^{T}u_{l}^{1}}{\lambda\beta}\\ \end{array}\right)$ $(1\leq l\leq i)$ that correspond to the eigenvalues $\lambda\neq\frac{1}{2}$ , where $u_{l}^{1}$ $(1\leq l\leq i)$ satisfy $\lambda\beta Au_{l}^{1}=\beta\lambda^{2}(\alpha I+2A)u_{l}^{1}+(2\lambda-1)^{2}BB^{T}u_{l}^{1}$ .

3) If $B$ is rank deficient and $\alpha>0$ , $i$ $(0\leq{i}\leq m)$ eigenvectors of the form $\left(\begin{array}[]{c}u_{l}^{1}\\ \frac{(2\lambda-1)B^{T}u_{l}^{1}}{\lambda\beta}\\ \end{array}\right)$ $(1\leq l\leq i)$ that correspond to the eigenvalues $\lambda\neq\frac{1}{2},0$ , where $u_{l}^{1}$ $(1\leq l\leq i)$ satisfy $\lambda\beta Au_{l}^{1}=\beta\lambda^{2}(\alpha I+2A)u_{l}^{1}+(2\lambda-1)^{2}BB^{T}u_{l}^{1}$ ; and $j$ $(1\leq j\leq{n})$ eigenvectors $\left(\begin{array}[]{c}0\\ v_{l}^{2}\\ \end{array}\right)$ $(1\leq{l}\leq{j})$ that correspond to the eigenvalue [math], where $v_{l}^{2}\neq{0}$ $(1\leq{l}\leq{j})$ satisfy $Bv_{l}^{2}=0$ .*

Proof. Let $\lambda$ be an eigenvalue of the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ and $\left(\begin{array}[]{c}u\\ v\\ \end{array}\right)$ be the corresponding eigenvector. To investigate the eigenvector distribution, we consider Equation (24) as follows:

[TABLE]

Firstly, we consider $B$ has full column rank. If $u=0$ , then it follows from the second equation of (33) that $\lambda v=0$ and therefore $v=0$ , which contradicts to the assumption that $(u^{*},v^{*})^{*}$ is an eigenvector. Hence $u\neq{0}$ . If $\lambda=\frac{1}{2}$ , then from (33) we can easily get $\alpha u=0$ and $v=0$ . If $\alpha=0$ , then Equation (33) is always true for the case of $\lambda=\frac{1}{2}$ . Hence, there are $m$ linearly independent eigenvectors $\left(\begin{array}[]{c}u_{l}\\ 0\\ \end{array}\right)$ $(l=1,2,\cdots,m)$ corresponding to the eigenvalue $\frac{1}{2}$ as $\alpha=0$ , where $u_{l}$ $(l=1,2,\cdots,m)$ are arbitrary linearly independent vectors. If $\alpha>0$ , then $u=0$ and $v=0$ , a contradiction.

Next, we consider the case $\lambda\neq\frac{1}{2}$ . It follows from the second equation of (33) that $v=\frac{(2\lambda-1)B^{T}u}{\lambda\beta}$ . Substituting this relation into the first equation of (33) results in

[TABLE]

If there exists $u\neq{0}$ which satisfies (34), there will be $i$ $(1\leq i\leq m)$ linearly independent eigenvectors $\left(\begin{array}[]{c}u_{l}^{1}\\ v_{l}^{1}\\ \end{array}\right)$ $(1\leq{l}\leq{i})$ corresponding to the eigenvalues $\lambda\neq\frac{1}{2}$ . Here, $u_{l}^{1}\neq{0}$ $(1\leq{l}\leq{i})$ satisfy $\lambda\beta Au_{l}^{1}=\beta\lambda^{2}(\alpha I+2A)u_{l}^{1}+(2\lambda-1)^{2}BB^{T}u_{l}^{1}$ and the forms of $v_{l}^{1}$ $(1\leq{l}\leq{i})$ are

[TABLE]

If $B$ is rank deficient, then $\lambda=0$ is an eigenvalue of $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ . If $\lambda=0$ , then from (33), it holds that $B^{T}u=0$ and $Au=-Bv$ , which lead to $B^{T}A^{-1}Bv=0$ and therefore $Bv=0$ is due to the fact that $A^{-1}$ is positive definite, then we have $Au=0$ and $u=0$ . Recalling that $B$ is rank deficient, then there exists $v\neq{0}$ which satisfies $Bv=0$ , hence there will be $j$ $(1\leq j\leq{n})$ linearly independent eigenvectors $\left(\begin{array}[]{c}0\\ v_{l}^{2}\\ \end{array}\right)$ $(1\leq{l}\leq{j})$ corresponding to the eigenvalue [math], where $v_{l}^{2}\neq{0}$ $(1\leq{l}\leq{j})$ satisfy $Bv_{l}^{2}=0$ . With a quite similar strategy utilized in the case that $B$ has full column rank, we also can obtain the eigenvectors that correspond to $\lambda=\frac{1}{2}$ and $\lambda\neq{0},\frac{1}{2}$ are the same as those for the case that $B$ is of full column rank.

Now, we show that the $m+i$ eigenvectors are linearly independent when $B$ is of full column rank and $\alpha=0$ . Let $c^{(1)}=[c_{1}^{(1)},c_{2}^{(1)},\cdots,c_{m}^{(1)}]$ and $c^{(2)}=[c_{1}^{(2)},c_{2}^{(2)},\cdots,c_{i}^{(2)}]$ be two vectors with $0\leq i\leq{m}$ . Then, we need to show that

[TABLE]

holds if and only if the vectors $c^{(1)}$ and $c^{(2)}$ both are zero vectors. Recall that in (35) the first matrix arises from the case $\lambda_{l}=\frac{1}{2}$ $(l=1,2,\cdots,m)$ in 1), and the second matrix from the case $\lambda_{l}\neq\frac{1}{2}$ $(l=1,2,\cdots,i)$ in 2). Multiplying both sides of (35) from left with $2\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ leads to

[TABLE]

Then, by subtracting (35) from (36), it holds that

[TABLE]

Since the eigenvalues $\lambda_{l}\neq\frac{1}{2}$ and $\left(\begin{array}[]{c}u_{l}^{1}\\ v_{l}^{1}\\ \end{array}\right)$ $(1\leq l\leq{i})$ are linearly independent, we infer that $c_{l}^{(2)}=0$ $(l=1,2,\cdots,i)$ . Because of the linear independence of $u_{l}$ $(l=1,2,\cdots,m)$ , it follows that $c_{l}^{(1)}=0$ $(l=1,2,\cdots,m)$ . Therefore, the $m+i$ eigenvectors are linearly independent.

In the sequel, we verify the $m+i+j$ eigenvectors are linearly independent when $B$ is rank deficient and $\alpha=0$ . Let $c^{(1)}=[c_{1}^{(1)},c_{2}^{(1)},\cdots,c_{m}^{(1)}]$ , $c^{(2)}=[c_{1}^{(2)},c_{2}^{(2)},\cdots,c_{i}^{(2)}]$ and $c^{(3)}=[c_{1}^{(3)},c_{2}^{(3)},\cdots,c_{j}^{(3)}]$ be three vectors with $0\leq i\leq{m}$ and $1\leq j\leq{n}$ , and

[TABLE]

It is necessary for us to prove that (37) holds if and only if the vectors $c^{(1)}$ , $c^{(2)}$ and $c^{(3)}$ are all zero vectors, where the first matrix consists of the eigenvectors corresponding to the eigenvalue $\frac{1}{2}$ for the case 1), and the second and the third matrices consist of those for the case 3). Premultiplying (37) with $2\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ and going through the same algebraic operations as before, we also obtain

[TABLE]

Inasmuch as $\lambda_{l}\neq\frac{1}{2}$ and $u_{l}^{1}$ $(1\leq l\leq{i})$ are linearly independent, it holds that $c_{l}^{(2)}=0$ $(l=1,2,\cdots,i)$ . Then it has

[TABLE]

As the vectors $v_{l}^{2}$ $(l=1,2,\cdots,j)$ are also linearly independent, we have $c_{l}^{(3)}=0$ $(l=1,2,\cdots,j)$ . Thus, (37) becomes to

[TABLE]

Since $u_{l}$ $(l=1,2,\cdots,m)$ are linearly independent, we have $c_{l}^{(1)}=0$ $(l=1,2,\cdots,m)$ . As a result, it holds that the $m+i+j$ eigenvectors are linearly independent.

Finally, we prove the $i+j$ eigenvectors are linearly independent when $B$ is rank deficient and $\alpha>0$ . Let $c^{(1)}=[c_{1}^{(1)},c_{2}^{(1)},\cdots,c_{i}^{(1)}]$ and $c^{(2)}=[c_{1}^{(2)},c_{2}^{(2)},\cdots,c_{j}^{(2)}]$ be two vectors with $0\leq i\leq{m},1\leq{j}\leq{n}$ . Then, we need to show that

[TABLE]

holds if and only if the vectors $c^{(1)}$ and $c^{(2)}$ both are zero vectors. Since $u_{l}^{1}$ $(1\leq l\leq{i})$ are linearly independent, we infer that $c_{l}^{(1)}=0$ $(l=1,2,\cdots,i)$ . Because of the linear independence of $v_{l}^{2}$ $(l=1,2,\cdots,j)$ , it follows that $c_{l}^{(2)}=0$ $(l=1,2,\cdots,j)$ . Consequently, the above $i+j$ eigenvectors are linearly independent. $\blacksquare$

6 Numerical experiments

In this section, two numerical examples are used to verify the performance of the MGSSP iteration method and the MGSSP preconditioned GMRES method. In the meanwhile, we compare the MGSSP iteration method with the GSS and GMSS methods, and also compare the MGSSP preconditioner with the SS, GSS, MSS and GMSS preconditioners for the GMRES method according to the number of iterations (denoted by “IT”) and the elapsed CPU times (denoted by “CPU”). All codes are run in MATLAB (version R2016a) and all experiments are performed on an Intel(R) Pentium(R) CPU G3240T 2.70 GHz, 4.0GB memory and XP operating system. In our implementations, the linear systems $(\alpha I+A+\frac{1}{\alpha}BB^{T})x=b$ , $(\alpha I+A+\frac{1}{\beta}BB^{T})x=b$ and $(\alpha I+2A+\frac{4}{\beta}BB^{T})x=b$ involved in the SS, GSS and MGSSP iteration, respectively are solved exactly by the the LU factorization. In addition, the linear systems $(\alpha I+2H+\frac{1}{\alpha}BB^{T})x=b$ and $(\alpha I+2H+\frac{1}{\beta}BB^{T})x=b$ contained in the MSS and the GMSS iteration are solved exactly by the Cholesky factorization.

In our numerical experiments, we choose the right-hand side vector $b$ so that the exact solution of the saddle point problem (1) is $(1,1,\cdots,1)^{T}$ . All experiments are started from the initial vector $\mathbf{x}^{(0)}=(x^{(0)T},y^{(0)T})^{T}=(0,0,\cdots,0)^{T}$ , terminated once the current iterate $\mathbf{x}^{(k)}$ satisfies

[TABLE]

and we use “–” to indicate that the corresponding iteration method does not satisfy the prescribed stopping criterion until $500$ iteration steps.

Example 6.1.

Consider the nonsymmetric nonsingular saddle point problem structured as (1) with the following coefficient sub-matrices [36]:

[TABLE]

where $\otimes$ denotes the Kronecker product symbol and $h=\frac{1}{p+1}$ is the discretization mesh size.

In Table 1, we list the parameters involved in the tested methods which are chosen to be the experimentally found optimal ones that minimize the total number of iteration steps for those methods, as well as the numerical results of the GSS, GMSS and MGSSP iteration methods when $v=0.1$ with respect to different grids $16\times 16$ , $32\times 32$ and $64\times 64$ . Moreover, numerical results of the GMRES method and the preconditioned GMRES methods incorporated with the SS, GSS, MSS, GMSS and the MGSSP preconditioners are listed in Tables 2-4 for $v=1$ , $0.1$ and $0.01$ on different uniform grids, respectively.

In order to better understand the numerical results in Table 1, convergence history of the GSS, GMSS and MGSSP iteration methods with experimental optimal parameters are depicted in Figure 1. To further confirm the effectiveness of the MGSSP preconditioned GMRES method, we plot the IT of the three preconditioned GMRES methods with parameters $\alpha=\beta$ from 0.1 to 10 with step size 0.1 in Figure 2. For more investigations, the eigenvalue distributions of the original matrix $\mathcal{A}$ and the five preconditioned matrices with $\alpha=0.6$ and $\beta=0.8$ for $v=1$ and $p=32$ are displayed in Figure 3.

Looking into Tables 1-4 and Figures 1-3 one may make the following observations.

•

From Table 1, it can be observed that the experimental optimal parameters of the MGSSP iteration method are more stable compared with those of other two methods. Besides, the results in Table 1 imply that the MGSSP iteration method is superior to the other two methods from the point view of the IT and CPU times, and the IT of the MGSSP iteration method remains constant under the experimental optimal parameters with the increasing of the problem size.

•

By comparing the results in Tables 2-4, it can be seen that without preconditioning, the GMRES method converges very slow even invalid within $500$ iteration steps for larger linear systems. All aforementioned preconditioners can largely accelerate the convergence rate of the GMRES method. The proposed MGSSP preconditioned GMRES method performs better than other five preconditioned GMRES methods as it requires less IT and CPU times. Another observation which can be pointed out is that, the convergence behavior of the MGSSP preconditioned GMRES method is not sensitive to $p$ , in the sense the iterations barely change.

•

Figure 1 indicates that the three tested methods converge while the MGSSP iteration method returns better numerical results than the GSS and the GMSS iteration methods. From Figure 2, as we expected for Example 6.1, we see that the MGSSP preconditioned GMRES method outperforms the other two methods with the changing of $\alpha$ , and show that our proposed preconditioner is more effective and practical for solving the nonsymmetric nonsingular saddle point problems, in comparison with the other preconditioners. Additionally, as seen from Figure 3, the preconditioned matrix $\mathcal{P}_{MGSSP}^{-1}\mathcal{A}$ has more clustered eigenvalues than the other ones. This means that the MGSSP preconditioner outperforms the other five preconditioners for the GMRES method, which is congruous with the results of Table 2.

Example 6.2.

Consider the nonsymmetric singular saddle point problem structured as (1) with the following coefficient sub-matrices [40]:

[TABLE]

where

[TABLE]

Here $\otimes$ denotes the Kronecker product and $h=\frac{1}{p+1}$ is the discretization meshsize. The iterations of all tested methods are terminated once the current iterate $\mathbf{x}^{(k)}$ satisfies (38) or the maximum prescribed number of iterations $k_{max}=500$ is exceeded.

Table 5 reports the iteration counts, CPU times and relative residual (RES) of the tested iteration methods with respect to different values of the problem size $p$ for $v=0.1$ . We adopt the parameters of the tested methods to be the experimentally found optimal ones. From Table 5, we observe that although all tested methods succeed in producing approximate solutions in all cases, the MGSSP iteration method outperforms other two methods in terms of the IT and CPU times, and the advantage of the MGSSP iteration method becomes more pronounced as the system size increases.

With respect to different sizes of the coefficient matrix, we list the numerical results of the SS, GSS, MSS, GMSS and MGSSP preconditioned GMRES methods with different values of $v$ ( $v=1$ , $v=0.1$ and $v=0.01$ ) in Tables 6-8, respectively. From Tables 6-8, we can conclude some observations as follows. Firstly, the GMRES method does not converge when $v=0.01$ and $p$ becomes large. Secondly, the five preconditioners can improve the convergence behavior of the GMRES method, but the MGSSP preconditioned GMRES method returns better numerical results than the other preconditioned GMRES methods in terms of IT and CPU time. Lastly, the MSS and GMSS preconditioned GMRES methods have worse convergence behaviors as $v$ becomes small.

The graphs of RES(log10) against number of iterations of in Table 5 for three different sizes are displayed in Figure 4. As observed in Figure 4, the MGSSP iteration method leads to much better performance than the GSS and the GMSS iteration methods. It is worthy noting that the IT of the GSS and the GMSS iteration methods increase when $p$ becomes large, but this is not true for the MGSSP iteration method.

In order to compare effects of the GSS, GMSS, and the MGSSP preconditioned GMRES methods in terms of the parameters $\alpha$ and $\beta$ , we test these methods with $\alpha=\beta$ and plot the IT of the three preconditioned GMRES methods with $\alpha$ from 0.1 to 10 with step size 0.1 in Figure 5. From Figure 5, we can obtain the same results as those of Figure 2.

In order to better investigate the performance of the tested preconditioned GMRES methods, Figure 6 depicts the eigenvalue distributions of the saddle point matrix $\mathcal{A}$ , the SS, GSS, MSS, GMSS and MGSSP preconditioned matrices with $v=0.1$ and $p=32$ . These subfigures clearly show that the preconditioned matrices have more tightly clustered eigenvalues than the original matrix. Moreover, the eigenvalues of the MGSSP preconditioned matrix are much tighter than the other ones. These observations imply that the MGSSP preconditioned GMRES method has better numerical performance than other preconditioned GMRES methods and it can act as an efficient preconditioner for solving the nonsymmetric singular saddle point problem by the preconditioned GMRES method.

7 Conclusions

For nonsymmetric saddle point problems, by combining the GSS and MSSP of a matrix, we establish a modified generalized shift-splitting (MGSSP) iteration method and the corresponding preconditioner called the MGSSP preconditioner in this paper. The unconditional convergence and semi-convergence of the MGSSP iteration method for solving nonsingular and singular saddle point problems, respectively are discussed. Moreover, eigenproperties of the preconditioned matrix are described. Numerical results given in Section 6 illustrate that the efficiency of the MGSSP iteration method and the MGSSP preconditioner for saddle point problems with nonsymmetric positive definite (1,1) parts, and confirm that they outperform some existing ones.

We should point out that the MGSSP preconditioner may not have the optimality property, i.e., the iteration counts depend on the parameters $\alpha$ and $\beta$ (see Figures 2 and 5). Besides, admittedly, the choices of the optimal parameters of the MGSSP iteration method and the MGSSP preconditioned GMRES method is a challenging problem that deserves further study. For most iterative methods, this work is very complicated. Nevertheless, by adopting certain approximation strategies, there have been practically useful formula for obtaining nearly optimal iteration parameters; see [38, 22, 31]. To further investigations, we would like to study how to further improve the MGSSP preconditioner and choose the optimal parameters for the MGSSP iteration method.

Bibliography45

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Z.-Z. Bai, Sharp error bounds of some Krylov subspace methods for non-Hermitian linear systems, Appl. Math. Comput. 109 (2000), 273-285.
2[2] Z.-Z. Bai, Structured preconditioners for nonsingular matrices of block two-by-two structures, Math. Comp. 75 (2006), 791-815.
3[3] Z.-Z. Bai, On semi-convergence of Hermitian and skew-Hermitian splitting methods for singular linear systems, Computing 89 (2010), 171-197.
4[4] Z.-Z. Bai, Motivations and realizations of Krylov subspace methods for large sparse linear systems, J. Comput. Appl. Math. 283 (2015), 71-78.
5[5] Z.-Z. Bai and G. H. Golub, Accelerated Hermitian and skew-Hermitian splitting iteration methods for saddle-point problems, IMA J. Numer. Anal. 27 (2007), 1-23.
6[6] Z.-Z. Bai, G. H. Golub, L.-Z. Lu and J.-F. Yin, Block triangular and skew-Hermitian splitting methods for positive-definite linear systems, SIAM J. Sci. Comput. 26 (2005), 844-863.
7[7] Z.-Z. Bai, G. H. Golub and M. K. Ng, Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems, SIAM J. Matrix Anal. Appl. 24 (2003), 603-626.
8[8] Z.-Z. Bai, G. H. Golub and M. K. Ng, On inexact Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems, Linear Algebra Appl. 428 (2008), 413-440.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

A modified generalized shift-splitting preconditioner for nonsymmetric saddle point problems

1 Introduction

2 The modified generalized shift-splitting (MGSSP) preconditioner and its implementation

3 Convergence of the MGSSP iteration method for nonsingular saddle point problems

Lemma 3.1**.**

Lemma 3.2**.**

Lemma 3.3**.**

Theorem 3.1**.**

4 Semi-convergence of the MGSSP iteration method for singular saddle point problems

Lemma 4.1**.**

Lemma 4.2**.**

Lemma 4.3**.**

Theorem 4.1**.**

5 Spectral analysis of the MGSSP preconditioned matrix

Theorem 5.1**.**

Remark 5.1**.**

Theorem 5.2**.**

6 Numerical experiments

Example 6.1**.**

Example 6.2**.**

7 Conclusions

Lemma 3.1.

Lemma 3.2.

Lemma 3.3.

Theorem 3.1.

Lemma 4.1.

Lemma 4.2.

Lemma 4.3.

Theorem 4.1.

Theorem 5.1.

Remark 5.1.

Theorem 5.2.

Example 6.1.

Example 6.2.