On Optimal Algebraic Multigrid Methods
Luis Garc\'ia Ramos, Reinhard Nabben

TL;DR
This paper introduces a new spectral characterization approach to derive optimal interpolation operators for algebraic multigrid methods applied to Hermitian positive definite systems, improving efficiency and robustness.
Contribution
It presents a spectral-based method for obtaining optimal interpolation operators, offering a simpler and more general approach compared to previous $A$-norm based techniques.
Findings
Optimal interpolation operators are derived using spectral characterization.
Operators are optimal with respect to $A$-norm, spectral radius, and condition number.
The method applies to both symmetric and non-symmetric two-grid methods.
Abstract
In this note we present an alternative way to obtain optimal interpolation operators for two-grid methods applied to Hermitian positive definite linear systems. Falgout and Vassilevski in [SIAM J. Numer. Anal, 42 (2004), pp. 1669-1693] and Zikatanov [Numer. Linear Algebra Appl., 15 (2008), pp. 439-454] have characterized the -norm of the error propagation operator of algebraic multigrid methods. These results have been recently used by Xu and Zikatanov [Acta Numer., 26 (2017), pp. 591-721] and Brannick, Cao et al. [SIAM J. Sci. Comp, 40 (2018), pp. 591-721] to determine optimal interpolation operators. Here we use a characterization not of the -norm but of the spectrum of the error propagation operator of two-grid methods, which was proved by Garc\'ia Ramos, Nabben and Kehl and holds for arbitrary matrices. For Hermitian positive definite systems this result leads to optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Methods in Computational Mathematics · Matrix Theory and Algorithms · Numerical methods for differential equations
On Optimal Algebraic Multigrid Methods
Luis García Ramos111 Technische Universität Berlin, Institut für Mathematik, Straße des 17. Juni 136, D-10623 Berlin, Germany ({garcia, nabben}@math.tu-berlin.de).
Reinhard Nabben111 Technische Universität Berlin, Institut für Mathematik, Straße des 17. Juni 136, D-10623 Berlin, Germany ({garcia, nabben}@math.tu-berlin.de).
Abstract
In this note we present an alternative way to obtain optimal interpolation operators for two-grid methods applied to Hermitian positive definite linear systems. In [5, 10] the -norm of the error propagation operator of algebraic multigrid methods is characterized. These results are just recently used in [9, 3] to determine optimal interpolation operators. Here we use a characterization not of the -norm but of the spectrum of the error propagation operator of two-grid methods, which was proved in [6]. This characterization holds for arbitrary matrices. For Hermitian positive definite systems this result leads to optimal interpolation operators with respect to the -norm in a short way, moreover, it also leads to optimal interpolation operators with respect to the spectral radius. For the symmetric two-grid method (with pre- and post-smoothing) the optimal interpolation operators are the same. But for a two-grid method with only post-smoothing the optimal interpolations (and hence the optimal algebraic multigrid methods) can be different. Moreover, using the characterization of the spectrum, we can show that the found optimal interpolation operators are also optimal with respect to the condition number of the multigrid preconditioned system.
keywords:
multigrid, optimal interpolation operator, two-grid methods
AMS:
65F10, 65F50, 65N22, 65N55.
1 Introduction
Typical multigrid methods to solve the linear system
[TABLE]
where is an matrix, consist of two ingredients, the smoothing and the coarse grid correction. The smoothing is typically done by a few steps of a basic stationary iterative method, like the Jacobi or Gauss-Seidel method. For the coarse grid correction, a prolongation or interpolation operator and a restriction operator are needed. The coarse grid matrix is then defined as
[TABLE]
Here we always assume that and are non-singular. The multigrid or algebraic multigrid (AMG) error propagation matrix is then given by
[TABLE]
where and are smoothers, and are the number of pre- and post-smoothing steps respectively, and is the coarse grid correction matrix. The multigrid method is convergent if and only if the spectral radius of the error propagation matrix is less than one. Alternatively, the norm of the error propagation matrix can be considered, where is a consistent matrix norm, and in this case one has
[TABLE]
The aim of algebraic multigrid methods is to balance the interplay between smoothing and coarse grid correction steps. However, most of the existing AMG methods first fix a smoother and then optimize a certain quantity to choose the interpolation and restriction .
To simplify the analysis, we assume that there exists a non-singular matrix such that
[TABLE]
it can be shown that such a non-singular matrix exists if the spectral radius of is less than one, see e.g. [2]. Note that the matrix can be written as
[TABLE]
where the matrix is known as the multigrid preconditioner, i.e., is an approximation of . Therefore, eigenvalue estimates of are of interest and they lead to estimates for the eigenvalues of .
The following theorem, proved by García Ramos, Kehl and Nabben in [6], gives a characterization of the spectrum of , denoted by , and hence a characterization of the spectrum of the general error propagation matrix .
Theorem 1**.**
Let be non-singular, and let and such that is non-singular. Moreover, let and be such that that the matrices in (1.3) and are non-singular. Then the following statements hold:
- (a)
The multigrid preconditioner in (1.4) is non-singular. 2. (b)
If are matrices such that the columns of and form orthonormal bases of and (the orthogonal complements of and in the Euclidean inner product) respectively, then the matrices and are non-singular and the spectrum of is given by
[TABLE]
We will apply this theorem to Hermitian positive definite (HPD) matrices to determine the optimal interpolation operators of AMG methods with respect to the spectral radius of the error propagation matrix. For HPD matrices, optimal interpolation operators with respect to the -norm have been obtained recently in [9, 3]. We will show that the optimal interpolation operators with respect to the spectral radius for the symmetric/symmetrized multigrid method (with pre- and post-smoothing) and the optimal interpolation operator with respect to the -norm are the same. But for multigrid with only a post-smoothing step the optimal interpolation operators with respect to the spectral radius and -norm (and hence the optimal algebraic multigrid methods) can be different. Using Theorem 1 we can also show that the interpolation operators with respect to the spectral radius are also optimal with respect to the condition number of the multigrid preconditioned system.
2 Optimal interpolation for Hermitian positive definite matrices
Let be HPD and recall that the norm induced by (or -norm) is defined for and by
[TABLE]
and
[TABLE]
We will study the following two-grid methods given by the error propagation operators
[TABLE]
and the symmetrized version
[TABLE]
Thus we are using . The range of , i.e. , is called the coarse space . We assume that the smoother is fixed and let and vary with respect to the choice of the interpolation operator . In addition, we assume that the smoother satisfies
[TABLE]
which is equivalent to the condition
[TABLE]
see, e.g., [8]. Given a fixed smoother such that , many AMG methods are designed to minimize or a related quantity. We say an interpolation operator is optimal if it minimizes . In view of the equality
[TABLE]
proved by Falgout and Vassilevski in [4], we can conclude that an optimal interpolation operator also minimizes . Zikatanov proved in [10, Lemma 2.3] (see also [5, Theorem 4.1]) that
[TABLE]
where is a quantity depending on the coarse space, defined by
[TABLE]
Here is the symmetrized smoother and . Although this equality has been known for a long time, only recently it was used to determine optimal prolongation operators formulated in terms of eigenvectors, which lead to a minimal value of for a given smoother (see [9, 3]). We will give an alternative proof of this result using the characterization of the eigenvalues of the multigrid iteration operator given in Theorem 1.
We consider first the more general error propagation matrix in (1.2) with and . Let be the range of the interpolation operator , and be a matrix with orthonormal columns that span (the orthogonal complement of with respect to the Euclidean inner product). Then Theorem 1 leads to
[TABLE]
In what follows, given a matrix with real eigenvalues we will denote by and the maximum and minimum eigenvalues of respectively.
Assuming that is Hermitian positive definite and that is at most one, we have . In order to find an optimal interpolation operator for the error propagation matrix, we need to first find
[TABLE]
and then find an interpolation operator such that . The following lemma solves the first problem.
Lemma 2**.**
Let be Hermitian positive definite and let be the eigenpairs of the generalized eigenvalue problem
[TABLE]
where
[TABLE]
Then
[TABLE]
which is achieved by
[TABLE]
where the columns of are orthogonal in the Euclidean inner product and satisfy .
Proof.
Let with . By the Courant-Fischer theorem we obtain
[TABLE]
Thus, if is the set of subspaces of of dimension , we have
[TABLE]
and the maximum is attained by choosing a matrix such that the columns of are orthogonal in the Euclidean inner product and satisfy . ∎
The previous lemma is the main tool to obtain the optimal interpolation operators.
Theorem 3**.**
Let and as in (1.3) be Hermitian positive definite. Let be the eigenpairs of , where , and suppose that . Then
[TABLE]
An optimal interpolation operator is given by
[TABLE]
Proof.
Since , we have that
[TABLE]
Note that the eigenvalues are the same as the in Lemma 2. According to Lemma 2, we need to find vectors which are orthogonal to the eigenvectors of the generalized eigenvalue problem . Now, consider the vectors . The are also eigenvectors of the generalized eigenvalue problem . Moreover, the vectors are eigenvectors of the generalized eigenvalue problem . But the are -orthogonal (the are eigenvectors of the Hermitian matrix ). Thus, the , are orthogonal to the in the Euclidean inner product and the interpolation operator given by (2.7) is the corresponding minimizer. ∎
Now, we consider and defined in (2.1) and (2.2). Again and can be written as
[TABLE]
for some matrices and in . A straightforward computation shows that is Hermitian, and by [1, Lemma 2.11] we have
[TABLE]
Moreover, the maximal eigenvalue of satisfies , see e.g. [8, Theorem 3.16]. We then obtain
[TABLE]
The matrix in (1.3) is given by
[TABLE]
With (2.3) we have that is Hermitian positive definite. We obtain the following corollary.
Corollary 4**.**
Let be Hermitian positive definite. Let such is Hermitian positive definite, and let be as in (2.9), and let be the eigenpairs of , where , Then
[TABLE]
An optimal interpolation operator is given by
[TABLE]
Proof.
We have that is positive definite and . By Theorem 3 we obtain the desired result. ∎
Next, let us consider the non-symmetric multigrid method defined implicitly by , in (2.1). We use a Hermitian positive definite smoother . The matrix in (1.3) is given by
[TABLE]
Hence
[TABLE]
Therefore, it is not clear which of or equals the spectral radius. One way to overcome this problem is scaling. Note that we have for all Hermitian positive defnite matrices and and for all matrices
[TABLE]
Hence, the Hermitian smoother
[TABLE]
satisfies
[TABLE]
With Theorem 1 and we then have
[TABLE]
thus
[TABLE]
Note that (2.12) is equivalent to being positive semidefinite. This discussion leads to the following corollary.
Corollary 5**.**
Let be Hermitian positive definite. Let such is Hermitian positive definite. Let . Let be the eigenvalues of and let , , be the corresponding eigenvectors. Then
[TABLE]
An optimal interpolation operator is given by
[TABLE]
Proof.
The matrix is Hermitian positive definite. Moreover, since is also Hermitian positive definite the eigenvalues of are less then one. Thus, with Theorem 1, . So, with Theorem 3 we obtain (2.13) and (2.14). ∎
Now we will compare the optimal interpolation with respect to the -norm as given in Corollary 4, with the optimal interpolation with respect to the spectral radius as given in Corollary 5. Using and Hermitian positive definite, the vectors used in Corollary 4 are eigenvectors of
[TABLE]
while in Corollary 4 we use the eigenvectors of
[TABLE]
But is just a polynomial in , where the polynomial is given by
[TABLE]
Thus, the eigenvectors of both matrices are the same. Moreover, the eigenvalues are related by the above polynomial. Hence, the eigenvectors corresponding to the smallest eigenvalues of are the same eigenvectors that correspond to the smallest eigenvalues of . In consequence, the optimal interpolation in Corollary 4 and Corollary 5 are the same, if we assume that is Hermitian positive definite.
Next, let us have a closer look to the non-symmetric two-grid method and avoid scaling. We assume that the smoother is Hermitian and leads to a convergent scheme, i.e.
[TABLE]
which implies Thus, for the matrix we have as above
[TABLE]
Let
[TABLE]
Then we have and with Theorem 1
[TABLE]
But . To get an upper bound for the minimal spectral radius of over all interpolation we consider the matrix . Our next theorem deals with this case.
Theorem 6**.**
Let be Hermitian positive definite, and let be Hermitian such that . Let , and let be the eigenpairs of with . Then
[TABLE]
The spectral radius can be achieved by the interpolation operator
[TABLE]
Proof.
The proof follows immediately from the above arguments. ∎
Note that the above Theorems correspond to clear statements: the optimal interpolation operators are given by those eigenvectors of for which the smoothing is slowest to converge.
3 The optimal interpolation with respect to the condition number
Note that for symmetric multigrid where is Hermitian positive definite the largest eigenvalue of is one (see e.g. [7]). As seen in the proof of Corollary 5, the same holds for when we assume that is Hermitian positive definite. The later assumption can be obtained by scaling, however, this scaling affects the spectral radius of the error propagation matrix. But for the condition number of the multigrid preconditioned system, this scaling has no effect.
Theorem 1 characterizes the spectrum of and . Following the arguments above, where we found optimal interpolation operators, such that and are maximal, we obtain that the same interpolation operators are optimal with respect to the condition number of the preconditioned system. This leads to the next result.
Theorem 7**.**
Let be Hermitian positive definite. Let such is Hermitian positive definite. Let be as in (2.9). Let be the eigenpairs of , where . Then
[TABLE]
An optimal interpolation operator is given by
[TABLE]
Our final result gives the optimal interpolation operator for the non-symmetric two-grid method with respect to the condition number .
Theorem 8**.**
Let be Hermitian positive definite. Let be Hermitian positive definite such that Let , and let be the eigenpairs of where . Then
[TABLE]
An optimal interpolation operator is given by
[TABLE]
Note, that in all cases of the previous sections any other interpolation operator with is also optimal.
4 Conclusion
As mentioned in [9], the in AMG methods can also be understood as an for Abstract Multigrid Methods. Here we contributed to the theory of abstract multigrid methods by presenting alternate derivations of previously known results and by establishing new results. Building on a result from [6] which gives a characterization of the spectrum of the error propagation operator and the preconditioned system of two-grid methods, we derived optimal interpolation operators with respect to the -norm and the spectral radius of the error propagation operator matrix in a short way. We also showed that these interpolation operators are optimal with respect to the condition number of the preconditioned system.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Benzi, A. Frommer, R. Nabben, and D. B. Szyld , Algebraic theory of multiplicative Schwarz methods , Numer. Math., 89 (2001), pp. 605–639, doi: 10.1007/s 002110100275 , https://doi.org/10.1007/s 002110100275 . · doi ↗
- 2[2] M. Benzi and D. B. Szyld , Existence and uniqueness of splittings for stationary iterative methods with applications to alternating methods , Numer. Math., 76 (1997), pp. 309–321, doi: 10.1007/s 002110050265 , https://doi.org/10.1007/s 002110050265 . · doi ↗
- 3[3] J. Brannick, F. Cao, K. Kahl, R. D. Falgout, and X. Hu , Optimal interpolation and compatible relaxation in classical algebraic multigrid , SIAM J. Sci. Comput., 40 (2018), pp. A 1473–A 1493, doi: 10.1137/17M 1123456 , https://doi.org/10.1137/17M 1123456 . · doi ↗
- 4[4] R. D. Falgout and P. S. Vassilevski , On generalizing the algebraic multigrid framework , SIAM J. Numer. Anal., 42 (2004), pp. 1669–1693, doi: 10.1137/S 0036142903429742 , https://doi.org/10.1137/S 0036142903429742 . · doi ↗
- 5[5] R. D. Falgout, P. S. Vassilevski, and L. T. Zikatanov , On two-grid convergence estimates , Numer. Linear Algebra Appl., 12 (2005), pp. 471–494, doi: 10.1002/nla.437 , https://doi.org/10.1002/nla.437 . · doi ↗
- 6[6] L. García Ramos, R. Kehl, and R. Nabben , Projections, deflation and multigrid for non symmetric matrices , submitted, (2018).
- 7[7] Y. Notay , Algebraic Theory of Two-Grid Methods , Numer. Math. Theory Methods Appl., 8 (2015), pp. 168–198.
- 8[8] P. S. Vassilevski , Multilevel block factorization preconditioners , Springer, New York, 2008. Matrix-based analysis and algorithms for solving finite element equations.
