Efficient Reduction of Compressed Unitary plus Low-rank Matrices to Hessenberg form
Roberto Bevilacqua, Gianna M. Del Corso, Luca Gemignani

TL;DR
This paper introduces efficient numerical methods for reducing a compressed unitary plus low-rank matrix to Hessenberg form, enabling faster eigenvalue computations by exploiting structured decompositions and bulge chasing techniques.
Contribution
It develops a novel structured decomposition called LFR for such matrices and provides a fast reduction algorithm with $O(n^2 k)$ complexity, improving eigenvalue computation efficiency.
Findings
Reduction cost is $O(n^2 k)$ arithmetic operations.
LFR decomposition enables efficient Hessenberg reduction.
Eigenvalues can be computed using a fast QR algorithm after reduction.
Abstract
We present fast numerical methods for computing the Hessenberg reduction of a unitary plus low-rank matrix , where is a unitary matrix represented in some compressed format using parameters and and are matrices with . At the core of these methods is a certain structured decomposition, referred to as a LFR decomposition, of as product of three possibly perturbed unitary Hessenberg matrices of size . It is shown that in most interesting cases an initial LFR decomposition of can be computed very cheaply. Then we prove structural properties of LFR decompositions by giving conditions under which the LFR decomposition of implies its Hessenberg shape. Finally, we describe a bulge chasing scheme for converting the initial LFR decomposition of into the LFR decomposition of a Hessenberg matrix by…
| n | ||||
|---|---|---|---|---|
| 32 | 8.2e+01 | 2.2e-17 | 3.9e-17 | 4.3e-19 |
| 64 | 1.4e+02 | 1.5e-17 | 5.2e-17 | 5.1e-19 |
| 128 | 2.7e+02 | 7.7e-18 | 6.0e-17 | 2.0e-19 |
| 256 | 5.2e+02 | 5.5e-18 | 1.3e-16 | 1.4e-19 |
| 512 | 1.0e+03 | 3.2e-18 | 2.2e-16 | 1.4e-19 |
| n | ||||
|---|---|---|---|---|
| 32 | 7.6e+04 | 1.2e-17 | 4.9e-17 | 7.0e-22 |
| 64 | 6.0e+05 | 1.3e-17 | 5.7e-17 | 2.1e-22 |
| 128 | 4.5e+06 | 5.5e-18 | 7.6e-17 | 6.6e-24 |
| 256 | 3.6e+07 | 6.6e-18 | 1.3e-16 | 1.5e-24 |
| 512 | 2.7e+08 | 2.3e-18 | 2.2e-16 | 2.6e-25 |
| n | ||||
|---|---|---|---|---|
| 64 | 1.5e+02 | 7.4e-18 | 4.4e-17 | 2.3e-18 |
| 128 | 2.9e+02 | 3.2e-18 | 5.6e-17 | 1.2e-18 |
| 256 | 5.5e+02 | 2.5e-18 | 9.6e-17 | 4.1e-19 |
| 512 | 1.1e+03 | 1.8e-18 | 1.6e-16 | 5.0e-19 |
| n | ||||
|---|---|---|---|---|
| 64 | 6.2e+05 | 6.6e-18 | 5.2e-17 | 5.3e-18 |
| 128 | 4.9e+06 | 4.5e-18 | 6.8e-17 | 1.8e-18 |
| 256 | 3.8e+07 | 2.2e-18 | 9.2e-17 | 5.5e-19 |
| 512 | 2.9e+08 | 2.3e-18 | 1.6e-16 | 8.0e-19 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\headers
Efficient Reduction of Compressed Unitary plus Low-rank Matrices to Hessenberg formR. Bevilacqua, G.M. Del Corso, L. Gemignani
Efficient Reduction of Compressed Unitary plus Low-rank Matrices to Hessenberg form††thanks: The research of the last two authors was partially supported by GNCS project “Tecniche innovative per
problemi di algebra lineare” and by the project sponsored by University of Pisa under the grant PRA-2017-05.
R. Bevilacqua Dipartimento di Informatica, Università di Pisa, Pisa, Italy, [email protected]
G.M. Del Corso Dipartimento di Informatica, Università di Pisa, Pisa, Italy, [email protected]
L. Gemignani Dipartimento di Informatica, Università di Pisa, Pisa, Italy, [email protected]
Abstract
We present fast numerical methods for computing the Hessenberg reduction of a unitary plus low-rank matrix , where is a unitary matrix represented in some compressed format using parameters and and are matrices with . At the core of these methods is a certain structured decomposition, referred to as a LFR decomposition, of as product of three possibly perturbed unitary Hessenberg matrices of size . It is shown that in most interesting cases an initial LFR decomposition of can be computed very cheaply. Then we prove structural properties of LFR decompositions by giving conditions under which the LFR decomposition of implies its Hessenberg shape. Finally, we describe a bulge chasing scheme for converting the initial LFR decomposition of into the LFR decomposition of a Hessenberg matrix by means of unitary transformations. The reduction can be performed at the overall computational cost of arithmetic operations using storage. The computed LFR decomposition of the Hessenberg reduction of can be processed by the fast QR algorithm presented in [8] in order to compute the eigenvalues of within the same costs.
keywords:
Hessenberg reduction, Rank-structured matrices, QR Method, Bulge chasing, CMV matrix, Complexity.
{AMS}
65F15
1 Introduction
Eigenvalue computations for small rank modifications of unitary matrices represented in some compressed format is a classical topic in structured numerical linear algebra. Matrices of the form where is a unitary block diagonal matrix and , , arise commonly in the numerical treatment of structured (generalized) eigenvalue problems [1, 2]. In particular any unitary plus low-rank matrix can be reduced in this form by a similarity (unitary) transformation and additionally matrices of this form can be directly generated by linearization techniques based on interpolation schemes applied for the solution of nonlinear eigenvalue problems [6, 18, 9, 7]. The class of unitary block upper Hessenberg matrices perturbed in the first block row or in the last block column includes block companion linearizations of matrix polynomials. These matrices are also related with computational problems involving orthogonal matrix polynomials on the unit circle [22, 21]. Constructing the sequence of orthogonal polynomials w.r.t a different basis modifies the compressed format of the unitary part by replacing the block Hessenberg shape with the block CMV shape [11, 19, 20]. Semiinfinite block upper Hessenberg and CMV unitary matrices are commonly used to represent unitary operators on a separable Hilbert space [3, 12]. Finite truncations of these matrices are unitary block Hessenberg/CMV matrices modified in the last row or column.
In most numerical methods Hessenberg reduction by unitary similarity transformations is the first step towards eigenvalue computation. Recently a fast reduction algorithm specifically tailored for block companion matrices has been presented in [5] whereas some efficient algorithms for dealing with block unitary diagonal plus small rank matrices have been developed in [17]. In particular, these latter algorithms are two-phase: in the first phase the matrix is reduced in a banded form employing a block CMV-like format to represent the unitary part. The second phase amounts to incrementally annihilate the lower subdiagonals of by means of Givens rotations which are gathered in order to construct a data-sparse compressed representation of the final Hessenberg matrix . The representation involves data storage consisting of vectors of length and Givens rotations. This compression is usually known as a Givens–Vector representation [24, 25], and it can also be explicitly resolved to produce a generators-based representation [14, 15]. However, a major weakness of this approach is that both these two compressed formats are not suited to be exploited in the design of fast specialized eigensolvers for unitary plus low rank matrices using ops only.
In this paper we describe a novel backward stable algorithm for computing the Hessenberg reduction of general matrices of the form , where is unitary block diagonal or unitary block upper Hessenberg or block CMV with block size and, in the case is unitary block upper Hessenberg or block CMV, we have the additional requirement that . These families include most of the important cases arising in applications.
This algorithm circumvents the drawback of the method proposed in [17] by introducing a different data-sparse compressed representation of the final Hessenberg matrix which is effectively usable in fast eigenvalue schemes. In particular, the representation is suited for the fast eigensolver for unitary plus low rank matrices developed in [8]. Our derivation is based on three key ingredients or building blocks:
A condensed representation of the matrix (or of a matrix unitary similar to ) which can be specified as , where is the product of unitary lower Hessenberg matrices, is the product of unitary upper Hessenberg matrices and the middle factor is the identity matrix perturbed in the first rows.
In the case matrix is block upper Hessenberg or block diagonal we can obtain the representation in a simple way that we clarify in Section 2.2 and 2.3. In the case is unitary block CMV matrix we provide a suitable extension of the well known factorization of CMV matrices as product of two block diagonal unitary matrices that are both the direct sum of or unitary blocks (compare with [20] and the references given therein). Specifically, block CMV matrices with blocks of size are -banded unitary matrices allowing a ’staircase-shaped’ profile. It is shown that a block CMV matrix with blocks of size admits a factorization as product of two unitary block diagonal matrices with diagonal blocks. It follows that the block CMV matrix can be decomposed as the product of a unitary lower Hessenberg matrix multiplied by a unitary upper Hessenberg matrix. 2. 2.
An embedding technique which for a given triple associated with makes it possible to construct a larger matrix which is still unitary plus rank and it can be factored as , where is the product of unitary lower Hessenberg matrices, is the product of unitary upper Hessenberg matrices and the middle factor is unitary block diagonal plus rank with some additional properties. 3. 3.
A theoretical result which provides conditions under which a matrix specified in the form turns out to be Hessenberg.
Combining together these ingredients allows the design of a specific bulge-chasing strategy for converting the factored representation of into the decomposition of an upper Hessenberg matrix unitarily similar to . The final representation of thus involves data storage consisting of vectors of length and Givens rotations. The reduction to Hessenberg form turns out to have the same asymptotic complexity of eigensolvers for unitary plus low rank matrices and furthermore, this representation is suited to be used directly by the fast eigensolver for unitary plus low rank matrices developed in [8].
The paper is organized as follows. In Section 2 we introduce the representations of unitary plus rank matrices by devising fast algorithms for transforming a matrix into its format provided that belongs to some special classes. In Section 3 we investigate the properties of representations of unitary plus rank Hessenberg matrices and we describe a suitable technique to embed the matrix into a larger matrix by mantaining its structural properties. In Section 4 we present our algorithm which modifies the representation of by computing the corresponding representation of a unitarily similar Hessenberg matrix. Finally, numerical experiments are discussed in Section 5 whereas conclusions and future work are drawn in Section 6.
2 The Format of Unitary plus Rank- Matrices
In this section we introduce a suitable compressed representation of unitary plus rank- matrices which can be exploited for the design of fast Hessenberg reduction algorithms.
Definition 2.1**.**
A unitary plus rank- matrix can be represented in the LFR format if there is a triple of matrices such that:
; 2. 2.
* is the product of unitary lower Hessenberg matrices;* 3. 3.
* is the product of unitary upper Hessenberg matrices;* 4. 4.
* is a unitary plus rank** matrix, where is a block diagonal unitary matrix of the form Q=\left[\begin{array}[]{c|c}I_{k}&\\ \hline\cr&\hat{Q}\end{array}\right], with unitary Hessenberg and .*
In the sequel of this section we present some fast algorithms for computing the format of a unitary plus rank- matrix specified as follows:
- •
, , and is unitary block CMV with block size ;
- •
, , and is unitary block upper Hessenberg with block size ;
- •
, , and is unitary block diagonal with block size .
These three cases cover the most interesting structures of low-rank perturbation of unitary matrices. In the general case of unitary matrices, where it is not known the spectral factorization of the unitary part or the unitary matrix cannot be represented in terms of a linear number of parameters, we cannot expect to recover the eigenvalues – even only of the unitary part – in .
In the following sections we investigates into the above three cases.
2.1 Small Rank Modifications of Unitary Block CMV Matrices
A block analogue of the CMV form of unitary matrices has been introduced in [17, 3].
Definition 2.2** (CMV shape).**
A unitary matrix is said to be CMV structured with block size if there exist non-singular matrices and , respectively upper and lower triangular, such that
[TABLE]
or
[TABLE]
*where the symbol has been used to identify (possibly) nonzero blocks. *
Block CMV matrices are associated with matrix orthogonal polynomials on the unit circle and the structure of the matrix depends on the choice of the starting basis of the set of matrix polynomials to be orthogonalized. In particular, fits the block structure shown in Definition 2.2 if or are considered. In what follows for the sake of simplicity we always assume that satisfies the block structure (1). Furthermore, in order to simplify the notation we often assume that is a multiple of , so the above structures fit “exactly” in the matrix. However, this is not restrictive and the theory presented here continues to hold in greater generality. In practice, one can deal with the more general case by allowing the blocks in the bottom-right corner of the matrix to be smaller.
Notice that a matrix in CMV form with blocks of size is, in particular, -banded. The CMV structure with blocks of size has been proposed as a generalization of what the tridiagonal structure is for Hermitian matrices in [11] and [19]. A further analogy between the scalar and the block case is derived from the Nullity Theorem [16] that is here applied to unitary matrices.
Lemma 2.3** (Nullity Theorem).**
Let be a unitary matrix of size . Then
[TABLE]
where and and are subsets of . If an we have
[TABLE]
From Lemma 2.3 applied to a block CMV structured matrix of block size we find that for :
[TABLE]
which gives
[TABLE]
Pictorially we are observing rank constraints on the following blocks
[TABLE]
and by similar arguments on the corresponding blocks in the upper triangular portion.
In the scalar case with these conditions make it possible to find a factorization of the CMV matrix as product of two block diagonal matrices usually referred to as the classical Schur parametrization [10]. Similarly, here we introduce a block counterpart of the Schur parametrization which gives a useful tool to encompass the structural properties of block CMV representations.
Lemma 2.4** (CMV factorization).**
Let be a unitary CMV structured matrix with blocks of size as defined in Definition 2.2. Then can be factored in two block diagonal unitary matrices of the form:
[TABLE]
*such that has rows and columns and all the other blocks have rows and columns and bandwidth with both and triangular matrices of full rank. Moreover, each matrix admitting such a factored form is in turn CMV. *
Proof 2.5**.**
The proof of this result is constructive, and can be obtained by performing a block QR decomposition. We notice that if we compute a QR decomposition of the top-left block of we have
[TABLE]
where identifies the blocks that have been altered by the transformation and the block in position can be assumed to be the identity matrix. Notice that in the first row the blocks in the second and third columns have to be zero due to being unitary, and that the block is nonsingular upper triangular since it inherits the properties of .
We can continue this process by computing the QR factorization of . Notice that, from the application of the Nullity Theorem 2.3 the block identified by in the picture has rank at most . This also holds for all the other blocks for the same kind. In particular, computing the QR factorization of the first columns and left-multiplying by will put to zero also the block on the right of . We will then get the following factorization:
[TABLE]
*where we notice that, as before, the block is nonsingular upper triangular and that some blocks in the upper part have been set to zero thanks to the unitary property. The process can then be continued until the end of the matrix, providing a factorization of as product of two unitary block diagonal matrices, that is . This factorization can further be simplified by means of a block diagonal scaling with , and unitary matrices determined so that the blocks are of bandwidth , that is the outermost blocks in and are triangular. For the sake of illustration consider and let be a QR decomposition of . By setting we obtain that and, moreover, from it follows that the block of in position also exhibits a lower triangular structure. The construction of the remaining blocks , , proceeds in a similar way. *
Pictorially, the above result gives the following structure of and :
[TABLE]
Now, let us assume that a matrix is such that , , and is unitary block CMV with block size . By replacing with its block diagonal factorization we obtain that . Since the left-hand and the right-hand side matrices are unitary banded it follows that they can both be factored as the product of unitary Hessenberg matrices. Hence, we have the following.
Theorem 2.6**.**
*Let be such that , , and is unitary block CMV with the block structure shown in Equation 1. Then can be represented in the LFR format as where , , is the decomposition provided in Lemma 2.4 and , . *
The overall cost of computing this condensed LFR representation of the unitary plus rank- matrix is flops using memory storage.
2.2 Small Rank Modifications of Unitary Block Hessenberg Matrices
The class of perturbed unitary block Hessenberg matrices includes the celebrated block companion forms which are the basic tool in the construction of matrix linearizations of matrix polynomials. To be specific let be a matrix such that , , and is unitary block upper Hessenberg with block size . A compressed LFR format of a matrix unitarily similar to can be computed as follows. First of all we can suppose that all the subdiagonal blocks , , are upper triangular. If not we consider the unitary block diagonal matrix defined by where , and is a QR decomposition of the matrix , . Then the matrix is such that and is unitary block upper Hessenberg with block size and , . Hence, the matrix is banded with lower bandwidth and therefore the factorization gives a suitable LFR representations of . Summing up we have the following.
Theorem 2.7**.**
*Let be such that , , and is unitary block upper Hessenberg with block size . Then there exists a unitary block diagonal matrix , , such that can be represented in the LFR format as where , and , . *
The overall cost of computing this condensed LFR representation of the unitary plus rank- matrix is flops using memory storage.
2.3 Small Rank Modifications of Unitary Block Diagonal Matrices
The unitary block diagonal matrix reduces to a unitary diagonal matrix up to a similarity transformation which can be performed within operations. The interest toward the properties of block CMV matrices is renewed in [17] where a general scheme is proposed to transform a unitary diagonal plus a rank matrix into a block CMV structured matrix plus a rank perturbation located in the first rows only. More specifically we have the following [17].
Theorem 2.8**.**
*Let be a unitary diagonal matrix and of full rank . Then, there exists a unitary matrix such that is CMV structured with block size and the block structure shown in Definition 2.2 and for some . The matrices and can be computed with operations. *
By applying Theorem 2.8 to the matrix pair we find that there exists a unitary matrix such that is CMV structured with block size and . In view of Lemma 2.4 this yields
[TABLE]
where Since the left-hand and the right-hand side matrices are unitary banded it follows that they can both be factored as the product of unitary Hessenberg matrices. In this way we obtain the next result.
Theorem 2.9**.**
*Let be such that with , and unitary diagonal. Then there exists a unitary matrix such that has the block CMV structure shown in Definition 2.2 and for some . Moreover, can be represented in the LFR format as where , , is the factorization of provided in Lemma 2.4 and , . *
The overall cost of computing this condensed LFR representation of the unitary plus rank- matrix is flops using memory storage.
In the next sections we investigate the properties of the Hessenberg reduction of a matrix given in the format.
3 Factored Representations of Hessenberg Matrices
In this section we investigate suitable conditions under which a factored representation , where is the product of unitary lower Hessenberg matrices, is the product of unitary upper Hessenberg matrices and the middle factor is unitary plus rank, specifies a matrix in Hessenberg form. In Section 4 we will discuss the chasing algorithm for reducing, by unitary similarity, a matrix of the form to Hessenberg form maintaining the factorization and enforcing the properness of the factor to avoid breakdown of the subsequent iterations.
A key ingredient is the properness of the generalized Hessenberg factors.
Definition 3.1**.**
*A matrix is called -upper Hessenberg if when . Similarly, is called -lower Hessenberg if when . In addition, when is -upper Hessenberg (-lower Hessenberg) and the outermost entries are non-zero, that is, (), , then the matrix is called proper. *
Note that for a Hessenberg matrix is proper iff it is unreduced. Also, a -upper Hessenberg matrix is proper iff . Similarly a -lower Hessenberg matrix is proper iff .
An important property of any unitary upper Hessenberg matrix is that it can be represented as product of elementary transformations, i.e., where with G_{\ell}=\left[\begin{array}[]{cc}\alpha_{\ell}&\beta_{\ell}\\ -\beta_{\ell}&\bar{\alpha}_{\ell}\end{array}\right], , , , are unitary Givens rotations and with . In this way the matrix is stored by two vectors of length formed by the elements , and . The same representation also extends to unitary -upper Hessenberg matrices specified as the product of unitary upper Hessenberg matrices multiplied on the right by a unitary diagonal matrix which is the identity matrix modified in the last diagonal entry. Lower unitary Hessenberg matrices can be parametrized similarly as .
Another basic property of unitary plus rank matrices is the existence of suitable embeddings which maintain their structural properties. The embedding turns out to be crucial to ensure the properness of the factor and guarantee the safe application of implicit iterations. The embedding is also important for the bulge chasing algorithm as we explain in the next section. The following result is first proved in [8] and here specialized to a matrix of the form determined in Theorems 2.6, 2.7 and 2.9.
Theorem 3.2**.**
Let be such that , where and are unitary and . Let , , be the economic QR factorization of . Let , , be defined as
[TABLE]
Then it holds
* is unitary;* 2. 2.
the matrix given by
[TABLE]
satisfies
[TABLE]
Proof 3.3**.**
Property 1 follows by direct calculations from
[TABLE]
For Property 2 we find that
[TABLE]
The unitary matrices and given in Theorems 2.6, 2.7 and 2.9 are -Hessenberg matrices. The same clearly holds for the larger matrices and occurring in the factorization of . The next result is the main contribution of this section and it provides conditions under which a matrix specified as a product , where is a unitary -lower Hessenberg matrix is a unitary -upper Hessenberg matrix and is a unitary matrix plus a rank correction, is in Hessenberg form.
In fact, once we apply the embedding described by Theorem 3.2 to , the matrix obtained, , is no more in the LFR format since the middle factor is not in the prescribed format required by Definition 2.1. Moreover is not a proper matrix, making implicit iterations subject to breakdown.
Theorem 3.4**.**
Let , , be two unitary matrices, where is a proper unitary -lower Hessenberg matrix and is a unitary -upper Hessenberg matrix. Let be a block diagonal unitary upper Hessenberg matrix of the form Q=\left[\begin{array}[]{c|c}I_{k}&\\ \hline\cr&\hat{Q}\end{array}\right], with unitary Hessenberg. Let be a unitary plus rank matrix with . Suppose that the matrix satisfies the block structure
[TABLE]
*Then is an upper Hessenberg matrix. *
Proof 3.5**.**
From Lemma 2.3 we find that is nonsingular due to the properness of .
Now, let us consider the matrix . This matrix is unitary with a -quasiseparable structure below the -th upper diagonal. Indeed, for any we have
[TABLE]
Applying Lemma 2.3 we have , implying that also . Since is non singular, we conclude that , .
From this observation we can then find a set of generators and a -upper Hessenberg matrix such that so that [13].
Then we can recover the rank correction from the left-lower corner of obtaining
[TABLE]
since . Notice that is upper Hessenberg as it is the product of a -upper Hessenberg matrix by a -upper Hessenberg matrix. Moreover, we find that since . From the block structure of there follows that
[TABLE]
which gives
[TABLE]
*Hence and therefore which concludes the proof. *
4 The Bulge Chasing Algorithm
In this section we present a bulge-chasing algorithm relying upon Theorem 3.4 to compute the Hessenberg reduction of the matrix given as in Theorem 3.2, i.e., the embedding of . We recall that and are the factors of the economic factorization of .
Let us set
[TABLE]
so that we have
[TABLE]
Observe that and, moreover, which implies . In the preprocessing phase we initialize
[TABLE]
Notice that is a unitary -lower Hessenberg matrix and is a unitary -upper Hessenberg matrix and, therefore, they can both be represented by the product of Hessenberg matrices. This property will be maintained under the bulge chasing process. In the cases considered in this paper, we rely on the additional structure of namely that is also -upper Hessenberg as we can observe from Theorems 2.6, 2.7 and 2.9.
In this section we make use of the following technical result.
Lemma 4.1**.**
Let be a unitary Hessenberg matrix. Let , be a unitary Hessenberg obtained as a sequence of ascending or descending Givens transformations acting on two consecutive rows, i.e. if is lower Hessenberg or if is upper Hessenberg. Then, there exist a unitary Hessenberg matrix (with the same orientation as ) and a unitary Hessenberg matrix such that where
- •
* if is -lower Hessenberg,*
- •
* if is -upper Hessenberg,*
*and has the same orientation of . *
Proof 4.2**.**
We prove the Lemma only in the case is lower Hessenberg and is -upper Hessenberg. We need to move each of the Givens rotations of on the right of . The first Givens rotations of , namely , when applied to do not destroy the -lower Hessenberg structure of , so that still -lower Hessenberg. When we apply to a bulge is produced in position , and we need to apply a rotation on the first two columns of to remove the bulge, i.e. , similarly we can remove each of the remaining Givens rotations. At step we have . The last Givens produces a bulge in position which can be removed by the rotation acting on the columns . We do not need to rotate the columns with indices between and , so that
[TABLE]
*We can similarly prove the remaining three cases. *
The reduction of in Hessenberg form proceeds in three steps according to Theorem 3.4. The first two steps amount to determine a different representation of the same matrix . In particular after these two steps the rank-correction inside the brackets is confined to the first -rows, while the factor on the left of the representation is substituted by a factor which is proper, and still with the lower -Hessenberg structure. The third step is a bulge-chasing scheme to complete the Hessenberg reduction.
(QR decomposition of ) We compute the full QR factorization of . Since is full rank the matrix is invertible and, moreover, the matrix can be taken as a -lower Hessenberg proper matrix (see Lemma 2.4 of [8]). We can write
[TABLE]
Then the matrix is such that
[TABLE]
Notice that is a unitary -upper Hessenberg matrix. Indeed, we have that
[TABLE]
where and since . Therefore, it holds which, for the block diagonal structure of , turns out to be -upper Hessenberg. 2. 2.
(Block decomposition of ) We compute the full QR factorization of . Specifically we determine a unitary matrix such that , and such can be taken in -lower Hessenberg form (see Lemma 2.4 of [8]). The matrix
[TABLE]
where is a unitary -upper Hessenberg matrix, due to the fact that is -upper Hessenberg and is lower triangular. We obtain that
[TABLE]
which gives
[TABLE]
Applying times Lemma 4.1, observing that is banded (i.e. simultaneously -upper and -lower Hessenberg) we can factorize where is a unitary -lower Hessenberg matrix and L_{1}=\left[\begin{array}[]{c|c}I_{k}&\\ \hline\cr&\hat{L}_{1}\end{array}\right] where is a unitary -upper Hessenberg matrix. It follows that
[TABLE]
Where the matrix satisfies \widehat{U}_{2}=\left[\begin{array}[]{c|c}I_{k}&\\ \hline\cr&\tilde{U}_{2}\end{array}\right] where is a unitary -upper Hessenberg matrix, and , where . Observe that and, moreover is nonsingular, because is proper. From Lemma 2.3 this implies the properness of . This property is maintained in the subsequent steps of the reduction process so that the final matrix is guaranteed to be proper as prescribed in Theorem 3.4.
At the end of this step the enlarged matrix has been reduced to a product of a proper -lower Hessenberg matrix on the left, a unitary factor corrected in the first rows i.e., the term inside the brackets, and a -upper Hessenberg matrix, i.e., . Step 3 consists of the reduction of to Hessenberg form so that the final matrix will be unitarly similar to and in the format. 3. 3.
(Hessenberg reduction of ) We now need to work on the representation of in equation (4) to reduce the inner matrix in Hessenberg form by means of a bulge-chasing procedure. Indeed Theorem 3.4 ensures that the matrix obtained will be in the LFR format and in Hessenberg form. These transformations will not affect the properness of the -lower Hessenberg term on the left.
For the sake of illustration let us consider the first step. Let us determine a unitary upper Hessenberg matrix such that
[TABLE]
Then setting , we have
[TABLE]
The application of on the right of the matrix by computing creates a bulge formed by an additional segment above the last nonzero superdiagonal of . This segment can be annihilated by a novel unitary upper Hessenberg matrix whose active part works on the left of by acting on the rows of indices 2 through . We can then apply a similarity transformation to remove the bulge
[TABLE]
where . The active part of , the matrix , acts on the right of producing a bulge which can be zeroed by a unitary upper Hessenberg matrix working on rows from to of . Then, the matrix
[TABLE]
has a bulge on the rows of indices through which can be chased away by a sequence of transformations having the same structure as above. Note that the rank correction of the unitary matrix inside the brackets is never affected by these transformations so that, at the end of the process, we have unitarily reduced to the LFR format in Definition 2.1. Also the zeros in the last rows are preserved.
The cost analysis is rather standard for matrix algorithms based on chasing operations [4].
Step 1 requires to compute the economic QR decomposition of a matrix of size and to multiply a unitary Hessenberg matrix specified as product of unitary Hessenberg matrices by vectors of size . The total cost is ops. 2. 2.
The cost of Step 2 is asymptotically the same. The construction of the factored representation of as well as the computation of and can still be performed using ops. 3. 3.
The dominant cost is the execution of Step 3. The zeroing of the sub-subdiagonal entries costs ops.
In the next section we provide algorithmic details and discuss the results of numerical experiments confirming the effectiveness and the robustness of our proposed approach.
5 Numerical Results
The structured Hessenberg reduction scheme described in the previous section has been implemented using MATLAB for numerical testing. The resulting algorithm basically amounts to manipulate chains of unitary Hessenberg matrices.
At step 1 of the structured Hessenberg reduction scheme we first compute the full QR factorization of the matrix . The matrix turns out to be the product of unitary upper Hessenberg matrices. Then we have to incorporate the unitary matrix on the right into the factored representations of and . The unitary matrix can always be represented as the product of at most elementary unitary transformations of size . Once this factorization is computed, we have to add each of these single transformations, one by one, on the right to the factored representations of and . This is accomplished by a sequence of turnover and fusion operations acting on the chains of elementary transformations in and (see [23] for the detailed description of these operations on elementary transformations).
At the beginning of step 2 the matrix is a -upper Hessenberg matrix, and is essentially determined by the product of two unitary -upper Hessenberg matrices that here we rename as . To reshape this factorization in the desired form in equation (3) we can apply times a reasoning similar to the one done in Lemma 4.1 to move each elementary transformation of on the left. In this way we find where \widetilde{Q}=\left[\begin{array}[]{c|c}I_{k}&\\ \hline\cr&\hat{Q}\end{array}\right] is the matrix appearing in (3). Since is formed by elementary transformations the reshaping costs ops. With a similar reasoning we can compute the representations of and where is -lower Hessenberg and , with unitary -upper Hessenberg.
The third phase of the structured Hessenberg reduction scheme basically amounts to reduce the matrix into a matrix of the form \left[\begin{array}[]{c|c}I_{k}&\\ \hline\cr&\tilde{U}_{2}\end{array}\right], with unitary Hessenberg. To be specific assume that and , where and are unitary upper Hessenberg matrices with the leading principal submatrix of order equal to the identity matrix. The overall reduction process splits into intermediate steps. At each step the first active elementary transformations of are annihilated (in this order). Each transformation is moved on the left by creating a bulge in the leftmost factor . This bulge is removed by applying a similarity transformation.
Let us consider the first step. Let denote the Schur parametrization of and similarly let that of . At this step we move left the first elementary transformations of each factor of the product , for example when moving the rotation in front of the resulting transformation acts on rows and while some of the rotations in and have changed. The final situation is as follows111As observed, we can use only a unitary diagonal matrix to keep track of all the diagonal contributions.
[TABLE]
where
[TABLE]
At this point we bring the bulge on the left of in equation (4) obtaining
[TABLE]
where is the product of a sequence of elementary transformations in ascending order acting on rows . The bulge is removed by chasing an elementary transformation at a time. For example to remove we apply the similarity transformation that will shift down the bulge of positions. So chasing step will be necessary to get rid of that first transformation. In this way the overall process is completed using ops. Note that the whole similarity transformation acts only on the first rows leaving untouched the null rows at the bottom of in equation (2).
Numerical experiments have been performed to confirm the computational properties of the proposed method. Among the three cases considered in Section 2 the last one, when the unitary part is block diagonal, is the most challenging since computing the starting LFR format costs vs the flops sufficient for the first two cases. The CMV reduction of the input unitary diagonal plus rank matrix is computed using the algorithm presented in [17] which is fast and backward stable. Our tests focus on the numerical performance of the Hessenberg reduction scheme provided in the previous section given the factors and satisfying Theorem 2.9. In the next tables we show the backward errors , and generated by our procedure. These errors are defined as follows:
is the error computed at the end of the first two preparatory steps. Given the matrix of size represented as in Theorem 3.2 we find the matrix of size obtained at the end of step 2. Denoting by the computed matrix, the error is
[TABLE] 2. 2.
is the classical *backward * error generated in the final step given by
[TABLE]
where is the matrix computed by multiplying all the factors obtained at the end of the third step, and is the product of the unitary transformations acting by similarity on the left and on the right of the matrix in the Hessenberg reduction phase. 3. 3.
is used to measure the Hessenberg structure of the matrix . It is
[TABLE]
where is the matrix formed by the elements on and below the -th diagonal of .
Next tables report these errors for different values of and .
The results of Table 1,2,3 and 4 show that the proposed algorithm is numerically backward stable.
In order to confirm the cost analysis of the algorithm we have also performed experiments taking fixed the size of the matrix. For matrices of size with varying from 2 to 16 we obtain that the measures of elapsed time satisfy
[TABLE]
This illustrates the linear growth of the cost with respect to , the size of the perturbation.
6 Conclusions and Future Work
In this paper we have presented a novel algorithm for the reduction in Hessenberg form of a unitary diagonal plus rank matrix. By exploiting the rank structure of the input matrix this algorithm achieves computational efficiency both with respect to the size of the matrix and the size of the perturbation as well as numerical accuracy. The algorithm complemented with the structured QR iteration described in [8] yields a fast and accurate eigensolver for unitary plus low rank matrices.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Amiraslani, R. M. Corless, and P. Lancaster , Linearization of matrix polynomials expressed in polynomial bases , IMA J. Numer. Anal., 29 (2009), pp. 141–157, https://doi.org/10.1093/imanum/drm 051 , http://dx.doi.org/10.1093/imanum/drm 051 . · doi ↗
- 2[2] P. Arbenz and G. H. Golub , On the spectral decomposition of Hermitian matrices modified by low rank perturbations with applications , SIAM J. Matrix Anal. Appl., 9 (1988), pp. 40–58, https://doi.org/10.1137/0609004 , http://dx.doi.org/10.1137/0609004 . · doi ↗
- 3[3] Y. Arlinskiĭ , Conservative discrete time-invariant systems and block operator CMV matrices , Methods Funct. Anal. Topology, 15 (2009), pp. 201–236.
- 4[4] J. Aurentz, T. Mach, L. Robol, R. Vandebril, and D. S. Watkins , Core-chasing algorithms for the eigenvalue problem , Fundamentals of Algorithms, SIAM, 2018.
- 5[5] J. Aurentz, T. Mach, L. Robol, R. Vandebril, and D. S. Watkins , Fast and backward stable computation of eigenvalues and eigenvectors of matrix polynomials , Math. Comp., 88 (2019), pp. 313–347, https://doi.org/10.1090/mcom/3338 , https://doi.org/10.1090/mcom/3338 . · doi ↗
- 6[6] A. P. Austin, P. Kravanja, and L. N. Trefethen , Numerical algorithms based on analytic function values at roots of unity , SIAM J. Numer. Anal., 52 (2014), pp. 1795–1821, https://doi.org/10.1137/130931035 , https://doi.org/10.1137/130931035 . · doi ↗
- 7[7] R. Bevilacqua, G. M. Del Corso, and L. Gemignani , A QR based approach for the nonlinear eigenvalue problem , Rendiconti Sem. Mat. Univ. Pol. Torino, 76 (2018), pp. 77–87.
- 8[8] R. Bevilacqua, G. M. D. Corso, and L. Gemignani , Fast QR iterations for unitary plus low rank matrices , 2018, https://arxiv.org/abs/ar Xiv:1810.02708 .
