Refined interlacing properties for zeros of paraorthogonal polynomials on the unit circle
K. Castillo, J. Petronilho

TL;DR
This paper extends known results on the interlacing of zeros of paraorthogonal polynomials on the unit circle, providing a unified approach that relates to matrices similar to unitary upper Hessenberg matrices with positive subdiagonals.
Contribution
It offers a simple, unified extension of interlacing properties for zeros of paraorthogonal polynomials on the unit circle.
Findings
Extended interlacing results for zeros of paraorthogonal polynomials.
Unified approach applicable to matrices similar to unitary upper Hessenberg matrices.
Clarified the relationship between polynomial zeros and matrix characteristics.
Abstract
The purpose of this note is to extend in a simple and unified way the known results on interlacing of zeros of paraorthogonal polynomials on the unit circle. These polynomials can be regarded as the characteristic polynomials of any matrix similar to an unitary upper Hessenberg matrix with positive subdiagonal elements.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Mathematical functions and polynomials · Electromagnetic Scattering and Analysis
Refined interlacing properties for zeros of paraorthogonal polynomials on the unit circle
K. Castillo
and
J. Petronilho
CMUC, Department of Mathematics, University of Coimbra, 3001-501 Coimbra, Portugal
Abstract.
The purpose of this note is to extend in a simple and unified way the known results on interlacing of zeros of paraorthogonal polynomials on the unit circle. These polynomials can be regarded as the characteristic polynomials of any matrix similar to an unitary upper Hessenberg matrix with positive subdiagonal elements.
Key words and phrases:
Paraorthogonal polynomials on the unit circle, zeros, unitary matrices, eigenvalues, interlacing, rank one perturbations.
2010 Mathematics Subject Classification:
15A42
1. Introduction and main result
The study of zeros of orthogonal polynomials on the real line (OPRL) can be regarded as an eigenvalue problem for Jacobi matrices111A symmetric tridiagonal matrix whose next-to-diagonal elements are positive (cf. [27, p. ]).. This allows us to go back to one of the most important single books in the nineteenth century, Cours d’analyse de l’École royale polytechnique (1821) by Cauchy to deduce, at least in the weak sense, the zero interlacing property of consecutive OPRL from the simplest form of the nowadays called Cauchy interlacing theorem. The search of more refined eigenvalue interlacing properties of Jacobi matrices was probably initiated by Cauchy himself in his work Sur l’ Équation à l’ Aide de Laquelle on Détermine les Inegalitées Séculaires des Mouvements des Planètes (1829) and later continued by several authors, including in the second half of the last century Wilkinson [45], Kahan [29], Golub [20], Hill and Parlett [26], and Bar-On [6]. In the same spirit, this work recovers one of the earliest approaches used to study zeros of paraorthogonal polynomials on the unit circle (POPUC), which is based on an eigenvalue problem for certain unitary matrices which bear many similarities with Jacobi matrices (cf. [31, 3, 23, 1, 25, 9, 16, 44, 7, 10, 11, 35, 36, 30, 39, 38, 40]).
Without wishing to delve into a historical discussion222The weakened orthogonality condition that POPUC satisfy appeared in [13, Equation ] as far as we can tell. While it is true that in Geronimus’ 1944 paper [18, Theorem IV] such polynomials were presented., as far as we know, the POPUC333In [13], Delsarte and Genin called to these polynomials (symmetric) predictor polynomials and its weakened orthogonality property quasiorthogonality. In [14], they refer to these polynomials as quasiorthogonal polynomials on the unit circle. This denomination could be also supported by the fact that in 1946 Geronimus regarding to these polynomials wrote that they “…play the same role here as the quasi-orthogonal polynomials of M. Riesz in the Hamburger problem.” (cf. [17, Remark I]). The denomination POPUC was coined in [28]. were introduced (in a somewhat hidden form) and successfully developed in a serie of papers by Delsarte and Genin at the end of the 1980’s [13, 15, 16], when they were working in signal processing. In [16], the authors focuses on the problem of computing the zeros of POPUC regarded as an eigenvalue problem for an unitary upper Hessenberg matrix with positive subdiagonal elements. Elegant and recent proofs of most interlacing properties of zeros of POPUC shared with OPRL are due to Simon [39] (cf. [40, Theorem ]) where the theory of rank one perturbations plays a central role. However, before such work (and references therein) the zeros of POPUC were studied by the Linear Algebra community based on ideas close to those of Simon but supported on more elementary facts. Further analysis of these ideas will allow us to easily extend the known results. Indeed, our main purpose is to prove and improve, in connection with the works of Delsarte and Genin on the subject, the known zero interlacing properties of POPUC, based on the development of the ideas discussed by Arbenz and Golub in [4, Section ]444Such ideas were pioneering employed in the present context by Bohnhorst in her Ph.D. thesis [7] defended in 1993 at the Bielefeld University under the supervision of Elsner..
Here and below, we mainly follow the notation of [35, 36, 40]. Denote by the open unit disk and by its boundary, i.e.,
[TABLE]
Let with () and . Set
[TABLE]
Define the -by- matrix
[TABLE]
where and are given explicitly by
[TABLE]
Any unitary -by- upper Hessenberg matrix with positive subdiagonal elements is uniquely parameterized by real numbers that compose the parameters of the array [22] (cf. [24] and [2, Proposition ]). The resulting matrix after this process is referred as the Schur parametric form of the original matrix. The factorization (1), which is unitarily similar to the Schur parametric form of an upper Hessenberg matrix with positive subdiagonal elements, was presented by Bunse-Gerstner and Elsner [9] (cf. [21, Section ] and [7, Definition and Lemma ]). The explicit unitary pentadiagonal or double-staircase form of (referred as Doppel-Treppen-Matrix in the original German source) was studied extensively by Bohnhorst [7], see Figure 1 for an -by- example (cf. [7, Equation ] and [30, Figure ]). The matrix becomes a very popular object in the Mathematical Physics and Orthogonal Polynomials communities after the work [11], specially after Simon’s monographs [35, 36] where it was called (improper) CMV matrix (cf. [38, 40]).
In order to make the notation more transparent, we write instead of . We choose the representation (1) instead of their unitary similar upper Hessenberg matrix for a technical reason related to the manner in which Lemma 2.1 below is presented. In the next definition and subsequently, denotes the identity matrix, whose order is made explicit or may be inferred from the context.
Definition 1.1** (cf. [39, Proposition ]).**
Let be the matrix given by (1), where () and . The (monic) polynomial defined by
[TABLE]
is the POPUC of degree associated with the array .
It is not difficult to see that the eigenvalues of are simple. This fact was observed in by Geronimus [18, Theorem IV] (cf. [17, Theorem III], [19, Theorem ] and [5, Theorem ]) using the connection between POPUC and OPUC. Note that if were in , then the corresponding characteristic polynomial would be an OPUC and their zeros would be in . A remarkable property of the eigenvectors of is the fact that all their components are nonzero (cf. [35, Chapter ] and references therein). This property is clearly valid also for the corresponding unitarily similar Hessenberg matrix.
Definition 1.2**.**
Two finite subsets and of interlace (resp. strictly interlace) whenever there exist points such that any closed arc (resp. open arc) on connecting two distinct elements of contains at last one element of , and vice versa.
We can now formulate our main result.
Theorem 1.1**.**
Let be a matrix given by (1), where () and . The following sentences hold:
- (i)
Let and define () and . Then the eigenvalues of and strictly interlace on for each . 2. (ii)
For each , let , and let be partitioned as
[TABLE]
* being the -by- leading principal submatrix of . For each , define recursively the numbers*555In [15] (cf. [16, Equation ]), Delsarte and Genin have shown that if the ’s (known as pseudo reflection coefficients) are given by (3), then the corresponding POPUC satisfy a three-term recurrence relation (cf. [12]). Bunse-Gerstner and He [10] have provided an illuminating discussion of the works of Delsarte and Genin on POPUC in matrix terms.**
[TABLE]
Set666 denotes the spectrum of .**
[TABLE]
where , with , and
[TABLE]
Then and have at most common eigenvalues. More precisely, and have as the set of common eigenvalues, being also given by the alternative expression
[TABLE]
Furthermore, the elements of the sets \sigma\big{(}\mathcal{C} \alpha_{n-1},b_{n})\big{)}\setminus{A} and \sigma\big{(}\mathcal{C}(\alpha_{0}, b_{m})\big{)}\,\cup\,{B} strictly interlace on .
Let be the POPUC of degree associated to the array . Since is a permutation matrix, it follows that . The sequence (all of whose zeros are roots of unity) produce, by geometric intuition, illuminating examples that fall within Theorem 1.1.
Example 1.1**.**
Let and be the POPUC associated to the arrays and , respectively. In this situation,
[TABLE]
and, therefore,
[TABLE]
In the notation of Theorem 1.1 we have , , , , and , where is obtained by using any of the expressions outlined in Theorem 1.1. Clearly, and have as the set of common eigenvalues and the elements of the sets and strictly interlace on , in concordance with sentence (ii) of Theorem 1.1.
Regarding Theorem 1.1, as far as we know, sentence (i) for was proved by Ammar, Gragg and Reichel [1, Proposition ], although the particular case is known since Geronimus’ work [18, Theorem IV] (cf. [17, Theorem III]). The sentence (i) for was proved by Bohnhorst in [7, Theorem ] (cf. [8, Theorem ]). In [39, Theorem ], Simon proved a weaker version of sentence (ii) that reads as follows: Strictly between any pair of eigenvalues of there is at least one eigenvalue of .
Corollary 1.1**.**
Let be a matrix given by (1), where () and . Let and define as in (4) for . Then and have at most one common eigenvalue. More precisely, either and have as (only) common eigenvalue and the elements of \sigma\big{(}\mathcal{C}(\alpha_{0},\dots,\alpha_{n-1},b_{n})\big{)}\setminus\{\overline{b}_{n}\gamma_{n-1}\} and \sigma\big{(}\mathcal{C}(\alpha_{0},\dots,\alpha_{n-2}, b_{n-1})\big{)} strictly interlace on , or else and have no common eigenvalues, and in such case is not an eigenvalue of either, and the elements of the sets \sigma\big{(}\mathcal{C}(\alpha_{0}, b_{n})\big{)} and \sigma\big{(}\mathcal{C}(\alpha_{0}, \dots,\alpha_{n-2},b_{n-1})\big{)}\cup\{\overline{b}_{n}\gamma_{n-1}\} strictly interlace on .
Proof.
Take in Theorem 1.1. Hence, (3) and (4) yield which, in turn, is equal to . Then either and if \overline{b}_{n}\gamma_{n-1}\in\sigma\big{(}\mathcal{C}(\alpha_{0},\dots,\alpha_{n-2},b_{n-1})\big{)}, or else and otherwise. The result follows immediately from sentence (ii) of Theorem 1.1. ∎
Corollary 1.1 was proved by Bohnhorst [7, p. ] (cf. [8, p. ]) and rediscovered by Simon [39, Theorem ]. It is worth noting that in view of Corollary 1.1 and besides the several and well-known practical consequences, POPUC answered the following open-ended question proposed by Turán as far as 1974 [42, Problem LXVI, p. 60]: “It is known that the zeros of the th orthogonal polynomial (with respect to a Lebesgue-integral function on an interval) separate the zeros of the th polynomial. What corresponds to this fact on the unit circle?”777We quote the English translation provided by Szüsz [43, Problem LXVI]..
2. Proof of Theorem 1.1
2.1. Some preliminary lemmas
Theorem 1.1 will be proved through the following sequence of lemmas.
Lemma 2.1**.**
Let and be unitary matrices of the same order and suppose that Then and have interlacing eigenvalues on . Moreover, assume that admits a decomposition and let be partitioned as
[TABLE]
* and being of the same order. Set , , and . Assume further that the eigenvalues of and are simple and . Then the elements of the sets U\setminus\big{(}U_{1}\,\cap\,U_{2}\big{)} and U_{1}\,\cup\,\big{(}U_{2}\backslash(U_{1}\,\cap\,U_{2})\big{)} strictly interlace on .*
Proof.
The first sentence of the lemma is the simplest form of a result due to Arbenz and Golub [4, Section ] (cf. [7, Theorem ] and [8, Theorem ])888It can be deduced directly using [32, p. ] and [27, Corollary ].. In order to deduce the second one, we first claim that
[TABLE]
Indeed, since , there exist nonzero vectors ( being the common order of and ) such that . Using the formula for the determinant of a rank one perturbation (cf. [34, Proposition ]), we may write for each 999 denotes the characteristic polynomial of .
[TABLE]
Let be the spectral decomposition of in which and . Thompson-McEnteggert’s formula for the adjugate [41] (cf. [33, Theorem ]) gives
[TABLE]
where the prime denotes the derivative. Combining (6) with (7) yields101010The eigenvalue interlacing already stated implies , and so and .
[TABLE]
We next claim that if 111111Given a set and , we define ; if , then ., then . We only prove that implies . (To prove that implies , we proceed similarly, as well as for proving that implies .) Indeed, suppose that and . Since there is a normalized eigenvector of associated with such that , we deduce
[TABLE]
hence , contrary to . Consequently, (5) follows from (8). Finally, it follows from (5) that the sets U\setminus\big{(}U_{1}\,\cap\,U_{2}\big{)} and U_{1}\,\cup\,\big{(}U_{2}\backslash(U_{1}\,\cap\,U_{2})\big{)} have no common elements, thus the second sentence of the lemma follows from the first one. ∎
Lemma 2.2**.**
Let be a unitary matrix and for a fixed let be the diagonal matrix obtained from the identity matrix by replacing the entry with a number on . Assume that and have the same order. Assume further that the eigenvalues of are simple and all its eigenvectors have a nonzero component at the position . Then and have strictly interlacing eigenvalues on .
Proof.
Without loss of generality we can assume that , and so with . Let be the spectral decomposition of in which and . Arguing as in the proof of Lemma 2.1 we have
[TABLE]
Let be the first component of the vector . Then
[TABLE]
Thus the result follows from (9) and the first sentence of Lemma 2.1. ∎
Lemma 2.3**.**
Let () and . The following sentences hold:
- (i)
Let be a diagonal matrix obtained from the -by-* identity matrix by replacing one of its diagonal entries with a number on . Then and have strictly interlacing eigenvalues on .* 2. (ii)
Let be partitioned as in (2). Then, for each , has no eigenvalues on .
Proof.
(i) The result follows directly from Lemma 2.2 and the fact that all the components of the eigenvectors of are nonzero.
(ii) Assume that is even. Note that is the -by- trailing principal submatrix of each of the matrices and , where . Suppose the assertion (ii) is false. Since and are unitary matrices, these matrices share all the eigenvalues of on , which contradicts sentence (i). If is odd, we argue in the same way noting that is the -by- trailing principal submatrix of each of the matrices and . ∎
Lemma 2.4**.**
Let () and . Let be partitioned as in (2), where . Let and define via (3) for each . Then and have at most common eigenvalues, which consist of the set of different solutions of the equation on .
Proof.
We begin by noting that
[TABLE]
for each . Indeed, by sentence (ii) of Lemma 2.3, is nonsingular, hence (10) follows from the equality (cf. [7, Equation ])
[TABLE]
after taking into account that the Schur complement of in is \zeta\mathcal{I}-\big{(}\mathcal{C}_{11}-\mathcal{C}_{12}(\mathcal{C}_{22}-\zeta\mathcal{I})^{-1}\mathcal{C}_{21}\big{)}. The result follows from (10) and the fact that for , with , and have no common eigenvalues (see e.g. [40, Theorem ]; alternatively, apply sentence (i) of Lemma 2.3). ∎
2.2. Proof of Theorem 1.1
(i) Let , , and , where is a -by- diagonal matrix. Note that
[TABLE]
Using (11) it is easily seen that 121212A different proof is given in [37, Theorem ].
[TABLE]
when is even. Similarly, the transpose of (11) leads to
[TABLE]
when is odd. The result follows from sentence (i) of Lemma 2.3.
(ii) Define the block diagonal matrix , where
[TABLE]
Hence
[TABLE]
when is odd, and
[TABLE]
when is even. Note that has simple eigenvalues (on ) by sentence (i) of Lemma 2.3. The result follows from Lemma 2.1, sentence (ii) of Lemma 2.3, and Lemma 2.4.
Acknowledgment
The authors thank the Bielefeld University Library for kindly sending them a hard copy of Birgit Bohnhorst’s Ph.D. Thesis. KC is supported by the Portuguese Government through the Fundação para a Ciência e a Tecnologia (FCT) under the grant SFRH/BPD/101139/2014. This work is partially supported by the Centre for Mathematics of the University of Coimbra – UID/MAT/00324/2013, funded by the Portuguese Government through FCT/MCTES and co-funded by the European Regional Development Fund through the Partnership Agreement PT2020. JP is also partially supported by Dirección General de Investigación Científica y Técnica, Ministerio de Economía y Competitividad of Spain under the project MTM2015–65888–C4–4–P.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] G. Ammar, W. Gragg, and L. Reichel. Constructing a unitary Hessenberg matrix from spectral data. In Numerical Linear Algebra, Digital Signal Processing and Parallel Algorithms (Leuven, 1988) , volume 70 of NATO Adv. Sci. Inst. Ser. F Compt. Systems Sci. , pages 385–395, Berlin, 1991. Springer.
- 2[2] G. S. Ammar and W. B. Gragg. Schur flow for orthogonal Hessenberg matrices. In Hamiltonian and gradient flows, algorithms and control , volume 3 of Fields Inst. Commun. , pages 27–34, Providence, RI, 1994. Amer. Math. Soc.
- 3[3] G. S. Ammar, W. B. Gragg, and L. Reichel. On the eigenproblem for orthogonal matrices. In 25th IEEE Conference on Decision and Control , pages 1963–1966, Athens, Greece, 1986.
- 4[4] P. Arbenz and G. H. Golub. On the spectral descomposition of Hermitian matrices modified by low rank perturbations with applications. SIAM J. Matrix Anal. Appl. , 9:40–58, 1988.
- 5[5] F. V. Atkinson. Discrete and continuous boundary problems , volume 8 of Mathematics in Science and Engineering . Academic Press, New York-London, 1964.
- 6[6] I. Bar-On. Interlacing properties of tridiagonal symmetric matrices with applications to parallel computing. SIAM J. Matrix Anal. Appl. , 17:548–562, 1996.
- 7[7] B. Bohnhorst. Beiträge zur numerischen Behandlung des unitären Eigenwertproblems . Ph D thesis, Fakultät für Mathematik, Universität Bielefeld, Bielefeld, Germany, 1993.
- 8[8] B. Bohnhorst, A. Bunse-Gerstner, and H. Faßbender. On the perturbation theory for unitary eigenvalue problems. SIAM J. Matrix Anal. Appl. , 21:809–824, 2000.
