
TL;DR
This paper simplifies Kummert's approach to realizing matrix-valued rational inner functions on the bidisk, extends it to isometric functions on the two-torus, and proves the existence of finite-dimensional realizations and special nilpotent structures.
Contribution
It provides a simplified proof of minimal unitary realizations, extends results to isometric functions, and establishes finite-dimensional realizations for matrix-valued Schur functions.
Findings
Every matrix-valued rational inner function in two variables has a minimal unitary transfer function realization.
Two-variable matrix-valued rational Schur functions have finite-dimensional contractive realizations.
Polynomial inner functions in two variables have transfer function realizations with nilpotent linear combinations.
Abstract
We give a simplified exposition of Kummert's approach to proving that every matrix-valued rational inner function in two variables has a minimal unitary transfer function realization. A slight modification of the approach extends to rational functions which are isometric on the two-torus and we use this to give a largely elementary new proof of the existence of Agler decompositions for every matrix-valued Schur function in two variables. We use a recent result of Dritschel to prove two variable matrix-valued rational Schur functions always have finite-dimensional contractive transfer function realizations. Finally, we prove that two variable matrix-valued polynomial inner functions have transfer function realizations built out of special nilpotent linear combinations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Kummert’s approach to realization on the bidisk
Greg Knese
Washington University in St. Louis
Department of Mathematics & Statistics
St. Louis, MO 63130
Abstract.
We give a simplified exposition of Kummert’s approach to proving that every matrix-valued rational inner function in two variables has a minimal unitary transfer function realization. A slight modification of the approach extends to rational functions which are isometric on the two-torus and we use this to give a largely elementary new proof of the existence of Agler decompositions for every matrix-valued Schur function in two variables. We use a recent result of Dritschel to prove two variable matrix-valued rational Schur functions always have finite-dimensional contractive transfer function realizations. Finally, we prove that two variable matrix-valued polynomial inner functions have transfer function realizations built out of special nilpotent linear combinations.
Key words and phrases:
Inner function, transfer function realization, Schur-Agler class, Agler decomposition, Schur class, bidisk, polydisk, bidisc, polydisc, Fejér-Riesz lemma
2010 Mathematics Subject Classification:
Primary 47A57; Secondary 32A17, 30H05, 30J05
Partially supported by NSF grant DMS-1900816
1. Introduction
The goal of this paper is to give a simple proof and several applications of the following theorem.
Theorem 1.1** (Main Theorem).**
Assume is rational with no poles in and satisfies on away from the zero set of the denominator of .
Then, there exist an integer and an isometric matrix such that
[TABLE]
where and are orthogonal projections with .
Above is the unit bidisk and is the two-torus (or bitorus). We shall call functions that satisfy the hypotheses of this theorem rational iso-inner functions. Formulas in the conclusion of this theorem such as (1.1), which are built out of block operators, will be called transfer function realizations (or TFRs). If the operator is a finite matrix we will call it a finite TFR and if we have extra information about the operator involved we will incorporate it into the terminology. For example, the above theorem asserts the existence of a “finite isometric TFR” for two variable rational iso-inner functions.
This theorem is due to Kummert in the square case [Kummert89]. Kummert’s theorem was ahead of its time and its proof was both ingenious and largely elementary. At the same time, Kummert’s argument seems complicated and the engineering terminology may obscure the underlying concepts for some, so one of our main goals is to give a simplified, conceptual, and entirely mathematical account of Kummert’s approach. We also give an algorithm for constructing the matrix . Motivation for doing so comes from recent interest in the wavelet community in transfer function formulas in one and several variables [CCCP]. We have presented generalizations of our simplified argument in a couple of papers [K11, GIK16], but the generalizations can also potentially obscure the underlying concepts. A minor adjustment allows us to treat the non-square case , which in turn allows us to give possibly the most elementary and direct proof of the following seminal theorem of Agler.
Theorem 1.2** (Agler [Agler1, Agler2]).**
Let be holomorphic and for all . Then, has a contractive TFR: there exists a contractive operator on some Hilbert space with block decomposition such that
[TABLE]
where and are pairwise orthogonal orthogonal projections which sum to the identity on the domain of .
Perhaps, the most important application of this theorem is a Pick interpolation theorem for holomorphic functions on the bidisk. For this and other applications we refer the reader to the book [AMbook] and the papers [AMcrelle, AMYmonotone, AMYcara, BT98].
Dritschel has recently proven a strong Fejér-Riesz type of result in two variables (Theorem 6.7) which makes it possible to prove that every two-variable rational function bounded by one in norm on (with no assumptions on boundary behavior) has a finite contractive TFR.
Theorem 1.3**.**
Let be rational with no poles in and assume for all . Then, there exists a contractive matrix such that
[TABLE]
where , are orthogonal projections with .
A very important bonus of Kummert’s approach is that it constructs the matrix in Theorem 1.1 with the minimal possible dimensions in a strong way. For a rational iso-inner function we can always make sense of for each fixed and this is a one variable rational iso-inner function (Lemma 4.3). If we have a formula as in Theorem 1.1 where the ranks of are then we can construct a transfer function realization for with size and a transfer function realization for with size . In the square case , this can be done optimally.
Theorem 1.4** (Kummert’s minimality theorem).**
Suppose is rational and inner. Then, one can choose in Theorem 1.1 so that the ranks of are simultaneously minimal: is the maximum of the minimal size of a unitary TFR for where varies over and is the maximum of the minimal size of a unitary TFR for where varies over .
In particular, among all possible unitary TFR’s for , neither nor can be smaller than those in Kummert’s construction. We will give a conceptual proof of Kummert’s minimality theorem, and clarify why this is the best possible result. Before the mathematical community knew of Kummert’s results, this result was reproven in the scalar case using the framework of Geronimo-Woerdeman [GW04] in [GKAPDE]. Later, Theorem 1.4 was also proven using Hilbert space methods in [BickelKnese]. The scalar minimality theorem was crucial in giving a characterization of two-variable rational matrix-monotone functions in [AMYmonotone]. It is also useful in proving determinantal representations for certain families of polynomials with no zeros in [GKdv].
We shall present a new application of the minimality theorem which has some relevance to the applications of this theory to wavelets in [wavelet, CCCP]. In these papers matrix-valued polynomial inner functions are of particular interest.
Theorem 1.5**.**
Let and assume on . Then, in Theorem 1.1 can be chosen with .
Note this means is nilpotent for every .
1.1. Guide to the reader
This paper is structured so that it can hopefully be read by a broad audience. We make no mention of systems theory terminology (except for “transfer function”) and we make no use of von Neumann inequalities and related operator theory originally used in the proof of Agler’s theorem. (We do discuss some of this for context in Section 6.) Our first goal is to quickly and simply prove Kummert’s Theorem 1.1 and explain how this proves Agler’s theorem. Some readers may be satisfied with this quick and mostly constructive approach to these results and can stop after Section 6. After that we introduce the technicalities necessary to prove Kummert’s minimality theorem and give an application to inner polynomials. We include an appendix with extra background.
1.2. Acknowledgments
This article overlaps with the interesting article of J. Ball [Ball] in some ways: both survey Agler decompositions on the bidisk/polydisk but Ball’s article follows Kummert’s original argument closely. Ball’s paper also discusses connections to the engineering literature and several other classes holomorphic functions. The present article and author owe a great debt to Professor Ball for disseminating Kummert’s argument to the mathematical community.
This article was motivated by the workshop “Mathematical Challenges of Structured Function Systems” at the Erwin Schrödinger Institute. I thank ESI as well as the workshop organizers (M. Charina, K. Gröchenig, M. Putinar, and J. Stöckler). The article [wavelet] was helpful in preparing this paper. I thank M. Dritschel for reading an early draft of this paper. I also thank K. Bickel for suggesting to me to write this paper. Finally, I sincerely thank the referee for several suggestions which greatly improved this paper.
Contents
2. Finite-dimensional transfer function realizations
One of the fundamental things that Agler did in his original proof of Theorem 1.2 was connect TFRs to certain formulas now called Agler decompositions which involved positive semi-definite kernels. The following theorem establishes some basic equivalences about finite TFRs and finite-dimensional Agler decompositions which hold not just on but any polydisk . Note that “matrix” below always refers to a finite matrix.
Theorem 2.1** (Equivalences Theorem).**
Let be a function.
The following are equivalent:
- (1)
There exists a contractive matrix such that
[TABLE]
where , for some pairwise orthogonal projections with . 2. (2)
There exist matrix functions and a constant contractive matrix such that
[TABLE] 3. (3)
There exist matrix functions such that
[TABLE]
We also have the following bonuses:
**B1: **
Assuming (1)-(3), , are all rational and for all . If we assume at the outset that is holomorphic, then item (3) need only hold initially on an open set in order for it to hold globally.
**B2: **
The that works in (1) also works in (2).
**B3: **
We also get equivalences if we replace “contractive” in (1) and (2) with “isometric” and with [math] in (3). In this case, is iso-inner and analytic outside the zeros of .
Proof.
. It helps to define . Let be the projection matrix for the block corresponding to . Then, the equation in (2) can be written as
[TABLE]
for . Block-by-block this says
[TABLE]
which yields and then .
. We simply define . Then, (2.1) holds because
[TABLE]
. The given equation implies
[TABLE]
Let and . Then,
[TABLE]
and this rearranges exactly into the equation in (3).
. This is known as a lurking isometry argument. The map
[TABLE]
extends linearly and in a well-defined way to an isometric map from the span of the vectors on the left to the span of the vectors on the right as varies over . We can extend this to an isometric matrix satisfying
[TABLE]
which we can compress to get a contractive matrix satisfying the equation in (2).
The bonus results follow. For (B1), is rational and bounded in operator norm by by (1) and (3). The matrix functions are rational by the proofs of and . If we assume is holomorphic and (3) only holds on an open set, then all of the proofs work on this restricted set but automatically extend holomorphically to by the matrix formulas. Bonus (B2) follows from the proof of . For bonus (B3), notice that if is an isometric matrix, then we have in the proof and if we start with we get to be isometric in the proof since no compression is necessary. Finally, is iso-inner because we can insert into condition (3) to see at least away from the zero set of which is a denominator for the and by the formula in (2) (1). ∎
The next proposition says the conditions of Theorem 2.1 are also equivalent to being a submatrix of a rational inner function possessing a finite-dimensional unitary transfer function realization. Moreover, the various sizes of the transfer function realizations stay the same. To be more precise, let be the rank of in condition (1) of Theorem 2.1. Then, will be called the size breakdown of the TFR. This terminology is endemic to this paper. The size of the TFR will refer to . Note that also equals the number of rows of in conditions (2) and (3) of Theorem 2.1.
Proposition 2.2**.**
Let be a function which has a finite contractive TFR with size breakdown . Then, there exists and a matrix rational inner function with finite unitary TFR with size breakdown such that is a submatrix of .
As a sort of converse, every submatrix of has a finite contractive TFR with same size breakdown.
Proof.
Suppose has a finite contractive TFR given via contractive . Every contractive matrix is a submatrix of a finite unitary, say . If we rearrange rows and columns we may write
[TABLE]
If
[TABLE]
then .
This same type of observation shows that every submatrix of has a finite contractive TFR. ∎
The following is referred to as the adjunction formula in [wavelet].
Proposition 2.3**.**
Let be a function with a finite contractive TFR given via a matrix as in (1),(2) of Theorem 2.1. Set . Then, has a finite contractive TFR given via .
In particular, if is isometric, then has a finite coisometric TFR.
Proof.
With we have
[TABLE]
which is exactly condition (1) of Theorem 2.1 with in place of . ∎
3. One variable version of Theorem 1.1
We now prove a detailed one variable version of the Main Theorem (Thm 1.1). If is a rational iso-inner function, then on away from zeros of , but then on all of by continuity.
Theorem 3.1**.**
Assume has no zeros in , , and on . Let be the maximum of the degrees of and the entries of . Then,
[TABLE]
where is a positive semi-definite matrix whose entries can be expressed as polynomials in the coefficients of . Furthermore, is a positive semi-definite kernel whose rank matches the rank of the matrix .
Positive semi-definite kernels are reviewed in Definition 10.5 and the rank of such a kernel is defined in Definition 10.6 in the Appendix.
The theorem allows for common zeros of and which is important in using this result in two variables. It immediately follows that possesses an isometric TFR because we can factor where is an matrix. Then, for we have
[TABLE]
By Theorem 2.1 we see that has an isometric TFR. After the proof of Theorem 3.1 we give an explicit way to find a formula for an isometry out of which a TFR for can be built. We need a standard lemma to prove Theorem 3.1. We give the short proof in the appendix; see Subsection 10.3.
Lemma 3.2**.**
Assume is analytic and in . Then, the kernel
[TABLE]
is positive semi-definite.
The swapping of , is deliberate and is discussed in the proof in the appendix.
Proof of Theorem 3.1.
By analyticity on . This implies the polynomial in
[TABLE]
is divisible by and hence we can write (3.1) where is indeed a matrix whose entries are polynomials in the coefficients of . We could solve for them but we do not need to. By Lemma 3.2, in (3.2) is positive semi-definite. Multiplying through by we have that as in (3.1) is a positive semi-definite matrix-valued polynomial function of bounded degree.
To show is positive semi-definite, take any and note that
[TABLE]
is positive semi-definite where is the block Vandermonde matrix . If the are all distinct then is invertible which implies that is positive semi-definite. The above computation also shows that the rank of equals the rank of , although we omit some details.
∎
Remark 3.3**.**
We now explain how to find an isometry out of which a TFR for can be built. This will closely parallel our approach in the two variable setting. We first factor where is with . Then, will possess a right inverse , namely . Set . To find such that
[TABLE]
we write out and extracting coefficients we equivalently need to satisfy
[TABLE]
The matrix has right inverse
[TABLE]
where so that
[TABLE]
Thus, can be computed directly from .
4. Two variables and Theorem 1.1
The basic idea of Kummert’s argument is to attempt a parametrized version of the one variable theorem above. The matrix Fejér-Riesz factorization in one variable, which we now review, then becomes crucial in attempting a parametrized version of the implication (3) (2) in the Equivalences Theorem (Thm 2.1).
Theorem 4.1** (Matrix Fejér-Riesz).**
Let be a matrix Laurent polynomial () such that for . Then, there exist a natural number , a matrix polynomial with for , and a polynomial matrix with polynomial inverse such that for we have
[TABLE]
Furthermore, has degree at most and a right rational inverse which is analytic in .
The case where is positive definite at all points of is usually attributed to Rosenblatt [Rosenblatt]. If vanishes at a finite number points, it is possible to factor out these zeros from ; see [DGK, Dj]. If is identically zero, it is possible to use operator-valued versions of this theorem which guarantee an outer factorization of . We explain how to go from the case of to the case in the appendix (subsection 10.2). The factorization above can be computed using semidefinite programming or Riccati equations (see for instance [Hachez]).
Theorem 4.1 in particular shows that has rank except at the finite number of zeros of . One nice application of Theorem 4.1 is the one variable version of Theorem 1.3.
Proposition 4.2**.**
Let be rational and for all . Then, has a finite contractive TFR.
Proof.
Write . Then, is positive semi-definite on . By Theorem 4.1, there exists a matrix polynomial such that on . Then, is iso-inner and by Theorem 3.1 possesses a finite isometric TFR. By Proposition 2.2, we see that possesses a finite contractive TFR. ∎
The following lemma lets us apply Theorem 3.1 to one variable slices.
Lemma 4.3**.**
Suppose is rational and iso-inner. Write where , has no zeros in , and , have no common factors. Then, on and for each , the one variable polynomial has no zeros in .
Proof.
As in one variable, on by continuity. For fixed notice that either has no zeros in or is identically zero by Hurwitz’s theorem (by considering as a limit of ). If is identically zero, then is identically zero because of on . Hence both polynomials are divisible by contradicting the assumption of no common factors. Thus, for every , has no zeros in . ∎
We are now ready to prove the Main Theorem (Thm 1.1).
Proof of Theorem 1.1.
Assume the setup of Theorem 1.1 and write as in Lemma 4.3. We can essentially follow a parametrized version of Remark 3.3 but we use the matrix Fejér-Riesz theorem to deal with certain matrix factorizations.
Step 1: Fix , divide by , and then extract the coefficients of to obtain
[TABLE]
where is a positive semi-definite matrix Laurent polynomial. This follows from Theorem 3.1 applied to . Here is the maximum of the degree of with respect to .
Step 2: Apply the matrix Fejér-Riesz theorem (Thm 4.1) to to get an matrix polynomial and an analytic (in ) rational matrix function such that on and in . For convenience we define
[TABLE]
Then, for and
[TABLE]
By Lemma 4.3, for each fixed the map is an iso-inner rational function and Theorem 2.1 guarantees the existence of an isometric matrix such that
[TABLE]
Step 3: In this step we find a formula for and show it extends to as a rational iso-inner function in one variable. We can rewrite (4.2) in terms of the coefficients of the powers of by writing and , defining , and . Then,
[TABLE]
using to denote the zero matrix. Since has no zeros in , the matrix has a rational matrix right inverse of the form . The exact formula for is . Then,
[TABLE]
extends to a rational function holomorphic in and isometry-valued on away from any singularities. So, not only is uniquely determined (by ) and iso-inner but both sides of (4.3) are now holomorphic, so (4.3) extends to . (We caution that the blocks in (4.4) do not line up as written. There is no need to multiply this out, so there is no real concern.)
Step 4: In this step we find an isometric matrix such that has a TFR built out of . It turns out as a one variable function has a TFR built out of the same isometry . Indeed, by Theorem 3.1 and Theorem 2.1 there exist a constant isometric matrix and matrix function such that
[TABLE]
A formula for can be found via Remark 3.3. As we now show, is the isometry we are looking for. If we multiply on the right by and define , we get
[TABLE]
By Theorem 2.1, this means has a finite-dimensional isometric transfer function realization built out of the isometry . This proves Theorem 1.1. ∎
When we prove the minimality theorem (Thm 1.4) we will pick up where this proof leaves off. We will later refer to as the dominant -term associated to , while we will refer to as the sub-dominant -term. We write instead of because the former are uniquely determined while are determined up to left multiplication by isometric matrices. By symmetry we could also construct a dominant -term with associated sub-dominant -term.
5. Detailed example
In this section we give a detailed example of the 4 steps presented in the proof of Theorem 1.1. The identity matrix is written , the zero matrix is written , and the zero matrix is written .
Consider the following simple rational inner function
[TABLE]
where is a unitary. The right expression shows is a product of inner functions and is therefore inner itself. Since is a polynomial the process below will be simpler than the general case but still illustrative. Note then that referring to the proof of Theorem 1.1 we have and .
Step 1: Set , divide by , and extract coefficients of the monomials in order to write
[TABLE]
where is the matrix Laurent polynomial
[TABLE]
Necessarily, is positive semi-definite on .
Step 2: Factor according to the one variable matrix Fejér-Riesz theorem. There exist algorithms for doing this ([Hachez]) and it can also be essentially reduced to polynomial algebra and one variable Fejér-Riesz factorizations (see [Dj] where this is done in a more general setup). We get on where
[TABLE]
has right inverse
[TABLE]
We use the equations above to define the matrix polynomials . Note that the right inverse in general could be rational.
Step 3: We find our parametrized unitary in this step. Form the “vectors” of coefficients
[TABLE]
where
[TABLE]
and then compute the one variable rational inner function as in (4.3)
[TABLE]
The fourth step is to find a TFR for . To do this we apply Remark 3.3. Let us emphasize the steps. Divide by and extract coefficients of to write
[TABLE]
where
[TABLE]
Then, we factor where
[TABLE]
Note that
[TABLE]
is a right inverse for (i.e. ). Set
[TABLE]
We need to compute the unitary (or isometry in general) such that
[TABLE]
After equating coefficients of powers of this is equivalent to
[TABLE]
where . Using the right inverse we have
[TABLE]
This is the desired unitary out of which we build our TFR. Setting
[TABLE]
[TABLE]
we have
[TABLE]
where . This is easy to verify since so that the formula reduces to
[TABLE]
which can be verified by hand.
While the above method involves several steps it is entirely systematic. Since is a product of simple inner functions, there are ad hoc ways of coming up with a TFR which might be shorter.
6. Matrix Agler decompositions in two variables
Theorem 1.1 makes it possible to prove Agler’s theorem (Thm 1.2). Cole-Wermer [CW99] showed that in the scalar case it is enough to prove Agler’s theorem for rational inner functions because holomorphic can be approximated locally uniformly by rational inner functions (Theorem 5.5.1 of Rudin [Rudin]). This approximation argument does not seem to transfer to the matrix-valued function setting, but there is a workaround.
Lemma 6.1**.**
Let be holomorphic and for all . Suppose for some . Then, there exist unitary matrices such that is a direct sum of a constant unitary matrix and a matrix valued holomorphic function on with for all .
Proof.
If , then there exists with such that . By the maximum principle, is constant and equal to one. Then, by equality in Cauchy-Schwarz, . Since has at most norm one, is reducing for meaning whenever . Thus, can be written in the form
[TABLE]
using the block decomposition . We can of course iterate this argument until we are left with the claimed decomposition. ∎
This lets us reduce to the case of with for all . The following is found in Rudin’s book [Rudin] in the scalar case (see Theorem 5.5.1 of [Rudin]). Define
[TABLE]
Lemma 6.2**.**
Suppose is holomorphic and for all . Then, for any and there exists such that and .
Consequently, every such is a local uniform limit of matrix polynomials with supremum norm strictly less than .
Proof.
Set for . For fixed there exists such that since is uniformly continuous on . Note Choose a Taylor polynomial of such that . Then, and . ∎
We need the following Fejér-Riesz type theorem of Dritschel.
Theorem 6.3** (Dritschel [mD1]).**
Let be a matrix-valued Laurent polynomial in variables; i.e. for and at most finitely many . If there is a such that on , then there exists a matrix polynomial such that on .
We sketch a simple proof with some new elements in the appendix; see Subsection 10.2.
Lemma 6.4**.**
If is a matrix polynomial such that then there exists a matrix polynomial such that is iso-inner. If , then has a finite contractive TFR.
Proof.
On , is a positive definite matrix Laurent polynomial. By Theorem 6.3 we can factor . Then, is isometry-valued on . If , then has a finite isometric TFR by Theorem 1.1 and hence possesses a finite contractive TFR by Proposition 2.2. ∎
Positive semi-definite kernels are defined in Definition 10.5. Notice that an expression of the form will always be positive semi-definite. By the above lemma and Theorem 2.1, any matrix polynomial with will satisfy a formula of the form
[TABLE]
where are positive semi-definite kernels. The term can be absorbed into since
[TABLE]
is positive semi-definite by the Schur product theorem. Thus, the following corollary holds for such strictly contractive matrix polynomials in two variables. Such formulas are called Agler decompositions.
Corollary 6.5**.**
Let be holomorphic with for . Then, there exist positive semi-definite kernels such that
[TABLE]
Sketch of Proof.
The hard work has already been done while the general outline and some technicalities are essentially in [CW99] so we only sketch the proof. We can assume that is point-wise strictly contractive by Lemma 6.1. Then, is a local uniform limit of matrix polynomials with supremum norm strictly less than one by Lemma 6.2. Each of these possesses an Agler decomposition by the discussion above.
The final part of the argument is the piece found in [CW99]. The kernels in the Agler decomposition are locally bounded because of the estimate
[TABLE]
This shows the kernels in Agler decompositions form a normal family. Subsequences converge locally uniformly to form positive semi-definite kernels in an Agler decomposition for . ∎
The above corollary proves Theorem 1.2. The proof is essentially the same as in the equivalences theorem (Thm 2.1) since positive semi-definite kernels can be factored as for some possibly operator valued function . Readers who have ventured this far (and are not in the cognoscenti of this material) may benefit from some context at this point. The fundamental contribution of Agler can perhaps be encapsulated in the following result.
Theorem 6.6** (Agler [Agler1, Agler2]).**
Let be holomorphic. Assume for . Then, the following are equivalent.
- (1)
* satisfies a von Neumann inequality:*
[TABLE]
for every -tuple of pairwise commuting strictly contractive operators (on some underlying Hilbert space); 2. (2)
* has an Agler decomposition: there exist positive semi-definite kernels such that*
[TABLE] 3. (3)
* has a contractive transfer function realization: there exists a contractive operator with block decomposition on some Hilbert space such that*
[TABLE]
where and the are pairwise orthogonal orthogonal projections which sum to the identity on the domain of .
Theorem 1.2 was originally proven via Andô’s inequality [Ando] which gives item (1) above. The approach we have given sidesteps the use of von Neumann’s inequality and the implication in Theorem 6.6. The proof of is possibly the hardest part of the theorem and is non-constructive as it uses a Hahn-Banach cone separation argument. On the other hand, is a relatively straightforward matter of “plugging” the -tuple into the Agler decomposition in item (2) in an appropriate sense. See [CW99] for details. Ball-Sadosky-Vinnikov [BSV05] have a different way to prove Theorem 1.2 directly using multi-evolution scattering systems. Theorem 1.2’s analogue for or more variables fails because the von Neumann inequality fails for 3 or more contractions [Varo]. Thus, Theorem 6.6 gives the best way of demonstrating that a function does not have a contractive TFR; namely, showing that it fails the von Neumann inequality. It is probably difficult to directly show that a function fails item (2) or (3) in Theorem 6.6.
We conclude this section by plugging Dritschel’s strong Fejér-Riesz type result (stated below) into earlier arguments in order to show rational contractive matrix-valued functions in two variables have a finite contractive TFR (Theorem 1.3).
Theorem 6.7** (Dritschel [mD2]).**
Let be a matrix-valued Laurent polynomial in two variables; i.e. for and at most finitely many . If on , then there exists a matrix polynomial such that on .
This theorem is considerably deeper than Theorem 6.3, and both theorems also apply to operator-valued functions. An earlier sums of squares theorem of Scheiderer, which applied to polynomials on a much more general class of two dimensional domains (than simply ), implies Theorem 6.7 in the scalar case [Scheiderer].
Proof of Theorem 1.3.
Apply the proof of Proposition 4.2 with Theorem 6.7 in place of Theorem 4.1. ∎
7. More on finite TFRs
We need to collect one more fact about finite-dimensional TFRs before proving the minimality theorem. If we have an Agler decomposition of an iso-inner function written in lowest terms, then the sums of squares terms are rational with denominator .
Theorem 7.1**.**
Suppose is rational and iso-inner. Write in lowest terms with and . Suppose we have an Agler decomposition
[TABLE]
where the are matrix functions. Then, for , is a matrix polynomial.
The significance of this theorem is that although has a TFR with denominator , this polynomial may not be the lowest degree denominator of .
Proof.
By Theorem 2.1 we already see that each is rational and holomorphic in . To prove that is a matrix polynomial consider
[TABLE]
Fix and set for . Then
[TABLE]
Because on , the left hand side above is divisible by and therefore
[TABLE]
is a polynomial in of degree in each less than the total degree of and . For simplicity we can regroup where now is a polynomial in for every . If we write out the homogeneous expansion of ,
[TABLE]
we see that
[TABLE]
In particular, for greater than the total degrees of and , the coefficient of vanishes for every ; namely, we have for all . Since is a matrix polynomial, this implies for greater than the total degrees of and . Therefore, is a polynomial. ∎
We conclude this short section with a few asides. The Agler norm (sometimes Schur-Agler norm) for holomorphic is
[TABLE]
where the supremum is taken over all -tuples of strictly contractive pairwise commuting operators on some Hilbert space. The Agler class consists of functions satisfying .
The argument in the proof above is related to the argument used to prove the following automatic finite-dimensionality result.
Theorem 7.2**.**
Suppose is rational, iso-inner or coiso-inner ( on ), and belongs to the Agler class . Then, has a finite-dimensional isometric (resp. coisometric) TFR as in Theorem 2.1.
The essence of this theorem was first proved in Cole-Wermer [CW99]. Although it was only stated and proved in the scalar case for , the proof goes through easily to all and for iso-inner functions. We gave a proof with some bounds on degrees and the numbers of squares involved in the scalar case in [KneseRIFITSAC]. A proof of the square matrix-valued case is in [BallKal]. Extending to the iso-inner (non-square) case causes no difficulties. The coisometric case follows from Proposition 2.3. A proof where is assumed to be a polynomial is also given in [wavelet]. The next theorem also produces a family of functions with finite TFRs.
Theorem 7.3** (Grinshpan et al [Getal]).**
Suppose is rational, analytic on a neighborhood of , and . Then, has a finite-dimensional contractive TFR as in Theorem 2.1.
The following question asks about what is still left open.
Question 7.4**.**
For , if is rational, , and is neither iso-inner nor coiso-inner, then does have a finite-dimensional contractive TFR?
We also do not know how essential analyticity on is for Theorem 7.3. Note follows from Theorem 1.3.
8. Kummert’s minimality theorem
In this section we discuss minimality of size breakdowns for finite TFRs, namely Theorem 1.4. Minimality in one variable follows directly from Theorem 2.1.
Proposition 8.1**.**
Let be rational and iso-inner. Then, the minimal size of an isometric TFR for is the rank of the positive semi-definite kernel
[TABLE]
The definition of the rank of a positive semi-definite kernel is given in Definition 10.6 in the Appendix. In two variables, we will frequently refer to the dominant -term and sub-dominant -term associated to which were constructed in the proof of Theorem 1.1; see the end of Section 4. Note that the number of rows of matches the generic rank of the matrix as in equation (4.1). This cannot be reduced because this is the generic or maximal rank of the positive semi-definite kernels
[TABLE]
Note division of (4.1) by will not change the rank of the positive semi-definite kernel and does not introduce any poles in since has no zeros in by Lemma 4.3.
We claim that in the inner case the rank of is also as small as possible. We suspect this happens in the iso-inner case but cannot prove it.
Question 8.2**.**
If is iso-inner (and not inner), does the construction in Section 4 produce a size breakdown with equal to the generic size of a TFR for (for ) and equal to the generic size of a TFR for (for )?
This question is subtle because every iso-inner function is a submatrix of an inner function with the same size breakdown. We have built a size breakdown with minimal so must also be minimal for . We could then build a TFR with size breakdown where is minimal for . Is it minimal for the restriction to ?
The next result characterizes and .
Proposition 8.3**.**
Assume is rational and iso-inner. Write in lowest terms. Suppose we had a formula
[TABLE]
where are matrix polynomials. Then,
[TABLE]
is a positive semi-definite polynomial kernel. Here again is the dominant -term and is the sub-dominant -term.
This result characterizes as maximal and as minimal in the above sense. Indeed, if some other kernel satisfied the same property as then both
[TABLE]
would be positive semi-definite forcing .
Proof of Proposition 8.3.
If we set we get
[TABLE]
The left side has degree at most in . We claim has degree at most in . Consider ’s top degree term where is a matrix polynomial. Then, the term appears on the right hand side with coefficient for . If then on implying on and also on by analyticity. Thus, has degree at most in .
Just as we have factored we can also factor . Recall . Upon extracting coefficients of we see that
[TABLE]
for . This is related to characterizing uniqueness in the matrix Fejér-Riesz theorem. We address this in the appendix in Theorem 10.4. By Theorem 10.4, since has a left inverse, there exists a one variable iso-inner function such that .
So,
[TABLE]
which is positive semi-definite. Applying on the left and on the right we get
[TABLE]
is positive semi-definite. It is a polynomial kernel because on . ∎
We now switch to the square/inner case and show that the Kummert construction gives the best possible size breakdown . We need to show has the minimal rank possible in the sense that it matches the generic size of a TFR for for . To do this, we show that we can “reflect” an Agler decomposition of to get an Agler decomposition for and this reflection reverses the dominant and sub-dominant properties of and . This is not the original approach of Kummert; instead it more closely resembles the Hilbert space approach in [BickelKnese]. Recall .
Proposition 8.4**.**
Suppose is rational and inner. Write in lowest terms. Suppose we had a formula
[TABLE]
where are matrix polynomials. Then,
[TABLE]
are matrix polynomials and
[TABLE]
The sub-dominant -term of reflects to the dominant -term of .
When we say reflects above we mean the operations:
[TABLE]
listed in the proposition statement equation (8.3). Notice that reflection of the term is slightly different from the reflection of the term.
Proof of Proposition 8.4.
Since on (where defined) we have for where defined. (This is where gets used.) So, . Now, take equation (8.2), replace with , multiply on the right by and left by , and finally divide through by to get (8.4) after applying various simplifications. Of course, we have the caveat that the formula only holds where all of the operations are defined. Fortunately, (8.4) only needs to hold on an open set for the proof of (3) (1),(2) in Theorem 2.1 to go through (bonus (B1) of Theorem 2.1 addresses this). We automatically obtain that are polynomials by Theorem 7.1, since if is in lowest terms then is too.
If we reflect equation (8.1) in the sense of replacing with and conjugating by we obtain
[TABLE]
which rearranges into
[TABLE]
This is still a positive semi-definite polynomial kernel. Thus, dominates an arbitrary -term making it the dominant -term for . ∎
Proof of Theorem 1.4.
By Proposition 8.4 the subdominant -term of reflects to the dominant -term of , . Note that this reflection does not change the rank of a positive semi-definite kernel. The rank of is then the generic rank of
[TABLE]
for . This matches the generic size of a TFR for which matches the generic size of a TFR for by the adjunction formula, Proposition 2.3. Thus the rank of matches the generic rank of
[TABLE]
∎
9. Application to inner polynomials
Of special interest in the papers connecting wavelets to TFRs is the case of iso-inner and inner polynomials [wavelet, CCCP]. In one variable, we have the following well-known result.
Proposition 9.1**.**
Let be iso-inner. Then, every isometric TFR of minimal size for is built out of an isometric matrix where is nilpotent.
We prove this using the following also well-known characterization of minimality.
Proposition 9.2**.**
Let be rational and iso-inner with minimal isometric TFR built out of the isometric matrix . Then,
[TABLE]
Proof.
First note that if has a TFR via , meaning , then it also has a TFR via
[TABLE]
where is a unitary matrix with the same dimensions as . This is apparent from the formula We can apply a unitary change of coordinates and break up the domain/codomain of into and its orthogonal complement . In these new coordinates takes the form
[TABLE]
since maps to itself and . Since the formula for is only determined by , we see that has an isometric TFR via the matrix which has a smaller size unless or rather .
For the second identity, we break up the domain of into and its orthogonal complement . Using this orthogonal decomposition we can write in new coordinates as
[TABLE]
since maps to [math] while maps into itself. But since this is an isometry we must have a unitary which forces . This means is given by the TFR with isometry This has smaller size unless . ∎
Proof of Proposition 9.1.
If is a polynomial, then necessarily for all large enough. By Proposition 9.2, for large enough. Then,
[TABLE]
implying or rather . ∎
Minimality of TFR representations in the rational inner case in two variables makes it possible to prove an analogous result for inner matrix-valued polynomials in two variables. Our approach uses determinants to count the size of minimal TFRs. The following is a standard result in one variable. We provide a proof in Subsection 10.3.
Proposition 9.3**.**
Let be a rational inner function. Then, equals the size of a minimal TFR for .
Since is rational inner, is a scalar rational inner function in one variable which is a finite Blaschke product. So, the refers to the degree of the numerator when written in lowest terms. This immediately yields a method using determinants to calculate the optimal size breakdown for rational inner functions in two variables. (This is another place where it helps to have square matrices.)
Theorem 9.4** (Kummert).**
If is rational inner, then the minimal size breakdown of a TFR for is
[TABLE]
Similarly, for all but finitely many , the degree of
[TABLE]
is . Therefore, the generic size of a TFR for is . This shows that generic restrictions to slices of our two variable minimal TFRs yield minimal TFRs for restricted functions.
Proof of Theorem 1.5.
The above argument shows that if a polynomial inner function has a minimal TFR via the unitary and projections as in Theorem 1.1 then has minimal unitary TFR via the unitary
[TABLE]
By Proposition 9.1, is nilpotent for all but finitely many . This means for all but finitely many . Since this is a polynomial equation we have and since is homogeneous we also have . Thus, is always nilpotent. ∎
This leads to the interesting question of describing contractions such that is nilpotent for all . An easy way to produce examples would be to make strictly upper triangular and choose the projections via projections onto the span of subsets of standard basis vectors. For such examples, is triangular; however, it is possible to produce matrices such that is nilpotent for all yet is not triangularizable independent of ; see [nilpotent]. This could be an interesting source of examples.
10. Appendix: auxiliary results
10.1. Maximum principle for rational iso-inner functions
Proposition 10.1**.**
Suppose is rational, analytic in , and for where defined. Then, for all .
Rationality is a key assumption since is unimodular on and analytic on yet not bounded by in .
Proof.
We can reduce to the scalar case by considering arbitrary unit vectors and the function . Fix and consider the one variable rational function This function is bounded by on away from its potential finite number of poles. But, must be unbounded near a pole, so any singularities on the boundary are removable. Hence, is analytic on and bounded by by the maximum principle. This implies is bounded by at any point of for . Given any , we can calculate as a Poisson integral of on for to see that . ∎
10.2. Fejér-Riesz proofs
A more traditional and well-known version of the matrix Fejér-Riesz theorem is as follows. See [DGK] for a proof.
Theorem 10.2**.**
Let be a matrix Laurent polynomial () such that for and is not identically zero.
Then, there exists a matrix polynomial of degree at most such that on and for .
We think it is worthwhile to show how to go from this theorem to the degenerate version, Theorem 4.1, using ideas from [Dj]. The key tool is the Smith normal form.
Theorem 10.3** (Smith normal form).**
Let be a matrix polynomial. Then, there exist with matrix polynomial inverses (equivalently, with constant determinants) and such that . The matrix has the following form: every entry off the main diagonal of is zero and the main diagonal consists of polynomials such that divides . Here and the may be zero for large enough.
See Hoffman-Kunze [HK].
Proof of Theorem 4.1.
The function is a polynomial matrix and therefore has Smith normal form decomposition
[TABLE]
Here are matrix polynomials with matrix polynomial inverses while
[TABLE]
is an diagonal matrix with only non-zero polynomials on the diagonal. Notice that has rank whenever , . Since is self-adjoint on , we have for and so
[TABLE]
is a matrix Laurent polynomial which is positive semi-definite on and with [math] in the last columns and rows. Thus, (10.1) has the form where is an matrix Laurent polynomial which is positive semi-definite on and crucially satisfying since has rank outside of a finite set.
By Theorem 10.2, there exists an matrix polynomial such that in and on . If we set and
[TABLE]
then on . Note that holds in since both sides are analytic and agree on .
Our degree bound on follows from the fact that
[TABLE]
is analytic at [math]. A right rational inverse of is given by . ∎
The matrix Fejér-Riesz factorization described is maximal in the sense of the following theorem. One can also describe all other factorizations. There is nothing essentially new about this result, but it is probably difficult to attribute. It could be deduced from inner-outer factorizations.
Theorem 10.4**.**
Assuming the setup and notation of Theorem 4.1. For any other factorization on with a matrix polynomial , there exists a rational iso-inner function such that (necessarily, ). If has a right rational inverse holomorphic in then is a constant unitary matrix.
Proof.
Suppose on . Then, we may write where has columns. Since
[TABLE]
we see that , on . This implies . Then, is analytic on and isometry-valued on . Any poles on are necessarily removable because is rational and bounded on . We also have . If has right rational inverse then . An isometry can only have a right inverse if it is square, so must be square (hence unitary on ) and must be unitary-valued on . By the maximum principle, and are contractive in the disk; however, since they are inverses of each other they must be unitary-valued in the disk. Such analytic functions are constant. (Lemma 6.1 proves something more general than this.) ∎
We now sketch a simple proof of Dritschel’s positive definite multivariable Fejér-Riesz result (Thm 6.3). Although it borrows elements from the original proof, we think it has some nice efficiencies in exposition.
Proof of Theorem 6.3.
Let be a positive integer and define the multivariable Cesaro summation operator which we apply to matrix Laurent polynomials
[TABLE]
where
[TABLE]
[TABLE]
is the Fejér kernel and is normalized Lebesgue measure on .
Let be the vector space of Laurent polynomials of degree at most in each variable separately. We shall consider . By basic properties of Cesaro summation, uniformly on as for . Since the set of linear operators on is finite dimensional, tends to the identity as with respect to any norm on . In particular, for large enough is invertible and tends to the identity as .
We next point out that if is positive semi-definite on then is a sum of squares. The reason is that on , is a Laurent polynomial of degree at most with respect to . Then, the integral representation of can be computed via “quadrature.” Indeed, for any , if and then
[TABLE]
This can be proven by testing on monomials. This means that is a positive finite linear combination of the terms . Since is evidently a squared polynomial and each value of on is assumed positive semi-definite, we see that is a sum of squares of polynomials.
Now, let be strictly positive on , i.e. there exists such that for . For large enough, is also strictly positive. Then, is a Cesaro sum of a positive Laurent polynomial which was already shown to be a sum of squares. ∎
10.3. PSD kernels
We now discuss the proof of Lemma 3.2 which claims that for analytic and in we have that
[TABLE]
is positive semi-definite (PSD). Let us recall the abstract definition of PSD for matrix or operator-valued kernels.
Definition 10.5**.**
Let be a set, a complex Hilbert space, and a function; here is the set of bounded linear self-maps of . We say that is a PSD kernel if for any and we have
[TABLE]
Notice that if is a PSD kernel, then is not necessarily PSD except in the scalar case .
Definition 10.6**.**
The rank of is the maximum of the ranks of the block operators as we vary over and .
Proof of Lemma 3.2.
Our proof uses rudiments of vector-valued Hardy spaces on the unit disk. See Agler-McCarthy [AMbook] for details.
Let be the set of -dimensional column vectors with entries in the Hardy space on the unit disk . Left multiplication by , , is contractive. If is the Szegő kernel, then by a fundamental formula in reproducing kernel Hilbert space theory
[TABLE]
for . We see that
[TABLE]
which after a short calculation using the fact that shows We could apply the same argument to to see that Replace with their conjugates and relabel the variables to see that is PSD. ∎
Proof of Proposition 9.3.
Assuming is rational inner we need to compute the rank of the positive semi-definite kernel We shall use notation from the proof of Lemma 3.2 above. As in said proof, it is notationally easier to deal with the kernel
[TABLE]
and we can reduce to this case by replacing with .
Now, is the reproducing kernel for . This follows from the fact that is inner: is a closed subspace of and has reproducing kernel
[TABLE]
which can be verified by the following calculation
[TABLE]
for . The rank of is the dimension of .
To count this dimension we write in lowest terms. Since is bounded on it can have no poles on , and therefore has no zeros in . Let be the Smith normal form decomposition for (Theorem 10.3 above). Notice that has full rank on since is inner. Write . Then, where is a constant because have polynomial inverses. Since is inner is a finite Blaschke product. Its degree equals its number of zeros in which equals the number of zeros of in since has none.
The vector space is isomorphic to the vector space quotient
[TABLE]
The first equality holds because has no zeros in , the second holds because has a polynomial inverse, and the last isomorphism holds because has a polynomial inverse. Recalling we note the dimension of is the number of zeros of in and therefore the dimension of is the number of zeros of inside (counting multiplicities). ∎
This proof appears in [BickelKnese].
References
