Kummert's approach to realization on the bidisk

Greg Knese

arXiv:1907.13191·math.CV·March 4, 2022

Kummert's approach to realization on the bidisk

Greg Knese

PDF

TL;DR

This paper simplifies Kummert's approach to realizing matrix-valued rational inner functions on the bidisk, extends it to isometric functions on the two-torus, and proves the existence of finite-dimensional realizations and special nilpotent structures.

Contribution

It provides a simplified proof of minimal unitary realizations, extends results to isometric functions, and establishes finite-dimensional realizations for matrix-valued Schur functions.

Findings

01

Every matrix-valued rational inner function in two variables has a minimal unitary transfer function realization.

02

Two-variable matrix-valued rational Schur functions have finite-dimensional contractive realizations.

03

Polynomial inner functions in two variables have transfer function realizations with nilpotent linear combinations.

Abstract

We give a simplified exposition of Kummert's approach to proving that every matrix-valued rational inner function in two variables has a minimal unitary transfer function realization. A slight modification of the approach extends to rational functions which are isometric on the two-torus and we use this to give a largely elementary new proof of the existence of Agler decompositions for every matrix-valued Schur function in two variables. We use a recent result of Dritschel to prove two variable matrix-valued rational Schur functions always have finite-dimensional contractive transfer function realizations. Finally, we prove that two variable matrix-valued polynomial inner functions have transfer function realizations built out of special nilpotent linear combinations.

Equations230

S (z) = A + B Δ (z) (I - D Δ (z))^{- 1} C

S (z) = A + B Δ (z) (I - D Δ (z))^{- 1} C

f (z) = A + B Δ (z) (I - D Δ (z))^{- 1} C

f (z) = A + B Δ (z) (I - D Δ (z))^{- 1} C

S (z) = A + B Δ (z) (I - D Δ (z))^{- 1} C

S (z) = A + B Δ (z) (I - D Δ (z))^{- 1} C

S (z) = A + B Δ (z) (I - D Δ (z))^{- 1} C

S (z) = A + B Δ (z) (I - D Δ (z))^{- 1} C

T I z_{1} F_{1} (z) ⋮ z_{d} F_{d} (z) = S (z) F_{1} (z) ⋮ F_{d} (z) .

T I z_{1} F_{1} (z) ⋮ z_{d} F_{d} (z) = S (z) F_{1} (z) ⋮ F_{d} (z) .

I - S (w)^{*} S (z) = G (w)^{*} G (z) + j \sum (1 - \overset{w}{ˉ}_{j} z_{j}) F_{j} (w)^{*} F_{j} (z) .

I - S (w)^{*} S (z) = G (w)^{*} G (z) + j \sum (1 - \overset{w}{ˉ}_{j} z_{j}) F_{j} (w)^{*} F_{j} (z) .

(A C B D) (I 0 0 Δ (z)) (I F (z)) = (S (z) F (z))

(A C B D) (I 0 0 Δ (z)) (I F (z)) = (S (z) F (z))

A + B Δ F

A + B Δ F

C + D Δ F

C + D Δ (I - D Δ)^{- 1} C = (I - D Δ)^{- 1} C .

C + D Δ (I - D Δ)^{- 1} C = (I - D Δ)^{- 1} C .

(I Δ (w) F (w))^{*} T^{*} T (I Δ (z) F (z)) = (S (w) F (w))^{*} (S (z) F (z)) .

(I Δ (w) F (w))^{*} T^{*} T (I Δ (z) F (z)) = (S (w) F (w))^{*} (S (z) F (z)) .

(I Δ (w) F (w))^{*} (I Δ (z) F (z)) = (S (w) F (w))^{*} (S (z) F (z)) + G (w)^{*} G (z)

(I Δ (w) F (w))^{*} (I Δ (z) F (z)) = (S (w) F (w))^{*} (S (z) F (z)) + G (w)^{*} G (z)

(I Δ (z) F (z)) \mapsto S (z) F (z) G (z)

(I Δ (z) F (z)) \mapsto S (z) F (z) G (z)

V (I Δ (z) F (z)) = S (z) F (z) G (z)

V (I Δ (z) F (z)) = S (z) F (z) G (z)

U = A A_{21} C A_{12} A_{22} C_{2} B B_{2} D .

U = A A_{21} C A_{12} A_{22} C_{2} B B_{2} D .

Φ (z) = (A A_{21} A_{12} A_{22}) + (B B_{2}) Δ (z) (I - D Δ (z))^{- 1} (C C_{2})

Φ (z) = (A A_{21} A_{12} A_{22}) + (B B_{2}) Δ (z) (I - D Δ (z))^{- 1} (C C_{2})

\overset{˘}{S} (z)

\overset{˘}{S} (z)

= A^{*} + C^{*} Δ (z) (I - D^{*} Δ (z))^{- 1} B^{*}

K (w, z) = \frac{p ( w ) p ( z ) I - Q ( w ) ^{*} Q ( z )}{1 - w ˉ z} = (I, \overset{w}{ˉ} I, \dots, \overset{w}{ˉ}^{n - 1} I) T (I, z I, \dots, z^{n - 1} I)^{t}

K (w, z) = \frac{p ( w ) p ( z ) I - Q ( w ) ^{*} Q ( z )}{1 - w ˉ z} = (I, \overset{w}{ˉ} I, \dots, \overset{w}{ˉ}^{n - 1} I) T (I, z I, \dots, z^{n - 1} I)^{t}

I - S (w)^{*} S (z) = (1 - \overset{w}{ˉ} z) (\frac{F ( w )}{p ( w )})^{*} \frac{F ( z )}{p ( z )} .

I - S (w)^{*} S (z) = (1 - \overset{w}{ˉ} z) (\frac{F ( w )}{p ( w )})^{*} \frac{F ( z )}{p ( z )} .

K_{S} (w, z) = \frac{I - S ( w ) ^{*} S ( z )}{1 - w ˉ z}

K_{S} (w, z) = \frac{I - S ( w ) ^{*} S ( z )}{1 - w ˉ z}

\overline{p (w)} p (z) I - Q (w)^{*} Q (z)

\overline{p (w)} p (z) I - Q (w)^{*} Q (z)

(K (z_{i}, z_{j}))_{i, j} = I I ⋮ I \overset{z}{ˉ}_{1} I \overset{z}{ˉ}_{2} I ⋮ \overset{z}{ˉ}_{n} I \dots \dots ⋱ \dots \overset{z}{ˉ}_{1}^{n - 1} I \overset{z}{ˉ}_{2}^{n - 1} ⋮ \overset{z}{ˉ}_{n}^{n - 1} I T I z_{1} I ⋮ z_{1}^{n - 1} I I z_{2} I ⋮ z_{2}^{n - 1} \dots ⋮ ⋱ \dots I z_{n} I ⋮ z_{n}^{n - 1} I = V^{*} T V

(K (z_{i}, z_{j}))_{i, j} = I I ⋮ I \overset{z}{ˉ}_{1} I \overset{z}{ˉ}_{2} I ⋮ \overset{z}{ˉ}_{n} I \dots \dots ⋱ \dots \overset{z}{ˉ}_{1}^{n - 1} I \overset{z}{ˉ}_{2}^{n - 1} ⋮ \overset{z}{ˉ}_{n}^{n - 1} I T I z_{1} I ⋮ z_{1}^{n - 1} I I z_{2} I ⋮ z_{2}^{n - 1} \dots ⋮ ⋱ \dots I z_{n} I ⋮ z_{n}^{n - 1} I = V^{*} T V

U (p (z) I z F (z)) = (Q (z) F (z))

U (p (z) I z F (z)) = (Q (z) F (z))

U (p_{0} I O [p_{1} I, \dots, p_{n} I] A) = ([Q_{0}, \dots, Q_{n - 1}] A Q_{n} O) .

U (p_{0} I O [p_{1} I, \dots, p_{n} I] A) = ([Q_{0}, \dots, Q_{n - 1}] A Q_{n} O) .

(p_{0}^{- 1} I O X B)

(p_{0}^{- 1} I O X B)

U = ([Q_{0}, \dots, Q_{n - 1}] A Q_{n} O) (p_{0}^{- 1} I O X B) .

U = ([Q_{0}, \dots, Q_{n - 1}] A Q_{n} O) (p_{0}^{- 1} I O X B) .

T = A^{*} A on T .

T = A^{*} A on T .

\frac{p ( w ) p ( z ) I - Q ( w ) ^{*} Q ( z )}{1 - w _{1} ˉ z _{1}} = j, k \sum \overset{w}{ˉ}_{1}^{j} z_{1}^{k} T_{j k} (z_{2}) = (I, \overset{w}{ˉ}_{1} I, \dots, \overset{w}{ˉ}_{1}^{n_{1} - 1} I) T (z_{2}) I z_{1} I ⋮ z_{1}^{n_{1} - 1} I

\frac{p ( w ) p ( z ) I - Q ( w ) ^{*} Q ( z )}{1 - w _{1} ˉ z _{1}} = j, k \sum \overset{w}{ˉ}_{1}^{j} z_{1}^{k} T_{j k} (z_{2}) = (I, \overset{w}{ˉ}_{1} I, \dots, \overset{w}{ˉ}_{1}^{n_{1} - 1} I) T (z_{2}) I z_{1} I ⋮ z_{1}^{n_{1} - 1} I

Λ (z_{1}) = (I_{N}, z_{1} I_{N}, \dots, z_{1}^{n_{1} - 1} I_{N})^{t} \in C^{n_{1} N \times N} [z_{1}] .

Λ (z_{1}) = (I_{N}, z_{1} I_{N}, \dots, z_{1}^{n_{1} - 1} I_{N})^{t} \in C^{n_{1} N \times N} [z_{1}] .

\overline{p (w)} p (z) I_{N} - Q (w)^{*} Q (z) = (1 - \overset{w}{ˉ}_{1} z_{1}) Λ (w_{1})^{*} A (w_{2})^{*} A (z_{2}) Λ (z_{1}) .

\overline{p (w)} p (z) I_{N} - Q (w)^{*} Q (z) = (1 - \overset{w}{ˉ}_{1} z_{1}) Λ (w_{1})^{*} A (w_{2})^{*} A (z_{2}) Λ (z_{1}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Kummert’s approach to realization on the bidisk

Greg Knese

Washington University in St. Louis

Department of Mathematics & Statistics

St. Louis, MO 63130

[email protected]

Abstract.

We give a simplified exposition of Kummert’s approach to proving that every matrix-valued rational inner function in two variables has a minimal unitary transfer function realization. A slight modification of the approach extends to rational functions which are isometric on the two-torus and we use this to give a largely elementary new proof of the existence of Agler decompositions for every matrix-valued Schur function in two variables. We use a recent result of Dritschel to prove two variable matrix-valued rational Schur functions always have finite-dimensional contractive transfer function realizations. Finally, we prove that two variable matrix-valued polynomial inner functions have transfer function realizations built out of special nilpotent linear combinations.

Key words and phrases:

Inner function, transfer function realization, Schur-Agler class, Agler decomposition, Schur class, bidisk, polydisk, bidisc, polydisc, Fejér-Riesz lemma

2010 Mathematics Subject Classification:

Primary 47A57; Secondary 32A17, 30H05, 30J05

Partially supported by NSF grant DMS-1900816

1. Introduction

The goal of this paper is to give a simple proof and several applications of the following theorem.

Theorem 1.1 (Main Theorem).

Assume $S:\mathbb{D}^{2}\to\mathbb{C}^{M\times N}$ is rational with no poles in $\mathbb{D}^{2}$ and satisfies $S^{*}S=I_{N}$ on $\mathbb{T}^{2}$ away from the zero set of the denominator of $S$ .

Then, there exist an integer $r$ and an $(M+r)\times(N+r)$ isometric matrix $U=\begin{pmatrix}A&B\\ C&D\end{pmatrix}$ such that

[TABLE]

where $\Delta(z_{1},z_{2})=z_{1}P_{1}+z_{2}P_{2}$ and $P_{1},P_{2}$ are orthogonal projections with $P_{1}+P_{2}=I_{r}$ .

Above $\mathbb{D}^{2}=\{z=(z_{1},z_{2})\in\mathbb{C}^{2}:|z_{1}|,|z_{2}|<1\}$ is the unit bidisk and $\mathbb{T}^{2}=\{(z_{1},z_{2})\in\mathbb{C}^{2}:|z_{1}|=|z_{2}|=1\}$ is the two-torus (or bitorus). We shall call functions that satisfy the hypotheses of this theorem rational iso-inner functions. Formulas in the conclusion of this theorem such as (1.1), which are built out of block operators, will be called transfer function realizations (or TFRs). If the operator is a finite matrix we will call it a finite TFR and if we have extra information about the operator involved we will incorporate it into the terminology. For example, the above theorem asserts the existence of a “finite isometric TFR” for two variable rational iso-inner functions.

This theorem is due to Kummert in the square case $M=N$ [Kummert89]. Kummert’s theorem was ahead of its time and its proof was both ingenious and largely elementary. At the same time, Kummert’s argument seems complicated and the engineering terminology may obscure the underlying concepts for some, so one of our main goals is to give a simplified, conceptual, and entirely mathematical account of Kummert’s approach. We also give an algorithm for constructing the matrix $U$ . Motivation for doing so comes from recent interest in the wavelet community in transfer function formulas in one and several variables [CCCP]. We have presented generalizations of our simplified argument in a couple of papers [K11, GIK16], but the generalizations can also potentially obscure the underlying concepts. A minor adjustment allows us to treat the non-square case $M\neq N$ , which in turn allows us to give possibly the most elementary and direct proof of the following seminal theorem of Agler.

Theorem 1.2 (Agler [Agler1, Agler2]).

Let $f:\mathbb{D}^{2}\to\mathbb{C}^{M\times N}$ be holomorphic and $\|f(z)\|\leq 1$ for all $z\in\mathbb{D}^{2}$ . Then, $f$ has a contractive TFR: there exists a contractive operator $T$ on some Hilbert space with block decomposition $T=\begin{pmatrix}A&B\\ C&D\end{pmatrix}$ such that

[TABLE]

where $\Delta(z)=z_{1}P_{1}+z_{2}P_{2}$ and $P_{1},P_{2}$ are pairwise orthogonal orthogonal projections which sum to the identity on the domain of $D$ .

Perhaps, the most important application of this theorem is a Pick interpolation theorem for holomorphic functions on the bidisk. For this and other applications we refer the reader to the book [AMbook] and the papers [AMcrelle, AMYmonotone, AMYcara, BT98].

Dritschel has recently proven a strong Fejér-Riesz type of result in two variables (Theorem 6.7) which makes it possible to prove that every two-variable rational function bounded by one in norm on $\mathbb{D}^{2}$ (with no assumptions on boundary behavior) has a finite contractive TFR.

Theorem 1.3.

Let $S:\mathbb{D}^{2}\to\mathbb{C}^{M\times N}$ be rational with no poles in $\mathbb{D}^{2}$ and assume $\|S(z)\|\leq 1$ for all $z\in\mathbb{D}^{2}$ . Then, there exists a contractive matrix $T=\begin{pmatrix}A&B\\ C&D\end{pmatrix}$ such that

[TABLE]

where $\Delta(z_{1},z_{2})=z_{1}P_{1}+z_{2}P_{2}$ , $P_{1},P_{2}$ are orthogonal projections with $P_{1}+P_{2}=I$ .

A very important bonus of Kummert’s approach is that it constructs the matrix $U$ in Theorem 1.1 with the minimal possible dimensions in a strong way. For a rational iso-inner function $S:\mathbb{D}^{2}\to\mathbb{C}^{M\times N}$ we can always make sense of $z_{1}\mapsto S(z_{1},z_{2})$ for each fixed $z_{2}\in\mathbb{T}$ and this is a one variable rational iso-inner function (Lemma 4.3). If we have a formula as in Theorem 1.1 where the ranks of $P_{1},P_{2}$ are $r_{1},r_{2}$ then we can construct a transfer function realization for $S(\cdot,z_{2})$ with size $r_{1}$ and a transfer function realization for $S(z_{1},\cdot)$ with size $r_{2}$ . In the square case $M=N$ , this can be done optimally.

Theorem 1.4 (Kummert’s minimality theorem).

Suppose $S:\mathbb{D}^{2}\to\mathbb{C}^{N\times N}$ is rational and inner. Then, one can choose $U$ in Theorem 1.1 so that the ranks $r_{1},r_{2}$ of $P_{1},P_{2}$ are simultaneously minimal: $r_{1}$ is the maximum of the minimal size of a unitary TFR for $z_{1}\mapsto S(z_{1},z_{2})$ where $z_{2}$ varies over $\mathbb{T}$ and $r_{2}$ is the maximum of the minimal size of a unitary TFR for $z_{2}\mapsto S(z_{1},z_{2})$ where $z_{1}$ varies over $\mathbb{T}$ .

In particular, among all possible unitary TFR’s for $S$ , neither $r_{1}$ nor $r_{2}$ can be smaller than those in Kummert’s construction. We will give a conceptual proof of Kummert’s minimality theorem, and clarify why this is the best possible result. Before the mathematical community knew of Kummert’s results, this result was reproven in the scalar case using the framework of Geronimo-Woerdeman [GW04] in [GKAPDE]. Later, Theorem 1.4 was also proven using Hilbert space methods in [BickelKnese]. The scalar minimality theorem was crucial in giving a characterization of two-variable rational matrix-monotone functions in [AMYmonotone]. It is also useful in proving determinantal representations for certain families of polynomials $p\in\mathbb{C}[z_{1},z_{2}]$ with no zeros in $\mathbb{D}^{2}$ [GKdv].

We shall present a new application of the minimality theorem which has some relevance to the applications of this theory to wavelets in [wavelet, CCCP]. In these papers matrix-valued polynomial inner functions are of particular interest.

Theorem 1.5.

Let $S\in\mathbb{C}^{N\times N}[z_{1},z_{2}]$ and assume $S^{*}S=I_{N}$ on $\mathbb{T}^{2}$ . Then, $U$ in Theorem 1.1 can be chosen with $\det(I-D\Delta(z))\equiv 1$ .

Note this means $D\Delta(z)=z_{1}DP_{1}+z_{2}DP_{2}$ is nilpotent for every $z$ .

1.1. Guide to the reader

This paper is structured so that it can hopefully be read by a broad audience. We make no mention of systems theory terminology (except for “transfer function”) and we make no use of von Neumann inequalities and related operator theory originally used in the proof of Agler’s theorem. (We do discuss some of this for context in Section 6.) Our first goal is to quickly and simply prove Kummert’s Theorem 1.1 and explain how this proves Agler’s theorem. Some readers may be satisfied with this quick and mostly constructive approach to these results and can stop after Section 6. After that we introduce the technicalities necessary to prove Kummert’s minimality theorem and give an application to inner polynomials. We include an appendix with extra background.

1.2. Acknowledgments

This article overlaps with the interesting article of J. Ball [Ball] in some ways: both survey Agler decompositions on the bidisk/polydisk but Ball’s article follows Kummert’s original argument closely. Ball’s paper also discusses connections to the engineering literature and several other classes holomorphic functions. The present article and author owe a great debt to Professor Ball for disseminating Kummert’s argument to the mathematical community.

This article was motivated by the workshop “Mathematical Challenges of Structured Function Systems” at the Erwin Schrödinger Institute. I thank ESI as well as the workshop organizers (M. Charina, K. Gröchenig, M. Putinar, and J. Stöckler). The article [wavelet] was helpful in preparing this paper. I thank M. Dritschel for reading an early draft of this paper. I also thank K. Bickel for suggesting to me to write this paper. Finally, I sincerely thank the referee for several suggestions which greatly improved this paper.

1 Introduction
1.1 Guide to the reader
1.2 Acknowledgments
2 Finite-dimensional transfer function realizations
3 One variable version of Theorem 1.1
4 Two variables and Theorem 1.1
5 Detailed example
6 Matrix Agler decompositions in two variables
7 More on finite TFRs
8 Kummert’s minimality theorem
9 Application to inner polynomials
10 Appendix: auxiliary results
10.1 Maximum principle for rational iso-inner functions
10.2 Fejér-Riesz proofs
10.3 PSD kernels

2. Finite-dimensional transfer function realizations

One of the fundamental things that Agler did in his original proof of Theorem 1.2 was connect TFRs to certain formulas now called Agler decompositions which involved positive semi-definite kernels. The following theorem establishes some basic equivalences about finite TFRs and finite-dimensional Agler decompositions which hold not just on $\mathbb{D}^{2}$ but any polydisk $\mathbb{D}^{d}$ . Note that “matrix” below always refers to a finite matrix.

Theorem 2.1 (Equivalences Theorem).

Let $S:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ be a function.

The following are equivalent:

(1)

There exists a contractive matrix $T=\begin{pmatrix}A&B\\ C&D\end{pmatrix}$ such that

[TABLE]

where $\Delta(z)=\sum_{j}z_{j}P_{j}$ , for some pairwise orthogonal projections with $\sum_{j}P_{j}=I$ . 2. (2)

There exist matrix functions $F_{j}$ and a constant contractive matrix $T$ such that

[TABLE] 3. (3)

There exist matrix functions $F_{1},\dots,F_{d},G$ such that

[TABLE]

We also have the following bonuses:

**B1: **

Assuming (1)-(3), $S$ , $F_{1},\dots,F_{d},G$ are all rational and $\|S(z)\|\leq 1$ for all $z\in\mathbb{D}^{d}$ . If we assume at the outset that $S$ is holomorphic, then item (3) need only hold initially on an open set in order for it to hold globally.

**B2: **

The $T$ that works in (1) also works in (2).

**B3: **

We also get equivalences if we replace “contractive” in (1) and (2) with “isometric” and $G$ with [math] in (3). In this case, $S$ is iso-inner and analytic outside the zeros of $\det(I-D\Delta(z))$ .

Proof.

$(2)\implies(1)$ . It helps to define $F(z)=\begin{pmatrix}F_{1}(z)\\ \vdots\\ F_{d}(z)\end{pmatrix}$ . Let $P_{j}$ be the projection matrix for the block corresponding to $F_{j}$ . Then, the equation in (2) can be written as

[TABLE]

for $\Delta(z)=\sum_{j}z_{j}P_{j}$ . Block-by-block this says

[TABLE]

which yields $F=(I-D\Delta)^{-1}C$ and then $S=A+B\Delta(I-D\Delta)^{-1}C$ .

$(1)\implies(2)$ . We simply define $F=(I-D\Delta)^{-1}C$ . Then, (2.1) holds because

[TABLE]

$(2)\implies(3)$ . The given equation implies

[TABLE]

Let $A=\sqrt{I-T^{*}T}$ and $G(z)=A\begin{pmatrix}I\\ \Delta(z)F(z)\end{pmatrix}$ . Then,

[TABLE]

and this rearranges exactly into the equation in (3).

$(3)\implies(2)$ . This is known as a lurking isometry argument. The map

[TABLE]

extends linearly and in a well-defined way to an isometric map from the span of the vectors on the left to the span of the vectors on the right as $z$ varies over $\mathbb{D}^{d}$ . We can extend this to an isometric matrix $V$ satisfying

[TABLE]

which we can compress to get a contractive matrix satisfying the equation in (2).

The bonus results follow. For (B1), $S$ is rational and bounded in operator norm by $1$ by (1) and (3). The matrix functions $F_{j},G$ are rational by the proofs of $(2)\implies(1)$ and $(2)\implies(3)$ . If we assume $S$ is holomorphic and (3) only holds on an open set, then all of the proofs work on this restricted set but automatically extend holomorphically to $\mathbb{D}^{d}$ by the matrix formulas. Bonus (B2) follows from the proof of $(1)\iff(2)$ . For bonus (B3), notice that if $T$ is an isometric matrix, then we have $G=0$ in the proof $(2)\implies(3)$ and if we start with $G=0$ we get $T$ to be isometric in the proof $(3)\implies(2)$ since no compression is necessary. Finally, $S$ is iso-inner because we can insert $z=w\in\mathbb{T}^{d}$ into condition (3) to see $S^{*}S=I$ at least away from the zero set of $\det(I-D\Delta(z))$ which is a denominator for the $F_{j}$ and $S$ by the formula in (2) $\implies$ (1). ∎

The next proposition says the conditions of Theorem 2.1 are also equivalent to $S$ being a submatrix of a rational inner function possessing a finite-dimensional unitary transfer function realization. Moreover, the various sizes of the transfer function realizations stay the same. To be more precise, let $r_{j}$ be the rank of $P_{j}$ in condition (1) of Theorem 2.1. Then, $r=(r_{1},\dots,r_{d})$ will be called the size breakdown of the TFR. This terminology is endemic to this paper. The size of the TFR will refer to $|r|=r_{1}+\cdots+r_{d}$ . Note that $r_{j}$ also equals the number of rows of $F_{j}$ in conditions (2) and (3) of Theorem 2.1.

Proposition 2.2.

Let $S:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ be a function which has a finite contractive TFR with size breakdown $r$ . Then, there exists $n\geq N,M$ and a matrix rational inner function $\Phi:\mathbb{D}^{d}\to\mathbb{C}^{n\times n}$ with finite unitary TFR with size breakdown $r$ such that $S$ is a submatrix of $\Phi$ .

As a sort of converse, every submatrix of $S$ has a finite contractive TFR with same size breakdown.

Proof.

Suppose $S$ has a finite contractive TFR given via contractive $T=\begin{pmatrix}A&B\\ C&D\end{pmatrix}$ . Every contractive matrix is a submatrix of a finite unitary, say $U$ . If we rearrange rows and columns we may write

[TABLE]

If

[TABLE]

then $S(z)=\begin{pmatrix}I&O\end{pmatrix}\Phi(z)\begin{pmatrix}I\\ O\end{pmatrix}$ .

This same type of observation shows that every submatrix of $S$ has a finite contractive TFR. ∎

The following is referred to as the adjunction formula in [wavelet].

Proposition 2.3.

Let $S:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ be a function with a finite contractive TFR given via a matrix $T$ as in (1),(2) of Theorem 2.1. Set $\breve{S}(z)=S(\bar{z})^{*}$ . Then, $\breve{S}$ has a finite contractive TFR given via $T^{*}$ .

In particular, if $T$ is isometric, then $\breve{S}$ has a finite coisometric TFR.

Proof.

With $S(z)=A+B\Delta(z)(I-D\Delta(z))^{-1}C$ we have

[TABLE]

which is exactly condition (1) of Theorem 2.1 with $T^{*}$ in place of $T$ . ∎

3. One variable version of Theorem 1.1

We now prove a detailed one variable version of the Main Theorem (Thm 1.1). If $S=Q/p:\mathbb{D}\to\mathbb{C}^{M\times N}$ is a rational iso-inner function, then $S^{*}S=I$ on $\mathbb{T}$ away from zeros of $p$ , but then $|p|^{2}I=Q^{*}Q$ on all of $\mathbb{T}$ by continuity.

Theorem 3.1.

Assume $p\in\mathbb{C}[z]$ has no zeros in $\mathbb{D}$ , $Q\in\mathbb{C}^{M\times N}[z]$ , and $|p|^{2}I=Q^{*}Q$ on $\mathbb{T}$ . Let $n$ be the maximum of the degrees of $p$ and the entries of $Q$ . Then,

[TABLE]

where $T$ is a positive semi-definite matrix whose entries can be expressed as polynomials in the coefficients of $p,\bar{p},Q,Q^{*}$ . Furthermore, $K(w,z)$ is a positive semi-definite kernel whose rank matches the rank of the matrix $T$ .

Positive semi-definite kernels are reviewed in Definition 10.5 and the rank of such a kernel is defined in Definition 10.6 in the Appendix.

The theorem allows for common zeros of $Q$ and $p$ which is important in using this result in two variables. It immediately follows that $S=Q/p$ possesses an isometric TFR because we can factor $T=\mathbf{F}^{*}\mathbf{F}$ where $\mathbf{F}$ is an $r\times nN$ matrix. Then, for $F(z)=\mathbf{F}(I,zI,\dots,z^{n-1}I)^{t}$ we have

[TABLE]

By Theorem 2.1 we see that $S$ has an isometric TFR. After the proof of Theorem 3.1 we give an explicit way to find a formula for an isometry $U$ out of which a TFR for $S$ can be built. We need a standard lemma to prove Theorem 3.1. We give the short proof in the appendix; see Subsection 10.3.

Lemma 3.2.

Assume $S:\mathbb{D}\to\mathbb{C}^{M\times N}$ is analytic and $\|S(z)\|\leq 1$ in $\mathbb{D}$ . Then, the kernel

[TABLE]

is positive semi-definite.

The swapping of $z$ , $w$ is deliberate and is discussed in the proof in the appendix.

Proof of Theorem 3.1.

By analyticity $\overline{p(1/\bar{z})}p(z)I=Q(1/\bar{z})^{*}Q(z)$ on $\mathbb{C}\setminus\{0\}$ . This implies the polynomial in $z,\bar{w}$

[TABLE]

is divisible by $(1-\bar{w}z)$ and hence we can write (3.1) where $T$ is indeed a $nN\times nN$ matrix whose entries are polynomials in the coefficients of $p,\bar{p},Q,Q^{*}$ . We could solve for them but we do not need to. By Lemma 3.2, $K_{S}(w,z)$ in (3.2) is positive semi-definite. Multiplying through by $\overline{p(w)}p(z)$ we have that $K(w,z)$ as in (3.1) is a positive semi-definite matrix-valued polynomial function of bounded degree.

To show $T$ is positive semi-definite, take any $z_{1},\dots,z_{n}\in\mathbb{D}$ and note that

[TABLE]

is positive semi-definite where $V=(V_{i,j})$ is the block Vandermonde matrix $V_{i,j}=z_{j}^{i-1}I$ . If the $z_{j}$ are all distinct then $V$ is invertible which implies that $T$ is positive semi-definite. The above computation also shows that the rank of $K$ equals the rank of $T$ , although we omit some details.

∎

Remark 3.3.

We now explain how to find an isometry $U$ out of which a TFR for $S=Q/p$ can be built. This will closely parallel our approach in the two variable setting. We first factor $T=A^{*}A$ where $A$ is $r\times nN$ with $r=\text{rank}(T)$ . Then, $A$ will possess a right inverse $B$ , namely $AB=I$ . Set $F(z)=A(I,zI,\dots,z^{n-1}I)^{t}$ . To find $U$ such that

[TABLE]

we write out $p(z)=\sum_{j=0}^{n}p_{j}z^{j},Q(z)=\sum_{j=0}^{n}z_{j}Q_{j}$ and extracting coefficients we equivalently need $U$ to satisfy

[TABLE]

The matrix $\begin{pmatrix}p_{0}I&\left[p_{1}I,\dots,p_{n}I\right]\\ O&A\end{pmatrix}$ has right inverse

[TABLE]

where $X=-p_{0}^{-1}\left[p_{1}I,\dots,p_{n}I\right]B$ so that

[TABLE]

Thus, $U$ can be computed directly from $p,Q,A,B$ .

4. Two variables and Theorem 1.1

The basic idea of Kummert’s argument is to attempt a parametrized version of the one variable theorem above. The matrix Fejér-Riesz factorization in one variable, which we now review, then becomes crucial in attempting a parametrized version of the implication (3) $\implies$ (2) in the Equivalences Theorem (Thm 2.1).

Theorem 4.1 (Matrix Fejér-Riesz).

Let $T(z)=\sum_{j=-n}^{n}T_{j}z^{j}$ be a matrix Laurent polynomial ( $T_{j}\in\mathbb{C}^{N\times N}$ ) such that $T(z)\geq 0$ for $z\in\mathbb{T}$ . Then, there exist a natural number $r\leq N$ , a matrix polynomial $A_{0}\in\mathbb{C}^{r\times r}[z]$ with $\det A_{0}(z)\neq 0$ for $z\in\mathbb{D}$ , and a polynomial matrix $V\in\mathbb{C}^{N\times N}[z]$ with polynomial inverse such that for $A=\begin{pmatrix}A_{0}&0_{r\times N-r}\end{pmatrix}V$ we have

[TABLE]

Furthermore, $A$ has degree at most $n$ and a right rational inverse $B$ which is analytic in $\mathbb{D}$ .

The case where $T(z)$ is positive definite at all points of $\mathbb{T}$ is usually attributed to Rosenblatt [Rosenblatt]. If $\det T(z)$ vanishes at a finite number points, it is possible to factor out these zeros from $T$ ; see [DGK, Dj]. If $\det T(z)$ is identically zero, it is possible to use operator-valued versions of this theorem which guarantee an outer factorization of $T$ . We explain how to go from the case of $\det T\not\equiv 0$ to the case $\det T\equiv 0$ in the appendix (subsection 10.2). The factorization above can be computed using semidefinite programming or Riccati equations (see for instance [Hachez]).

Theorem 4.1 in particular shows that $T(z)$ has rank $r$ except at the finite number of zeros of $\det A_{0}$ . One nice application of Theorem 4.1 is the one variable version of Theorem 1.3.

Proposition 4.2.

Let $S:\mathbb{D}\to\mathbb{C}^{M\times N}$ be rational and $\|S(z)\|\leq 1$ for all $z\in\mathbb{D}$ . Then, $S$ has a finite contractive TFR.

Proof.

Write $S=Q/p$ . Then, $|p|^{2}I-Q^{*}Q$ is positive semi-definite on $\mathbb{T}$ . By Theorem 4.1, there exists a matrix polynomial $A$ such that $|p|^{2}-Q^{*}Q=A^{*}A$ on $\mathbb{T}$ . Then, $\Phi=\begin{pmatrix}S\\ A/p\end{pmatrix}$ is iso-inner and by Theorem 3.1 possesses a finite isometric TFR. By Proposition 2.2, we see that $S$ possesses a finite contractive TFR. ∎

The following lemma lets us apply Theorem 3.1 to one variable slices.

Lemma 4.3.

Suppose $S:\mathbb{D}^{2}\to\mathbb{C}^{M\times N}$ is rational and iso-inner. Write $S=Q/p$ where $Q\in\mathbb{C}^{M\times N}[z_{1},z_{2}]$ , $p\in\mathbb{C}[z_{1},z_{2}]$ has no zeros in $\mathbb{D}^{2}$ , and $Q$ , $p$ have no common factors. Then, $|p|^{2}I=Q^{*}Q$ on $\mathbb{T}^{2}$ and for each $z_{2}\in\mathbb{T}$ , the one variable polynomial $z_{1}\mapsto p(z_{1},z_{2})$ has no zeros in $\mathbb{D}$ .

Proof.

As in one variable, $|p|^{2}I=Q^{*}Q$ on $\mathbb{T}^{2}$ by continuity. For fixed $\tau\in\mathbb{T}$ notice that $z_{1}\mapsto p(z_{1},\tau)$ either has no zeros in $\mathbb{D}$ or is identically zero by Hurwitz’s theorem (by considering $\tau$ as a limit of $t\in\mathbb{D}$ ). If $p(\cdot,\tau)$ is identically zero, then $Q(\cdot,\tau)$ is identically zero because of $|p|^{2}I=Q^{*}Q$ on $\mathbb{T}^{2}$ . Hence both polynomials are divisible by $z_{2}-\tau$ contradicting the assumption of no common factors. Thus, for every $z_{2}\in\mathbb{T}$ , $z_{1}\mapsto p(z_{1},z_{2})$ has no zeros in $\mathbb{D}$ . ∎

We are now ready to prove the Main Theorem (Thm 1.1).

Proof of Theorem 1.1.

Assume the setup of Theorem 1.1 and write $S=Q/p$ as in Lemma 4.3. We can essentially follow a parametrized version of Remark 3.3 but we use the matrix Fejér-Riesz theorem to deal with certain matrix factorizations.

Step 1: Fix $z_{2}=w_{2}\in\mathbb{T}$ , divide $\overline{p(w)}p(z)I-Q(w)^{*}Q(z)$ by $(1-\bar{w}_{1}z_{1})$ , and then extract the coefficients of $\bar{w}_{1}^{j}z_{1}^{k}$ to obtain

[TABLE]

where $T(z_{2})=(T_{jk}(z_{2}))_{jk}$ is a positive semi-definite $(n_{1}N\times n_{1}N)$ matrix Laurent polynomial. This follows from Theorem 3.1 applied to $p(\cdot,z_{2}),Q(\cdot,z_{2})$ . Here $n_{1}$ is the maximum of the degree of $p,Q$ with respect to $z_{1}$ .

Step 2: Apply the matrix Fejér-Riesz theorem (Thm 4.1) to $T(z_{2})$ to get an $r\times n_{1}N$ matrix polynomial $A(z_{2})$ and an analytic (in $\mathbb{D}$ ) rational matrix function $B(z_{2})$ such that $A^{*}A=T$ on $\mathbb{T}$ and $AB=I$ in $\mathbb{D}$ . For convenience we define

[TABLE]

Then, for $z_{2}=w_{2}\in\mathbb{T}$ and $z_{1},w_{1}\in\mathbb{C}$

[TABLE]

By Lemma 4.3, for each fixed $z_{2}\in\mathbb{T}$ the map $z_{1}\mapsto\frac{Q(z_{1},z_{2})}{p(z_{1},z_{2})}$ is an iso-inner rational function and Theorem 2.1 guarantees the existence of an isometric matrix $U(z_{2})$ such that

[TABLE]

Step 3: In this step we find a formula for $U(z_{2})$ and show it extends to $\overline{\mathbb{D}}$ as a rational iso-inner function in one variable. We can rewrite (4.2) in terms of the coefficients of the powers of $z_{1}$ by writing $p(z)=\sum_{j}p_{j}(z_{2})z_{1}^{j}$ and $Q(z)=\sum_{j}Q_{j}(z_{2})z_{1}^{j}$ , defining $\vec{p}(z_{2})=(p_{0}(z_{2})I_{N},p_{1}(z_{2})I_{N},\dots,p_{n_{1}}(z_{2})I_{N})$ , and $\vec{Q}(z_{2})=(Q_{0}(z_{2}),\dots Q_{n_{1}}(z_{2}))$ . Then,

[TABLE]

using $O_{r\times N}$ to denote the $r\times N$ zero matrix. Since $p(0,z_{2})=p_{0}(z_{2})$ has no zeros in $\mathbb{D}$ , the matrix $\begin{pmatrix}\vec{p}(z_{2})\\ \begin{matrix}0&A(z_{2})\end{matrix}\end{pmatrix}$ has a rational matrix right inverse of the form $\begin{pmatrix}p_{0}(z_{2})^{-1}I&X(z_{2})\\ 0&B(z_{2})\end{pmatrix}$ . The exact formula for $X(z_{2})$ is $-\frac{1}{p_{0}}(p_{1}I,\dots,p_{n_{1}}I)B$ . Then,

[TABLE]

extends to a rational function holomorphic in $\mathbb{D}$ and isometry-valued on $\mathbb{T}$ away from any singularities. So, not only is $U$ uniquely determined (by $A,B$ ) and iso-inner but both sides of (4.3) are now holomorphic, so (4.3) extends to $\mathbb{D}$ . (We caution that the blocks in (4.4) do not line up as written. There is no need to multiply this out, so there is no real concern.)

Step 4: In this step we find an isometric matrix $V$ such that $S$ has a TFR built out of $V$ . It turns out $U(z_{2})$ as a one variable function has a TFR built out of the same isometry $V$ . Indeed, by Theorem 3.1 and Theorem 2.1 there exist a constant isometric matrix $V$ and matrix function $F(z_{2})$ such that

[TABLE]

A formula for $V$ can be found via Remark 3.3. As we now show, $V$ is the isometry we are looking for. If we multiply on the right by $\begin{pmatrix}p(z)I\\ z_{1}A(z_{2})\Lambda(z_{1})\end{pmatrix}$ and define $H(z):=F(z_{2})\begin{pmatrix}p(z)I\\ z_{1}A(z_{2})\Lambda(z_{1})\end{pmatrix}$ , $G(z):=A(z_{2})\Lambda(z_{1})$ we get

[TABLE]

By Theorem 2.1, this means $S$ has a finite-dimensional isometric transfer function realization built out of the isometry $V$ . This proves Theorem 1.1. ∎

When we prove the minimality theorem (Thm 1.4) we will pick up where this proof leaves off. We will later refer to $G^{*}G$ as the dominant $z_{1}$ -term associated to $S$ , while we will refer to $H^{*}H$ as the sub-dominant $z_{2}$ -term. We write $G^{*}G:=G(w)^{*}G(z),H^{*}H:=H(w)^{*}H(z)$ instead of $G,H$ because the former are uniquely determined while $G,H$ are determined up to left multiplication by isometric matrices. By symmetry we could also construct a dominant $z_{2}$ -term with associated sub-dominant $z_{1}$ -term.

5. Detailed example

In this section we give a detailed example of the 4 steps presented in the proof of Theorem 1.1. The $N\times N$ identity matrix is written $I_{N}$ , the $N\times N$ zero matrix is written $O_{N}$ , and the $N\times M$ zero matrix is written $O_{N\times M}$ .

Consider the following simple rational inner function

[TABLE]

where $X=\frac{1}{\sqrt{2}}\begin{pmatrix}1&1\\ 1&-1\end{pmatrix}$ is a unitary. The right expression shows $S$ is a product of inner functions and is therefore inner itself. Since $S$ is a polynomial the process below will be simpler than the general case but still illustrative. Note then that referring to the proof of Theorem 1.1 we have $p=1$ and $Q=S$ .

Step 1: Set $|z_{2}|=1$ , divide $I-S(w_{1},z_{2})^{*}S(z_{1},z_{2})$ by $1-\bar{w}_{1}z_{1}$ , and extract coefficients of the monomials $\bar{w}_{1}^{j}z_{1}^{k}$ in order to write

[TABLE]

where $T(z_{2})$ is the matrix Laurent polynomial

[TABLE]

Necessarily, $T$ is positive semi-definite on $\mathbb{T}$ .

Step 2: Factor $T$ according to the one variable matrix Fejér-Riesz theorem. There exist algorithms for doing this ([Hachez]) and it can also be essentially reduced to polynomial algebra and one variable Fejér-Riesz factorizations (see [Dj] where this is done in a more general setup). We get $T(z_{2})=A(z_{2})^{*}A(z_{2})$ on $\mathbb{T}$ where

[TABLE]

has right inverse

[TABLE]

We use the equations above to define the $2\times 2$ matrix polynomials $A_{0}(z_{2}),A_{1}(z_{2}),B_{0}(z_{2})B_{1}(z_{2})$ . Note that the right inverse in general could be rational.

Step 3: We find our parametrized unitary $U(z_{2})$ in this step. Form the “vectors” of coefficients

[TABLE]

where

[TABLE]

and then compute the one variable rational inner function $U(z_{2})$ as in (4.3)

[TABLE]

The fourth step is to find a TFR for $U(z_{2})$ . To do this we apply Remark 3.3. Let us emphasize the steps. Divide $I_{4}-U(w_{2})^{*}U(z_{2})$ by $1-\bar{w}_{2}z_{2}$ and extract coefficients of $\bar{w}_{2}^{j}z_{2}^{k}$ to write

[TABLE]

where

[TABLE]

Then, we factor $Y=C^{*}C$ where

[TABLE]

Note that

[TABLE]

is a right inverse for $C$ (i.e. $CD=I_{2}$ ). Set

[TABLE]

We need to compute the unitary (or isometry in general) $V$ such that

[TABLE]

After equating coefficients of powers of $z_{2}$ this is equivalent to

[TABLE]

where $U(z_{2})=U_{0}+z_{2}U_{1}+z_{2}^{2}U_{2}$ . Using the right inverse $D$ we have

[TABLE]

This is the desired unitary out of which we build our TFR. Setting $V_{11}=O_{2}$

[TABLE]

we have

[TABLE]

where $\Delta(z_{1},z_{2})=\begin{pmatrix}z_{1}I_{2}&O_{2}\\ O_{2}&z_{2}I_{2}\end{pmatrix}$ . This is easy to verify since $(V_{22}\Delta(z))^{3}=O$ so that the formula reduces to

[TABLE]

which can be verified by hand.

While the above method involves several steps it is entirely systematic. Since $S$ is a product of simple inner functions, there are ad hoc ways of coming up with a TFR which might be shorter.

6. Matrix Agler decompositions in two variables

Theorem 1.1 makes it possible to prove Agler’s theorem (Thm 1.2). Cole-Wermer [CW99] showed that in the scalar case it is enough to prove Agler’s theorem for rational inner functions because holomorphic $f:\mathbb{D}^{2}\to\mathbb{D}$ can be approximated locally uniformly by rational inner functions (Theorem 5.5.1 of Rudin [Rudin]). This approximation argument does not seem to transfer to the matrix-valued function setting, but there is a workaround.

Lemma 6.1.

Let $f:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ be holomorphic and $\|f(z)\|\leq 1$ for all $z\in\mathbb{D}^{d}$ . Suppose $\|f(z_{0})\|=1$ for some $z_{0}\in\mathbb{D}^{d}$ . Then, there exist unitary matrices $U_{1},U_{2}$ such that $U_{1}fU_{2}$ is a direct sum of a constant unitary matrix and a matrix valued holomorphic function $g$ on $\mathbb{D}^{d}$ with $\|g(z)\|<1$ for all $z\in\mathbb{D}^{d}$ .

Proof.

If $\|f(z_{0})\|=1$ , then there exists $v\in\mathbb{C}^{N}$ with $|v|=1$ such that $|f(z_{0})v|=1$ . By the maximum principle, $\langle f(z)v,f(z_{0})v\rangle$ is constant and equal to one. Then, by equality in Cauchy-Schwarz, $f(z)v\equiv f(z_{0})v$ . Since $f(z)$ has at most norm one, $v$ is reducing for $f(z)$ meaning $f(z)w\perp f(z)v$ whenever $v\perp w$ . Thus, $f(z)$ can be written in the form

[TABLE]

using the block decomposition $\mathbb{C}f(z_{0})v\oplus(f(z_{0})v)^{\perp}\times(\mathbb{C}v)\oplus v^{\perp}$ . We can of course iterate this argument until we are left with the claimed decomposition. ∎

This lets us reduce to the case of $f$ with $\|f(z)\|<1$ for all $z\in\mathbb{D}^{d}$ . The following is found in Rudin’s book [Rudin] in the scalar case (see Theorem 5.5.1 of [Rudin]). Define

[TABLE]

Lemma 6.2.

Suppose $f:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ is holomorphic and $\|f(z)\|<1$ for all $z\in\mathbb{D}^{d}$ . Then, for any $r\in(0,1)$ and $\epsilon>0$ there exists $P\in\mathbb{C}^{M\times N}[z_{1},\dots,z_{d}]$ such that $\|P\|_{\mathbb{D}^{d}}<1$ and $\|f-P\|_{r\mathbb{D}^{d}}<\epsilon$ .

Consequently, every such $f$ is a local uniform limit of matrix polynomials with supremum norm strictly less than $1$ .

Proof.

Set $f_{r}(z)=f(rz)$ for $r\in(0,1)$ . For fixed $r\in(0,1)$ there exists $s\in(0,1)$ such that $\|f_{r}-f_{rs}\|_{\mathbb{D}^{d}}<\epsilon/2$ since $f_{r}$ is uniformly continuous on $\overline{\mathbb{D}}^{d}$ . Note $\|f_{s}\|_{\mathbb{D}^{d}}<1.$ Choose a Taylor polynomial $P$ of $f_{s}$ such that $\|f_{s}-P\|_{\mathbb{D}^{d}}<\min(1-\|f_{s}\|_{\mathbb{D}^{d}},\epsilon/2)$ . Then, $\|P\|_{\mathbb{D}^{d}}<1$ and $\|f_{r}-P_{r}\|_{\mathbb{D}^{d}}\leq\|f_{r}-f_{rs}\|_{\mathbb{D}^{d}}+\|f_{rs}-P_{r}\|_{\mathbb{D}^{d}}<\epsilon$ . ∎

We need the following Fejér-Riesz type theorem of Dritschel.

Theorem 6.3 (Dritschel [mD1]).

Let $T(z)=\sum_{j\in\mathbb{Z}^{d}}T_{j}z^{j}$ be a matrix-valued Laurent polynomial in $d$ variables; i.e. $T_{j}\in\mathbb{C}^{N\times N}$ for $j\in\mathbb{Z}^{d}$ and at most finitely many $T_{j}\neq 0$ . If there is a $\delta>0$ such that $T(z)\geq\delta I$ on $\mathbb{T}^{d}$ , then there exists a matrix polynomial $A\in\mathbb{C}^{M\times N}[z_{1},\dots,z_{d}]$ such that $T=A^{*}A$ on $\mathbb{T}^{d}$ .

We sketch a simple proof with some new elements in the appendix; see Subsection 10.2.

Lemma 6.4.

If $P:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ is a matrix polynomial such that $\|P\|_{\mathbb{D}^{d}}<1$ then there exists a matrix polynomial $A$ such that $\begin{pmatrix}P\\ A\end{pmatrix}$ is iso-inner. If $d=1,2$ , then $P$ has a finite contractive TFR.

Proof.

On $\mathbb{T}^{d}$ , $I-P^{*}P$ is a positive definite matrix Laurent polynomial. By Theorem 6.3 we can factor $I-P^{*}P=A^{*}A$ . Then, $S=\begin{pmatrix}P\\ A\end{pmatrix}$ is isometry-valued on $\mathbb{T}^{d}$ . If $d=1,2$ , then $S$ has a finite isometric TFR by Theorem 1.1 and hence $P$ possesses a finite contractive TFR by Proposition 2.2. ∎

Positive semi-definite kernels are defined in Definition 10.5. Notice that an expression of the form $F(w)^{*}F(z)$ will always be positive semi-definite. By the above lemma and Theorem 2.1, any matrix polynomial $P\in\mathbb{C}^{M\times N}[z_{1},z_{2}]$ with $\|P\|_{\mathbb{D}^{2}}<1$ will satisfy a formula of the form

[TABLE]

where $k_{0},k_{1},k_{2}$ are positive semi-definite kernels. The term $k_{0}$ can be absorbed into $k_{1}$ since

[TABLE]

is positive semi-definite by the Schur product theorem. Thus, the following corollary holds for such strictly contractive matrix polynomials in two variables. Such formulas are called Agler decompositions.

Corollary 6.5.

Let $f:\mathbb{D}^{2}\to\mathbb{C}^{M\times N}$ be holomorphic with $\|f(z)\|\leq 1$ for $z\in\mathbb{D}^{2}$ . Then, there exist positive semi-definite kernels $k_{1},k_{2}:\mathbb{D}^{2}\times\mathbb{D}^{2}\to\mathbb{C}^{N\times N}$ such that

[TABLE]

Sketch of Proof.

The hard work has already been done while the general outline and some technicalities are essentially in [CW99] so we only sketch the proof. We can assume that $f$ is point-wise strictly contractive by Lemma 6.1. Then, $f$ is a local uniform limit of matrix polynomials with supremum norm strictly less than one by Lemma 6.2. Each of these possesses an Agler decomposition by the discussion above.

The final part of the argument is the piece found in [CW99]. The kernels in the Agler decomposition are locally bounded because of the estimate

[TABLE]

This shows the kernels in Agler decompositions form a normal family. Subsequences converge locally uniformly to form positive semi-definite kernels in an Agler decomposition for $f$ . ∎

The above corollary proves Theorem 1.2. The proof is essentially the same as $(3)\implies(1)$ in the equivalences theorem (Thm 2.1) since positive semi-definite kernels can be factored as $F(w)^{*}F(z)$ for some possibly operator valued function $F$ . Readers who have ventured this far (and are not in the cognoscenti of this material) may benefit from some context at this point. The fundamental contribution of Agler can perhaps be encapsulated in the following result.

Theorem 6.6 (Agler [Agler1, Agler2]).

Let $f:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ be holomorphic. Assume $\|f(z)\|\leq 1$ for $z\in\mathbb{D}^{d}$ . Then, the following are equivalent.

(1)

$f$ * satisfies a von Neumann inequality:*

[TABLE]

for every $d$ -tuple $T=(T_{1},\dots,T_{d})$ of pairwise commuting strictly contractive operators (on some underlying Hilbert space); 2. (2)

$f$ * has an Agler decomposition: there exist positive semi-definite kernels $k_{1},\dots,k_{d}:\mathbb{D}^{d}\times\mathbb{D}^{d}\to\mathbb{C}^{N\times N}$ such that*

[TABLE] 3. (3)

$f$ * has a contractive transfer function realization: there exists a contractive operator with block decomposition $T=\begin{pmatrix}A&B\\ C&D\end{pmatrix}$ on some Hilbert space such that*

[TABLE]

where $\Delta(z)=\sum_{j=1}^{d}z_{j}P_{j}$ and the $P_{j}$ are pairwise orthogonal orthogonal projections which sum to the identity on the domain of $D$ .

Theorem 1.2 was originally proven via Andô’s inequality [Ando] which gives item (1) above. The approach we have given sidesteps the use of von Neumann’s inequality and the implication $(1)\implies(2)$ in Theorem 6.6. The proof of $(1)\implies(2)$ is possibly the hardest part of the theorem and is non-constructive as it uses a Hahn-Banach cone separation argument. On the other hand, $(2)\implies(1)$ is a relatively straightforward matter of “plugging” the $d$ -tuple $T$ into the Agler decomposition in item (2) in an appropriate sense. See [CW99] for details. Ball-Sadosky-Vinnikov [BSV05] have a different way to prove Theorem 1.2 directly using multi-evolution scattering systems. Theorem 1.2’s analogue for $3$ or more variables fails because the von Neumann inequality fails for 3 or more contractions [Varo]. Thus, Theorem 6.6 gives the best way of demonstrating that a function does not have a contractive TFR; namely, showing that it fails the von Neumann inequality. It is probably difficult to directly show that a function fails item (2) or (3) in Theorem 6.6.

We conclude this section by plugging Dritschel’s strong Fejér-Riesz type result (stated below) into earlier arguments in order to show rational contractive matrix-valued functions in two variables have a finite contractive TFR (Theorem 1.3).

Theorem 6.7 (Dritschel [mD2]).

Let $T(z)=\sum_{j\in\mathbb{Z}^{2}}T_{j}z^{j}$ be a matrix-valued Laurent polynomial in two variables; i.e. $T_{j}\in\mathbb{C}^{N\times N}$ for $j\in\mathbb{Z}^{2}$ and at most finitely many $T_{j}\neq 0$ . If $T(z)\geq 0$ on $\mathbb{T}^{2}$ , then there exists a matrix polynomial $A\in\mathbb{C}^{M\times N}[z_{1},z_{2}]$ such that $T=A^{*}A$ on $\mathbb{T}^{2}$ .

This theorem is considerably deeper than Theorem 6.3, and both theorems also apply to operator-valued functions. An earlier sums of squares theorem of Scheiderer, which applied to polynomials on a much more general class of two dimensional domains (than simply $\mathbb{T}^{2}$ ), implies Theorem 6.7 in the scalar case [Scheiderer].

Proof of Theorem 1.3.

Apply the proof of Proposition 4.2 with Theorem 6.7 in place of Theorem 4.1. ∎

7. More on finite TFRs

We need to collect one more fact about finite-dimensional TFRs before proving the minimality theorem. If we have an Agler decomposition of an iso-inner function $S=Q/p$ written in lowest terms, then the sums of squares terms are rational with denominator $p$ .

Theorem 7.1.

Suppose $S:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ is rational and iso-inner. Write $S=Q/p$ in lowest terms with $Q\in\mathbb{C}^{M\times N}[z_{1},\dots,z_{d}]$ and $p\in\mathbb{C}[z_{1},\dots,z_{d}]$ . Suppose we have an Agler decomposition

[TABLE]

where the $F_{j}$ are matrix functions. Then, for $j=1,\dots,d$ , $p(z)F_{j}(z)$ is a matrix polynomial.

The significance of this theorem is that although $S$ has a TFR with denominator $\det(I-D\Delta(z))$ , this polynomial may not be the lowest degree denominator of $S$ .

Proof.

By Theorem 2.1 we already see that each $F_{j}$ is rational and holomorphic in $\mathbb{D}^{d}$ . To prove that $H_{j}:=pF_{j}$ is a matrix polynomial consider

[TABLE]

Fix $\tau\in\mathbb{T}^{d}$ and set $z=\zeta\tau,w=\eta\tau$ for $\zeta,\eta\in\mathbb{D}$ . Then

[TABLE]

Because $S^{*}S=I_{N}$ on $\mathbb{T}^{d}$ , the left hand side above is divisible by $(1-\bar{\eta}\zeta)$ and therefore

[TABLE]

is a polynomial in $\zeta,\bar{\eta}$ of degree in each less than the total degree of $p$ and $Q$ . For simplicity we can regroup $\sum_{j=1}^{d}H_{j}(w)^{*}H_{j}(z)=H(w)^{*}H(z)$ where now $H(\eta\tau)^{*}H(\zeta\tau)$ is a polynomial in $\zeta,\bar{\eta}$ for every $\tau\in\mathbb{T}^{d}$ . If we write out the homogeneous expansion of $H$ ,

[TABLE]

we see that

[TABLE]

In particular, for $j$ greater than the total degrees of $p$ and $Q$ , the coefficient of $\bar{\eta}^{j}\zeta^{j}$ vanishes for every $\tau$ ; namely, we have $P_{j}(\tau)^{*}P_{j}(\tau)\equiv 0$ for all $\tau\in\mathbb{T}^{d}$ . Since $P_{j}$ is a matrix polynomial, this implies $P_{j}\equiv 0$ for $j$ greater than the total degrees of $p$ and $Q$ . Therefore, $H$ is a polynomial. ∎

We conclude this short section with a few asides. The Agler norm (sometimes Schur-Agler norm) for holomorphic $f:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ is

[TABLE]

where the supremum is taken over all $d$ -tuples $T=(T_{1},\dots,T_{d})$ of strictly contractive pairwise commuting operators on some Hilbert space. The Agler class $\mathcal{A}_{d}$ consists of functions satisfying $\|f\|_{\mathcal{A}_{d}}\leq 1$ .

The argument in the proof above is related to the argument used to prove the following automatic finite-dimensionality result.

Theorem 7.2.

Suppose $S:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ is rational, iso-inner or coiso-inner ( $SS^{*}=I$ on $\mathbb{T}^{d}$ ), and belongs to the Agler class $\mathcal{A}_{d}$ . Then, $S$ has a finite-dimensional isometric (resp. coisometric) TFR as in Theorem 2.1.

The essence of this theorem was first proved in Cole-Wermer [CW99]. Although it was only stated and proved in the scalar case for $d=2$ , the proof goes through easily to all $d$ and for iso-inner functions. We gave a proof with some bounds on degrees and the numbers of squares involved in the scalar case in [KneseRIFITSAC]. A proof of the square matrix-valued case is in [BallKal]. Extending to the iso-inner (non-square) case causes no difficulties. The coisometric case follows from Proposition 2.3. A proof where $S$ is assumed to be a polynomial is also given in [wavelet]. The next theorem also produces a family of functions with finite TFRs.

Theorem 7.3 (Grinshpan et al [Getal]).

Suppose $S:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ is rational, analytic on a neighborhood of $\overline{\mathbb{D}}^{d}$ , and $\|S\|_{\mathcal{A}_{d}}<1$ . Then, $S$ has a finite-dimensional contractive TFR as in Theorem 2.1.

The following question asks about what is still left open.

Question 7.4.

For $d>2$ , if $S:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ is rational, $\|S\|_{\mathcal{A}_{d}}=1$ , and is neither iso-inner nor coiso-inner, then does $S$ have a finite-dimensional contractive TFR?

We also do not know how essential analyticity on $\overline{\mathbb{D}}^{d}$ is for Theorem 7.3. Note $d=1,2$ follows from Theorem 1.3.

8. Kummert’s minimality theorem

In this section we discuss minimality of size breakdowns for finite TFRs, namely Theorem 1.4. Minimality in one variable follows directly from Theorem 2.1.

Proposition 8.1.

Let $S:\mathbb{D}\to\mathbb{C}^{M\times N}$ be rational and iso-inner. Then, the minimal size of an isometric TFR for $S$ is the rank of the positive semi-definite kernel

[TABLE]

The definition of the rank of a positive semi-definite kernel is given in Definition 10.6 in the Appendix. In two variables, we will frequently refer to the dominant $z_{1}$ -term $G^{*}G$ and sub-dominant $z_{2}$ -term $H^{*}H$ associated to $S$ which were constructed in the proof of Theorem 1.1; see the end of Section 4. Note that the number of rows of $G$ matches the generic rank of the matrix $T(z_{2})$ as in equation (4.1). This cannot be reduced because this is the generic or maximal rank of the positive semi-definite kernels

[TABLE]

Note division of (4.1) by $\overline{p(w_{1},z_{2})}p(z_{1},z_{2})$ will not change the rank of the positive semi-definite kernel and does not introduce any poles in $\mathbb{D}$ since $p(\cdot,z_{2})$ has no zeros in $\mathbb{D}$ by Lemma 4.3.

We claim that in the inner case the rank of $H^{*}H$ is also as small as possible. We suspect this happens in the iso-inner case but cannot prove it.

Question 8.2.

If $S:\mathbb{D}^{2}\to\mathbb{C}^{M\times N}$ is iso-inner (and not inner), does the construction in Section 4 produce a size breakdown $(r_{1},r_{2})$ with $r_{1}$ equal to the generic size of a TFR for $S(\cdot,z_{2})$ (for $z_{2}\in\mathbb{T}$ ) and $r_{2}$ equal to the generic size of a TFR for $S(z_{1},\cdot)$ (for $z_{1}\in\mathbb{T}$ )?

This question is subtle because every iso-inner function $S$ is a submatrix of an inner function $\Phi$ with the same size breakdown. We have built a size breakdown with $r_{1}$ minimal so $r_{1}$ must also be minimal for $\Phi$ . We could then build a TFR with size breakdown $(r_{1},r_{2}^{*})$ where $r_{2}^{*}$ is minimal for $\Phi$ . Is it minimal for the restriction to $S$ ?

The next result characterizes $G^{*}G$ and $H^{*}H$ .

Proposition 8.3.

Assume $S:\mathbb{D}^{2}\to\mathbb{C}^{M\times N}$ is rational and iso-inner. Write $S=Q/p$ in lowest terms. Suppose we had a formula

[TABLE]

where $\Gamma_{1},\Gamma_{2}$ are matrix polynomials. Then,

[TABLE]

is a positive semi-definite polynomial kernel. Here again $G^{*}G$ is the dominant $z_{1}$ -term and $H^{*}H$ is the sub-dominant $z_{2}$ -term.

This result characterizes $G^{*}G$ as maximal and $H^{*}H$ as minimal in the above sense. Indeed, if some other kernel $L^{*}L$ satisfied the same property as $G^{*}G$ then both

[TABLE]

would be positive semi-definite forcing $G^{*}G=L^{*}L$ .

Proof of Proposition 8.3.

If we set $z_{2}=w_{2}\in\mathbb{T}$ we get

[TABLE]

The left side has degree at most $n_{1}-1$ in $z_{1}$ . We claim $\Gamma_{1}(z)$ has degree at most $n_{1}-1$ in $z_{1}$ . Consider $\Gamma_{1}$ ’s top degree term $\gamma(z_{2})z_{1}^{k}$ where $\gamma(z_{2})$ is a matrix polynomial. Then, the term $\bar{w}_{1}^{k}z_{1}^{k}$ appears on the right hand side with coefficient $\gamma(z_{2})^{*}\gamma(z_{2})$ for $z_{2}\in\mathbb{T}$ . If $k>n_{1}-1$ then $\gamma(z_{2})^{*}\gamma(z_{2})\equiv 0$ on $\mathbb{T}$ implying $\gamma(z_{2})\equiv 0$ on $\mathbb{T}$ and also on $\mathbb{C}$ by analyticity. Thus, $\Gamma_{1}$ has degree at most $n_{1}-1$ in $z_{1}$ .

Just as we have factored $G(z)=A(z_{2})\Lambda(z_{1})$ we can also factor $\Gamma_{1}(z)=C(z_{2})\Lambda(z_{1})$ . Recall $\Lambda(z_{1})=(I,z_{1}I,\cdots,z_{1}^{n_{1}-1}I)^{t}$ . Upon extracting coefficients of $\bar{w}_{1}^{j}z_{1}^{k}$ we see that

[TABLE]

for $z_{2}\in\mathbb{T}$ . This is related to characterizing uniqueness in the matrix Fejér-Riesz theorem. We address this in the appendix in Theorem 10.4. By Theorem 10.4, since $A$ has a left inverse, there exists a one variable iso-inner function $\Phi$ such that $C=\Phi A$ .

So,

[TABLE]

which is positive semi-definite. Applying $\Lambda(w_{1})^{*}$ on the left and $\Lambda(z_{1})$ on the right we get

[TABLE]

is positive semi-definite. It is a polynomial kernel because $A^{*}A=C^{*}C$ on $\mathbb{T}$ . ∎

We now switch to the square/inner case and show that the Kummert construction gives the best possible size breakdown $r=(r_{1},r_{2})$ . We need to show $H(w)^{*}H(z)$ has the minimal rank possible in the sense that it matches the generic size of a TFR for $S(z_{1},\cdot)$ for $z_{1}\in\mathbb{T}$ . To do this, we show that we can “reflect” an Agler decomposition of $S$ to get an Agler decomposition for $\breve{S}$ and this reflection reverses the dominant and sub-dominant properties of $G^{*}G$ and $H^{*}H$ . This is not the original approach of Kummert; instead it more closely resembles the Hilbert space approach in [BickelKnese]. Recall $\breve{S}(z)=S(\bar{z})^{*}$ .

Proposition 8.4.

Suppose $S:\mathbb{D}^{2}\to\mathbb{C}^{N\times N}$ is rational and inner. Write $S=Q/p$ in lowest terms. Suppose we had a formula

[TABLE]

where $\Gamma_{1},\Gamma_{2}$ are matrix polynomials. Then,

[TABLE]

are matrix polynomials and

[TABLE]

The sub-dominant $z_{2}$ -term of $S$ reflects to the dominant $z_{2}$ -term of $\breve{S}$ .

When we say reflects above we mean the operations:

[TABLE]

listed in the proposition statement equation (8.3). Notice that reflection of the $\Gamma_{1}$ term is slightly different from the reflection of the $\Gamma_{2}$ term.

Proof of Proposition 8.4.

Since $S(z)^{*}S(z)=I$ on $\mathbb{T}^{2}$ (where defined) we have $I=S(1/\bar{z})^{*}S(z)=S(z)S(1/\bar{z})^{*}$ for $z\in\mathbb{C}^{2}$ where defined. (This is where $M=N$ gets used.) So, $Q(1/z)\breve{Q}(z)=p(1/z)\breve{p}(z)I$ . Now, take equation (8.2), replace $z,w$ with $1/z,1/w$ , multiply on the right by $\breve{Q}(z)$ and left by $\breve{Q}(w)^{*}$ , and finally divide through by $-\overline{p(1/w)}p(1/z)$ to get (8.4) after applying various simplifications. Of course, we have the caveat that the formula only holds where all of the operations are defined. Fortunately, (8.4) only needs to hold on an open set for the proof of (3) $\implies$ (1),(2) in Theorem 2.1 to go through (bonus (B1) of Theorem 2.1 addresses this). We automatically obtain that $\tilde{\Gamma}_{1},\tilde{\Gamma}_{2}$ are polynomials by Theorem 7.1, since if $Q/p$ is in lowest terms then $\breve{Q}/\breve{p}$ is too.

If we reflect equation (8.1) in the sense of replacing $z,w$ with $1/z,1/w$ and conjugating by $\breve{Q}$ we obtain

[TABLE]

which rearranges into

[TABLE]

This is still a positive semi-definite polynomial kernel. Thus, $\tilde{H}^{*}\tilde{H}$ dominates an arbitrary $z_{2}$ -term making it the dominant $z_{2}$ -term for $\breve{S}$ . ∎

Proof of Theorem 1.4.

By Proposition 8.4 the subdominant $z_{2}$ -term $H^{*}H$ of $S$ reflects to the dominant $z_{2}$ -term of $\breve{S}$ , $\tilde{H}^{*}\tilde{H}$ . Note that this reflection does not change the rank of a positive semi-definite kernel. The rank of $\tilde{H}^{*}\tilde{H}$ is then the generic rank of

[TABLE]

for $z_{1}\in\mathbb{T}$ . This matches the generic size of a TFR for $\breve{S}(z_{1},\cdot)$ which matches the generic size of a TFR for $S(z_{1},\cdot)$ by the adjunction formula, Proposition 2.3. Thus the rank of $H^{*}H$ matches the generic rank of

[TABLE]

∎

9. Application to inner polynomials

Of special interest in the papers connecting wavelets to TFRs is the case of iso-inner and inner polynomials [wavelet, CCCP]. In one variable, we have the following well-known result.

Proposition 9.1.

Let $S\in\mathbb{C}^{M\times N}[z]$ be iso-inner. Then, every isometric TFR of minimal size for $S$ is built out of an isometric matrix $T=\begin{pmatrix}A&B\\ C&D\end{pmatrix}$ where $D$ is nilpotent.

We prove this using the following also well-known characterization of minimality.

Proposition 9.2.

Let $S:\mathbb{D}\to\mathbb{C}^{M\times N}$ be rational and iso-inner with minimal isometric TFR built out of the isometric matrix $T=\begin{pmatrix}A&B\\ C&D\end{pmatrix}$ . Then,

[TABLE]

Proof.

First note that if $S$ has a TFR via $T$ , meaning $S(z)=A+zB(I-zD)^{-1}C$ , then it also has a TFR via

[TABLE]

where $U$ is a unitary matrix with the same dimensions as $D$ . This is apparent from the formula $A+zBU(I-zU^{*}DU)^{-1}U^{*}C=S(z).$ We can apply a unitary change of coordinates and break up the domain/codomain of $D$ into $\mathcal{H}=\text{span}\{D^{j}C:j=0,1,\dots\}$ and its orthogonal complement $\mathcal{H}^{\perp}$ . In these new coordinates $T$ takes the form

[TABLE]

since $D$ maps $\mathcal{H}$ to itself and $\text{range}(C)\subset\mathcal{H}$ . Since the formula for $S$ is only determined by $D|_{\mathcal{H}}$ , we see that $S$ has an isometric TFR via the matrix $\begin{pmatrix}A&B_{1}\\ C&D|_{\mathcal{H}}\end{pmatrix}$ which has a smaller size unless $\mathcal{H}^{\perp}=\{0\}$ or rather $\mathcal{H}=\text{domain}(D)$ .

For the second identity, we break up the domain of $D$ into $\mathcal{L}=\bigcap_{j\geq 0}\text{kernel}(BD^{j})$ and its orthogonal complement $\mathcal{L}^{\perp}$ . Using this orthogonal decomposition we can write $T$ in new coordinates as

[TABLE]

since $B$ maps $\mathcal{L}$ to [math] while $D$ maps $\mathcal{L}$ into itself. But since this is an isometry we must have $D|_{\mathcal{L}}$ a unitary which forces $C_{2},D_{21}=0$ . This means $S$ is given by the TFR with isometry $\begin{pmatrix}A&B\\ C_{1}&D_{11}\end{pmatrix}.$ This has smaller size unless $\mathcal{L}=\{0\}$ . ∎

Proof of Proposition 9.1.

If $S(z)=A+zB(I-zD)^{-1}C$ is a polynomial, then necessarily $BD^{j}C=0$ for all $j$ large enough. By Proposition 9.2, $BD^{n}=0$ for $n$ large enough. Then,

[TABLE]

implying $\text{range}(D^{n})=0$ or rather $D^{n}=0$ . ∎

Minimality of TFR representations in the rational inner case in two variables makes it possible to prove an analogous result for inner matrix-valued polynomials in two variables. Our approach uses determinants to count the size of minimal TFRs. The following is a standard result in one variable. We provide a proof in Subsection 10.3.

Proposition 9.3.

Let $S:\mathbb{D}\to\mathbb{C}^{N\times N}$ be a rational inner function. Then, $\deg\det S$ equals the size of a minimal TFR for $S$ .

Since $S$ is rational inner, $\det S$ is a scalar rational inner function in one variable which is a finite Blaschke product. So, the $\deg\det S$ refers to the degree of the numerator when written in lowest terms. This immediately yields a method using determinants to calculate the optimal size breakdown for rational inner functions in two variables. (This is another place where it helps to have square matrices.)

Theorem 9.4 (Kummert).

If $S:\mathbb{D}^{2}\to\mathbb{C}^{N\times N}$ is rational inner, then the minimal size breakdown $r=(r_{1},r_{2})$ of a TFR for $S$ is

[TABLE]

Similarly, for all but finitely many $\zeta\in\mathbb{T}$ , the degree of

[TABLE]

is $r_{1}+r_{2}$ . Therefore, the generic size of a TFR for $z\mapsto S(z,\zeta z)$ is $r_{1}+r_{2}$ . This shows that generic restrictions to slices of our two variable minimal TFRs yield minimal TFRs for restricted functions.

Proof of Theorem 1.5.

The above argument shows that if a polynomial inner function $S$ has a minimal TFR via the unitary $U=\begin{pmatrix}A&B\\ C&D\end{pmatrix}$ and projections $P_{1},P_{2}$ as in Theorem 1.1 then $z\mapsto S(z,\zeta z)$ has minimal unitary TFR via the unitary

[TABLE]

By Proposition 9.1, $D\Delta(1,\zeta)$ is nilpotent for all but finitely many $\zeta\in\mathbb{T}$ . This means $(D\Delta(1,\zeta))^{N}=0$ for all but finitely many $\zeta\in\mathbb{T}$ . Since this is a polynomial equation we have $(D\Delta(1,\zeta))^{N}\equiv 0$ and since $D\Delta(z)$ is homogeneous we also have $(D\Delta(z_{1},z_{2}))^{N}\equiv 0$ . Thus, $D\Delta(z)$ is always nilpotent. ∎

This leads to the interesting question of describing contractions $D$ such that $D\Delta(z)$ is nilpotent for all $z$ . An easy way to produce examples would be to make $D$ strictly upper triangular and choose the projections $P_{1},P_{2}$ via projections onto the span of subsets of standard basis vectors. For such examples, $D\Delta(z)$ is triangular; however, it is possible to produce matrices $D_{1},D_{2}$ such that $z_{1}D_{1}+z_{2}D_{2}$ is nilpotent for all $z$ yet is not triangularizable independent of $z$ ; see [nilpotent]. This could be an interesting source of examples.

10. Appendix: auxiliary results

10.1. Maximum principle for rational iso-inner functions

Proposition 10.1.

Suppose $S:\mathbb{D}^{d}\to\mathbb{C}^{M\times N}$ is rational, analytic in $\mathbb{D}^{d}$ , and $\|S(z)\|\leq 1$ for $z\in\mathbb{T}^{d}$ where defined. Then, $\|S(z)\|\leq 1$ for all $z\in\mathbb{D}^{d}$ .

Rationality is a key assumption since $f(z)=\exp\left(\frac{1+z}{1-z}\right)$ is unimodular on $\mathbb{T}\setminus\{1\}$ and analytic on $\mathbb{C}\setminus\{1\}$ yet not bounded by $1$ in $\mathbb{D}$ .

Proof.

We can reduce to the scalar case by considering arbitrary unit vectors $v,w$ and the function $F(z)=w^{*}S(z)v$ . Fix $\omega\in\mathbb{T}^{d}$ and consider the one variable rational function $f(\zeta)=F(\zeta\omega).$ This function is bounded by $1$ on $\mathbb{T}$ away from its potential finite number of poles. But, $f$ must be unbounded near a pole, so any singularities on the boundary are removable. Hence, $f$ is analytic on $\overline{\mathbb{D}}$ and bounded by $1$ by the maximum principle. This implies $F$ is bounded by $1$ at any point of $r\mathbb{T}^{d}$ for $r<1$ . Given any $z\in\mathbb{D}^{d}$ , we can calculate $F(z)$ as a Poisson integral of $F$ on $r\mathbb{T}^{d}$ for $\|z\|_{\infty}<r<1$ to see that $|F(z)|\leq 1$ . ∎

10.2. Fejér-Riesz proofs

A more traditional and well-known version of the matrix Fejér-Riesz theorem is as follows. See [DGK] for a proof.

Theorem 10.2.

Let $T(z)=\sum_{j=-n}^{n}T_{j}z^{j}$ be a matrix Laurent polynomial ( $T_{j}\in\mathbb{C}^{N\times N}$ ) such that $T(z)\geq 0$ for $z\in\mathbb{T}$ and $\det T(z)$ is not identically zero.

Then, there exists a matrix polynomial $A\in\mathbb{C}^{N\times N}[z]$ of degree at most $n$ such that $T=A^{*}A$ on $\mathbb{T}$ and $\det A(z)\neq 0$ for $z\in\mathbb{D}$ .

We think it is worthwhile to show how to go from this theorem to the degenerate version, Theorem 4.1, using ideas from [Dj]. The key tool is the Smith normal form.

Theorem 10.3 (Smith normal form).

Let $P\in\mathbb{C}^{M\times N}[z]$ be a matrix polynomial. Then, there exist $T_{1}\in\mathbb{C}^{M\times M}[z],T_{2}\in\mathbb{C}^{N\times N}[z]$ with matrix polynomial inverses (equivalently, with constant determinants) and $D\in\mathbb{C}^{M\times N}[z]$ such that $P=T_{1}DT_{2}$ . The matrix $D$ has the following form: every entry off the main diagonal of $D$ is zero and the main diagonal consists of polynomials $d_{1},\dots,d_{k}$ such that $d_{j}$ divides $d_{j+1}$ . Here $k=\min\{N,M\}$ and the $d_{j}$ may be zero for $j$ large enough.

See Hoffman-Kunze [HK].

Proof of Theorem 4.1.

The function $G(z)=z^{n}T(z)$ is a polynomial matrix and therefore has Smith normal form decomposition

[TABLE]

Here $T_{1},T_{2}$ are matrix polynomials with matrix polynomial inverses while

[TABLE]

is an $r\times r$ diagonal matrix with only non-zero polynomials on the diagonal. Notice that $T(z)$ has rank $r$ whenever $\det D(z)\neq 0$ , $z\neq 0$ . Since $T$ is self-adjoint on $\mathbb{T}$ , we have $T(z)=T(1/\bar{z})^{*}$ for $z\neq 0$ and so

[TABLE]

is a matrix Laurent polynomial which is positive semi-definite on $\mathbb{T}$ and with [math] in the last $N-r$ columns and rows. Thus, (10.1) has the form $\begin{pmatrix}T_{0}(z)&0\\ 0&0\end{pmatrix}$ where $T_{0}$ is an $r\times r$ matrix Laurent polynomial which is positive semi-definite on $\mathbb{T}$ and crucially satisfying $\det T_{0}\not\equiv 0$ since $T$ has rank $r$ outside of a finite set.

By Theorem 10.2, there exists an $r\times r$ matrix polynomial $A_{0}$ such that $\det A_{0}(z)\neq 0$ in $\mathbb{D}$ and $A_{0}(z)^{*}A_{0}(z)=T_{0}(z)$ on $\mathbb{T}$ . If we set $V=T_{2}$ and

[TABLE]

then $A(z)^{*}A(z)=T(z)$ on $\mathbb{T}$ . Note that $A(1/\bar{z})^{*}A(z)=T(z)$ holds in $\mathbb{C}\setminus\{0\}$ since both sides are analytic and agree on $\mathbb{T}$ .

Our degree bound on $A$ follows from the fact that

[TABLE]

is analytic at [math]. A right rational inverse of $A$ is given by $V^{-1}\begin{pmatrix}A_{0}^{-1}\\ 0\end{pmatrix}$ . ∎

The matrix Fejér-Riesz factorization described is maximal in the sense of the following theorem. One can also describe all other factorizations. There is nothing essentially new about this result, but it is probably difficult to attribute. It could be deduced from inner-outer factorizations.

Theorem 10.4.

Assuming the setup and notation of Theorem 4.1. For any other factorization $T=C^{*}C$ on $\mathbb{T}$ with a matrix polynomial $C$ , there exists a rational iso-inner function $\Phi$ such that $C=\Phi A$ (necessarily, $\Phi=CB$ ). If $C$ has a right rational inverse holomorphic in $\mathbb{D}$ then $\Phi$ is a constant unitary matrix.

Proof.

Suppose $T=C^{*}C$ on $\mathbb{T}$ . Then, we may write $CV^{-1}=\begin{pmatrix}C_{0}&C_{1}\end{pmatrix}$ where $C_{0}$ has $r$ columns. Since

[TABLE]

we see that $C_{0}^{*}C_{0}=A_{0}^{*}A_{0}$ , $C_{1}^{*}C_{1}=0$ on $\mathbb{T}$ . This implies $C_{1}\equiv 0$ . Then, $\Phi:=C_{0}A_{0}^{-1}$ is analytic on $\mathbb{D}$ and isometry-valued on $\mathbb{T}$ . Any poles on $\mathbb{T}$ are necessarily removable because $\Phi$ is rational and bounded on $\mathbb{T}$ . We also have $\Phi A=C$ . If $C$ has right rational inverse $C^{\prime}$ then $\Phi AC^{\prime}=I$ . An isometry can only have a right inverse if it is square, so $\Phi$ must be square (hence unitary on $\mathbb{T}$ ) and $AC^{\prime}$ must be unitary-valued on $\mathbb{T}$ . By the maximum principle, $\Phi$ and $AC^{\prime}$ are contractive in the disk; however, since they are inverses of each other they must be unitary-valued in the disk. Such analytic functions are constant. (Lemma 6.1 proves something more general than this.) ∎

We now sketch a simple proof of Dritschel’s positive definite multivariable Fejér-Riesz result (Thm 6.3). Although it borrows elements from the original proof, we think it has some nice efficiencies in exposition.

Proof of Theorem 6.3.

Let $n$ be a positive integer and define the multivariable Cesaro summation operator $C_{n}$ which we apply to $N\times N$ matrix Laurent polynomials $L(z)=\sum_{k\in\mathbb{Z}^{d}}L_{k}z^{k}$

[TABLE]

where

[TABLE]

is the Fejér kernel and $d\sigma$ is normalized Lebesgue measure on $\mathbb{T}^{d}$ .

Let $\mathcal{L}_{m}$ be the vector space of $N\times N$ Laurent polynomials of degree at most $m$ in each variable separately. We shall consider $C^{m}_{n}:=C_{n}|_{\mathcal{L}_{m}}:\mathcal{L}_{m}\to\mathcal{L}_{m}$ . By basic properties of Cesaro summation, $C^{m}_{n}L\to L$ uniformly on $\mathbb{T}^{d}$ as $n\to\infty$ for $L\in\mathcal{L}_{m}$ . Since the set of linear operators $B(\mathcal{L}_{m})$ on $\mathcal{L}_{m}$ is finite dimensional, $C^{m}_{n}$ tends to the identity as $n\to\infty$ with respect to any norm on $B(\mathcal{L}_{m})$ . In particular, for $n$ large enough $C^{m}_{n}$ is invertible and $(C^{m}_{n})^{-1}$ tends to the identity as $n\to\infty$ .

We next point out that if $L\in\mathcal{L}_{m}$ is positive semi-definite on $\mathbb{T}^{d}$ then $C^{m}_{n}L$ is a sum of squares. The reason is that on $\mathbb{T}^{d}$ , $F_{n}(z,\zeta)L(\zeta)$ is a Laurent polynomial of degree at most $n+m$ with respect to $\zeta$ . Then, the integral representation of $C_{n}L$ can be computed via “quadrature.” Indeed, for any $M$ , if $H\in\mathcal{L}_{M}$ and $\mu=e^{2\pi i/(M+1)}$ then

[TABLE]

This can be proven by testing on monomials. This means that $C_{n}L(z)$ is a positive finite linear combination of the terms $F_{n}(z,(\mu^{j_{1}},\dots,\mu^{j_{d}}))L(\mu^{j_{1}},\dots,\mu^{j_{d}})$ . Since $F_{n}$ is evidently a squared polynomial and each value of $L$ on $\mathbb{T}^{d}$ is assumed positive semi-definite, we see that $C_{n}L$ is a sum of squares of polynomials.

Now, let $T\in\mathcal{L}^{m}$ be strictly positive on $\mathbb{T}^{d}$ , i.e. there exists $\delta>0$ such that $T(z)\geq\delta I$ for $z\in\mathbb{T}^{d}$ . For $n$ large enough, $T_{n}:=(C^{m}_{n})^{-1}T$ is also strictly positive. Then, $T=C_{n}T_{n}$ is a Cesaro sum of a positive Laurent polynomial which was already shown to be a sum of squares. ∎

10.3. PSD kernels

We now discuss the proof of Lemma 3.2 which claims that for $S:\mathbb{D}\to\mathbb{C}^{M\times N}$ analytic and $\|S(z)\|\leq 1$ in $\mathbb{D}$ we have that

[TABLE]

is positive semi-definite (PSD). Let us recall the abstract definition of PSD for matrix or operator-valued kernels.

Definition 10.5.

Let $X$ be a set, $\mathcal{L}$ a complex Hilbert space, and $K:X\times X\to B(\mathcal{L})$ a function; here $B(\mathcal{L})$ is the set of bounded linear self-maps of $\mathcal{L}$ . We say that $K$ is a PSD kernel if for any $x_{1},\dots,x_{n}\in X$ and $v_{1},\dots,v_{n}\in\mathcal{L}$ we have

[TABLE]

Notice that if $(x,y)\mapsto K(x,y)$ is a PSD kernel, then $(x,y)\mapsto K(y,x)$ is not necessarily PSD except in the scalar case $\mathcal{H}=\mathbb{C}$ .

Definition 10.6.

The rank of $K$ is the maximum of the ranks of the block operators $(K(x_{i},x_{j}))_{i,j}$ as we vary over $n$ and $x_{1},\dots,x_{n}\in X$ .

Proof of Lemma 3.2.

Our proof uses rudiments of vector-valued Hardy spaces on the unit disk. See Agler-McCarthy [AMbook] for details.

Let $H_{M}=H^{2}(\mathbb{D})\otimes\mathbb{C}^{M}$ be the set of $M$ -dimensional column vectors with entries in the Hardy space on the unit disk $H^{2}(\mathbb{D})$ . Left multiplication by $S$ , $M_{S}:H_{N}\to H_{M}$ , is contractive. If $k_{w}(z)=k(z,w):=\frac{1}{1-\bar{w}z}$ is the Szegő kernel, then by a fundamental formula in reproducing kernel Hilbert space theory

[TABLE]

for $v\in\mathbb{C}^{M}$ . We see that

[TABLE]

which after a short calculation using the fact that $I-M_{S}M_{S}^{*}\geq 0$ shows $(z,w)\mapsto\frac{I-S(z)S(w)^{*}}{1-z\bar{w}}\text{ is PSD}.$ We could apply the same argument to $\breve{S}(z):=S(\bar{z})^{*}$ to see that $(z,w)\mapsto\frac{I-S(\bar{z})^{*}S(\bar{w})}{1-z\bar{w}}\text{ is PSD.}$ Replace $z,w$ with their conjugates and relabel the variables to see that $K_{S}(w,z)$ is PSD. ∎

Proof of Proposition 9.3.

Assuming $S:\mathbb{D}\to\mathbb{C}^{N\times N}$ is rational inner we need to compute the rank of the positive semi-definite kernel $(w,z)\mapsto\frac{I-S(w)^{*}S(z)}{1-\bar{w}z}.$ We shall use notation from the proof of Lemma 3.2 above. As in said proof, it is notationally easier to deal with the kernel

[TABLE]

and we can reduce to this case by replacing $S$ with $S(\bar{z})^{*}$ .

Now, $K$ is the reproducing kernel for $H_{N}\ominus SH_{N}$ . This follows from the fact that $S$ is inner: $SH_{N}$ is a closed subspace of $H_{N}$ and has reproducing kernel

[TABLE]

which can be verified by the following calculation

[TABLE]

for $f\in H_{N}$ . The rank of $K$ is the dimension of $H_{N}\ominus SH_{N}$ .

To count this dimension we write $S=Q/p$ in lowest terms. Since $S$ is bounded on $\mathbb{T}$ it can have no poles on $\mathbb{T}$ , and therefore $p$ has no zeros in $\overline{\mathbb{D}}$ . Let $Q(z)=T_{1}(z)D(z)T_{2}(z)$ be the Smith normal form decomposition for $Q$ (Theorem 10.3 above). Notice that $D$ has full rank on $\mathbb{T}$ since $S$ is inner. Write $D=\text{diag}(d_{1},\dots,d_{N})$ . Then, $\det Q=c\det D=c\prod_{j}d_{j}$ where $c=\det T_{1}\det T_{2}$ is a constant because $T_{1},T_{2}$ have polynomial inverses. Since $S$ is inner $\det S=\frac{\det Q}{p^{N}}$ is a finite Blaschke product. Its degree equals its number of zeros in $\mathbb{D}$ which equals the number of zeros of $\det Q$ in $\mathbb{D}$ since $p$ has none.

The vector space $H_{N}\ominus SH_{N}$ is isomorphic to the vector space quotient

[TABLE]

The first equality holds because $p$ has no zeros in $\overline{\mathbb{D}}$ , the second holds because $T_{2}$ has a polynomial inverse, and the last isomorphism holds because $T_{1}$ has a polynomial inverse. Recalling $D=\text{diag}(d_{1},\dots,d_{N})$ we note the dimension of $H^{2}/d_{j}H^{2}$ is the number of zeros of $d_{j}$ in $\mathbb{D}$ and therefore the dimension of $H_{N}/DH_{N}$ is the number of zeros of $\prod_{j=1}^{N}d_{j}$ inside $\mathbb{D}$ (counting multiplicities). ∎

This proof appears in [BickelKnese].

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Kummert’s approach to realization on the bidisk

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Theorem 1.1** (Main Theorem).**

Theorem 1.2** (Agler [Agler1, Agler2]).**

Theorem 1.3**.**

Theorem 1.4** (Kummert’s minimality theorem).**

Theorem 1.5**.**

1.1. Guide to the reader

1.2. Acknowledgments

Contents

2. Finite-dimensional transfer function realizations

Theorem 2.1** (Equivalences Theorem).**

Proof.

Proposition 2.2**.**

Proof.

Proposition 2.3**.**

Proof.

3. One variable version of Theorem 1.1

Theorem 3.1**.**

Lemma 3.2**.**

Proof of Theorem 3.1.

Remark 3.3**.**

4. Two variables and Theorem 1.1

Theorem 4.1** (Matrix Fejér-Riesz).**

Proposition 4.2**.**

Proof.

Lemma 4.3**.**

Proof.

Proof of Theorem 1.1.

5. Detailed example

6. Matrix Agler decompositions in two variables

Lemma 6.1**.**

Proof.

Lemma 6.2**.**

Proof.

Theorem 6.3** (Dritschel [mD1]).**

Lemma 6.4**.**

Proof.

Corollary 6.5**.**

Sketch of Proof.

Theorem 6.6** (Agler [Agler1, Agler2]).**

Theorem 6.7** (Dritschel [mD2]).**

Proof of Theorem 1.3.

7. More on finite TFRs

Theorem 7.1**.**

Proof.

Theorem 7.2**.**

Theorem 7.3** (Grinshpan et al [Getal]).**

Question 7.4**.**

8. Kummert’s minimality theorem

Proposition 8.1**.**

Question 8.2**.**

Proposition 8.3**.**

Proof of Proposition 8.3.

Proposition 8.4**.**

Proof of Proposition 8.4.

Proof of Theorem 1.4.

9. Application to inner polynomials

Proposition 9.1**.**

Proposition 9.2**.**

Proof.

Proof of Proposition 9.1.

Proposition 9.3**.**

Theorem 9.4** (Kummert).**

Proof of Theorem 1.5.

10. Appendix: auxiliary results

10.1. Maximum principle for rational iso-inner functions

Proposition 10.1**.**

Proof.

10.2. Fejér-Riesz proofs

Theorem 10.2**.**

Theorem 1.1 (Main Theorem).

Theorem 1.2 (Agler [Agler1, Agler2]).

Theorem 1.3.

Theorem 1.4 (Kummert’s minimality theorem).

Theorem 1.5.

Theorem 2.1 (Equivalences Theorem).

Proposition 2.2.

Proposition 2.3.

Theorem 3.1.

Lemma 3.2.

Remark 3.3.

Theorem 4.1 (Matrix Fejér-Riesz).

Proposition 4.2.

Lemma 4.3.

Lemma 6.1.

Lemma 6.2.

Theorem 6.3 (Dritschel [mD1]).

Lemma 6.4.

Corollary 6.5.

Theorem 6.6 (Agler [Agler1, Agler2]).

Theorem 6.7 (Dritschel [mD2]).

Theorem 7.1.

Theorem 7.2.

Theorem 7.3 (Grinshpan et al [Getal]).

Question 7.4.

Proposition 8.1.

Question 8.2.

Proposition 8.3.

Proposition 8.4.

Proposition 9.1.

Proposition 9.2.

Proposition 9.3.

Theorem 9.4 (Kummert).

Proposition 10.1.

Theorem 10.2.

Theorem 10.3 (Smith normal form).

Theorem 10.4.

Definition 10.5.

Definition 10.6.