Maximum Principles for Matrix-Valued Analytic Functions

Alberto A. Condori

arXiv:1901.07171·math.CV·January 23, 2019·Am. Math. Mon.

Maximum Principles for Matrix-Valued Analytic Functions

Alberto A. Condori

PDF

TL;DR

This paper explores maximum modulus principles for matrix-valued analytic functions, extending classical scalar results to matrices, and discusses their implications for singular values, resolvents, and matrix exponentials.

Contribution

It introduces new maximum norm principles for matrix-valued analytic functions and derives maximum and minimum principles for their singular values.

Findings

01

Maximum norm principles for matrix-valued functions are established.

02

Maximum and minimum principles for singular values are deduced.

03

Observations on resolvents and matrix exponentials are provided.

Abstract

To what extent is the maximum modulus principle for scalar-valued analytic functions valid for matrix-valued analytic functions? In response, we discuss some maximum norm principles for such functions that do not appear to be widely known, deduce maximum and minimum principles for their singular values, and make some observations concerning resolvents and matrix exponentials.

Equations102

∥ T ∥_{F}^{2} = trace (T^{*} T) for T \in M_{n},

∥ T ∥_{F}^{2} = trace (T^{*} T) for T \in M_{n},

F(z)=\left[\begin{array}[]{cc}1&0\\ 0&g(z)\end{array}\right].

F(z)=\left[\begin{array}[]{cc}1&0\\ 0&g(z)\end{array}\right].

∥ F (z) ∥^{2} = max {1, ∣ g (z) ∣^{2}} = 1 for all z \in D,

∥ F (z) ∥^{2} = max {1, ∣ g (z) ∣^{2}} = 1 for all z \in D,

∣Λ (F (z)) ∣ \leq ∥ F (z) ∥ \leq ∥ F (z_{0}) ∥ = ∣Λ (F (z_{0})) ∣.

∣Λ (F (z)) ∣ \leq ∥ F (z) ∥ \leq ∥ F (z_{0}) ∥ = ∣Λ (F (z_{0})) ∣.

∥ F (z) ∥ \geq Λ (F (z)) = Λ (F (z_{0})) = ∥ F (z_{0}) ∥ \geq ∥ F (z) ∥ for all z \in Ω.

∥ F (z) ∥ \geq Λ (F (z)) = Λ (F (z_{0})) = ∥ F (z_{0}) ∥ \geq ∥ F (z) ∥ for all z \in Ω.

F (z) = k = 0 \sum \infty C_{k} (z - z_{0})^{k},

F (z) = k = 0 \sum \infty C_{k} (z - z_{0})^{k},

∥ F (z) x ∥^{2} = j, k \geq 0 \sum (z - z_{0})^{j} (\overline{z - z_{0}})^{k} ⟨ C_{j} x, C_{k} x ⟩

∥ F (z) x ∥^{2} = j, k \geq 0 \sum (z - z_{0})^{j} (\overline{z - z_{0}})^{k} ⟨ C_{j} x, C_{k} x ⟩

\frac{1}{2 π} \int_{0}^{2 π} ∥ F (z_{0} + r e^{i t}) x ∥^{2} d t = k = 0 \sum \infty ∥ C_{k} x ∥^{2} r^{2 k}

\frac{1}{2 π} \int_{0}^{2 π} ∥ F (z_{0} + r e^{i t}) x ∥^{2} d t = k = 0 \sum \infty ∥ C_{k} x ∥^{2} r^{2 k}

k = 0 \sum \infty ∥ C_{k} x ∥^{2} r^{2 k} = \frac{1}{2 π} \int_{0}^{2 π} ∥ F (z_{0} + r e^{i t}) x ∥^{2} d t \leq ∥ C_{0} ∥^{2} ∥ x ∥^{2}

k = 0 \sum \infty ∥ C_{k} x ∥^{2} r^{2 k} = \frac{1}{2 π} \int_{0}^{2 π} ∥ F (z_{0} + r e^{i t}) x ∥^{2} d t \leq ∥ C_{0} ∥^{2} ∥ x ∥^{2}

∥ C_{0} ∥^{2} + k = 1 \sum \infty ∥ C_{k} x_{0} ∥^{2} r^{2 k} \leq ∥ C_{0} ∥^{2} ∥ x_{0} ∥^{2} = ∥ C_{0} ∥^{2},

∥ C_{0} ∥^{2} + k = 1 \sum \infty ∥ C_{k} x_{0} ∥^{2} r^{2 k} \leq ∥ C_{0} ∥^{2} ∥ x_{0} ∥^{2} = ∥ C_{0} ∥^{2},

∥ F (z_{0}) ∥ = ∥ F (z_{0}) x_{0} ∥ = ∥ F (z) x_{0} ∥ \leq ∥ F (z) ∥ for all z \in Ω.

∥ F (z_{0}) ∥ = ∥ F (z_{0}) x_{0} ∥ = ∥ F (z) x_{0} ∥ \leq ∥ F (z) ∥ for all z \in Ω.

F(z)=U\left[\begin{array}[]{cc}\|F(z_{0})\|&0\\ 0&G(z)\end{array}\right]V.

F(z)=U\left[\begin{array}[]{cc}\|F(z_{0})\|&0\\ 0&G(z)\end{array}\right]V.

F(z)=\left[\begin{array}[]{cc}1&0\\ 0&g(z)\end{array}\right]

F(z)=\left[\begin{array}[]{cc}1&0\\ 0&g(z)\end{array}\right]

∥ y_{0} ∥^{2} = ∥ x_{0} ∥^{2} = 1 and y_{0}^{*} F (z) x_{0} = y_{0}^{*} F (z_{0}) x_{0} = 1 for all z \in Ω.

∥ y_{0} ∥^{2} = ∥ x_{0} ∥^{2} = 1 and y_{0}^{*} F (z) x_{0} = y_{0}^{*} F (z_{0}) x_{0} = 1 for all z \in Ω.

Y_{0}^{*}F(z)X_{0}=\left[\begin{array}[]{cc}a_{1,1}(z)&a_{1,2}(z)\\ a_{2,1}(z)&a_{2,2}(z)\end{array}\right],

Y_{0}^{*}F(z)X_{0}=\left[\begin{array}[]{cc}a_{1,1}(z)&a_{1,2}(z)\\ a_{2,1}(z)&a_{2,2}(z)\end{array}\right],

a_{1, 2} (z) = 0 and a_{2, 1} (z) = 0 for all z \in Ω

a_{1, 2} (z) = 0 and a_{2, 1} (z) = 0 for all z \in Ω

F(z)=Y_{0}\left[\begin{array}[]{cc}1&0\\ 0&a_{2,2}(z)\end{array}\right]X_{0}^{*}

F(z)=Y_{0}\left[\begin{array}[]{cc}1&0\\ 0&a_{2,2}(z)\end{array}\right]X_{0}^{*}

∥ A x ∥ = ∥ A ∥ \cdot ∥ x ∥ if and only if A^{*} A x = ∥ A ∥^{2} x .

∥ A x ∥ = ∥ A ∥ \cdot ∥ x ∥ if and only if A^{*} A x = ∥ A ∥^{2} x .

s_{1} (A) \geq s_{2} (A) \geq \dots \geq s_{n} (A) .

s_{1} (A) \geq s_{2} (A) \geq \dots \geq s_{n} (A) .

F(z)=U_{1}\left[\begin{array}[]{cc}s_{1}(F(z_{1}))&0\\ 0&F_{1}(z)\end{array}\right]V_{1}.

F(z)=U_{1}\left[\begin{array}[]{cc}s_{1}(F(z_{1}))&0\\ 0&F_{1}(z)\end{array}\right]V_{1}.

\left[\begin{array}[]{cccccc}s_{1}(F(z_{1}))&\ldots&0&0\\ \vdots&\ddots&\vdots&\vdots\\ 0&\ldots&s_{r}(F(z_{r}))&0\\ 0&\dots&0&F_{r}(z)\end{array}\right]

\left[\begin{array}[]{cccccc}s_{1}(F(z_{1}))&\ldots&0&0\\ \vdots&\ddots&\vdots&\vdots\\ 0&\ldots&s_{r}(F(z_{r}))&0\\ 0&\dots&0&F_{r}(z)\end{array}\right]

F(z)=\left[\begin{array}[]{cc}1&0\\ 0&z^{-1}\end{array}\right],

F(z)=\left[\begin{array}[]{cc}1&0\\ 0&z^{-1}\end{array}\right],

d = dim {x \in C^{n} : ∥ F (z_{0}) x ∥ = ∥ F (z_{0}) ∥ \cdot ∥ x ∥} .

d = dim {x \in C^{n} : ∥ F (z_{0}) x ∥ = ∥ F (z_{0}) ∥ \cdot ∥ x ∥} .

F (z) = ∥ F (z_{0}) ∥ U \cdot V when d = n,

F (z) = ∥ F (z_{0}) ∥ U \cdot V when d = n,

F(z)=U\left[\begin{array}[]{cc}\|F(z_{0})\|\cdot I_{d}&0\\ 0&R(z)\end{array}\right]V\;\text{ when }d<n.

F(z)=U\left[\begin{array}[]{cc}\|F(z_{0})\|\cdot I_{d}&0\\ 0&R(z)\end{array}\right]V\;\text{ when }d<n.

s_{1} (F (z_{1})) = s_{d_{1}} (F (z_{1})),

s_{1} (F (z_{1})) = s_{d_{1}} (F (z_{1})),

s_{d_{1} + 1} (F (z_{2})) = s_{d_{2}} (F (z_{2})),

s_{d_{1} + 1} (F (z_{2})) = s_{d_{2}} (F (z_{2})),

\left[\begin{array}[]{ccccc}s_{d_{1}}(F(z_{1}))\cdot I_{d_{1}}&0&0\\ 0&s_{d_{2}}(F(z_{2}))\cdot I_{d_{2}}&0\\ 0&0&*\end{array}\right].

\left[\begin{array}[]{ccccc}s_{d_{1}}(F(z_{1}))\cdot I_{d_{1}}&0&0\\ 0&s_{d_{2}}(F(z_{2}))\cdot I_{d_{2}}&0\\ 0&0&*\end{array}\right].

\left[\begin{array}[]{ccccc}s_{d_{1}}(F(z_{1}))\cdot I_{d_{1}}&0&\ldots&0\\ 0&s_{d_{2}}(F(z_{2}))\cdot I_{d_{2}}&\ldots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\ldots&s_{d_{\kappa}}(F(z_{\kappa}))\cdot I_{d_{\kappa}}\end{array}\right],

\left[\begin{array}[]{ccccc}s_{d_{1}}(F(z_{1}))\cdot I_{d_{1}}&0&\ldots&0\\ 0&s_{d_{2}}(F(z_{2}))\cdot I_{d_{2}}&\ldots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\ldots&s_{d_{\kappa}}(F(z_{\kappa}))\cdot I_{d_{\kappa}}\end{array}\right],

∥ G (z) ∥ \leq ∥ F (z_{0}) ∥ for all z \in Ω.

∥ G (z) ∥ \leq ∥ F (z_{0}) ∥ for all z \in Ω.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Maximum Principles for Matrix-Valued Analytic Functions

Alberto A. Condori

Abstract.

To what extent is the maximum modulus principle for scalar-valued analytic functions valid for matrix-valued analytic functions? In response, we discuss some maximum norm principles for such functions that do not appear to be widely known, deduce maximum and minimum principles for their singular values, and make some observations concerning resolvents and matrix exponentials.

1. Introduction.

The maximum modulus principle (MMP) is a fundamental result in complex analysis. It is often used to deduce other important results such as the fundamental theorem of algebra, the open mapping theorem (i.e., analytic functions map open sets to open sets), Schwarz’s lemma, the Phragmén–Lindelöff principle, etc. One of its various formulations states that if $f$ is a scalar-valued function, analytic on a region $\Omega$ (i.e., a nonempty open connected subset) of the complex plane $\mathbb{C}$ , whose modulus attains a local maximum in $\Omega$ , then $f$ is constant on $\Omega$ . For a proof of the MMP, we refer the reader to [7, Chapter 10].

Many differential equations encountered in science and engineering lead to the consideration of matrix-valued functions, that is, functions with range in the set $\mathbb{M}_{n}$ of $n\times n$ matrices, $n>1$ , with entries in $\mathbb{C}$ . For instance, the standard model of an RLC circuit in electrical engineering admits the formulation $x^{\prime}(t)=A\cdot x(t)$ , where $A\in\mathbb{M}_{n}$ and $x$ is a function with values in $\mathbb{C}^{n}$ . The vector-valued solutions $x(t)=\exp(tA)x_{0}$ to such an equation (with $x_{0}\in\mathbb{C}^{n}$ ) depend on the matrix-valued function $t\mapsto\exp(tA)=\sum_{k=0}^{\infty}A^{k}t^{k}/k!$ , and the decay of these solutions is controlled by its operator norm $\|\exp(tA)\|$ . As usual, $\|T\|=\sup\{\|Tv\|_{\mathbb{C}^{n}}:\|v\|_{\mathbb{C}^{n}}=1\}$ is the operator norm of $T\in\mathbb{M}_{n}$ induced by the Euclidean norm on $\mathbb{C}^{n}$ , namely $\|v\|_{\mathbb{C}^{n}}=\left(|v_{1}|^{2}+\cdots+|v_{n}|^{2}\right)^{1/2}$ when $v=(v_{1},\ldots,v_{n})$ .

In linear algebra, too, matrix-valued functions arise (implicitly) in the study of eigenvalues, i.e., the spectrum $\sigma(A)$ of $A\in\mathbb{M}_{n}$ . After all, $\lambda\in\mathbb{C}$ satisfies $Av=\lambda v$ for some nonzero vector $v\in\mathbb{C}^{n}$ if and only if the resolvent function $z\mapsto(A-zI)^{-1}$ has a singularity at $z=\lambda$ , i.e., $A-\lambda I$ is not invertible. (Throughout, $I=I_{n}$ denotes the identity in $\mathbb{M}_{n}$ .) Since the spectrum is often insufficient for the analysis of non-normal matrices (see [8]), focus has shifted to the study111Equivalently, one may study the so-called “pseudospectra” of $A$ . For an overview of that subject, see [9]. of the norm of the resolvent $\|(A-zI)^{-1}\|$ . For instance, the norm of the resolvent alone characterizes when $A$ is a normal matrix [1].

Thus, it is of interest to study the (operator) norms of the matrix-valued functions $\exp(zA)$ and $(A-zI)^{-1}$ . As can be expected, these functions are analytic222Recall that a function $F:\Omega\to\mathbb{M}_{n}$ is analytic if, for each $z_{0}\in\Omega$ , there is a member of $\mathbb{M}_{n}$ , denoted by $F^{\prime}(z_{0})$ , such that $\|(z-z_{0})^{-1}\{F(z)-F(z_{0})\}-F^{\prime}(z_{0})\|\to 0$ as $z\to z_{0}$ . It can be shown that $F:\Omega\to\mathbb{M}_{n}$ is analytic if and only if $F$ is “entry-wise analytic,” i.e., every entry of $F(z)$ is an analytic function on $\Omega$ . in regions of $\mathbb{C}$ , the entire plane $\mathbb{C}$ , and $\mathbb{C}\backslash\sigma(A)$ , respectively. The fact these functions are analytic leads one to question the extent to which the MMP for scalar-valued functions is valid for $\|F(z)\|$ , where $F$ is any matrix-valued analytic function. The purpose of this article is to find sufficient conditions, say involving the norm of a matrix-valued analytic function, that guarantee that the function is constant.

In Section 2, we state and discuss some maximum norm principles for matrix-valued analytic functions. Although it has been long known that a direct analog of the MMP fails in the context of matrix-valued functions in which the operator norm plays the role of the modulus, we find a suitable analog. Stated roughly, if $F:\Omega\to\mathbb{M}_{n}$ is such that $\|F(z)\|$ attains a maximum at some $z_{0}\in\Omega$ , then there is a direction in which $F(z)$ is constant (although $F(z)$ need not be) namely that of any maximizing vector of $F(z_{0})$ (see Theorem 3). We rediscovered this result originally noted by Brown and Douglas in [2] and use it to describe the structure of the function $F(z)$ (see Theorem 4). Since the result lends itself to iteration, we make natural assumptions on the function’s singular values and explore the consequences further in Section 3. One of the section’s main results (see Corollary 6) illuminates the equivalence of two apparently distinct statements to the single statement that the matrix function $F(z)$ is constant: the Frobenius norm of $F(z)$ attains a maximum, and every singular value of $F(z)$ attains a maximum (at possibly distinct points).

Once the maximum singular-values principle is established in Section 3, we proceed to prove a minimum singular-values principle in Section 4. That result (Theorem 9) is, in a sense, an analog of the well-known minimum modulus principle of complex analysis in the context of matrix-valued functions. Finally, in Section 5, we discuss the implications of our results in the context of the resolvent and the matrix exponential which involve their largest and smallest singular values.

It is worth mentioning that analytic matrix-valued functions appear in many other areas such as the harmonic analysis of operators on a Hilbert space (e.g., finite-rank perturbations of self-adjoint and unitary operators), and consequently in mathematical physics (e.g., Schrödinger operators); roughly, problems concerning spectral properties of an operator are often solved through the consideration of an analytic matrix-valued function defined on the upper-half plane, i.e. the so-called “characteristic function.” Due to the scope of the paper, the reader is referred to the survey [6] and all references therein for further details.

We also remark that the results of this article could be written in the more general framework of operator-valued functions $F:\Omega\to\mathcal{B}(H)$ , where $H$ is a complex Hilbert space, or that of vector-valued functions $F:\Omega\to B$ , where $B$ is a complex Banach space. However, all statements in this article are kept in the context of matrix-valued functions so that the results are easier to read and appeal to a wider audience.

2. Maximum norm principles.

To find a suitable analog of the MMP for matrix-valued functions, it is reasonable to first test whether known proofs of the MMP can be easily adapted when replacing modulus with operator norm. One such proof of the MMP appears in [7, Chapter 10]. In it, the identity $|w|^{2}=w\bar{w}$ ( $w\in\mathbb{C}$ ) appears, and although the operator norm of $T\in\mathbb{M}_{n}$ does not readily provide a direct analog for $\|T\|^{2}$ , the Frobenius norm does. In fact,333As usual, $T^{*}$ denotes the conjugate transpose of the matrix $T$ .

[TABLE]

when $\|T\|_{\mathcal{F}}$ is the Frobenius (Hilbert–Schmidt) norm of $T$ , and an argument analogous to the proof of the MMP in [7] (that relies on (1)) gives the following result.

Theorem 1 (Maximum Frobenius Norm Principle).

Let $\Omega$ be a region of $\mathbb{C}$ and let $F:\Omega\to\mathbb{M}_{n}$ be analytic. If $\|F(z)\|_{\mathcal{F}}$ assumes its maximum at some $z_{0}\in\Omega$ , then $F(z)=F(z_{0})$ for all $z\in\Omega$ .

Despite its provision of a direct analog of the MMP for matrix-valued functions, in applications, it is the operator norm that is of interest, not the Frobenius norm. Unfortunately, the conclusion of Theorem 1 need not hold when the Frobenius norm is replaced by another matrix norm. For example, let $\mathbb{D}$ denote the open unit disk centered at the origin, let $g:\mathbb{D}\to\mathbb{D}$ be analytic (e.g., $g(z)=z$ ), and consider the $2\times 2$ matrix-valued function

[TABLE]

Notice that the operator norm of $F(z)$ satisfies

[TABLE]

even though $F(z)$ is not a constant function. Nevertheless, one can prove a weakened version for any norm.

Theorem 2 (Maximum Norm Principle).

Let $\Omega$ be a region of $\mathbb{C}$ and let $F:\Omega\to\mathbb{M}_{n}$ be analytic. If $\|F(z)\|$ attains its maximum in $\Omega$ , then $\|F(z)\|$ is constant on $\Omega$ .

Theorem 2 is well known and a proof can be found in [4, Section III.14]; we provide a different short proof based on a well-known consequence of the Hahn–Banach theorem on linear functionals, namely if $X$ is any normed space and $x\in X$ is nonzero, then there is a bounded linear functional $\Lambda$ on $X$ such that $\|\Lambda\|=1$ and $\Lambda(x)=\|x\|$ . For further details and a simple proof of this fact, see [7, Chapter 5].

Proof of Theorem 2.

Assume there is a $z_{0}\in\Omega$ such that $\|F(z)\|\leq\|F(z_{0})\|$ for all $z\in\Omega$ and, without loss of generality, that $\|F(z_{0})\|\neq 0$ . Then we can choose a bounded linear functional $\Lambda:\mathbb{M}_{n}\to\mathbb{C}$ of norm $1$ so that $\|F(z_{0})\|=\Lambda(F(z_{0}))$ . By continuity of $\Lambda$ and analyticity of $F$ , $\Lambda(F(z))$ defines an analytic function on $\Omega$ , and

[TABLE]

It follows now from the usual MMP that $\Lambda(F(z))$ must be constant throughout $\Omega$ and

[TABLE]

Thus, $\|F(z)\|=\|F(z_{0})\|$ for all $z\in\Omega$ . ∎

The conclusion of the maximum norm principle above may be seen as unsatisfactory because it gives limited information about the structure of $F(z)$ itself. This is not at all surprising; after all, the theorem holds for any norm. So, from now on we use the operator norm exclusively in an effort to gain more information about the function $F$ .

A useful property of the operator norm of a matrix is that given any $A\in\mathbb{M}_{n}$ , there is a unit vector $x_{0}\in\mathbb{C}^{n}$ , called a maximizing vector for $A$ , so that $\|Ax_{0}\|=\|A\|$ ; in other words, matrices attain their operator norm at some vector in the unit ball of $\mathbb{C}^{n}$ . This is a consequence of the compactness of the closed unit ball of $\mathbb{C}^{n}$ .

Recently, we rediscovered a maximum operator norm principle due to Brown and Douglas. In [2, Theorem 4], the authors proved that if $F(z)$ is a nonconstant matrix-valued analytic function whose operator norm attains its maximum, then there is a direction $x_{0}$ in which $F(z)x_{0}$ is constant. Our version reads as follows.

Theorem 3 (Maximum Operator Norm Principle, cf. [2]).

Let $\Omega$ be a region of $\mathbb{C}$ and let $F:\Omega\to\mathbb{M}_{n}$ be analytic. If there is a $z_{0}\in\Omega$ so that $\|F(z)\|\leq\|F(z_{0})\|$ for all $z\in\Omega$ and $x_{0}$ is a maximizing vector for $F(z_{0})$ , then $F^{(k)}(z_{0})x_{0}=0$ for every $k\geq 1$ . In particular, $F(z)x_{0}$ is constant on $\Omega$ .

The conclusion444It is worth mentioning that our version of Theorem 3 also complements a result due to Daniluk in [3]. of Theorem 3 here is, at first sight, a slight improvement to that in Theorem 4 (part (1)) of [2]; after all, using a series expansion of $F(z)$ , the condition $F^{(k)}(z_{0})x_{0}=0$ for $k\geq 1$ easily implies that $F(z)x_{0}$ is constant on $\Omega$ . In fact, the reverse implication is also true and a justification can be made using series, too. On the other hand, although our series proof of Theorem 3 below is not as short as that of Brown and Douglas, it elucidates the consideration of maximizing vectors $x_{0}$ (see (5) below).

Proof of Theorem 3.

Let $R>0$ be such that $D(z_{0};R)\subseteq\Omega$ . Then $F(z)$ admits a power series representation on $D(z_{0};R)$ , say

[TABLE]

where $C_{k}\in\mathbb{M}_{n}$ for $k\geq 0$ . For any vector $x$ ,

[TABLE]

by continuity of the inner product and so

[TABLE]

for any $r\in(0,R)$ .

Now, since $\|F(z)\|\leq\|F(z_{0})\|=\|C_{0}\|$ for all $z\in\Omega$ , it follows from (4) that

[TABLE]

for any vector $x$ and $r\in(0,R)$ . Let $x_{0}$ be a maximizing vector for $C_{0}$ . Replace $x$ by $x_{0}$ in (5), and conclude

[TABLE]

and $F^{(k)}(z_{0})x_{0}=C_{k}x_{0}=0$ for every $k\geq 1$ . In particular, by (3), $F(z)x_{0}=C_{0}x_{0}=F(z_{0})x_{0}$ for all $z\in D(z_{0};R)$ and so, by the identity theorem (e.g., [7, Theorem 10.18]), $F(z)x_{0}=F(z_{0})x_{0}$ for all $z\in\Omega$ . ∎

Remark.

Note that the conclusion of Theorem 3 alone implies that $\|F(z)\|$ has a minimum at $z_{0}$ ; after all, if $z\mapsto F(z)x_{0}$ is constant on $\Omega$ for some maximizing vector $x_{0}$ of $F(z_{0})$ , then

[TABLE]

Hence, the conclusion of Theorem 3 is stronger than that of the maximum norm principle (when using the operator norm) because it implies that any maximizing vector $x_{0}$ for $F(z_{0})$ is also a maximizing vector for $F(z)$ , and $F(z)$ has constant norm equal to that of $F(z_{0})$ for all $z\in\Omega$ .

The observation made in the remark leads one to the following factorization.

Theorem 4.

Let $\Omega$ be a region of $\mathbb{C}$ and let $F:\Omega\to\mathbb{M}_{n}$ be analytic. If there is a $z_{0}\in\Omega$ so that $\|F(z)\|\leq\|F(z_{0})\|$ for all $z\in\Omega$ , then there are $n\times n$ (constant) unitary555Recall that $A\in\mathbb{M}_{n}$ is said to be unitary if $A^{*}A=AA^{*}=I$ . matrices $U$ and $V$ , and an analytic function $G:\Omega\to\mathbb{M}_{n-1}$ , such that

[TABLE]

Roughly, in the case of $2\times 2$ matrices, Theorem 4 states that when $F(z)$ is nonconstant, analytic, and achieves its maximum operator norm, say equal to $1$ , at a point of a region, then there is a nonconstant analytic function $g:\Omega\to\mathbb{D}$ such that

[TABLE]

up to multiplication by (constant) unitary matrices on the right and the left. Hence, in a sense, the example given in (2) is essentially the only example of a nonconstant $2\times 2$ matrix function whose operator norm achieves a maximum value of $1$ .

Proof of Theorem 4.

Without loss of generality, we assume $\|F(z_{0})\|=1$ . By Theorem 3, if $x_{0}$ is a maximizing vector for $F(z_{0})$ , then the vector function $z\mapsto F(z)x_{0}$ is constant on $\Omega$ . Recalling that $\|v\|_{\mathbb{C}^{n}}^{2}=v^{*}v$ for any $v\in\mathbb{C}^{n}$ and choosing $y_{0}=F(z_{0})x_{0}$ , we obtain

[TABLE]

Let $X_{0}$ and $Y_{0}$ be (constant) $n\times n$ unitary matrices whose first columns are $x_{0}$ and $y_{0}$ , respectively. Then, in matrix blocks,

[TABLE]

where $a_{1,1}(z)=y_{0}^{*}F(z)x_{0}=1$ . Furthermore, as $X_{0}$ and $Y_{0}$ are unitary, $\|Y_{0}^{*}F(z)X_{0}\|=\|F(z)\|=1$ (or, alternatively, this follows by the remark following the proof of Theorem 3). This implies that

[TABLE]

because the operator norm of an $n\times n$ matrix is an upper bound on the Euclidean (vector) norm of its columns and rows. In other words, the assumptions on $F(z)$ imply the existence of $n\times n$ constant unitary matrices $X_{0}$ and $Y_{0}$ so that

[TABLE]

where $a_{2,2}(z)$ is an analytic $(n-1)\times(n-1)$ matrix-valued function. Thus, the desired conclusion follows with $U=Y_{0}$ , $V=X_{0}^{*}$ , and $G(z)=a_{2,2}(z)$ . ∎

3. Maximum singular value principles.

An attractive feature of Theorem 4 is that it lends itself to iteration. Indeed, the lower right block $G(z)$ in (8) may very well satisfy the assumptions of Theorem 4 just as $F(z)$ did. In this section, we explore this situation and its consequences, but first review some basic terminology and results concerning singular values.

We begin with the observation that the maximizing vectors for a matrix $A$ admit the characterization that $x_{0}$ is a maximizing vector for $A\in\mathbb{M}_{n}$ if and only if $x_{0}$ has norm $1$ and $A^{*}Ax_{0}=\|A\|^{2}x_{0}$ . More generally, for a vector $x$ (whether it has norm $1$ or not),

[TABLE]

A proof of (7) can be based on the fact that every positive semi-definite matrix has a unique positive semi-definite square root (e.g., see [5, Theorem 7.2.6]). To that end, first note that the inequality $\|Av\|\leq\|A\|\cdot\|v\|$ valid for all vectors $v$ is equivalent to stating that the matrix $\|A\|^{2}I-A^{*}A$ is positive semi-definite. So, $\|Ax\|=\|A\|\cdot\|x\|$ holds if and only if $\|(\|A\|^{2}I-A^{*}A)^{1/2}x\|=0$ , or equivalently, $(\|A\|^{2}I-A^{*}A)x=0$ . Hence, $x_{0}$ is a maximizing vector of $A$ if and only if it is an eigenvector of $A^{*}A$ of norm $1$ , i.e., (7) holds.

The role played in Theorem 3 by maximizing vectors for a matrix and their alternative characterization as eigenvectors lead directly to the consideration of singular values.

Recall that the singular values $s_{k}(A)$ , $1\leq k\leq n$ , of an $n\times n$ matrix $A$ are the nonnegative square roots of the eigenvalues of $A^{*}A$ ordered in the nonincreasing order, that is,

[TABLE]

In particular, $s_{1}(A)=\|A\|$ (see (7)) and $s_{1}^{2}(A)+s_{2}^{2}(A)+\cdots+s_{n}^{2}(A)=\|A\|_{\mathcal{F}}^{2}$ . The latter can be deduced using any singular value decomposition (SVD) of $A$ (e.g., [5, Theorem 7.35]) and (1).

The following result is a simple consequence of Theorem 4.

Theorem 5.

Let $\Omega$ be a region of $\mathbb{C}$ and let $F:\Omega\to\mathbb{M}_{n}$ be analytic. Suppose that, for each $k=1,\ldots,n$ , the function $z\mapsto s_{k}(F(z))$ attains its maximum value on $\Omega$ . Then $F(z)$ is constant on $\Omega$ .

In Theorem 5, the assumption does not require that the functions $s_{1}(F(z))$ , $\,\ldots\,$ , $s_{n}(F(z))$ attain their maximum values at the same point666In fact, if the functions $s_{1}(F(z))$ , $\,\ldots\,$ , $s_{n}(F(z))$ attain their maximum values at the same point $z_{0}\in\Omega$ , it follows already from Theorem 1 that $F(z)$ must be constant on $\Omega$ . of $\Omega$ ; they may assume their respective maxima at distinct points $z_{1}$ , $\,\ldots\,$ , $z_{n}\in\Omega$ .

Proof of Theorem 5.

Our proof is by induction on $n$ . When $n=1$ , the desired conclusion holds by the MMP. So, suppose that the result holds for $n=1,\ldots,m-1$ with $m\in\mathbb{N}$ . We now show that it also holds for $n=m$ .

Suppose $F:\Omega\to\mathbb{M}_{m}$ is analytic, and the function $z\mapsto s_{k}(F(z))$ attains its maximum value on $\Omega$ for each $k=1,\ldots,m$ . Let $z_{1}\in\Omega$ be such that $s_{1}(F(z))\leq s_{1}(F(z_{1}))$ for all $z\in\Omega$ . By Theorem 4, there are $m\times m$ (constant) unitary matrices $U_{1}$ and $V_{1}$ , and an analytic function $F_{1}:\Omega\to\mathbb{M}_{n-1}$ , such that

[TABLE]

In particular, $s_{k}(F_{1}(z))=s_{k+1}(F(z))$ attains its maximum value on $\Omega$ for each $k=1,\ldots,m-1$ . By the inductive hypothesis, $F_{1}(z)$ must be constant on $\Omega$ and, consequently, $F(z)$ is also constant. ∎

At first sight, the assumption in Theorem 5 that every function $z\mapsto s_{k}(F(z))$ attains its maximum value on $\Omega$ for $k=1,\ldots,n$ appears to be different from saying that $\|F(z)\|_{\mathcal{F}}$ attains its maximum value on $\Omega$ in the maximum Frobenius norm principle above. Based upon the results above one may conclude that they are in fact equivalent!

Corollary 6.

Let $\Omega$ be a region of $\mathbb{C}$ . The following statements are equivalent for an analytic function $F:\Omega\to\mathbb{M}_{n}$ .

(1)

$F(z)$ * is constant on $\Omega$ .* 2. (2)

For every $k=1,\ldots,n$ , $s_{k}(F(z))$ is constant on $\Omega$ . 3. (3)

For every $k=1,\ldots,n$ , $s_{k}(F(z))$ attains its maximum value at some $z_{k}\in\Omega$ . 4. (4)

$\|F(z)\|_{\mathcal{F}}$ * is constant on $\Omega$ .* 5. (5)

$\|F(z)\|_{\mathcal{F}}$ * attains its maximum value at some $z_{0}\in\Omega$ .*

Proof.

It is evident that $(\ref{Fconstant})\implies(\ref{constantSk})$ , $(\ref{constantSk})\implies(\ref{maxedSk})$ , $(\ref{Fconstant})\implies(\ref{normFconstant})$ , and $(\ref{normFconstant})\implies(\ref{normFmaxAttained})$ . The only nontrivial implications $(\ref{normFmaxAttained})\implies(\ref{Fconstant})$ and $(\ref{maxedSk})\implies(\ref{Fconstant})$ are consequences of the maximum Frobenius norm principle and Theorem 5, respectively. ∎

In view of Corollary 6 (or Theorem 5), if $\Omega$ is region of $\mathbb{C}$ and $F:\Omega\to\mathbb{M}_{n}$ is a nonconstant analytic function such that $s_{1}(F(z))$ attains its maximum on $\Omega$ , then there is a largest integer $r<n$ such that the functions $s_{1}(F(z))$ , $\ldots\,$ , $s_{r}(F(z))$ attain their maximum values on $\Omega$ . In this case, up to multiplication by (constant) unitary matrices on the right and the left, $F(z)$ has the block form

[TABLE]

for some (necessarily nonconstant) analytic function $F_{r}:\Omega\to\mathbb{M}_{n-r}$ .

A closer look at the proof of Theorem 5 also reveals the following refinement of the maximum norm principle. We omit the details.

Corollary 7.

Let $1\leq m\leq n$ , let $\Omega$ be a region of $\mathbb{C}$ , and let $F:\Omega\to\mathbb{M}_{n}$ be analytic. Suppose that, for each $k=1,\ldots,m$ , the function $s_{k}(F(z))$ attains its maximum value on $\Omega$ . Then $s_{k}(F(z))$ is constant on $\Omega$ for each $k=1,\ldots,m$ .

Note that for an arbitrary $F(z)$ , it may happen that $s_{n}(F(z))$ is constant while $s_{k}(F(z))$ is not when $1\leq k<n$ . For example, the function $F:\mathbb{D}\setminus\{0\}\to\mathbb{M}_{2}$ defined by

[TABLE]

has $s_{1}(F(z))=|z|^{-1}$ and $s_{2}(F(z))=1$ for all $z\in\mathbb{D}\setminus\{0\}$ .

As seen in its proof, the key to obtaining the conclusion of Theorem 4 relies on choosing a maximizing vector for $F(z_{0})$ . The following theorem is a refinement of Theorem 4 that relies on choosing instead “all maximizing vectors” for $F(z_{0})$ .

Theorem 8.

Let $\Omega$ be a region of $\mathbb{C}$ and let $F:\Omega\to\mathbb{M}_{n}$ be analytic. Suppose there is a $z_{0}\in\Omega$ so that $\|F(z)\|\leq\|F(z_{0})\|$ for all $z\in\Omega$ and set777Equivalently, $d$ is the dimension of the subspace spanned by the “right-singular vectors” associated with the largest singular value of $F(z_{0})$ .

[TABLE]

Then there are $n\times n$ unitary matrices $U$ and $V$ such that

[TABLE]

or, for some analytic function $R:\Omega\to\mathbb{M}_{n-d}$ ,

[TABLE]

In particular, $z\mapsto F(z)x$ is constant and $F^{(k)}(z_{0})x=0$ for all $k\geq 1$ when $x$ satisfies $\|F(z_{0})x\|=\|F(z_{0})\|\cdot\|x\|$ .

Note that one could apply Theorem 8 again to the lower-right matrix-block function $R(z)$ appearing in (10). More definitively, if $s_{1}(F(z))$ attains its maximum at $z_{1}\in\Omega$ , then $d_{1}$ is the largest integer such that

[TABLE]

$s_{d_{1}+1}(F(z))$ attains its maximum at $z_{2}\in\Omega$ , and $d_{2}$ is the largest integer such that

[TABLE]

then up to multiplication by (constant) unitary matrices on the right and the left, $F(z)$ has the block form

[TABLE]

Hence, if every function $z\mapsto s_{k}(F(z))$ attains its maximum at some point of $\Omega$ then, up to multiplication by (constant) unitary matrices on the right and the left, $F(z)$ admits the block form

[TABLE]

and is hence a constant matrix, as expected by Theorem 5.

Likewise, a completely analogous argument reveals that the refinement of the maximum norm principle in Corollary 7 is also a consequence of Theorem 8, because $s_{j}(F(z))=\|F(z_{0})\|$ for $j=1,\ldots,d$ and $s_{\ell+d}(F(z))=s_{\ell}(R(z))$ for $\ell=1,\ldots,(n-d)$ . We leave the details to the reader.

Proof of Theorem 8.

Let $D_{z_{0}}$ be the diagonal matrix whose main diagonal entries are the singular values of $F(z_{0})$ listed in nonincreasing order. Then we may let $U_{z_{0}}$ and $V_{z_{0}}$ be unitary matrices such that $F(z_{0})=U_{z_{0}}D_{z_{0}}V_{z_{0}}$ (i.e., an SVD for $F(z_{0})$ ). Let $r$ denote the largest positive integer such that $s_{r}(F(z_{0}))=\|F(z_{0})\|$ . Note that, by (7), a vector $x$ satisfies $\|F(z_{0})x\|=\|F(z_{0})\|\cdot\|x\|$ if and only if $V_{z_{0}}^{*}(\|F(z_{0})\|^{2}I-D_{z_{0}}^{2})V_{z_{0}}x=0$ , or equivalently, $x$ belongs to the linear span of first $r$ columns of $V^{*}_{z_{0}}$ because $V^{*}_{z_{0}}$ is unitary. Thus, $r=d$ with $d$ as in (9).

Now, consider the function $G(z)=U_{z_{0}}^{*}F(z)V_{z_{0}}^{*}$ . Clearly, $G$ is analytic on $\Omega$ and satisfies

[TABLE]

Since the $\mathbb{C}^{n}$ norm of every column (and row) of a matrix is bounded by its operator norm, the modulus of every (analytic) entry $G_{i,j}(z)$ is also bounded by $\|F(z_{0})\|$ . Moreover, if $1\leq i\leq r$ , then $G_{i,i}(z_{0})=\|F(z_{0})\|$ and so $G_{i,i}(z)=\|F(z_{0})\|$ for all $z\in\Omega$ by the (usual) MMP. In particular, the first $r$ columns and $r$ rows of $G(z)$ have $\mathbb{C}^{n}$ norm at least $\|F(z_{0})\|$ . Therefore, $G_{i,j}(z)=0$ when $i\neq j$ and $1\leq i,j\leq r$ . In other words, using matrix blocks, this shows that

[TABLE]

for some analytic function $R:\Omega\to\mathbb{M}_{n-r}$ when $r<n$ , while $F(z)=\|F(z_{0})\|U_{z_{0}}V_{z_{0}}$ when $r=n$ . This completes the proof of (10).

Finally, if $e_{1},\ldots,e_{n}$ denotes the standard basis for $\mathbb{C}^{n}$ and $k\geq 1$ , then the $j$ th column $V^{*}_{z_{0}}e_{j}$ of $V^{*}_{z_{0}}$ satisfies $F(z)V^{*}_{z_{0}}e_{j}=\|F(z_{0})\|U_{z_{0}}e_{j}$ and

[TABLE]

for $j=1,\ldots,r$ . Thus, $z\mapsto F(z)x$ is constant and $F^{(k)}(z)x=0$ whenever $x$ belongs to the linear span of first $r$ columns of $V^{*}_{z_{0}}$ , or equivalently, when $x$ satisfies $\|F(z_{0})x\|=\|F(z_{0})\|\cdot\|x\|$ . ∎

4. Minimum singular value principles.

In the case of nonconstant scalar-valued functions, the MMP tells us that the minimum modulus (of an analytic function on a region) can only be attained at a zero of the function. This conclusion is often called the minimum modulus principle in complex analysis. As a consequence of Theorem 5, we state and prove an analog of that minimum principle in the context of matrix-valued functions.

Theorem 9.

Let $\Omega$ be a region of $\mathbb{C}$ and let $F:\Omega\to\mathbb{M}_{n}$ be a nonconstant analytic function. Then no point $z_{0}\in\Omega$ can be a minimum value for all of the functions $s_{k}(F(z))$ , $1\leq k\leq n$ , unless $F(z_{0})$ is not invertible.

Proof.

We prove that if there is a $z_{0}\in\Omega$ such that $F(z_{0})$ is invertible and the functions $z\mapsto s_{k}(F(z))$ attain their minimum at $z_{0}$ for $k=1,\ldots,n$ , then $F(z)$ must be a constant function.

To begin, recall that the collection of invertible matrices is open. This implies $F(z)$ must be invertible for all $z$ sufficiently close to $z_{0}$ . So, $G(z)\stackrel{{\scriptstyle\mbox{\footnotesize{\rm def}}}}{{=}}F^{-1}(z)$ exists in some neighborhood $\Omega_{0}$ of $z_{0}$ , $\det F(z)$ is nonzero and analytic on $\Omega_{0}$ , and the adjugate (or transpose of the cofactor matrix) $\operatorname{adj}(F(z))$ of $F(z)$ is analytic on $\Omega_{0}$ . Thus, $G(z)=F(z)^{-1}=\det^{-1}(F(z))\operatorname{adj}(F(z))$ is analytic on $\Omega_{0}$ as well.

By the singular value decomposition, at each $z\in\Omega_{0}$ , the singular values of $G(z)$ are the reciprocals of those of $F(z)$ ; more specifically,

[TABLE]

Therefore, the assumption of the theorem is equivalent to stating that the functions $z\mapsto s_{k}(G(z))$ attain a maximum on $\Omega_{0}$ at $z_{0}$ . By Theorem 5, $G(z)$ and $F(z)$ must be constant on $\Omega_{0}$ . Finally, applying the identity theorem (e.g., [7, Theorem 10.18]) to each entry of $F(z)$ implies that $F(z)$ is constant throughout $\Omega$ , as desired. ∎

Corollary 10.

Let $\Omega$ be a region of $\mathbb{C}$ and let $F:\Omega\to\mathbb{M}_{n}$ be a nonconstant analytic function. If every function $s_{k}(F(z))$ , $1\leq k\leq n$ , attains a minimum value at $z_{0}\in\Omega$ , then $\det(F(z_{0}))=0$ .

Remark.

Notice that $s_{n}(F(z_{0}))=0$ if and only if $z_{0}$ is a zero of $\det F(z)$ ; indeed, with an SVD of $A\in\mathbb{M}_{n}$ , we see that

[TABLE]

Thus, Corollary 10 states that if every function $s_{k}(F(z))$ , $1\leq k\leq n$ , attains a minimum value at $z_{0}\in\Omega$ , then $s_{n}(F(z_{0}))=0$ .

To illustrate Theorem 9, it suffices to take $F:\mathbb{D}\to\mathbb{M}_{2}$ as in (2); indeed, the functions $s_{1}(F(z))=1$ and $s_{2}(F(z))=|g(z)|$ attain their respective minimum values at any zero $z_{0}$ of $g$ and $F(z_{0})$ is certainly not invertible.

In light of Theorems 5 and 9, one may ask whether the singular values of a matrix-valued analytic function could attain minimum values at distinct points. The following result gives an affirmative answer.

Theorem 11.

If $F:\mathbb{C}\to\mathbb{M}_{2}$ denotes the function defined by

[TABLE]

then $s_{1}(F(z))$ has a minimum at $z_{1}=0$ and $s_{2}(F(z))$ has a minimum at $z_{2}=1$ .

Proof.

The remark following the proof of Theorem 3 shows that $s_{1}(F(z))$ has a minimum at $z_{1}=0$ ; indeed, $z\mapsto F(z)x_{0}$ is constant when $x_{0}=[1,0]^{T}$ . On the other hand, if $z_{2}=1$ , then

[TABLE]

satisfies $s_{2}(F(z_{2}))=0$ because $\det F(z_{2})=0$ . In particular, $s_{2}(F(z))$ has a minimum at $z_{2}=1$ . ∎

Finally, it is worth mentioning that a singular value of a matrix function may attain its minimum value at specified locations. For instance, when $g$ and $h$ are analytic, the function defined by

[TABLE]

satisfies $s_{2}(K(z))=0$ at every zero of $g$ and $h$ .

5. Return to the resolvent and matrix exponential.

With the wisdom acquired about the norms and singular values of analytic matrix-valued functions, we now return to the resolvent and matrix exponential of a given matrix. To simplify our notation, let $R_{A}(z)$ denote the resolvent of $A\in\mathbb{M}_{n}$ at $z$ , i.e.,

[TABLE]

Also, set $L_{A}(z)=A-zI$ .

Let $\Omega$ be a region of $\mathbb{C}\setminus\sigma(A)$ . By Theorem 5, the singular values $s_{k}(L_{A}(z))$ and likewise $s_{k}(R_{A}(z))$ cannot all attain a maximum value on $\Omega$ as functions. Recalling that

[TABLE]

it follows that the functions $s_{k}(R_{A}(z))$ cannot all attain a maximum nor a minimum on $\Omega$ ; in fact, this holds for $k=1$ and $k=n$ , respectively, as shown below.888The first inequality in Theorem 12 was observed by Daniluk [3] for resolvents of operators on a complex Hilbert space.

Theorem 12.

If $A\in\mathbb{M}_{n}$ and $\Omega$ is any region of $\mathbb{C}\setminus\sigma(A)$ , then

[TABLE]

In particular, the functions $s_{1}(R_{A}(z))$ and $s_{n}(R_{A}(z))$ are nonconstant on $\Omega$ .

Proof.

To obtain a contradiction, assume instead there are points $z_{0},w_{0}\in\Omega$ such that

[TABLE]

(see (14)). By Theorem 3,

[TABLE]

when $x_{0}$ and $y_{0}$ are maximizing vectors for $R_{A}(z_{0})$ and $L_{A}(w_{0})$ , respectively. On the other hand, as

[TABLE]

for any $z\in\Omega$ , we have

[TABLE]

and clearly $L_{A}^{\prime}(w_{0})=-I$ . However, these equations, together with (15), imply that

[TABLE]

or $y_{0}=-L_{A}^{\prime}(w_{0})y_{0}=0$ , which are impossible because $\|x_{0}\|=\|y_{0}\|=1$ . ∎

We now turn to the matrix exponential. Recall that given $T\in\mathbb{M}_{n}$ , the matrix exponential of $T$ is the $n\times n$ matrix defined by

[TABLE]

It is not difficult to verify that the series above converges for any $T\in\mathbb{M}_{n}$ (say, under the operator norm), and $\exp(T)$ is invertible in $\mathbb{M}_{n}$ with inverse $\exp(-T)$ .

For $A\in\mathbb{M}_{n}$ , we see that the map $z\mapsto\exp(zA)$ is a well-defined matrix-valued function, analytic on the entire complex plane $\mathbb{C}$ , and

[TABLE]

Furthermore, a straightforward verification999Indeed, when $\operatorname{Re}z>\|A\|$ , the function $t\mapsto\exp(t(A-zI))$ has operator norm equal to $e^{-\operatorname{Re}(zt)}\|\exp(At)\|$ , which tends to zero as $t\to\infty$ , and so the integral over $[0,\infty)$ of its derivative equals the identity matrix. reveals that

[TABLE]

while term-by-term integration of the power series representations for the exponential and the resolvent gives

[TABLE]

where $\Gamma_{r}$ denotes any circle of radius $r>\|A\|$ centered at the origin.

In addition to the intimate relationship between the resolvent and the matrix exponential (as described in (16) and (17)), intuition from the case of scalar-valued functions may suggest that, in analogy to Theorem 12, the functions $s_{1}(\exp(zA))$ and $s_{n}(\exp(zA))$ should not attain their maximum and minimum values, respectively,101010After all, for fixed $a\in\mathbb{C}$ , $|e^{za}|$ cannot attain its maximum nor minimum values over any region $\Omega$ . over any region $\Omega$ of $\mathbb{C}$ . This is in fact false. Notice that

[TABLE]

provides a counterexample; indeed, computation reveals that

[TABLE]

Thus, $s_{1}(\exp(zA))$ and $s_{2}(\exp(zA))$ are constant when $\operatorname{Re}z<0$ and $\operatorname{Re}z>0$ , respectively.

Finally, we would like to propose a question for further investigation. Given an analytic function $F:\Omega\to\mathbb{M}_{n}$ such that $\|F(z)\|$ attains its maximum in $\Omega$ , Theorem 8 not only describes the structure of $F$ , it also implies that $\|F(z)\|=\|F(z_{0})\|$ for all $z\in\Omega$ . So, in a sense, it is rare for $\|F(z)\|$ to attain its maximum. Instead, what may be less rare is for $\|F(z)\|$ to attain a minimum value (see Theorem 11).

In fact, the remark made after the proof of Theorem 3 already gives a sufficient condition for $\|F(z)\|$ to have a minimum at $z_{0}$ , namely when $z\mapsto F(z)x_{0}$ is constant for some maximizing vector for $F(z_{0})$ . Furthermore, by completely analogous reasoning, a sufficient condition for $s_{n}(F(z))$ to attain a maximum at $z_{0}$ is that $z\mapsto F(z)x_{0}$ is constant for some minimizing111111A vector $x_{0}$ is said to be a minimizing vector for $A\in\mathbb{M}_{n}$ if $\|Ax_{0}\|=\min\{\|Ax\|:\|x\|=1\}$ . vector for $F(z_{0})$ . For example, in light of this, it may be verified that $s_{1}(F(z))$ has a minimum at $z=0$ and $s_{2}(F(z))$ has a maximum at $z=0$ when $F(z)$ is the function in (12). This leads one to wonder what necessary and sufficient conditions permit $s_{1}(F(z))$ to attain a minimum and $s_{n}(F(z))$ to attain a maximum over a region $\Omega$ ? Is it more attainable to consider the special case $F(z)=(A-zI)^{-1}$ ? How about when $F(z)=\exp(zA)$ ?

Acknowledgments. I wish to thank Cara D. Brooks whose valuable comments helped improve the exposition of the paper tremendously. Also, I am grateful to Nicholas Seguin whose sustained interest in this project helped me see it to completion. Finally, I thank the referees and editor for their helpful suggestions.

Bibliography9

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Brooks, C. D., Condori, A. A. (2018). A resolvent criterion for normality. Amer. Math. Monthly. 125(2): 149–156.
2[2] Brown, A., Douglas, R. G. (1965). On maximum theorems for analytic operator functions. Acta Sci. Math. (Szeged). 26: 325–327.
3[3] Daniluk, A. (2011). The maximum principle for holomorphic operator functions. Integral Equations Operator Theory. 69(3): 365–-372.
4[4] Dunford, N., Schwartz, J. T. (1958). Linear Operators. I. General Theory. New York–London: Interscience Publishers.
5[5] Horn, R. A., Johnson, C. R. (1985). Matrix Analysis. Cambridge, UK: Cambridge Univ. Press.
6[6] Nikolski, N., Vasyunin, V. (1998). Elements of spectral theory in terms of the free function model. I. Basic constructions. In: Sheldon, A., Mc Carthy, J. E., Sarason, D., eds. Holomorphic Spaces (Berkeley, CA, 1995). Math. Sci. Res. Inst. Publ., vol. 33. Cambridge, UK: Cambridge Univ. Press, pp. 211–302.
7[7] Rudin, W. (1987). Real and Complex Analysis, 3rd ed. New York, NY: Mc Graw-Hill Book Co.
8[8] Trefethen, L. N. (1992). Pseudospectra of matrices. In: Griffiths, D. F., Watson, G. A., eds. Numerical Analysis 1991 (Dundee, 1991). Pitman Res. Notes Math. Ser., 260. Harlow: Longman Scientific & Technical and New York: John Wiley & Sons, Inc., pp. 234–266.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Maximum Principles for Matrix-Valued Analytic Functions

Abstract.

1. Introduction.

2. Maximum norm principles.

Theorem 1** (Maximum Frobenius Norm Principle).**

Theorem 2** (Maximum Norm Principle).**

Proof of Theorem 2.

Theorem 3** (Maximum Operator Norm Principle, cf. [2]).**

Proof of Theorem 3.

Remark**.**

Theorem 4**.**

Proof of Theorem 4.

3. Maximum singular value principles.

Theorem 5**.**

Proof of Theorem 5.

Corollary 6**.**

Proof.

Corollary 7**.**

Theorem 8**.**

Proof of Theorem 8.

4. Minimum singular value principles.

Theorem 9**.**

Proof.

Corollary 10**.**

Remark**.**

Theorem 11**.**

Proof.

5. Return to the resolvent and matrix exponential.

Theorem 12**.**

Proof.

Theorem 1 (Maximum Frobenius Norm Principle).

Theorem 2 (Maximum Norm Principle).

Theorem 3 (Maximum Operator Norm Principle, cf. [2]).

Remark.

Theorem 4.

Theorem 5.

Corollary 6.

Corollary 7.

Theorem 8.

Theorem 9.

Corollary 10.

Remark.

Theorem 11.

Theorem 12.