Maximum Principles for Matrix-Valued Analytic Functions
Alberto A. Condori

TL;DR
This paper explores maximum modulus principles for matrix-valued analytic functions, extending classical scalar results to matrices, and discusses their implications for singular values, resolvents, and matrix exponentials.
Contribution
It introduces new maximum norm principles for matrix-valued analytic functions and derives maximum and minimum principles for their singular values.
Findings
Maximum norm principles for matrix-valued functions are established.
Maximum and minimum principles for singular values are deduced.
Observations on resolvents and matrix exponentials are provided.
Abstract
To what extent is the maximum modulus principle for scalar-valued analytic functions valid for matrix-valued analytic functions? In response, we discuss some maximum norm principles for such functions that do not appear to be widely known, deduce maximum and minimum principles for their singular values, and make some observations concerning resolvents and matrix exponentials.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Maximum Principles for Matrix-Valued Analytic Functions
Alberto A. Condori
Abstract.
To what extent is the maximum modulus principle for scalar-valued analytic functions valid for matrix-valued analytic functions? In response, we discuss some maximum norm principles for such functions that do not appear to be widely known, deduce maximum and minimum principles for their singular values, and make some observations concerning resolvents and matrix exponentials.
1. Introduction.
The maximum modulus principle (MMP) is a fundamental result in complex analysis. It is often used to deduce other important results such as the fundamental theorem of algebra, the open mapping theorem (i.e., analytic functions map open sets to open sets), Schwarz’s lemma, the Phragmén–Lindelöff principle, etc. One of its various formulations states that if is a scalar-valued function, analytic on a region (i.e., a nonempty open connected subset) of the complex plane , whose modulus attains a local maximum in , then is constant on . For a proof of the MMP, we refer the reader to [7, Chapter 10].
Many differential equations encountered in science and engineering lead to the consideration of matrix-valued functions, that is, functions with range in the set of matrices, , with entries in . For instance, the standard model of an RLC circuit in electrical engineering admits the formulation , where and is a function with values in . The vector-valued solutions to such an equation (with ) depend on the matrix-valued function , and the decay of these solutions is controlled by its operator norm . As usual, is the operator norm of induced by the Euclidean norm on , namely when .
In linear algebra, too, matrix-valued functions arise (implicitly) in the study of eigenvalues, i.e., the spectrum of . After all, satisfies for some nonzero vector if and only if the resolvent function has a singularity at , i.e., is not invertible. (Throughout, denotes the identity in .) Since the spectrum is often insufficient for the analysis of non-normal matrices (see [8]), focus has shifted to the study111Equivalently, one may study the so-called “pseudospectra” of . For an overview of that subject, see [9]. of the norm of the resolvent . For instance, the norm of the resolvent alone characterizes when is a normal matrix [1].
Thus, it is of interest to study the (operator) norms of the matrix-valued functions and . As can be expected, these functions are analytic222Recall that a function is analytic if, for each , there is a member of , denoted by , such that as . It can be shown that is analytic if and only if is “entry-wise analytic,” i.e., every entry of is an analytic function on . in regions of , the entire plane , and , respectively. The fact these functions are analytic leads one to question the extent to which the MMP for scalar-valued functions is valid for , where is any matrix-valued analytic function. The purpose of this article is to find sufficient conditions, say involving the norm of a matrix-valued analytic function, that guarantee that the function is constant.
In Section 2, we state and discuss some maximum norm principles for matrix-valued analytic functions. Although it has been long known that a direct analog of the MMP fails in the context of matrix-valued functions in which the operator norm plays the role of the modulus, we find a suitable analog. Stated roughly, if is such that attains a maximum at some , then there is a direction in which is constant (although need not be) namely that of any maximizing vector of (see Theorem 3). We rediscovered this result originally noted by Brown and Douglas in [2] and use it to describe the structure of the function (see Theorem 4). Since the result lends itself to iteration, we make natural assumptions on the function’s singular values and explore the consequences further in Section 3. One of the section’s main results (see Corollary 6) illuminates the equivalence of two apparently distinct statements to the single statement that the matrix function is constant: the Frobenius norm of attains a maximum, and every singular value of attains a maximum (at possibly distinct points).
Once the maximum singular-values principle is established in Section 3, we proceed to prove a minimum singular-values principle in Section 4. That result (Theorem 9) is, in a sense, an analog of the well-known minimum modulus principle of complex analysis in the context of matrix-valued functions. Finally, in Section 5, we discuss the implications of our results in the context of the resolvent and the matrix exponential which involve their largest and smallest singular values.
It is worth mentioning that analytic matrix-valued functions appear in many other areas such as the harmonic analysis of operators on a Hilbert space (e.g., finite-rank perturbations of self-adjoint and unitary operators), and consequently in mathematical physics (e.g., Schrödinger operators); roughly, problems concerning spectral properties of an operator are often solved through the consideration of an analytic matrix-valued function defined on the upper-half plane, i.e. the so-called “characteristic function.” Due to the scope of the paper, the reader is referred to the survey [6] and all references therein for further details.
We also remark that the results of this article could be written in the more general framework of operator-valued functions , where is a complex Hilbert space, or that of vector-valued functions , where is a complex Banach space. However, all statements in this article are kept in the context of matrix-valued functions so that the results are easier to read and appeal to a wider audience.
2. Maximum norm principles.
To find a suitable analog of the MMP for matrix-valued functions, it is reasonable to first test whether known proofs of the MMP can be easily adapted when replacing modulus with operator norm. One such proof of the MMP appears in [7, Chapter 10]. In it, the identity () appears, and although the operator norm of does not readily provide a direct analog for , the Frobenius norm does. In fact,333As usual, denotes the conjugate transpose of the matrix .
[TABLE]
when is the Frobenius (Hilbert–Schmidt) norm of , and an argument analogous to the proof of the MMP in [7] (that relies on (1)) gives the following result.
Theorem 1** (Maximum Frobenius Norm Principle).**
Let be a region of and let be analytic. If assumes its maximum at some , then for all .
Despite its provision of a direct analog of the MMP for matrix-valued functions, in applications, it is the operator norm that is of interest, not the Frobenius norm. Unfortunately, the conclusion of Theorem 1 need not hold when the Frobenius norm is replaced by another matrix norm. For example, let denote the open unit disk centered at the origin, let be analytic (e.g., ), and consider the matrix-valued function
[TABLE]
Notice that the operator norm of satisfies
[TABLE]
even though is not a constant function. Nevertheless, one can prove a weakened version for any norm.
Theorem 2** (Maximum Norm Principle).**
Let be a region of and let be analytic. If attains its maximum in , then is constant on .
Theorem 2 is well known and a proof can be found in [4, Section III.14]; we provide a different short proof based on a well-known consequence of the Hahn–Banach theorem on linear functionals, namely if is any normed space and is nonzero, then there is a bounded linear functional on such that and . For further details and a simple proof of this fact, see [7, Chapter 5].
Proof of Theorem 2.
Assume there is a such that for all and, without loss of generality, that . Then we can choose a bounded linear functional of norm so that . By continuity of and analyticity of , defines an analytic function on , and
[TABLE]
It follows now from the usual MMP that must be constant throughout and
[TABLE]
Thus, for all . ∎
The conclusion of the maximum norm principle above may be seen as unsatisfactory because it gives limited information about the structure of itself. This is not at all surprising; after all, the theorem holds for any norm. So, from now on we use the operator norm exclusively in an effort to gain more information about the function .
A useful property of the operator norm of a matrix is that given any , there is a unit vector , called a maximizing vector for , so that ; in other words, matrices attain their operator norm at some vector in the unit ball of . This is a consequence of the compactness of the closed unit ball of .
Recently, we rediscovered a maximum operator norm principle due to Brown and Douglas. In [2, Theorem 4], the authors proved that if is a nonconstant matrix-valued analytic function whose operator norm attains its maximum, then there is a direction in which is constant. Our version reads as follows.
Theorem 3** (Maximum Operator Norm Principle, cf. [2]).**
Let be a region of and let be analytic. If there is a so that for all and is a maximizing vector for , then for every . In particular, is constant on .
The conclusion444It is worth mentioning that our version of Theorem 3 also complements a result due to Daniluk in [3]. of Theorem 3 here is, at first sight, a slight improvement to that in Theorem 4 (part (1)) of [2]; after all, using a series expansion of , the condition for easily implies that is constant on . In fact, the reverse implication is also true and a justification can be made using series, too. On the other hand, although our series proof of Theorem 3 below is not as short as that of Brown and Douglas, it elucidates the consideration of maximizing vectors (see (5) below).
Proof of Theorem 3.
Let be such that . Then admits a power series representation on , say
[TABLE]
where for . For any vector ,
[TABLE]
by continuity of the inner product and so
[TABLE]
for any .
Now, since for all , it follows from (4) that
[TABLE]
for any vector and . Let be a maximizing vector for . Replace by in (5), and conclude
[TABLE]
and for every . In particular, by (3), for all and so, by the identity theorem (e.g., [7, Theorem 10.18]), for all . ∎
Remark**.**
Note that the conclusion of Theorem 3 alone implies that has a minimum at ; after all, if is constant on for some maximizing vector of , then
[TABLE]
Hence, the conclusion of Theorem 3 is stronger than that of the maximum norm principle (when using the operator norm) because it implies that any maximizing vector for is also a maximizing vector for , and has constant norm equal to that of for all .
The observation made in the remark leads one to the following factorization.
Theorem 4**.**
Let be a region of and let be analytic. If there is a so that for all , then there are (constant) unitary555Recall that is said to be unitary if . matrices and , and an analytic function , such that
[TABLE]
Roughly, in the case of matrices, Theorem 4 states that when is nonconstant, analytic, and achieves its maximum operator norm, say equal to , at a point of a region, then there is a nonconstant analytic function such that
[TABLE]
up to multiplication by (constant) unitary matrices on the right and the left. Hence, in a sense, the example given in (2) is essentially the only example of a nonconstant matrix function whose operator norm achieves a maximum value of .
Proof of Theorem 4.
Without loss of generality, we assume . By Theorem 3, if is a maximizing vector for , then the vector function is constant on . Recalling that for any and choosing , we obtain
[TABLE]
Let and be (constant) unitary matrices whose first columns are and , respectively. Then, in matrix blocks,
[TABLE]
where . Furthermore, as and are unitary, (or, alternatively, this follows by the remark following the proof of Theorem 3). This implies that
[TABLE]
because the operator norm of an matrix is an upper bound on the Euclidean (vector) norm of its columns and rows. In other words, the assumptions on imply the existence of constant unitary matrices and so that
[TABLE]
where is an analytic matrix-valued function. Thus, the desired conclusion follows with , , and . ∎
3. Maximum singular value principles.
An attractive feature of Theorem 4 is that it lends itself to iteration. Indeed, the lower right block in (8) may very well satisfy the assumptions of Theorem 4 just as did. In this section, we explore this situation and its consequences, but first review some basic terminology and results concerning singular values.
We begin with the observation that the maximizing vectors for a matrix admit the characterization that is a maximizing vector for if and only if has norm and . More generally, for a vector (whether it has norm or not),
[TABLE]
A proof of (7) can be based on the fact that every positive semi-definite matrix has a unique positive semi-definite square root (e.g., see [5, Theorem 7.2.6]). To that end, first note that the inequality valid for all vectors is equivalent to stating that the matrix is positive semi-definite. So, holds if and only if , or equivalently, . Hence, is a maximizing vector of if and only if it is an eigenvector of of norm , i.e., (7) holds.
The role played in Theorem 3 by maximizing vectors for a matrix and their alternative characterization as eigenvectors lead directly to the consideration of singular values.
Recall that the singular values , , of an matrix are the nonnegative square roots of the eigenvalues of ordered in the nonincreasing order, that is,
[TABLE]
In particular, (see (7)) and . The latter can be deduced using any singular value decomposition (SVD) of (e.g., [5, Theorem 7.35]) and (1).
The following result is a simple consequence of Theorem 4.
Theorem 5**.**
Let be a region of and let be analytic. Suppose that, for each , the function attains its maximum value on . Then is constant on .
In Theorem 5, the assumption does not require that the functions ,, attain their maximum values at the same point666In fact, if the functions ,, attain their maximum values at the same point , it follows already from Theorem 1 that must be constant on . of ; they may assume their respective maxima at distinct points ,,.
Proof of Theorem 5.
Our proof is by induction on . When , the desired conclusion holds by the MMP. So, suppose that the result holds for with . We now show that it also holds for .
Suppose is analytic, and the function attains its maximum value on for each . Let be such that for all . By Theorem 4, there are (constant) unitary matrices and , and an analytic function , such that
[TABLE]
In particular, attains its maximum value on for each . By the inductive hypothesis, must be constant on and, consequently, is also constant. ∎
At first sight, the assumption in Theorem 5 that every function attains its maximum value on for appears to be different from saying that attains its maximum value on in the maximum Frobenius norm principle above. Based upon the results above one may conclude that they are in fact equivalent!
Corollary 6**.**
Let be a region of . The following statements are equivalent for an analytic function .
- (1)
* is constant on .* 2. (2)
For every , is constant on . 3. (3)
For every , attains its maximum value at some . 4. (4)
* is constant on .* 5. (5)
* attains its maximum value at some .*
Proof.
It is evident that , , , and . The only nontrivial implications and are consequences of the maximum Frobenius norm principle and Theorem 5, respectively. ∎
In view of Corollary 6 (or Theorem 5), if is region of and is a nonconstant analytic function such that attains its maximum on , then there is a largest integer such that the functions , , attain their maximum values on . In this case, up to multiplication by (constant) unitary matrices on the right and the left, has the block form
[TABLE]
for some (necessarily nonconstant) analytic function .
A closer look at the proof of Theorem 5 also reveals the following refinement of the maximum norm principle. We omit the details.
Corollary 7**.**
Let , let be a region of , and let be analytic. Suppose that, for each , the function attains its maximum value on . Then is constant on for each .
Note that for an arbitrary , it may happen that is constant while is not when . For example, the function defined by
[TABLE]
has and for all .
As seen in its proof, the key to obtaining the conclusion of Theorem 4 relies on choosing a maximizing vector for . The following theorem is a refinement of Theorem 4 that relies on choosing instead “all maximizing vectors” for .
Theorem 8**.**
Let be a region of and let be analytic. Suppose there is a so that for all and set777Equivalently, is the dimension of the subspace spanned by the “right-singular vectors” associated with the largest singular value of .
[TABLE]
Then there are unitary matrices and such that
[TABLE]
or, for some analytic function ,
[TABLE]
In particular, is constant and for all when satisfies .
Note that one could apply Theorem 8 again to the lower-right matrix-block function appearing in (10). More definitively, if attains its maximum at , then is the largest integer such that
[TABLE]
attains its maximum at , and is the largest integer such that
[TABLE]
then up to multiplication by (constant) unitary matrices on the right and the left, has the block form
[TABLE]
Hence, if every function attains its maximum at some point of then, up to multiplication by (constant) unitary matrices on the right and the left, admits the block form
[TABLE]
and is hence a constant matrix, as expected by Theorem 5.
Likewise, a completely analogous argument reveals that the refinement of the maximum norm principle in Corollary 7 is also a consequence of Theorem 8, because for and for . We leave the details to the reader.
Proof of Theorem 8.
Let be the diagonal matrix whose main diagonal entries are the singular values of listed in nonincreasing order. Then we may let and be unitary matrices such that (i.e., an SVD for ). Let denote the largest positive integer such that . Note that, by (7), a vector satisfies if and only if , or equivalently, belongs to the linear span of first columns of because is unitary. Thus, with as in (9).
Now, consider the function . Clearly, is analytic on and satisfies
[TABLE]
Since the norm of every column (and row) of a matrix is bounded by its operator norm, the modulus of every (analytic) entry is also bounded by . Moreover, if , then and so for all by the (usual) MMP. In particular, the first columns and rows of have norm at least . Therefore, when and . In other words, using matrix blocks, this shows that
[TABLE]
for some analytic function when , while when . This completes the proof of (10).
Finally, if denotes the standard basis for and , then the th column of satisfies and
[TABLE]
for . Thus, is constant and whenever belongs to the linear span of first columns of , or equivalently, when satisfies . ∎
4. Minimum singular value principles.
In the case of nonconstant scalar-valued functions, the MMP tells us that the minimum modulus (of an analytic function on a region) can only be attained at a zero of the function. This conclusion is often called the minimum modulus principle in complex analysis. As a consequence of Theorem 5, we state and prove an analog of that minimum principle in the context of matrix-valued functions.
Theorem 9**.**
Let be a region of and let be a nonconstant analytic function. Then no point can be a minimum value for all of the functions , , unless is not invertible.
Proof.
We prove that if there is a such that is invertible and the functions attain their minimum at for , then must be a constant function.
To begin, recall that the collection of invertible matrices is open. This implies must be invertible for all sufficiently close to . So, exists in some neighborhood of , is nonzero and analytic on , and the adjugate (or transpose of the cofactor matrix) of is analytic on . Thus, is analytic on as well.
By the singular value decomposition, at each , the singular values of are the reciprocals of those of ; more specifically,
[TABLE]
Therefore, the assumption of the theorem is equivalent to stating that the functions attain a maximum on at . By Theorem 5, and must be constant on . Finally, applying the identity theorem (e.g., [7, Theorem 10.18]) to each entry of implies that is constant throughout , as desired. ∎
Corollary 10**.**
Let be a region of and let be a nonconstant analytic function. If every function , , attains a minimum value at , then .
Remark**.**
Notice that if and only if is a zero of ; indeed, with an SVD of , we see that
[TABLE]
Thus, Corollary 10 states that if every function , , attains a minimum value at , then .
To illustrate Theorem 9, it suffices to take as in (2); indeed, the functions and attain their respective minimum values at any zero of and is certainly not invertible.
In light of Theorems 5 and 9, one may ask whether the singular values of a matrix-valued analytic function could attain minimum values at distinct points. The following result gives an affirmative answer.
Theorem 11**.**
If denotes the function defined by
[TABLE]
then has a minimum at and has a minimum at .
Proof.
The remark following the proof of Theorem 3 shows that has a minimum at ; indeed, is constant when . On the other hand, if , then
[TABLE]
satisfies because . In particular, has a minimum at . ∎
Finally, it is worth mentioning that a singular value of a matrix function may attain its minimum value at specified locations. For instance, when and are analytic, the function defined by
[TABLE]
satisfies at every zero of and .
5. Return to the resolvent and matrix exponential.
With the wisdom acquired about the norms and singular values of analytic matrix-valued functions, we now return to the resolvent and matrix exponential of a given matrix. To simplify our notation, let denote the resolvent of at , i.e.,
[TABLE]
Also, set .
Let be a region of . By Theorem 5, the singular values and likewise cannot all attain a maximum value on as functions. Recalling that
[TABLE]
it follows that the functions cannot all attain a maximum nor a minimum on ; in fact, this holds for and , respectively, as shown below.888The first inequality in Theorem 12 was observed by Daniluk [3] for resolvents of operators on a complex Hilbert space.
Theorem 12**.**
If and is any region of , then
[TABLE]
In particular, the functions and are nonconstant on .
Proof.
To obtain a contradiction, assume instead there are points such that
[TABLE]
[TABLE]
when and are maximizing vectors for and , respectively. On the other hand, as
[TABLE]
for any , we have
[TABLE]
and clearly . However, these equations, together with (15), imply that
[TABLE]
or , which are impossible because . ∎
We now turn to the matrix exponential. Recall that given , the matrix exponential of is the matrix defined by
[TABLE]
It is not difficult to verify that the series above converges for any (say, under the operator norm), and is invertible in with inverse .
For , we see that the map is a well-defined matrix-valued function, analytic on the entire complex plane , and
[TABLE]
Furthermore, a straightforward verification999Indeed, when , the function has operator norm equal to , which tends to zero as , and so the integral over of its derivative equals the identity matrix. reveals that
[TABLE]
while term-by-term integration of the power series representations for the exponential and the resolvent gives
[TABLE]
where denotes any circle of radius centered at the origin.
In addition to the intimate relationship between the resolvent and the matrix exponential (as described in (16) and (17)), intuition from the case of scalar-valued functions may suggest that, in analogy to Theorem 12, the functions and should not attain their maximum and minimum values, respectively,101010After all, for fixed , cannot attain its maximum nor minimum values over any region . over any region of . This is in fact false. Notice that
[TABLE]
provides a counterexample; indeed, computation reveals that
[TABLE]
Thus, and are constant when and , respectively.
Finally, we would like to propose a question for further investigation. Given an analytic function such that attains its maximum in , Theorem 8 not only describes the structure of , it also implies that for all . So, in a sense, it is rare for to attain its maximum. Instead, what may be less rare is for to attain a minimum value (see Theorem 11).
In fact, the remark made after the proof of Theorem 3 already gives a sufficient condition for to have a minimum at , namely when is constant for some maximizing vector for . Furthermore, by completely analogous reasoning, a sufficient condition for to attain a maximum at is that is constant for some minimizing111111A vector is said to be a minimizing vector for if . vector for . For example, in light of this, it may be verified that has a minimum at and has a maximum at when is the function in (12). This leads one to wonder what necessary and sufficient conditions permit to attain a minimum and to attain a maximum over a region ? Is it more attainable to consider the special case ? How about when ?
Acknowledgments. I wish to thank Cara D. Brooks whose valuable comments helped improve the exposition of the paper tremendously. Also, I am grateful to Nicholas Seguin whose sustained interest in this project helped me see it to completion. Finally, I thank the referees and editor for their helpful suggestions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Brooks, C. D., Condori, A. A. (2018). A resolvent criterion for normality. Amer. Math. Monthly. 125(2): 149–156.
- 2[2] Brown, A., Douglas, R. G. (1965). On maximum theorems for analytic operator functions. Acta Sci. Math. (Szeged). 26: 325–327.
- 3[3] Daniluk, A. (2011). The maximum principle for holomorphic operator functions. Integral Equations Operator Theory. 69(3): 365–-372.
- 4[4] Dunford, N., Schwartz, J. T. (1958). Linear Operators. I. General Theory. New York–London: Interscience Publishers.
- 5[5] Horn, R. A., Johnson, C. R. (1985). Matrix Analysis. Cambridge, UK: Cambridge Univ. Press.
- 6[6] Nikolski, N., Vasyunin, V. (1998). Elements of spectral theory in terms of the free function model. I. Basic constructions. In: Sheldon, A., Mc Carthy, J. E., Sarason, D., eds. Holomorphic Spaces (Berkeley, CA, 1995). Math. Sci. Res. Inst. Publ., vol. 33. Cambridge, UK: Cambridge Univ. Press, pp. 211–302.
- 7[7] Rudin, W. (1987). Real and Complex Analysis, 3rd ed. New York, NY: Mc Graw-Hill Book Co.
- 8[8] Trefethen, L. N. (1992). Pseudospectra of matrices. In: Griffiths, D. F., Watson, G. A., eds. Numerical Analysis 1991 (Dundee, 1991). Pitman Res. Notes Math. Ser., 260. Harlow: Longman Scientific & Technical and New York: John Wiley & Sons, Inc., pp. 234–266.
