The star-center of the quaternionic numerical range
Lu\'is Carvalho, Cristina Diogo, S\'ergio Mendes

TL;DR
This paper proves that the quaternionic numerical range is always star-shaped, identifies its star-center, and describes its geometric structure using tangents to the lower bild, advancing understanding of quaternionic matrix analysis.
Contribution
It establishes the star-shaped property of the quaternionic numerical range and characterizes its star-center through geometric analysis, a novel contribution in quaternionic matrix theory.
Findings
Quaternionic numerical range is always star-shaped.
Star-center determined by equivalence classes of the bild's star-center.
Geometric shape of the star-center's upper part is defined by two tangent lines.
Abstract
In this paper we prove that the quaternionic numerical range is always star-shaped and its star-center is given by the equivalence classes of the star-center of the bild. We determine the star-center of the bild, and consequently of the numerical range, by showing that the geometrical shape of the upper part of the center is defined by two lines, tangents to the lower bild.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The star-center of the quaternionic numerical range
Luís Carvalho
Luís Carvalho, ISCTE - Lisbon University Institute
Av. das Forças Armadas
1649-026, Lisbon
Portugal
,
Cristina Diogo
Cristina Diogo, ISCTE - Lisbon University Institute
Av. das Forças Armadas
1649-026, Lisbon
Portugal
and
Center for Mathematical Analysis, Geometry, and Dynamical Systems
Mathematics Department,
Instituto Superior Técnico, Universidade de Lisboa
Av. Rovisco Pais, 1049-001 Lisboa, Portugal
and
Sérgio Mendes
Sérgio Mendes, ISCTE - Lisbon University Institute
Av. das Forças Armadas
1649-026, Lisbon
Portugal
and Centro de Matemática e Aplicações
Universidade da Beira Interior
Rua Marquês d’Ávila e Bolama
6201-001, Covilhã
Abstract.
In this paper we prove that the quaternionic numerical range is always star-shaped and its star-center is given by the equivalence classes of the star-center of the bild. We determine the star-center of the bild, and consequently of the numerical range, by showing that the geometrical shape of the upper part of the center is defined by two lines, tangents to the lower bild.
Key words and phrases:
quaternions, numerical range, star-shapedness
2010 Mathematics Subject Classification:
15B33, 47A12
The second author was partially supported by FCT through project UID/MAT/04459/2013 and the third author was partially supported by FCT through CMA-UBI, project PEst-OE/MAT/UI0212/2013.
1. Introduction
Let denote the skew-field of Hamilton quaternions. Let be a matrix with quaternionic entries. It is well known that the numerical range is a connected but not necessarily convex subset of the quaternions. The group of unitary quaternions acts on by automorphisms. Since every class , , has a representative in and each class of is contained in , it became clear from the early studies of the quaternionic numerical range that it is enough to study the bild of , or the upper-bild . The latter has the advantage of being always convex whereas is convex if, and only if, is convex, see [Zh, page 53] and theorem 3.1. The convexity of the numerical range, the bild and upper bild has been studied by several authors, see [AY1, AY2, R, ST, STZ].
In the complex setting the numerical range is convex thanks to the celebrated Toeplitz-Hausdorff Theorem [GR]. Over the time, several generalizations of the numerical range have been proposed, namely the C-numerical range, the joint numerical range, among others, and in these cases convexity may fail. It then becomes natural to look for convexity-like geometric properties. For instance, the property of star-shapedness has been studied in [CT, LLPS18, LLPS19, LNT, LP]. We recall that star-shapedness of a set only requires that there is an element such that every segment connecting and any other element of must be contained in , see definition 2.1. Accordingly, we say that is in the star-center of .
For some generalizations of the numerical range, the star-shapedness of the (complex) numerical range holds under certain conditions. In the article we tackle the question of the star-shapedness in the quaternionic setting. We prove that the quaternionic numerical range is always star-shaped. In addition, we characterize the shape of the star-center for quaternionic matrices.
The star-shapedness of the numerical range is a consequence of two simple facts (see theorem 3.4). Firstly, the convexity of the upper and lower bilds imply that the segments whose end is a real element of the bild is contained in the bild. Therefore the bild is star-shaped and the reals therein are part of its center. And secondly, the equality, up to isomorphism, of all two dimensional real subalgebras of the quaternions that include the reals (as a real subspace), leads us to the conclusion that the reals in are in fact part of the (star) center of the numerical range.
As mentioned before, the general reason to focus on the bild is that the whole numerical range can be reconstructed from it by using similarity classes. Our result is in line with the elements of the bild being the building blocks of the numerical range. In fact, we prove in theorem 3.8 that the center of the numerical range is given by the similarity classes of the center of the bild. Therefore, we only need to know the center of the bild, and then to build the similarity classes to obtain the center of the numerical range. When the matrix is non hermitian the upper center (likewise for the lower center) is the region of the upper bild limited by two lines. These two lines are the tangents to the curve defining the boundary of the lower bild at the reals, see theorems 4.1, 4.3 and corollary 4.5. As a consequence of these results we establish a new proof of the important theorem by Au-Yeung [AY1, theorem 3], which establish a necessary and sufficient condition for convexity of the numerical range, see corollary 4.4. We conclude with an example where we explicitly compute the center.
2. Preliminaries
The quaternionic skew-field is an algebra of rank over with basis , where the product is given by . For any we denote by and , the real and imaginary parts of , respectively. Let the pure quaternions be . The conjugate of is given by and the norm is defined by . Two quaternions are called similar, if there exists a unitary quaternion such that . Similarity is an equivalence relation and we denote by the equivalence class containing . A necessary and sufficient condition for the similarity of and is given by , see [R, theorem 2.2.6]. We will denote the set of all equivalence classes of the elements of a set by . Then,
[TABLE]
Let be the -dimensional -space. The norm of is . The disk with center and radius is the set and its boundary is the sphere . In particular, if and , we simply write and . With this notation, the group of unitary quaternions is whereas denotes the unit sphere over the pure quaternions.
Let be the set of all matrices with entries over . The set
[TABLE]
is called the quaternionic numerical range of in . From the above definition we see that the quaternionic numerical range of is the subset of containing the images of the quadratic function over the quaternionic unitary sphere, . The numerical range is invariant under unitary equivalence, i.e.
[TABLE]
for every unitary [R, theorem 3.5.4].
It is well known that if then , see [R, page 38]. This means that if and then . For simplicity we just say that belongs to by similarity. Therefore, it is enough to study the subset of complex elements in each similarity class. This set is known as , the bild of :
[TABLE]
We will freely use both notations and for the bild of . Although the bild may not be convex, the upper bild is always convex, see [ST]. Analogously, the lower bild is also always convex. Note that , and .
For , let . For any and , let be the representative of the class in , that is,
[TABLE]
In particular,
[TABLE]
and we can write .
Let be a real subspace of . We denote by the canonical -linear projection .
For we will denote by the set of convex linear combinations of and :
[TABLE]
Definition 2.1**.**
Let be a subset of a vector space. We say the set is star-shaped if there is a vector such that . The star-center of a set is defined to be
[TABLE]
For simplicity, we refer to the star-center of a set as the center.
3. Star-shapedness of the bild and numerical range
The upper bild and the bild fully specify the numerical range, but the first is considered better suited to represent the quaternionic numerical range. This is not only because it is convex but also because it has the advantage of containing one single element from each similarity class. In a sense, the upper bild can be interpreted as the set of equivalence classes for the similarity relation , that is, the quotient set . However, from the convexity of the upper bild we cannot infer about the convexity of the numerical range, as the first is always convex and the latter is not.
The first result of this paper relates the convexity of the bild with the convexity of the numerical range. This is a known result (see [Zh, page 53]), however we present a different proof based on elementary properties of the numerical range.
Theorem 3.1**.**
Let . Then is convex if and only if is convex.
Proof.
It is enough to prove that, if is convex then is convex. Let and . We need to show that . The quaternion has and
[TABLE]
We will prove that , thus proving by similarity that . Since the upper bild is convex,
[TABLE]
By similarity, . Note that . From (3.2), and from (3.1), . Therefore,
[TABLE]
and so there is such that . Hence, . By hypothesis, is convex and so . ∎
Any quaternionic matrix can be written as , with hermitian and skew-hermitian. Let be the unitary matrix that diagonalize , i.e., . Since the numerical range is invariant under unitary equivalence, we can work with , that can be written in the form . Since is hermitian and since is skew-hermitian the real part of is zero, see [R, corollary 3.5.3].
We claim that . To prove this we will find a vector such that . Let , then take and in such that and . The quaternions and are either zero or the representatives of in and in , respectively. Thus they are pure complex. Finally, choose such that . Take and . Then, the vector is in the stated conditions. It is now clear that . In fact, take vector and compute . We have proved the following result. 111This result is apparently known for some time, as it appears in the thesis of [Siu], supervised by Au-Yeung, however it has never been published before, (to the best of our knowledge). In spite of this, Au-Yeung in [AY1, corollary 1] apropos of the connectedness of , and citing a result from [J], states the possibility of . This possibility is also stated by [Zh, theorem 9.2] and [K, corollary 2.10], repeating again the same result by [J], (although [K] doesn’t cite it).
Proposition 3.2**.**
For any , .
From now on, we fix a matrix with quaternionic entries, , and we denote the quaternionic numerical range of simply by .
Let . We say an element is -similar to , if and only if, for some ,
[TABLE]
in which case we write We say that and are -similar, and denote it by , if and only if, for any there is an such that , and vice versa. When two sets are -similar they share some properties, namely convexity. In fact, if is convex we can conclude that is convex. Take any . Then, there are , such that and . For any it is a matter of simple calculations to note that
[TABLE]
Now, since and is convex we conclude that . Therefore is also convex. A similar argument proves that the centers are -similar for any two -similar sets and , since whenever a segment is in the -similar segment must be in . That is, whenever .
Define, for , and
Lemma 3.3**.**
For any we have:
- (i)
* are convex,* 2. (ii)
\mathscr{C}\Big{(}W^{(q_{1})}\Big{)}\dot{\sim}\mathscr{C}\Big{(}W^{(q_{2})}\Big{)}.**
Proof.
The numerical range is such that, by similarity, , for any . It is also an immediate conclusion of numerical range’s closedness to similarity that , for any . It is known that the upper bild is convex, thus from the previous discussion, we have that is also convex for any . Moreover from we know that \mathscr{C}\Big{(}W^{(q_{1})}\Big{)}\dot{\sim}\mathscr{C}\Big{(}W^{(q_{2})}\Big{)}. ∎
As a consequence of this lemma we only need to study the center of one of the ’s and the natural choice is to take , that is, we only need to study the center of the bild .
Theorem 3.4**.**
The quaternionic numerical range is star-shaped and .
Proof.
By proposition 3.2, there is . For every , there is such that . Since and using lemma 3.3 we have that . Hence, the numerical range is star-shaped. Moreover, . ∎
The numerical range is contained in if, and only if, is hermitian, see [R, corollary 3.5.3].. The next result follows trivially from theorem 3.4.
Corollary 3.5**.**
If is hermitian then .
Lemma 3.6**.**
The center of the bild is closed under conjugation, i.e.
[TABLE]
Proof.
Assume . Let be any element of the bild . Since the bild is closed for conjugation, . Then being in the center implies that , for any . And again using the bild’s closedness to conjugation we conclude that . Since this is true for any , . The converse inclusion follows similar steps. ∎
We now establish the equality between the center of the bild and the complex part of the center of the numerical range.
Proposition 3.7**.**
We have:
[TABLE]
Proof.
The inclusion is obvious since a complex element in the center of must be in the center of .
For the converse inclusion, starting with , we will prove that for any and .
We can assume, without loss of generality, that . Since any quaternion can be written as the sum of a real with a pure quaternion, we may write , with . We have:
[TABLE]
By similarity, it is enough to prove that . With this purpose, we will find two elements such that
[TABLE]
In this case, by convexity of the upper bild, since for some . Let
[TABLE]
The conclusion that follows from the fact that , which is a convex set.
If we take , else we take (clearly, ).
We now need to check that and are in and satisfy conditions (3.3). It is trivial to conclude that the real parts are all equal. On the other hand,
[TABLE]
To conclude that we will use Cauchy-Schwartz inequality. If we look a quaternion as a vector in , its norm is given by , where is the usual inner product in real vector spaces. Then we have:
[TABLE]
Since and, we have:
[TABLE]
Using the equality (\alpha c_{(i)}+(1-\alpha)w_{(i)}^{*}\Big{)}_{v}=\alpha|c_{(i),v}|i-(1-\alpha)|w_{(i),v}|i, it follows that
[TABLE]
Therefore,
It remains to prove that . If , by hypothesis and , then any convex combination of them is also in . If then , because by lemma 3.6, and . ∎
Next result establish the relation between the center of the numerical range and the center of the bild .
Theorem 3.8**.**
The center of the numerical range is such that
[TABLE]
Proof.
Let . For some , we have . Using a similar reasoning of the proof of proposition 3.7, we can show that
[TABLE]
Now, if and only if , for some , that is,
[TABLE]
By lemma 3.3, . We conclude that
[TABLE]
∎
If we use the fact that is the set of all elements similar to those in , that is, W=\Big{[}W\cap\mathbb{C}\Big{]}, the above result can be written in the following way:
[TABLE]
In other words, the operations of taking the center and of taking the equivalence classes of a numerical range commute.
4. Characterization of the center of the bild
We now know that it is possible to characterize the center of the numerical range from the center of the bild. On the other hand, lemma 3.6 guarantees that the lower part of the center of the bild is the conjugate of the upper part,
[TABLE]
and we conclude that to determine we only need to know . From corollary 3.5, we may focus only on non-hermitian matrices.
By the convexity of the upper bild, the segment joining any two elements in the upper bild is contained in it. Therefore an element of the upper bild is not in the center if and only if a convex combination with an element in the lower bild is not in the bild. That is, an element is not in the center of the bild, , if and only if, there is such that the segment connecting the two is not contained in the bild, i.e. . The argument we will use is build upon the fact that a segment, joining two elements of the bild, is not totally contained in the bild, if and only if it crosses the reals outside of it. Thus, either an element of the upper bild has all its segments , for , crossing the real line inside the bild, that is, , in which case is in the center, or there is one of these segments that crosses the real line outside the bild, and the element is not in the center.
For the rest of this section we will slightly change notation and write as . Let and be the minimum and maximum of the real elements in the bild. Using the previous reasoning, but on a dual perspective, to find out if an element in the upper bild is in the center, we only need to see if the segments joining to and to intersects the interior of the lower bild or not. In the case where it does the element is not in the center. For instance, if the segment joining to intersects at in the lower part of the interior of the bild, then there is an element to the left of such that the segment will cross the reals to the left of , and therefore outside of the bild.
The next results formalize this intuitive argument. To reach this we will need to define for each two lines, one denoted connecting to , and the other denoted connecting to . Since the real points of the numerical range belongs to the center (see theorem 3.4), it is enough to consider points , with . The lines are given by
[TABLE]
with and .
Let and . By symmetry of the bild, . Since the matrix is non-hermitian, .
We may define, for , two functions:
[TABLE]
Notice that and . According to [Roc, theorem 5.3], is convex and is concave. The lower bild may be written using and :
[TABLE]
The interior of the lower bild is given by:
[TABLE]
The next result gives a characterization of , when . For the lines and do not cross over the interior of the lower bild, if and only if, .
Theorem 4.1**.**
Let and let . Then, if, and only if,
[TABLE]
Proof.
We begin by observing the following. Let with . The line passing through and can be written as:
[TABLE]
and define two half planes:
[TABLE]
To prove that if , then we proceed by contrapositive.
Fix an element as before, i.e., with , and suppose there is an element (if the proof is analogous). Since , the line may also be written as
[TABLE]
Let be a neighborhood of . Then, there is such that and .
The line passing through and is
[TABLE]
Define the affine function by:
[TABLE]
Clearly, and . Since and , there is such that . Moreover, since is affine,
[TABLE]
and so, . Hence, the line passing through and does not intersect , which implies that and .
Now we prove the converse, that is, for if then Since , there is a point such that the line segment is not contained in the bild.
Assume that the line containing , call it , intersects the real line () at . Since then . Otherwise, if , convexity of the upper bild implies that and convexity of the lower bild implies that . Thus,
[TABLE]
which contradicts our hypothesis. We will assume that (when the proof is analogous). We claim that we can take . In fact, if is on the boundary of , take , with small enough such that and , close enough to in order to satisfy . In this way, there is a point such that .
Since then and must be in different half-planes, that is, and .
We now show that and are in different half-planes using the same reasoning of the first part of the proof, but now with being the line that passes through the points , and , and being the line that contains . It follows that . Since and , there is such that . Hence, and . It follows that and therefore
Let . Since and , then the line that joins to , by continuity of , passes through a point with , that is, . Taking into account , it only remains to prove that . Since , by convexity of we have
[TABLE]
for some . Note that if , and if , , that cannot happen because and . We know that , and so,
[TABLE]
From (4.4) and since , we have:
[TABLE]
With a similar reasoning and using that we see that . It follows that
[TABLE]
Now we need to check that . Since we know that , and so,
[TABLE]
We conclude that . ∎
Relying on our previous results, we will now prove the existence of two lines containing and that define the upper boundary of the center. Such lines are denoted respectively by and .
Any concave function has lateral derivatives [Roc, theorem 23.1], therefore let and , the left derivative at [math] of and , respectively.
Let the left tangent line to and at [math] be given by the sets
[TABLE]
respectively. Since is convex and is concave we have, [Roc, theorem 25.1], and , for every .
Proposition 4.2**.**
Let and let . Then,
- (i)
* if, and only if, ,* 2. (ii)
* if, and only if, .*
Proof.
We will prove (i). A similar reasoning proves (ii). Let with and be the line passing through and . We can write , with .
Now we will prove that if the line does not intersect . Since , it is clear that , i.e., . Since we have . For , and so
[TABLE]
For we have , since . From the convexity of and using [Roc, theorem 25.1] we have for any . Then, Therefore, with and from (4.3) we see that .
To prove the converse, we want to show that if then , that is, the line intersects . Again, we have , and therefore . Since , necessarily . Define, for ,
[TABLE]
By the first order Taylor’s approximation of , for small we get
[TABLE]
Since , it follows that , for small . In other words, for small enough. Taking into account that and that and are continuous [Roc, corollary 10.1.1], for small enough we have . Therefore, we can choose an such that and . Then . ∎
We can now present a general way to determine the center. Let and .
Theorem 4.3**.**
Let . Then, if, and only if, .
Proof.
When , proposition 4.2 and theorem 4.1 prove the stated equivalence. For the case we will first find out the and then prove the equality with the set .
When and the bild is a vertical segment then, clearly, and, in this case, . If but the bild is not a vertical line () then we claim the center is . To see this, first consider that with . Then and . Therefore . It remains to consider the case where , for some . There is with and . Assume, without loss of generality that has opposite sign of . Then there is a , such that . Clearly, , thus
[TABLE]
We concluded that and therefore that , for .
In the case where , for and , thus for any . Then
[TABLE]
When and , we know that and . Then . In the case where we have
[TABLE]
Therefore, and this is, in fact, the upper center of .
We now consider (the case where is the one where ). Since is convex and is concave we know that, using again [Roc, theorem 25.1], . As a consequence of we have that and thus
[TABLE]
that is, the lower bild is a line, and we can write it as the set
[TABLE]
Since the upper bild is the conjugate of the lower bild,
[TABLE]
Then the intersection of and , when is just . That is
[TABLE]
∎
A simple observation on the slope of the lines and allows us to give a different proof of the known result of Au-Yeung (see, [AY1, theorem 3]), which establishes an equivalent condition for the convexity of the quaternionic numerical range.
It is well known [Roc, theorem 23.1] that for any convex function of real variable and any fixed element in the domain of the function defined by
[TABLE]
is increasing with . Then any line that joins and in the graph of with has slope smaller than . Notice now that there is an element in the lower bild. Using the previous conclusion when the convex function is , the reference point is and , we conclude that
[TABLE]
[Roc, theorem 23.1], that is, has positive slope. For the case when we have
[TABLE]
since and . Thus has nonpositive slope.
Analogously, it can be shown that when , has negative slope and when , has nonnegative slope.
Corollary 4.4**.**
The numerical range is convex if and only if and .
Proof.
We begin by proving that if or , then the numerical range is non-convex. Suppose (the case is analogous). Let be the left tangent line to at [math] as in (4.5).
For , we have, by (4.6), . Notice that . Therefore, . Hence, we have found such that . From theorem 4.3 we have and so is not convex since . By theorem 3.1 we conclude that is not convex.
Now we prove that if and then the numerical range is convex. Recall that , with , see (4.7). For , we have . For every , we have .
Analogously, we can show that . From theorem 4.3 we have that . Since is arbitrary, we have that is convex and from theorem 3.1, is convex. ∎
An interesting case, where the center is a kite, is when and . The next corollary proves this result.
Corollary 4.5**.**
Let . Suppose there is such that . Then,
[TABLE]
Proof.
When , as we have noticed in (4.6), has positive slope. Similarly, we can show that has negative slope. Since passes through and through , and must cross at a point in . Let this point be . The result follows from theorem 4.3. ∎
The follow example illustrates a case where the center is a kite.
Example*.*
Following [ST, page 318], let A=\left[\begin{array}[]{cc}k_{1}i&\alpha\\ -\alpha&1+k_{2}i\end{array}\right], with and . In this case, the boundary of the lower bild consists of an ellipse and the segment , where and are the points where intersects the real axis (the notation in [ST] is and ). Our aim is to describe the center of the bild of .
From [ST, lemma 6.4], case , the ellipse contains the points , , and , where
[TABLE]
Moreover, we know that the vertical lines and are tangent to the ellipse at and , respectively. These data fully characterize the ellipse . Therefore, if we substitute those points in the general equation
[TABLE]
we obtain a homogeneous system of six linear equations with six unknowns. From formulas (4.8) one concludes that the linear system’s matrix has rank . Solving the linear system leads to the following characterization of :
[TABLE]
Taking the derivative in (4.9) with (recall that the left derivatives and exist), we get
[TABLE]
It is now possible, albeit a tedious computation, to define the lines and as in theorem 4.3 and characterize .
Let us consider a more specific example. Take A=\left[\begin{array}[]{cc}\frac{1}{8}i&\frac{1}{4}\\ -\frac{1}{4}&1+\frac{1}{8}i\end{array}\right], i.e. and . Then, the ellipse becomes
[TABLE]
or, in the reduced form,
[TABLE]
We have:
[TABLE]
The lines and are given by
[TABLE]
They intersect at \Big{(}\dfrac{1}{2},\dfrac{3}{8}\Big{)}, a point on the boundary of the bild of .
We conclude that the center of the bild of A=\left[\begin{array}[]{cc}\frac{1}{8}i&\frac{1}{4}\\ -\frac{1}{4}&1+\frac{1}{8}i\end{array}\right] is:
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AY 1] Y. Au-Yeung, On the convexity of the numerical range in quaternionic Hilbert space , Linear and Multilinear Algebra, 16 (1984), 93–100.
- 2[AY 2] Y. Au-Yeung, A short proof of a theorem on the numerical range of a normal quaternionic matrix , Linear and Multilinear Algebra, 39:3 (1995), 279–284.
- 3[AYS] Y. Au-Yeung, L. Siu, Quaternionic numerical range and real subspaces , Linear and Multilinear Algebra, 45 (1999), 317–327.
- 4[CT] W. Cheung, N.-K. Tsing, The C 𝐶 C -numerical range of matrices is star-shaped , Linear and Multilinear Algebra, 41:3 (1996), 245–250.
- 5[GR] K. Gustafson, D. Rao, Numerical Range , Springer-Verlag, New York, 1997.
- 6[J] J. Jamison, Numerical range and numerical radius in quaternionic Hilbert space , Doctoral Dissertation, Univ. of Missouri, 1972.
- 7[Ki] R. Kippenhahn, On the numerical range of a matrix , Translated from the German by Paul F. Zachlin and Michiel E. Hochstenbach. Linear Multilinear Algebra, 56:1-2 (2008), 185-225.
- 8[K] P. Kumar, A note on convexity of sections of quaternionic numerical range , Linear Algebra and its Applications, 572 (2019), 92-116.
