Generalized subdifferentials of spectral functions over Euclidean Jordan algebras
Bruno F. Louren\c{c}o, Akiko Takeda

TL;DR
This paper develops formulas for various generalized subdifferentials of spectral functions on Euclidean Jordan algebras, extending previous results and applying them to eigenvalue functions, with implications for nonsmooth optimization.
Contribution
It provides new formulas for regular, approximate, horizon, and Clarke subdifferentials of spectral functions, extending existing theory and analyzing the KL property in this context.
Findings
Formulas for regular, approximate, and horizon subdifferentials of spectral functions.
Extension of Clarke subdifferential formula under local lower semicontinuity.
Analysis of the Kurdyka-Lojasiewicz property and transfer of KL-exponent for spectral functions.
Abstract
This paper is devoted to the study of generalized subdifferentials of spectral functions over Euclidean Jordan algebras. Spectral functions appear often in optimization problems playing the role of "regularizer", "barrier", "penalty function" and many others. We provide formulae for the regular, approximate and horizon subdifferentials of spectral functions. In addition, under local lower semicontinuity, we also furnish a formula for the Clarke subdifferential, thus extending an earlier result by Baes. As application, we compute the generalized subdifferentials of the function that maps an element to its k-th largest eigenvalue. Furthermore, in connection with recent approaches for nonsmooth optimization, we present a study of the Kurdyka-Lojasiewicz (KL) property for spectral functions and prove a transfer principle for the KL-exponent. In our proofs, we make extensive use of recent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Generalized subdifferentials of spectral functions over Euclidean Jordan algebras
Bruno F. Lourenço Department of Statistical Inference and Mathematics, Institute of Statistical Mathematics, 10-3 Midori-cho, Tachikawa, Tokyo 190-8562, Japan. ([email protected])
Akiko Takeda
Department of Creative Informatics, Graduate School of Information Science and Technology, University of Tokyo, Tokyo, Japan and RIKEN Center for Advanced Intelligence Project, 1-4-1, Nihonbashi, Chuo-ku, Tokyo 103-0027, Japan. ([email protected])
Abstract
This paper is devoted to the study of generalized subdifferentials of spectral functions over Euclidean Jordan algebras. Spectral functions appear often in optimization problems playing the role of “regularizer”, “barrier”, “penalty function” and many others. We provide formulae for the regular, approximate and horizon subdifferentials of spectral functions. In addition, under local lower semicontinuity, we also furnish a formula for the Clarke subdifferential, thus extending an earlier result by Baes. As application, we compute the generalized subdifferentials of the function that maps an element to its -th largest eigenvalue. Furthermore, in connection with recent approaches for nonsmooth optimization, we present a study of the Kurdyka-Łojasiewicz (KL) property for spectral functions and prove a transfer principle for the KL-exponent. In our proofs, we make extensive use of recent tools such as the commutation principle of Ramírez, Seeger and Sossa and majorization principles developed by Gowda.
Keywords: spectral functions, generalized subdifferential, approximating subdifferential, Euclidean Jordan algebra, Kurdyka-Łojasiewicz inequality.
1 Introduction
Let be a function that is symmetric, i.e., does not change if we permute the coordinates of . Here, denotes the extended line . Now, let us consider a Euclidean Jordan algebra of rank , for example, the symmetric matrices. Then, can be extended in a natural fashion to a function over by defining for all
[TABLE]
where is the vector containing the eigenvalues of in nonincreasing order, i.e.,
[TABLE]
We call the spectral function induced by . Because is symmetric, it is known from the works of Baes [3], Sun and Sun [26], Jeong and Gowda [15] and others that several properties of are transferred to . For example, is convex if and only if is convex. The same goes for differentiability. Results of this type are sometimes called transfer results or transfer principles, e.g., [15].
Spectral functions are ubiquitous throughout optimization and recognizing that is a spectral function can make computing derivatives/subdifferentials of significantly simpler than if one tries to do so by scratch. This is because transfer principles usually come with formulae that relate the derivatives/subdifferentials of and .
Motivated by the needs of nonsmooth optimization, our goal in this paper is to obtain formulae for the regular, approximate and horizon subdifferentials of spectral functions without any extra assumptions such as local Lipschitzness. In nonsmooth optimization, the regular and approximate subdifferential are often used to express optimality conditions and in the analysis of algorithms. Also, conditions involving the horizon subdifferential are quite common to ensure that the function satisfies some desirable property. We will also obtain a formula for the Clarke subgradient with the assumption of local lower semicontinuity, which extends an earlier result by Baes [2]. We will use these formulae to compute the generalized subdifferentials of the eigenvalue functions in the context of Euclidean Jordan algebras, see Section 4.6.
Another motivation comes from the so-called composite optimization, where we wish to solve the problem
[TABLE]
and only is assumed to be smooth. It is common for the function to play the role of a “regularizer”, “penalty” or “barrier”. In those cases, is often a spectral function. Here are a few examples. In what follows, for , we denote its -norm by and the sum of the components with largest absolute value by .
[TABLE]
where is a positive parameter. When , is the regularizer. is a multiple of the classical self-concordant barrier for the symmetric cone associated to . The function maps to the sum of the eigenvalues of with smallest absolute value, which is an important function for dealing with rank constrained problems, see [9] and Section 4 in [10]. Here, we are expressing as a DC (difference of convex) function. We observe that are all spectral functions, while and are nonsmooth and nonconvex. In any case, under appropriate regularity conditions, a necessary condition for to be a local optimal solution to (OPT) is that
[TABLE]
where is the approximate subdifferential of at , see Exercise 8.8 and Theorem 8.15 in [24].
Yet another motivation for this work is that the approximate subdifferential is necessary in order to compute the so-called Kurdyka-Łojasiewicz (KL) exponent, which has been shown to control the convergence properties of many first-order methods as can be seen, for instance, in the classical work by Attouch, Bolte, Redont and Soubeyran [1]. For a recent discussion on this topic, see the work by Li and Pong [21].
While there are many criteria that can be used to show that a function satisfies the so-called KL-property, it is often highly nontrivial to compute the KL-exponent [21]. For instance, if we wish to compute the -exponent of , we have to analyze the approximate subdifferentials of , because , as can be seen in Exercise 8.8 of [24]. In this paper, although we will not compute the KL-exponent of itself, as an application of our results, we will show that if is a symmetric function and is the corresponding spectral function, then and share the same KL-exponent. Admittedly, this is not a very powerful result, but it seems to be beyond what can be proved directly with the results of [21] (see Remark 29) and we believe it is a first step towards a more comprehensive study of the KL-exponent of composite functions where one of the functions is spectral.
1.1 Previous works
Lewis [17, 18, 19] has discussed extensively the case of spectral functions over symmetric real matrices and Hermitian complex matrices. In particular, in [19], Lewis gave expressions for the regular, approximate and horizon subdifferentials of spectral functions over symmetric real matrices. A formula for Clarke subdifferentials was also given for the locally Lipschitz case.
Spectral functions over the algebra associated to the second order cone were initially studied by Fukushima, Luo and Tseng [8] and by Chen, Chen and Tseng [5]. In [5], there is a discussion of the Clarke subdifferential of locally Lipschitz spectral functions and Sendov [25] gave formulae for regular, approximate and horizon subdifferentials. Sendov also proved a formula for the Clarke subdifferential under the hypothesis of local lower semicontinuity.
In the general framework of Euclidean Jordan algebras, Baes [2, 3], Sun and Sun [26] and Jeong and Gowda [14, 15] proved several key results regarding spectral functions and the related notion of spectral sets. However, as far as we know, until now there were no results for the regular, approximate and horizon subdifferentials of spectral functions. Furthermore, results for the Clarke subgradient were only known in the locally Lipschitz case. Related to Clarke subgradients, we mention in passing that Kong, Tunçel and Xiu proved an expression for the Clarke subgradient of the orthogonal projection of the symmetric cone associated to a Euclidean Jordan algebra [16].
1.2 Contributions of this work
In this work, we have three contributions. The first is a meta-formula for the generalized subdifferentials of a spectral function. We will show that if is a spectral function induced by , then there is a formula that relate the generalized subdifferentials of and , see Theorems 17, 19 and 21.
A feature of our results is that we will never assume that the algebra is simple, which makes some results more general, but a bit harder to prove. Every Jordan algebra can be decomposed as a direct sum of simple algebras and simplicity is, in many cases, a harmless hypothesis. Previous work by Lewis [19] and Sendov [25] can be seen as containing results for specific cases of simple Euclidean Jordan algebras. However, because the generalized subdifferentials do not behave nicely with respect to partial subdifferentiation, there are cases where we cannot extend results from simple to general Euclidean Jordan algebras in a straightforward way. We emphasize that our results are directly applicable to a situation where, for example, is a direct product , where denotes the space of real symmetric matrices.
Our second contribution is providing formulae for the generalized subdifferentials of the function , which maps an element to its -th largest eigenvalue, see Theorem 25. We believe this is the first time such formulae are given in the context of Euclidean Jordan algebras.
Last, we will show a transfer principle of the KL-property for spectral functions and show that and must share the same KL-exponent, see Theorem 28.
This work is divided as follows. In Section 2, we review generalized subdifferentials. In Section 3, we overview the necessary concepts from the theory of Euclidean Jordan algebras. In Section 4, we develop and present our main results regarding generalized subdifferentials of spectral functions. Finally, in Section 5 we discuss the KL-property and KL-exponent of spectral functions.
2 Preliminaries
2.1 Notation
Given an element , we will denote its -th component by . We write for the cone of elements satisfying . We write for the nonnegative orthant, i.e., the elements such that for every . We will write for the group of permutation matrices. Given , we write for the stabilizer subgroup of , i.e.,
[TABLE]
The convex hull, the interior and the closure of a set will be denoted by , and , respectively. If is a function, the domain of (i.e., the elements for which is finite) will be denoted by . We assume that is furnished with the usual Euclidean inner product and the usual Euclidean norm .
2.2 Generalized subdifferentials
In this subsection, we recall a few notions of generalized subdifferentials. However, the discussion on the Clarke subdifferential will be postponed until Section 4.5. Let be a function and . We say that is a regular subgradient of at if
[TABLE]
The set of regular subgradients of at is denoted by and is called the regular subdifferential of at . From (1) it follows that if and only if for every there exists some such that implies
[TABLE]
We say that is an approximate subgradient (also called limiting subgradient) of at if there are sequences , such that every satisfies and the following limits hold:
[TABLE]
The set of approximate subgradients of at is denoted by and is called the approximate subdifferential of at .
We say that is an horizon subgradient of at if there are sequences , , such that every satisfies and the following limits hold:
[TABLE]
Here, indicates that all the are nonzero and that is a monotone nonincreasing sequence converging to zero. The set of horizon subgradients, called the horizon subdifferential, will be denoted by . In variational analysis, conditions involving the horizon subdifferential are quite common, e.g., see Corollary 10.9 in [24]. See also Section 8.B in [24] for examples of the subdifferentials discussed so far.
We will also make use of the following characterization of regular subgradients.
Proposition 1** (Rockafellar and Wets, Proposition 8.5 in [24]).**
Let . Then, if and only if, on some neighborhood of there exists a function such that
[TABLE]
In this paper, sometimes we will prove results that are valid for several different notions of subdifferential. In that case, we use the symbol as a placeholder for some unspecified subdifferential, e.g., see Theorem 17.
3 Euclidean Jordan algebras
Here, we give a brief overview of Jordan algebras and review the necessary tools to prove our results. More details can be found in Faraut and Korányi’s book [6] or in the survey by Faybusovich [7]. First of all, a Euclidean Jordan algebra is a finite dimensional real vector space equipped with a bilinear product and an inner product satisfying the following properties:
, 2.
, where , 3.
,
for all . We can always assume that a Euclidean Jordan algebra has an element that satisfies , for all . Such an element is called the identity element. An element satisfying is called an idempotent. A nonzero idempotent that cannot be written as the sum of two nonzero idempotents satisfying is called a primitive idempotent.
In a Euclidean Jordan algebra the following spectral theorem holds.
Theorem 2** (Spectral Theorem, see Theorem III.1.2 in [6]).**
Let be a Euclidean Jordan algebra and let . Then there are primitive idempotents satisfying and
[TABLE]
and unique real numbers satisfying
[TABLE]
The that appears in Theorem 2 only depends on the algebra and is called the rank of . The in Theorem 2 are called the eigenvalues of . Although unique, the eigenvalues of might be repeated and they are not necessarily in nonincreasing/nondecreasing order. We define the rank of as the number of nonzero ’s appearing in (3). The ordered set in Theorem 2 is called a Jordan frame for .
Here, we are using the notation instead of to emphasize that the order of the elements is taken into account, so, for example, and are different ordered sets. Although might have many different Jordan frames, the sum of primitive idempotents associated to some eigenvalue must be unique.
Proposition 3** (Unique sum of primitive idempotents, see Theorems III.1.1 and III.1.2 in [6]).**
Let and be two Jordan frames for . Suppose that
[TABLE]
Then, for every , we have
[TABLE]
We define the eigenvalue map as the map satisfying
[TABLE]
where . Here, denotes the -th largest eigenvalue of .
The trace map is defined as
[TABLE]
In fact, the trace map is a linear function. Furthermore, it can be shown that the function that maps to is an inner product satisfying Property of the definition Euclidean Jordan algebras. Henceforth, we shall assume that the inner product is given by
[TABLE]
Under this inner product, for all and elements of any Jordan frame are mutually orthogonal. That is, if is Jordan frame, then if .
The norm induced by is given by
[TABLE]
With that, any primitive idempotent satisfies . Furthermore, the map becomes a Lipschitz continuous function with Lipschitz constant , when is equipped with the usual Euclidean norm. We now summarize some important properties of .
Lemma 4** (Properties of the eigenvalue map).**
Let be a Euclidean Jordan algebra of rank and let be the eigenvalue map. The following properties hold.
* holds, for all .* 2.
For every , has directional derivatives along all directions. Furthermore, letting denote the directional derivative of at along , the following limit holds
[TABLE]
where .
Proof.
This was proved by Baes, see Corollary 24 in [3]. 2.
Baes showed that for every , the function that maps to its -th largest eigenvalue is directionally differentiable, see Theorem 36 in [3]. Therefore, all components of are directionally differentiable, so must also be directionally differentiable. Then, it is a general fact that a Lipschitz continuous function that is directionally differentiable everywhere must also satisfy the limit above, see Lemma 2.1.1 and Remark 2.1.2 in [13].
∎
3.1 Simultaneous diagonalization
Let be a Euclidean Jordan algebra of rank . Given , we denote by the Lyapunov operator associated to , which is the linear map satisfying
[TABLE]
Given another element , we say that and operator commute if
[TABLE]
holds. It is known that and operator commute if and only if they share a common Jordan frame , see Lemma X.2.2 in [6]. This means that there are mutually orthogonal primitive idempotents such that and
[TABLE]
where the and are the eigenvalues of and , respectively. More generally if is a Jordan frame for which can be expressed as linear combination of the , we say that diagonalizes . Therefore, the existence of a common Jordan frame for and means that and are simultaneously diagonalizable.
Here, the and that appear in the decomposition of and are not necessarily sorted in nondecreasing/nonincreasing order. However, reordering the , we may suppose that the are sorted in an nonincreasing order, i.e., , for all . With respect to this new ordering, we can write
[TABLE]
where is some permutation of . Because the idempotents in are orthogonal amongst themselves, we have for every
[TABLE]
With that in mind, we are going to introduce the function , which maps an element to its “diagonal” with respect the Jordan frame . That is, we have
[TABLE]
If is a frame that diagonalizes , then is, in fact, the eigenvalue vector of . Of course, might not be sorted in any particular way. However, for the specific and we have discussed so far, we have
[TABLE]
We are now going to introduce two more extra notations. We will denote by the set of common Jordan frames for for which . In other words, not only must be a common Jordan for and , but it must also be such that the eigenvalues of appear in nonincreasing order. Here, we emphasize that the eigenvalues of might appear in no particular order. By convention, if and do not operator commute, we will define . We observe that since , we have
[TABLE]
Furthermore, we will define . That is, is the set of Jordan frames of for which the eigenvalues of appear in nonincreasing order. We have for every .
We also need a map that plays the opposite role of . Let be the map that takes a vector in and constructs a “diagonal element” in according to , i.e.,
[TABLE]
We have , for every . We observe that, since is a Jordan frame, the eigenvalues of are precisely the .
3.2 The directional derivative of the -th largest eigenvalue
In this section, we will describe an expression proved by Baes [3] to compute the directional derivative of the -th largest eigenvalue. For that, we need to review the Peirce decomposition, the properties of quadratic maps in Euclidean Jordan algebras and, most regrettably, introduce more notation.
Let be an idempotent and . We define
[TABLE]
Now, let be an arbitrary element (not necessarily an idempotent), the quadratic map of is the linear map such that
[TABLE]
is always self-adjoint. With that, we have the following result.
Theorem 5** (Peirce Decomposition, see Proposition IV.1.1 and page 64 in [6]).**
Let be an Euclidean Jordan algebra of rank and let be an idempotent of rank . Then is decomposed as the orthogonal direct sum
[TABLE]
In addition, and are Euclidean Jordan algebras of rank and , respectively. The orthogonal projections on and are given by and , respectively.
Next, we move on to the necessary notation. The eigenvalues of might be repeated so, for instance, it could be the case that . The next notation corresponds to a way of assigning the indices to . That is, we need to map an index to its “relative position” with respect to the eigenvalues of that are equal to . Here, we will mostly follow the notation proposed by Baes in [3] and define for every , the integer which is such that
[TABLE]
Furthermore, if we will denote by the sum of the satisfying , i.e.,
[TABLE]
We remark that was used instead of in [3].
Example 6**.**
Suppose that the rank of is and the eigenvalues of are as follows.
[TABLE]
Then , because and are unique eigenvalues. We have , and , since are, respectively, the “first”, “second” and “third” eigenvalues of a group of three equal eigenvalues. Similarly, we have and .
We have , ,
[TABLE]
Finally, let be an Euclidean Jordan algebra and let . Then, the eigenvalues of as an element of might be different from the eigenvalues of seen as an element of . When it is necessary to make this distinction, we will denote the -th eigenvalue of seen as element of by
[TABLE]
The eigenvalue map of the algebra will be similarly denoted by . We have now all pieces in place to state the following theorem.
Theorem 7** (Baes, Theorem 36 in [3]).**
Let and consider the spectral decomposition of :
[TABLE]
Then the directional derivative of the -th largest eigenvalue of along the direction is given by
[TABLE]
where .
From Theorem 5, is the projection of in the algebra . Therefore, to compute we need to project on , and then compute the -th eigenvalue of the projection with respect the algebra , where is the “relative position” of the index with respect to the eigenvalues of that are equal to .
3.3 Spectral functions and sets
Let be a Euclidean Jordan algebra of rank and let be a function. We say that is a symmetric function if holds for every and every permutation matrix . Symmetric functions satisfy the following key relation between subdifferentials:
[TABLE]
whenever is or , e.g., Proposition 2 in [19]. We remark that (6) will be used often in this paper.
We denote by the spectral map induced by , which is the function defined as
[TABLE]
The function is well-defined, even if is not symmetric. However, if is indeed symmetric, many properties of are transferred to .
There is also a notion of spectral set. We say that is a symmetric set if for every . Then the spectral set induced by is defined as
[TABLE]
To conclude this subsection, we now move on to the notion of weakly spectral sets/maps, which was introduced by Gowda and Jeong in [12]. We say that a linear bijection is a Jordan algebra automorphism if
[TABLE]
The group of Jordan algebra automorphisms is denoted by . Then, a function is said to be weakly spectral if
[TABLE]
A set is said to be weakly spectral if holds for every . A spectral map/set must also be weakly spectral, but the converse is not true in general, see remarks in Section 3 of [12].
4 Transfer principles for generalized subdifferentials
We start with a description of our setting and a few conventions. Throughout Sections 4 and 5, denotes a Euclidean Jordan algebra of rank , the inner product of two elements of is given by (4) and the norm is the one induced by . Although we are using the same symbol to denote the Euclidean inner product on and the trace inner product on , there will be no confusion. The letters will always be reserved for elements of and for elements of .
Let be a spectral function induced by some symmetric function . Our first goal is to prove the following meta-formula:
[TABLE]
where is either , or .
Remark 8**.**
For the sake of dispelling any possible confusion, should be interpreted as , i.e., is the generalized subdifferential of at .
Proving (Transfer) will require several tools old and new, such as commutation principles [23, 12], majorization principles [11] and the formulae for the directional derivatives of the eigenvalue functions [3].
4.1 Commutation principles and generalized subdifferentials
The first step towards (Transfer) is proving that if is a spectral function and is any generalized subgradient of , then and must operator commute. For that, we will use a commuting principle proved by Ramírez, Seeger and Sossa [23].
Theorem 9** **(Ramírez, Seeger and Sossa111
Here, we are quoting the theorem as it appears in Gowda and Jeong’s paper [12] (Theorem 1.1 therein), since it is more suited to our purposes. [23]).
Suppose that is a spectral set and is a spectral function. Let be Fréchet differentiable. If is a local minimizer/maximizer of
[TABLE]
then and operator commute222We recall that is a local minimum if there exists a neighbourhood of such that holds for every ..
Recently, Gowda and Jeong showed that it is possible to weaken the hypothesis of Theorem 9 and consider weakly spectral sets/functions instead [12].
Theorem 10** (Gowda and Jeong [12]).**
The conclusion of Theorem 9 holds if is a weakly spectral set and is a weakly spectral function.
Using the variational characterization of the regular subdifferential, we can prove the following new result, which is more general than what is strictly necessary for proving (Transfer), but we believe it is still useful.
Proposition 11** (Operator commutativity for weakly spectral functions).**
Let be a weakly spectral function. Suppose
[TABLE]
where is either or . Then, and operator commute.
Proof.
First, we prove the result for the case . By Proposition 1, there exists a function such that , and for all near . We invoke Theorem 10 using , and . By the properties of , we have that is a local minimum of . Therefore, commutes with , so it must commute with too. In reality, there are some minor technical details we have overlooked, see the footnote333The functions in Theorem 10 are finite functions defined everywhere, whereas is an extended value function and is defined only in a neighbourhood of . To sidestep this, we define such that if and if . With that, is still a weakly spectral function. Next we need to extend to a function defined over which coincides with in some neighbourhood of . It is a classical fact that this can always be done and here we show briefly why. Suppose that is defined over some open set . Let be an open ball such that and over which is a local minimizer of . Next, pick any function that is smooth and such that is on the compact set and [math] outside . Then, we define by letting if and if . With that, we have that and is a local minimum of restricted to . Then, as before, we can invoke Theorem 10 with , and . below.
Next, suppose instead that or . Then, there are sequences , such that every satisfies and the following limits hold.
[TABLE]
Here, there are two cases for . If , then for every . If , then .
Either way, because , from what we have proved so far, we have that and operator commute for every . That is,
[TABLE]
By taking limits, we conclude that must also hold. Therefore, and operator commute too. ∎
4.2 The easy inclusion
Next, we prove the inclusion “” in (Transfer), when .
Proposition 12** (The easy inclusion).**
Let be the spectral function induced by a symmetric function . Let . Then, and operator commute and for any we have
[TABLE]
Proof.
Let . By Proposition 1 there exists a neighborhood of and a function such that for all and , . In addition, by Proposition 11, and operator commute. Therefore, must be nonempty, i.e., and have at least one common Jordan frame.
Let and consider the linear map . Since is continuous, is an open set of containing . Now, let be such that
[TABLE]
Let . Using the symmetry of and the properties of , we obtain
[TABLE]
That is, holds for every . Also . By the chain rule, we also have . Therefore, by Proposition 1, we conclude that . ∎
4.3 The hard inclusion
The hard part of proving (Transfer) is establishing the inclusion “”, when . From Lewis’ discussion in [19], it seems that one of the key steps for proving (Transfer) in the case of symmetric matrices is a result relating the diagonal of a matrix with the directional derivative , see Theorem 5 in [19]. We will prove an analogous result by following an original approach making use of a recent majorization principle proved by Gowda in [11].
Let , we denote by the element in corresponding to a reordering of the coordinates of in such a way that
[TABLE]
Now, let be another element. Then, we say that is majorized by and write if
[TABLE]
and the sum of components of both and coincide, i.e., . It is a classical fact following from Birkhoff’s theorem that is majorized by if and only if lies in the convex hull of all permutations of , i.e.,
[TABLE]
see Section B in Chapter 2 of [22]. If we say that is majorized by and write if is majorized by . Whenever majorization principles are used, it is safer to mention the standard disclaimers that, throughout the literature, there seems to be no consensus on the direction of the inequalities appearing in the definition of majorization. In some texts, “” is used instead of “”. Here, we are following the convention in [11], which by its turn follows the notation in [4].
Let be a symmetric matrix. It is known that the diagonal entries of are majorized by the eigenvalues of . Gowda recently extended this fact to Euclidean Jordan algebras.
Proposition 13** (Gowda, Example 7 and Theorem 6 in [11]).**
Let be a Jordan frame and let . Then, is majorized by . In particular,
[TABLE]
Proof.
Consider the map defined by
[TABLE]
In [11], the map is denoted by “” and it has a different meaning from the map we are using in this paper. In any case, in Example 7 and Theorem 6 in [11], Gowda showed that holds for every . Accordingly, we have
[TABLE]
Now, we observe that the components of are precisely the eigenvalues of . Furthermore, the fact that a vector is majorized by does not change if we permute the entries of or . We conclude that and that ∎
We are now able to prove an analogous of Theorem 4 of [19] for Euclidean Jordan algebras.
Theorem 14** (The diagonal map and directional derivatives of the eigenvalue map).**
Let and let . Then
[TABLE]
First, we sketch the general proof strategy for Theorem 14. The idea is to separate the vector in blocks of equal eigenvalues and apply the formula in Theorem 7 for each block. Then, for each block, we associate a Euclidean Jordan algebra and invoke Proposition 13. Since Proposition 13 is invoked in a blockwise fashion according to the blocks of equal eigenvalues of , the resulting pieces can be glued together to obtain a convex combination of matrices in .
Proof.
To start, let us consider the spectral decomposition of ,
[TABLE]
where and is a Jordan frame. Now, we use the notation described in Section 3.2 and denote by the “relative position” of the index with respect the eigenvalues of that are equal to .
Next, let be such that
[TABLE]
Here, is the number of distinct eigenvalues of . For convenience, we define and for . Then, we divide in parts according to the blocks of equal eigenvalues of :
[TABLE]
where
[TABLE]
We do the same for the map and divide in maps such that
[TABLE]
Here, each is a map such that
[TABLE]
Applying Theorem 7 to each , we obtain
[TABLE]
where is the sum of the idempotents associated to the eigenvalues equal to and is the Jordan algebra of rank .
Let , for every . From Theorem 5, is the orthogonal projection of onto . The indices from to all correspond to equal eigenvalues of . Therefore, from (7) and the definition of the relative index , we conclude that
[TABLE]
where we recall that is the eigenvalue map of the algebra . Next, let . Since is a Jordan frame and the sum of the elements of is (the identity element of ), we have that is a Jordan frame in the algebra . We will now prove that . Let be an integer such that , we have
[TABLE]
where the second equality follows from the fact is self-adjoint and the third equality follows from the fact that since is the identity element in and is an idempotent contained in . Since this holds for every satisfying , we conclude that . From (8) and Proposition 13 applied to and , we conclude that for every , we have
[TABLE]
That is, there are nonnegative coefficients and permutation matrices such that
[TABLE]
We are now almost done. First, we define as the following matrix
[TABLE]
Next, we define as the matrix satisfying
[TABLE]
Because of (10), we have
[TABLE]
which together with (9) implies that
[TABLE]
Now, we consider an arbitrary matrix appearing in (11) which is of the form
[TABLE]
is a block diagonal matrix and since each block is a permutation matrix, is a permutation matrix too. Furthermore, by construction, the block structure of follows the pattern of equal eigenvalues of . So, for instance, has size , which corresponds to the first block of equal eigenvalues of . For this reason, we obtain
[TABLE]
Accordingly, belongs to and from (11) and (12), we conclude that
[TABLE]
∎
Next, we will prove the inclusion “” in (Transfer), when . With all the preliminary results in place, we can proceed analogously to Theorem 5 of [19].
Proposition 15** (The hard inclusion).**
Let be the spectral function induced by a symmetric function . Then
[TABLE]
Proof.
Let and be such that . Our goal is to show that In view of (2), will be established if we show that for every , there exists such that implies
[TABLE]
However, since diagonalizes , we have
[TABLE]
Therefore, our goal is to show that for every , there exists such that implies
[TABLE]
Now, we will set up a few objects that will help us towards proving (Goal). First, we observe that and (6) implies that
[TABLE]
Next, we define to be the convex hull of the with and denote by the corresponding support function. Since is generated by a finite number of elements, we have
[TABLE]
Now that the pieces are in place, we move on to proving (Goal). Let . From the definition of regular subgradients (see (1)) and from (2), for every , there exists such that implies
[TABLE]
In particular, if we let , we conclude that
[TABLE]
whenever . From item of Lemma 4 and decreasing if necessary, we have that if satisfies , it holds that
[TABLE]
By item of Lemma 4, . Therefore, if satisfies , we obtain from (13) that
[TABLE]
Since is the pointwise maximum of linear functions, is a Lipschitz continuous sublinear function with Lipschitz constant given by
[TABLE]
Therefore, for every , we have
[TABLE]
Now, we let and in (16) and use the resulting inequality back in (15), to obtain
[TABLE]
where the last inequality follows from (14).
By Theorem 14, we have
[TABLE]
Therefore, there are nonnegative numbers such that their sum is and
[TABLE]
where each belongs to . We recall that, by definition, for every and . Using the convexity of , we obtain
[TABLE]
Using inequality (18) in (17), we obtain that for every with , we have
[TABLE]
Since was arbitrary, this shows that (Goal) holds. ∎
4.4 Main results
From Propositions 12 and 15, we conclude that (Transfer) holds for the case . Next, will prove transfer results for the approximate and horizon subdifferentials which will conclude the proof of (Transfer).
Proposition 16** (The approximate and horizon subdifferentials of spectral functions).**
Let be the spectral function induced by a symmetric function . Then, for , we have
[TABLE]
Proof.
First, we prove the inclusion “” in (19) and (20). Let or . By definition, there are sequences such that holds for every and
[TABLE]
Here, there are two cases for . If , then for every . If , then . Since holds for every , Proposition 12 implies the existence of such that
[TABLE]
Let . Since for every and , passing to a subsequence if necessary, we may assume that for every , converges to some . Elementary properties of limits show that if and . Therefore is a Jordan frame in .
Now, we need to examine whether . We have
[TABLE]
Since each is a continuous function and , we conclude that
[TABLE]
An analogous argument shows that diagonalizes . Gathering all we have shown, we obtain that holds for every and
[TABLE]
That is, together with either (if ) or (if ).
We will now prove the inclusion “”. Let be such that there are sequences satisfying for every and
[TABLE]
where . Here, either for every or . Let .
For every , let be a permutation matrix such that . Since holds for every and is a symmetric function, we have from (6) that
[TABLE]
Let
[TABLE]
Let be the permutation on the set induced by , i.e., , if and only if, permutes the -th and the -th entries of a vector. We have and , where is defined as
[TABLE]
Therefore, from (21) we have
[TABLE]
which combined with Proposition 15 shows that
[TABLE]
Next, since , it follows that . Again, recalling that is a symmetric function and that
[TABLE]
we have , since . Similarly, we have , since . This shows that (if ) or (if ). ∎
We can now state our main result.
Theorem 17** (Generalized subdifferentials of spectral functions).**
Let be a Euclidean Jordan algebra of rank and let be the spectral function induced by a symmetric function . Then, for , we have
[TABLE]
whenever is or .
Proof.
Follows from Propositions 12, 15, 16. ∎
4.5 Convex hull of generalized subdifferentials and the Clarke subdifferential
In this subsection, we will prove the following meta-formula
[TABLE]
whenever is a subdifferential which behaves nicely with respect to permutations and for which (Transfer) holds. One of the motivations for this formula is, of course, the study of the Clarke subdifferential, which we will discuss next. First, we recall that is locally Lipschitz continuous at if there exists some neighbourhood of and a constant such that
[TABLE]
Using the construction of the Clarke subdifferential through the Bouligand derivative, Baes proved in his PhD thesis that, if is locally Lipschitz, then the meta-formula (Transfer) holds when is either the Bouligand or the Clarke subdifferential, see Proposition 4.5.1 and Theorems 4.5.4 and 4.5.5 in [2]. However, denoting by the Clarke subdifferential, it turns out that, under local Lipschitzness, we have
[TABLE]
see Theorem 9.61 in [24]. Therefore, with some effort, Theorem 17 can be used to give another proof that (Transfer) holds when is and is locally Lipschitz continuous. The first step towards this idea is the following result, which is a variant of Theorem 14.
Proposition 18**.**
Let be such that and operator commute. Then, for every and every we have
[TABLE]
Proof.
By Theorem 14, we already have
[TABLE]
All we need to do now is to relate and . For that, we will proceed as in the proof of Theorem 14.
Let us consider the spectral decomposition of according to ,
[TABLE]
Then, we use the notation described in Section 3.2 and denote by the “relative position” of the index with respect the eigenvalues of that are equal to . Furthermore, we let be the sum of the idempotents associated to the eigenvalues equal to . We also let be such that
[TABLE]
Here, is the number of distinct eigenvalues of . For convenience, we define and for . Then, we divide and in parts according to the blocks of equal eigenvalues of :
[TABLE]
First, we observe that if , then we have . Then, from the formula for the directional derivatives (Theorem 7) and the fact that diagonalizes , we obtain
[TABLE]
where . We recall that is the orthogonal projection of onto . And, again, because diagonalizes , we obtain
[TABLE]
which is the spectral decomposition of in the algebra . In particular, the eigenvalues of in the algebra are precisely the components of . We also need to recall that is, in fact, the -th largest eigenvalue of in the algebra .
Piecing everything together, we conclude that is just the result of sorting in nonincreasing order. Therefore, there exists a permutation matrix such that , for every . Then, if we let
[TABLE]
we have and since the block structure of follows the blocks of equal eigenvalues of , we have . From (22), we have
[TABLE]
since is a group. ∎
For what follows, we say that a subdifferential is permutation compatible if
[TABLE]
whenever is a symmetric function and . We note that all subdifferentials that have appeared so far in this paper are permutation compatible. With that, we are ready to prove the following meta-theorem which might be applicable to other subdifferentials not discussed in this paper.
Theorem 19** (Convex hull of generalized subdifferentials).**
Let be the spectral function induced by a symmetric function . Then, for , we have
[TABLE]
where is any permutation compatible subdifferential for which (Transfer) holds. In particular, if and is locally Lipschitz continuous at , then (Transfer) holds when .
Proof.
First we prove the “” inclusion. Suppose and are such that and is the convex combination of . Then, since (Transfer) holds, we have
[TABLE]
Because is a convex combination of the , we obtain .
Next, we prove the “” inclusion. Let . Since (Transfer) holds, there are and such that
[TABLE]
Let be a convex combination of , so that
[TABLE]
for some . Since and are pairs of simultaneously diagonalizable elements, the same must be true of the pair , see (5). We conclude that there exists . Now, we invoke Proposition 18 with and , to conclude that
[TABLE]
Because is permutation compatible, (23) implies that belongs to for every . Therefore, . A completely analogous argument for shows that
[TABLE]
Since is a convex combination of and , we conclude that, indeed,
[TABLE]
which proves the inclusion “”.
Finally, if is locally Lipschitz continuous at , the fact that the eigenvalue map is Lipschitz continuous (Lemma 4) shows that must be locally Lipschitz continuous at . Therefore,
[TABLE]
This shows that (Transfer) holds with . ∎
Next, we will take a look at the Clarke subdifferential of spectral functions without assuming local Lipschitzness, in order to extend Baes’ results. First, we will briefly explain some technical issues related to this task. In Theorem 8.9 of [24], we see that each of the generalized subdifferentials is associated to a corresponding notion of normal cone. In this context, the Clarke subdifferential is defined using the convexified version of the normal cone associated to , see Section J in chapter 8 of [24]. The problem is that, by doing so, the Clarke subdifferential can be larger than the convex hull of the approximate subdifferential. Therefore, in general, we have .
Nevertheless, under local lower semicontinuity, we have the following, see Lemma 4.1 in [20]. We recall that is said to be locally lower semicontinuous at , if is finite and there exists such that is closed for every satisfying , see Definition 1.33 in [24].
Lemma 20** (Lemma 4.1 in [20]).**
Suppose is locally lower semicontinuous at . Then,
[TABLE]
With the aid of Lemma 20, we are now in position to extend Baes’ results on the Clarke subdifferential.
Theorem 21** (Clarke subgradients of spectral functions under local lower semicontinuity).**
Let be the spectral function induced by a symmetric function . The following hold:
* is locally lower semicontinuous at if and only if is locally lower semicontinuous at .* 2.
If is locally lower semicontinuous at , then (Transfer) is valid when .
Proof.
Item follows from the continuity of the eigenvalue map and elementary properties of the maps and when . We will omit its proof.
Now, we move on to item . Under Lemma 20, we have
[TABLE]
First, suppose that , so there is a sequence such that and for each we have
[TABLE]
where and . By Theorem 19, there are and such that
[TABLE]
Because and both operator commute with , we conclude that operator commutes with as well, see (5). Therefore, there exists a Jordan frame such that . Next, we apply Proposition 18 two times. First with , , and then with , , in order to obtain that
[TABLE]
Since (6) holds for the approximate and horizon subdifferentials, we have
[TABLE]
for every when is or . Therefore, (26) together with (27) and (28) implies that
[TABLE]
and
[TABLE]
We now proceed as in the proof of Proposition 16. Since the idempotents in have norm 1, passing to a converging subsequence if necessary, the Jordan frame converges to some Jordan frame and we have
[TABLE]
Together with (24) and (29), we conclude that the inclusion “” holds in (Transfer) when is .
Now, for the “” inclusion, suppose that is such that with . By (24), there is a sequence with such that
[TABLE]
where and . Therefore, . In addition, by Theorem 19, we have
[TABLE]
Using (25), we conclude that . ∎
4.6 Subdifferentials of the -th largest eigenvalue function
In this subsection, as an application of Theorems 17, 19 and 21, we will compute the generalized subdifferentials of the function that maps an element to its -th largest eigenvalue, for .
Let be the function that maps to its -th largest component. Then, is a symmetric function and is the spectral function generated by . We note that, since the eigenvalue map is Lipschitz continuous, each must be Lipschitz continuous as well. In what follows, denotes the -th unit vector and we recall that denotes the -th component of . We also define
[TABLE]
For a finite set , we denote its cardinality by . The generalized subdifferentials of are described by the following proposition, see Proposition 6 and Theorem 9 in [19].
Proposition 22**.**
The following hold.
[TABLE]
*where . *
Let denote the set of primitive idempotents of . We recall that if and only if is nonzero, and cannot be written as the sum of two nonzero orthogonal idempotents.
Lemma 23** (Frame extension lemma).**
Let and . If for some , then is an eigenvalue of and there is a Jordan frame such that . In particular, .
Proof.
By the Peirce decomposition (Theorem 5), we have
[TABLE]
Then, since , we have . Therefore,
is a Euclidean Jordan algebra (see Theorem 5). Furthermore, since has rank , the algebra has rank . Therefore, we can find a Jordan frame that diagonalizes in . It follows that
[TABLE]
where for every . We now need to check that is a Jordan frame. All elements of are primitive idempotents. Furthermore, if . Since , we also have for every . Since the identity element of is and is a Jordan frame in , we have
[TABLE]
This shows that . Therefore, is indeed a Jordan frame of the algebra and (30) shows that diagonalizes . Since eigenvalues are unique, must be one of the eigenvalues of . Reordering if necessary, we obtain . ∎
Lemma 24** (Convex hull of primitive idempotents).**
Let and be an eigenvalue of . Let
[TABLE]
Let .
The eigenvalues of are nonnegative and sum to . 2.
*There is such that for every not belonging to . *
Proof.
Primitive idempotents have trace equal to and the trace function is linear, so elements in must have trace too. Then, we recall that any idempotent must be belong to , which is a symmetric cone (see Theorem III.2.1 in [6]). In particular, is a convex cone and, since is a convex combination of elements of , belongs to which implies that its eigenvalues are nonnegative.
Next, we move on to item . Pick any Jordan frame for and let denote the sum of the primitive idempotents associated to the eigenvalue . By Proposition 3, does not depend on the choice of Jordan frame. Since , we have
[TABLE]
where for every and the are nonnegative and sum to . First, we will show that .
By Lemma 23, each can be extended to a Jordan frame with . Then, the idempotents in associated to the eigenvalue must sum to by Proposition 3 and, at the same time, holds whenever and . We conclude that
[TABLE]
Therefore, each belongs to , which shows that . Since and are Euclidean Jordan algebras, there is a Jordan Frame that diagonalizes . Next, since , there is a Jordan frame that diagonalizes .
Let . First, because and are Jordan frames, we have (the well-known fact) that is a Jordan frame in the algebra .
Then, since diagonalizes , diagonalizes and the sum of the elements of is (the unit element of ), we conclude that diagonalizes and . We also observe that , which can be seen by expressing as a linear combination of the elements in and recalling that the idempotents of sum to .
Finally, if but , then and , because and are orthogonal spaces. Reordering if necessary, we obtain with the required properties. ∎
We are now equipped to prove the following result.
Theorem 25** (Generalized subdifferentials of ).**
Let be a Euclidean Jordan algebra of rank and let denote the function that maps an element to its -largest eigenvalue. The following hold.
[TABLE]
where .
Proof.
The equality follows from Theorem 17 and Proposition 22.
We will now prove the formula for . Let . By Theorem 19 and Proposition 22, there exists such that
[TABLE]
Because is written as a linear combination of elements of , (35) implies that is a convex combination of the idempotents of associated to . Observing that those idempotents satisfy , we obtain
[TABLE]
which shows that “” holds in (31).
Conversely, suppose that . By item of Lemma 24 applied to and , the eigenvalues of are nonnegative and sum to . Furthermore, by item of Lemma 24, there exists such that , whenever and is not associated to . This, together with Proposition 22, shows that
[TABLE]
because the nonzero components of are nonnegative, sum to and are located only at indices associated to idempotents in . By Theorem 19, we have , which shows that (31) holds.
The expressions for are consequences of Theorem 17, Proposition 22, the formula for and the fact that . ∎
5 The KL-exponent of spectral functions
We recall the definitions of the KL property and KL-exponent, see Definitions 2.2 and 2.3 in [21]. In what follows, we define . If is a subset of , we define . If is a subset of , we define analogously using the norm induced by (4).
Definition 26** (KL-property and KL-exponent).**
A lower semicontinuous function is said to satisfy the KL property at if there exists a neighbourhood of , and a continuous concave function with such that
* is continuously differentiable on with (its derivative) positive over ;* 2.
for all with , we have
[TABLE]
In particular, is said to satisfy the KL property with exponent at , if can be taken to be for some positive constant .
First, we need the following lemma.
Lemma 27**.**
Let be a symmetric function and let be the corresponding spectral function. Then, for every and for every Jordan frame which diagonalizes (see Section 3.1) we have
[TABLE]
Proof.
Let and let be a Jordan frame which diagonalizes . From (6) and since permutation matrices are orthogonal matrices, we obtain
[TABLE]
In particular,
[TABLE]
Therefore, it suffices to show that From Theorem 17, we have
[TABLE]
Therefore, . To show the opposite inequality, let By Theorem 17, is such that . Furthermore, we have . This shows that . ∎
Theorem 28** (Transfer principle for the KL property and KL exponent).**
Let be a symmetric function and let be the corresponding spectral function. Then,
* satisfies the KL property if and only if satisfies the KL property at . In addition, the and in Definition 26 can be taken to be the same for both and .* 2.
* satisfies the KL property with exponent at if and only if satisfies the KL property with exponent at .*
Proof.
First we prove item . By Theorem 17 we have if and only if . Next, suppose that satisfies the KL property at and let and be as in Definition 26.
Since is continuous, is a neighbourhood of . Therefore, if is such that , we have
[TABLE]
By Lemma 27 and item of Definition 26 applied to and , we have
[TABLE]
This shows that satisfies the KL property at with the same and .
Now, we prove the converse. Suppose that satisfies the KL property at and let be a neighbourhood of together with and such that Definition 26 is satisfied.
Let and . Then, whenever is such that , we have
[TABLE]
By item of Definition 26, we have
[TABLE]
By Lemma 27, we have
[TABLE]
This shows that satisfies the KL property at with the same and , which concludes the proof of item .
Next, we observe that item is a particular case of the previous item, when can be taken to be . ∎
Remark 29**.**
*In Theorem 3.2 of [21] there is a result about the KL-exponent of function compositions of the form . However, the result requires that be continuously differentiable, so it cannot be used to prove Theorem 28. *
Acknowledgments
We thank the referees for their comments, which helped to improve the paper. This work was partially supported by the Grant-in-Aid for Scientific Research (B) (19H04069) and the Grant-in-Aid for Young Scientists (19K20217) from Japan Society for the Promotion of Science.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Attouch, J. Bolte, P. Redont, and A. Soubeyran. Proximal alternating minimization and projection methods for nonconvex problems: An approach based on the Kurdyka-Łojasiewicz inequality. Mathematics of Operations Research , 35(2):438–457, 2010.
- 2[2] M. Baes. Spectral functions and smoothing techniques on Jordan algebras . Ph D thesis, Université catholique de Louvain, 2006.
- 3[3] M. Baes. Convexity and differentiability properties of spectral functions and spectral mappings on Euclidean Jordan algebras. Linear Algebra and its Applications , 422(2):664 – 700, 2007.
- 4[4] R. Bhatia. Matrix Analysis . Graduate Texts in Mathematics. Springer New York, 1997.
- 5[5] J.-S. Chen, X. Chen, and P. Tseng. Analysis of nonsmooth vector-valued functions associated with second-order cones. Mathematical Programming , 101(1):95–117, Sep 2004.
- 6[6] J. Faraut and A. Korányi. Analysis on symmetric cones . Oxford mathematical monographs. Clarendon Press, Oxford, 1994.
- 7[7] L. Faybusovich. Several Jordan-algebraic aspects of optimization. Optimization , 57(3):379–393, 2008.
- 8[8] M. Fukushima, Z. Luo, and P. Tseng. Smoothing functions for second-order-cone complementarity problems. SIAM Journal on Optimization , 12(2):436–460, 2002.
