Generalized subdifferentials of spectral functions over Euclidean Jordan   algebras

Bruno F. Louren\c{c}o; Akiko Takeda

arXiv:1902.05270·math.OC·September 27, 2021·SIAM J. Optim.

Generalized subdifferentials of spectral functions over Euclidean Jordan algebras

Bruno F. Louren\c{c}o, Akiko Takeda

PDF

TL;DR

This paper develops formulas for various generalized subdifferentials of spectral functions on Euclidean Jordan algebras, extending previous results and applying them to eigenvalue functions, with implications for nonsmooth optimization.

Contribution

It provides new formulas for regular, approximate, horizon, and Clarke subdifferentials of spectral functions, extending existing theory and analyzing the KL property in this context.

Findings

01

Formulas for regular, approximate, and horizon subdifferentials of spectral functions.

02

Extension of Clarke subdifferential formula under local lower semicontinuity.

03

Analysis of the Kurdyka-Lojasiewicz property and transfer of KL-exponent for spectral functions.

Abstract

This paper is devoted to the study of generalized subdifferentials of spectral functions over Euclidean Jordan algebras. Spectral functions appear often in optimization problems playing the role of "regularizer", "barrier", "penalty function" and many others. We provide formulae for the regular, approximate and horizon subdifferentials of spectral functions. In addition, under local lower semicontinuity, we also furnish a formula for the Clarke subdifferential, thus extending an earlier result by Baes. As application, we compute the generalized subdifferentials of the function that maps an element to its k-th largest eigenvalue. Furthermore, in connection with recent approaches for nonsmooth optimization, we present a study of the Kurdyka-Lojasiewicz (KL) property for spectral functions and prove a transfer principle for the KL-exponent. In our proofs, we make extensive use of recent…

Equations351

F (x) : = f (λ (x)),

F (x) : = f (λ (x)),

λ_{1} (x) \geq \dots \geq λ_{r} (x) .

λ_{1} (x) \geq \dots \geq λ_{r} (x) .

x \in E min Φ (x) = ψ (x) + F (x),

x \in E min Φ (x) = ψ (x) + F (x),

F_{1} (x) = μ ∥ λ (x) ∥_{p},

F_{1} (x) = μ ∥ λ (x) ∥_{p},

F_{3} (x) = μ (∥ λ (x) ∥_{1} - ∣ ∥ λ (x) ∥ ∣_{ℓ}),

- \nabla ψ (x^{*}) \in \partial F (x^{*}),

- \nabla ψ (x^{*}) \in \partial F (x^{*}),

P^{r} (u) : = {P \in P^{r} ∣ P (u) = u} .

P^{r} (u) : = {P \in P^{r} ∣ P (u) = u} .

v \to 0 v \neq = 0 lim inf \frac{f ( u + v ) - f ( u ) - ⟨ d , v ⟩}{∥ v ∥} \geq 0.

v \to 0 v \neq = 0 lim inf \frac{f ( u + v ) - f ( u ) - ⟨ d , v ⟩}{∥ v ∥} \geq 0.

f (u + v) - f (u) - ⟨ d, v ⟩ \geq - ϵ ∥ v ∥

f (u + v) - f (u) - ⟨ d, v ⟩ \geq - ϵ ∥ v ∥

u^{k} \to u, f (u^{k}) \to f (u), d^{k} \to d .

u^{k} \to u, f (u^{k}) \to f (u), d^{k} \to d .

u^{k} \to u, f (u^{k}) \to f (u), t^{k} d^{k} \to d, t^{k} ↓ 0.

u^{k} \to u, f (u^{k}) \to f (u), t^{k} d^{k} \to d, t^{k} ↓ 0.

h (u)

h (u)

h (v)

c_{i} \circ c_{j} = 0 for i \neq = j,

c_{i} \circ c_{j} = 0 for i \neq = j,

x = i = 1 \sum r α_{i} c_{i} .

x = i = 1 \sum r α_{i} c_{i} .

x = i = 1 \sum r α_{i} c_{i} = i = 1 \sum r α_{i}^{'} c_{i}^{'} .

x = i = 1 \sum r α_{i} c_{i} = i = 1 \sum r α_{i}^{'} c_{i}^{'} .

i with α_{i} = α \sum c_{i} = i with α_{i}^{'} = α \sum c_{i}^{'} .

i with α_{i} = α \sum c_{i} = i with α_{i}^{'} = α \sum c_{i}^{'} .

λ (x) : = (λ_{1} (x), \dots, λ_{r} (x)),

λ (x) : = (λ_{1} (x), \dots, λ_{r} (x)),

tr (x) : = λ_{1} (x) + \dots + λ_{r} (x) .

tr (x) : = λ_{1} (x) + \dots + λ_{r} (x) .

⟨ x, y ⟩ = tr (x \circ y), \forall x, y \in E .

⟨ x, y ⟩ = tr (x \circ y), \forall x, y \in E .

∥ x ∥ = tr (x^{2}) .

∥ x ∥ = tr (x^{2}) .

z \to 0 lim \frac{λ ( x + z ) - λ ( x ) - λ ^{'} ( x ; z )}{∥ z ∥} = 0,

z \to 0 lim \frac{λ ( x + z ) - λ ( x ) - λ ^{'} ( x ; z )}{∥ z ∥} = 0,

L_{x} (z) = x \circ z, \forall z \in E .

L_{x} (z) = x \circ z, \forall z \in E .

L_{x} L_{y} = L_{y} L_{x}

L_{x} L_{y} = L_{y} L_{x}

x = i = 1 \sum r a_{i} c_{i}, y = i = 1 \sum r b_{i} c_{i},

x = i = 1 \sum r a_{i} c_{i}, y = i = 1 \sum r b_{i} c_{i},

x = i = 1 \sum r λ_{i} (x) c_{i}, y = i = 1 \sum r \tilde{b}_{i} c_{i},

x = i = 1 \sum r λ_{i} (x) c_{i}, y = i = 1 \sum r \tilde{b}_{i} c_{i},

⟨ c_{i}, x ⟩ = λ_{i} (x), ⟨ c_{i}, y ⟩ = \tilde{b}_{i} .

⟨ c_{i}, x ⟩ = λ_{i} (x), ⟨ c_{i}, y ⟩ = \tilde{b}_{i} .

diag (z, J) = (⟨ c_{1}, z ⟩, \dots, ⟨ c_{r}, z ⟩), \forall z \in E .

diag (z, J) = (⟨ c_{1}, z ⟩, \dots, ⟨ c_{r}, z ⟩), \forall z \in E .

diag (x, J) = λ (x), diag (y, J) = (\tilde{b}_{1}, \dots, \tilde{b}_{r}) .

diag (x, J) = λ (x), diag (y, J) = (\tilde{b}_{1}, \dots, \tilde{b}_{r}) .

J (x, y) \neq = \emptyset and J (x, z) \neq = \emptyset \Rightarrow J (x, α y + β z) \neq = \emptyset, \forall α, β \in R .

J (x, y) \neq = \emptyset and J (x, z) \neq = \emptyset \Rightarrow J (x, α y + β z) \neq = \emptyset, \forall α, β \in R .

Diag (u, J) : = i = 1 \sum r u_{i} c_{i} .

Diag (u, J) : = i = 1 \sum r u_{i} c_{i} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Generalized subdifferentials of spectral functions over Euclidean Jordan algebras

Bruno F. Lourenço Department of Statistical Inference and Mathematics, Institute of Statistical Mathematics, 10-3 Midori-cho, Tachikawa, Tokyo 190-8562, Japan. ([email protected])

Akiko Takeda

Department of Creative Informatics, Graduate School of Information Science and Technology, University of Tokyo, Tokyo, Japan and RIKEN Center for Advanced Intelligence Project, 1-4-1, Nihonbashi, Chuo-ku, Tokyo 103-0027, Japan. ([email protected])

Abstract

This paper is devoted to the study of generalized subdifferentials of spectral functions over Euclidean Jordan algebras. Spectral functions appear often in optimization problems playing the role of “regularizer”, “barrier”, “penalty function” and many others. We provide formulae for the regular, approximate and horizon subdifferentials of spectral functions. In addition, under local lower semicontinuity, we also furnish a formula for the Clarke subdifferential, thus extending an earlier result by Baes. As application, we compute the generalized subdifferentials of the function that maps an element to its $k$ -th largest eigenvalue. Furthermore, in connection with recent approaches for nonsmooth optimization, we present a study of the Kurdyka-Łojasiewicz (KL) property for spectral functions and prove a transfer principle for the KL-exponent. In our proofs, we make extensive use of recent tools such as the commutation principle of Ramírez, Seeger and Sossa and majorization principles developed by Gowda.

Keywords: spectral functions, generalized subdifferential, approximating subdifferential, Euclidean Jordan algebra, Kurdyka-Łojasiewicz inequality.

1 Introduction

Let $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ be a function that is symmetric, i.e., $f(u)$ does not change if we permute the coordinates of $u\in\mathbb{R}^{r}$ . Here, $\overline{\mathbb{R}}$ denotes the extended line $[-\infty,+\infty]$ . Now, let us consider a Euclidean Jordan algebra $\mathcal{E}$ of rank $r$ , for example, the $r\times r$ symmetric matrices. Then, $f$ can be extended in a natural fashion to a function $F$ over $\mathcal{E}$ by defining for all $x\in\mathcal{E}$

[TABLE]

where $\lambda(x)\in\mathbb{R}^{r}$ is the vector containing the eigenvalues of $x$ in nonincreasing order, i.e.,

[TABLE]

We call $F$ the spectral function induced by $f$ . Because $f$ is symmetric, it is known from the works of Baes [3], Sun and Sun [26], Jeong and Gowda [15] and others that several properties of $f$ are transferred to $F$ . For example, $f$ is convex if and only if $F$ is convex. The same goes for differentiability. Results of this type are sometimes called transfer results or transfer principles, e.g., [15].

Spectral functions are ubiquitous throughout optimization and recognizing that $F$ is a spectral function can make computing derivatives/subdifferentials of $F$ significantly simpler than if one tries to do so by scratch. This is because transfer principles usually come with formulae that relate the derivatives/subdifferentials of $F$ and $f$ .

Motivated by the needs of nonsmooth optimization, our goal in this paper is to obtain formulae for the regular, approximate and horizon subdifferentials of spectral functions without any extra assumptions such as local Lipschitzness. In nonsmooth optimization, the regular and approximate subdifferential are often used to express optimality conditions and in the analysis of algorithms. Also, conditions involving the horizon subdifferential are quite common to ensure that the function satisfies some desirable property. We will also obtain a formula for the Clarke subgradient with the assumption of local lower semicontinuity, which extends an earlier result by Baes [2]. We will use these formulae to compute the generalized subdifferentials of the eigenvalue functions in the context of Euclidean Jordan algebras, see Section 4.6.

Another motivation comes from the so-called composite optimization, where we wish to solve the problem

[TABLE]

and only $\psi:\mathcal{E}\to\mathbb{R}$ is assumed to be smooth. It is common for the function $F$ to play the role of a “regularizer”, “penalty” or “barrier”. In those cases, $F$ is often a spectral function. Here are a few examples. In what follows, for $u\in\mathbb{R}^{r}$ , we denote its $p$ -norm by $\left\|u\right\|_{p}$ and the sum of the $\ell$ components with largest absolute value by $\lvert\left\|u\right\|\rvert_{\ell}$ .

[TABLE]

where $\mu$ is a positive parameter. When $p=1$ , $F_{1}$ is the $l_{1}$ regularizer. $F_{2}$ is a multiple of the classical self-concordant barrier for the symmetric cone associated to $\mathcal{E}$ . The function $F_{3}$ maps $x$ to the sum of the $r-\ell$ eigenvalues of $x$ with smallest absolute value, which is an important function for dealing with rank constrained problems, see [9] and Section 4 in [10]. Here, we are expressing $F_{3}$ as a DC (difference of convex) function. We observe that $F_{1},F_{2},F_{3},F_{4}$ are all spectral functions, while $F_{3}$ and $F_{4}$ are nonsmooth and nonconvex. In any case, under appropriate regularity conditions, a necessary condition for $x^{*}$ to be a local optimal solution to (OPT) is that

[TABLE]

where $\partial F(x^{*})$ is the approximate subdifferential of $F$ at $x^{*}$ , see Exercise 8.8 and Theorem 8.15 in [24].

Yet another motivation for this work is that the approximate subdifferential is necessary in order to compute the so-called Kurdyka-Łojasiewicz (KL) exponent, which has been shown to control the convergence properties of many first-order methods as can be seen, for instance, in the classical work by Attouch, Bolte, Redont and Soubeyran [1]. For a recent discussion on this topic, see the work by Li and Pong [21].

While there are many criteria that can be used to show that a function satisfies the so-called KL-property, it is often highly nontrivial to compute the KL-exponent [21]. For instance, if we wish to compute the $KL$ -exponent of $\Phi$ , we have to analyze the approximate subdifferentials of $F$ , because $\partial\Phi(x)=\nabla\psi(x)+\partial F(x)$ , as can be seen in Exercise 8.8 of [24]. In this paper, although we will not compute the KL-exponent of $\Phi$ itself, as an application of our results, we will show that if $f$ is a symmetric function and $F$ is the corresponding spectral function, then $f$ and $F$ share the same KL-exponent. Admittedly, this is not a very powerful result, but it seems to be beyond what can be proved directly with the results of [21] (see Remark 29) and we believe it is a first step towards a more comprehensive study of the KL-exponent of composite functions where one of the functions is spectral.

1.1 Previous works

Lewis [17, 18, 19] has discussed extensively the case of spectral functions over symmetric real matrices and Hermitian complex matrices. In particular, in [19], Lewis gave expressions for the regular, approximate and horizon subdifferentials of spectral functions over symmetric real matrices. A formula for Clarke subdifferentials was also given for the locally Lipschitz case.

Spectral functions over the algebra associated to the second order cone were initially studied by Fukushima, Luo and Tseng [8] and by Chen, Chen and Tseng [5]. In [5], there is a discussion of the Clarke subdifferential of locally Lipschitz spectral functions and Sendov [25] gave formulae for regular, approximate and horizon subdifferentials. Sendov also proved a formula for the Clarke subdifferential under the hypothesis of local lower semicontinuity.

In the general framework of Euclidean Jordan algebras, Baes [2, 3], Sun and Sun [26] and Jeong and Gowda [14, 15] proved several key results regarding spectral functions and the related notion of spectral sets. However, as far as we know, until now there were no results for the regular, approximate and horizon subdifferentials of spectral functions. Furthermore, results for the Clarke subgradient were only known in the locally Lipschitz case. Related to Clarke subgradients, we mention in passing that Kong, Tunçel and Xiu proved an expression for the Clarke subgradient of the orthogonal projection of the symmetric cone associated to a Euclidean Jordan algebra [16].

1.2 Contributions of this work

In this work, we have three contributions. The first is a meta-formula for the generalized subdifferentials of a spectral function. We will show that if $F:\mathcal{E}\to\overline{\mathbb{R}}$ is a spectral function induced by $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ , then there is a formula that relate the generalized subdifferentials of $F$ and $f$ , see Theorems 17, 19 and 21.

A feature of our results is that we will never assume that the algebra $\mathcal{E}$ is simple, which makes some results more general, but a bit harder to prove. Every Jordan algebra can be decomposed as a direct sum of simple algebras and simplicity is, in many cases, a harmless hypothesis. Previous work by Lewis [19] and Sendov [25] can be seen as containing results for specific cases of simple Euclidean Jordan algebras. However, because the generalized subdifferentials do not behave nicely with respect to partial subdifferentiation, there are cases where we cannot extend results from simple to general Euclidean Jordan algebras in a straightforward way. We emphasize that our results are directly applicable to a situation where, for example, $\mathcal{E}$ is a direct product $\mathcal{S}^{r_{1}}\times\cdots\times\mathcal{S}^{r_{\ell}}$ , where $\mathcal{S}^{r}$ denotes the space of $r\times r$ real symmetric matrices.

Our second contribution is providing formulae for the generalized subdifferentials of the function $\lambda_{k}:\mathcal{E}\to\mathbb{R}$ , which maps an element $x\in\mathcal{E}$ to its $k$ -th largest eigenvalue, see Theorem 25. We believe this is the first time such formulae are given in the context of Euclidean Jordan algebras.

Last, we will show a transfer principle of the KL-property for spectral functions and show that $F$ and $f$ must share the same KL-exponent, see Theorem 28.

This work is divided as follows. In Section 2, we review generalized subdifferentials. In Section 3, we overview the necessary concepts from the theory of Euclidean Jordan algebras. In Section 4, we develop and present our main results regarding generalized subdifferentials of spectral functions. Finally, in Section 5 we discuss the KL-property and KL-exponent of spectral functions.

2 Preliminaries

2.1 Notation

Given an element $u\in\mathbb{R}^{r}$ , we will denote its $i$ -th component by $u_{i}$ . We write $\mathbb{R}^{r}_{\geq}$ for the cone of elements $u$ satisfying $u_{1}\geq\cdots\geq u_{r}$ . We write $\mathbb{R}^{r}_{+}$ for the nonnegative orthant, i.e., the elements $u\in\mathbb{R}^{r}$ such that $u_{i}\geq 0$ for every $i$ . We will write $\mathcal{P}^{r}$ for the group of $r\times r$ permutation matrices. Given $u\in\mathbb{R}^{r}$ , we write $\mathcal{P}^{r}(u)$ for the stabilizer subgroup of $u$ , i.e.,

[TABLE]

The convex hull, the interior and the closure of a set $C$ will be denoted by ${\mathrm{conv}\,}C$ , $\mathrm{int}\,C$ and $\mathrm{cl}\,C$ , respectively. If $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ is a function, the domain of $f$ (i.e., the elements for which $f$ is finite) will be denoted by ${\rm dom}\,f$ . We assume that $\mathbb{R}^{r}$ is furnished with the usual Euclidean inner product $\langle\cdot,\cdot\rangle$ and the usual Euclidean norm $\left\|\cdot\right\|$ .

2.2 Generalized subdifferentials

In this subsection, we recall a few notions of generalized subdifferentials. However, the discussion on the Clarke subdifferential will be postponed until Section 4.5. Let $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ be a function and $u\in{\rm dom}\,f$ . We say that $d$ is a regular subgradient of $f$ at $u$ if

[TABLE]

The set of regular subgradients of $f$ at $u$ is denoted by $\hat{\partial}f(u)$ and is called the regular subdifferential of $f$ at $u$ . From (1) it follows that $d\in\hat{\partial}f(u)$ if and only if for every $\epsilon>0$ there exists some $\delta>0$ such that $\left\|v\right\|\leq\delta$ implies

[TABLE]

We say that $d$ is an approximate subgradient (also called limiting subgradient) of $f$ at $u$ if there are sequences $\{u^{k}\}$ , $\{d^{k}\}$ such that every $k$ satisfies $d^{k}\in\hat{\partial}f(u^{k})$ and the following limits hold:

[TABLE]

The set of approximate subgradients of $f$ at $u$ is denoted by $\partial f(u)$ and is called the approximate subdifferential of $f$ at $u$ .

We say that $d$ is an horizon subgradient of $f$ at $u$ if there are sequences $\{u^{k}\}$ , $\{d^{k}\}$ , $\{t^{k}\}$ such that every $k$ satisfies $d^{k}\in\hat{\partial}f(u^{k})$ and the following limits hold:

[TABLE]

Here, $t^{k}\downarrow 0$ indicates that all the $t^{k}$ are nonzero and that $t^{k}$ is a monotone nonincreasing sequence converging to zero. The set of horizon subgradients, called the horizon subdifferential, will be denoted by $\partial^{\infty}f(u)$ . In variational analysis, conditions involving the horizon subdifferential are quite common, e.g., see Corollary 10.9 in [24]. See also Section 8.B in [24] for examples of the subdifferentials discussed so far.

We will also make use of the following characterization of regular subgradients.

Proposition 1 (Rockafellar and Wets, Proposition 8.5 in [24]).

Let $d\in\mathbb{R}^{r}$ . Then, $d\in\hat{\partial}f(u)$ if and only if, on some neighborhood $U$ of $u$ there exists a $C^{1}$ function $h:U\to\mathbb{R}$ such that

[TABLE]

In this paper, sometimes we will prove results that are valid for several different notions of subdifferential. In that case, we use the symbol $\Diamond$ as a placeholder for some unspecified subdifferential, e.g., see Theorem 17.

3 Euclidean Jordan algebras

Here, we give a brief overview of Jordan algebras and review the necessary tools to prove our results. More details can be found in Faraut and Korányi’s book [6] or in the survey by Faybusovich [7]. First of all, a Euclidean Jordan algebra $(\mathcal{E},\circ)$ is a finite dimensional real vector space $\mathcal{E}$ equipped with a bilinear product $\circ:\mathcal{E}\times\mathcal{E}\to\mathcal{E}$ and an inner product $\langle\cdot,\cdot\rangle$ satisfying the following properties:

$(1)$

${x\circ y}={y\circ x}$ , 2. $(2)$

${x\circ(}{{x^{2}\circ y}})={x^{2}\circ(}{{x\circ y}})$ , where $x^{2}={x\circ x}$ , 3. $(3)$

$\langle{x\circ y},z\rangle=\langle x,{y\circ z}\rangle$ ,

for all $x,y,z\in\mathcal{E}$ . We can always assume that a Euclidean Jordan algebra has an element $e$ that satisfies ${e\circ x}=x$ , for all $x\in\mathcal{E}$ . Such an element $e$ is called the identity element. An element $c\in\mathcal{E}$ satisfying $c^{2}=c$ is called an idempotent. A nonzero idempotent $c$ that cannot be written as the sum of two nonzero idempotents $\hat{c},\tilde{c}$ satisfying ${\hat{c}\circ\tilde{c}}=0$ is called a primitive idempotent.

In a Euclidean Jordan algebra the following spectral theorem holds.

Theorem 2 (Spectral Theorem, see Theorem III.1.2 in [6]).

Let $(\mathcal{E},{\circ})$ be a Euclidean Jordan algebra and let $x\in\mathcal{E}$ . Then there are primitive idempotents $[c_{1},\dots,c_{r}]$ satisfying $c_{1}+\cdots+c_{r}={e}$ and

[TABLE]

and unique real numbers $\alpha_{1},\ldots,\alpha_{r}$ satisfying

[TABLE]

The $r$ that appears in Theorem 2 only depends on the algebra $\mathcal{E}$ and is called the rank of $\mathcal{E}$ . The $\alpha_{i}$ in Theorem 2 are called the eigenvalues of $x$ . Although unique, the eigenvalues of $x$ might be repeated and they are not necessarily in nonincreasing/nondecreasing order. We define the rank of $x$ as the number of nonzero $\alpha_{i}$ ’s appearing in (3). The ordered set $[c_{1},\ldots,c_{r}]$ in Theorem 2 is called a Jordan frame for $x$ .

Here, we are using the notation $[c_{1},\ldots,c_{r}]$ instead of $\{c_{1},\ldots,c_{r}\}$ to emphasize that the order of the elements is taken into account, so, for example, $[c_{1},c_{2}]$ and $[c_{2},c_{1}]$ are different ordered sets. Although $x$ might have many different Jordan frames, the sum of primitive idempotents associated to some eigenvalue must be unique.

Proposition 3 (Unique sum of primitive idempotents, see Theorems III.1.1 and III.1.2 in [6]).

Let $x\in\mathcal{E}$ and $[c_{1},\cdots,c_{r}],[c_{1}^{\prime},\cdots,c_{r}^{\prime}]$ be two Jordan frames for $x$ . Suppose that

[TABLE]

Then, for every $\alpha\in\mathbb{R}$ , we have

[TABLE]

We define the eigenvalue map $\lambda:\mathcal{E}\to\mathbb{R}^{r}_{\geq}$ as the map satisfying

[TABLE]

where $\lambda_{1}(x)\geq\cdots\geq\lambda_{r}(x)$ . Here, $\lambda_{i}(x)$ denotes the $i$ -th largest eigenvalue of $x$ .

The trace map $\mathrm{tr}\,:\mathcal{E}\to\mathbb{R}$ is defined as

[TABLE]

In fact, the trace map is a linear function. Furthermore, it can be shown that the function that maps $x,y\in\mathcal{E}$ to $\mathrm{tr}\,({x\circ y})$ is an inner product satisfying Property $(3)$ of the definition Euclidean Jordan algebras. Henceforth, we shall assume that the inner product $\langle\cdot,\cdot\rangle$ is given by

[TABLE]

Under this inner product, $\mathrm{tr}\,(x)=\langle e,x\rangle$ for all $x\in\mathcal{E}$ and elements of any Jordan frame are mutually orthogonal. That is, if $\mathcal{J}=[c_{1},\ldots,c_{r}]$ is Jordan frame, then $\langle c_{i},c_{j}\rangle=0$ if $i\neq j$ .

The norm induced by $\langle\cdot,\cdot\rangle$ is given by

[TABLE]

With that, any primitive idempotent $c$ satisfies $\left\|c\right\|=1$ . Furthermore, the map $\lambda$ becomes a Lipschitz continuous function with Lipschitz constant $1$ , when $\mathbb{R}^{r}$ is equipped with the usual Euclidean norm. We now summarize some important properties of $\lambda$ .

Lemma 4 (Properties of the eigenvalue map).

Let $\mathcal{E}$ be a Euclidean Jordan algebra of rank $r$ and let $\lambda:\mathcal{E}\to\mathbb{R}^{r}_{\geq}$ be the eigenvalue map. The following properties hold.

$(i)$

$\left\|\lambda(x)-\lambda(y)\right\|\leq\left\|x-y\right\|$ * holds, for all $x,y\in\mathcal{E}$ .* 2. $(ii)$

For every $x\in\mathcal{E}$ , $\lambda$ has directional derivatives along all directions. Furthermore, letting $\lambda^{\prime}(x;z)$ denote the directional derivative of $\lambda$ at $x$ along $z$ , the following limit holds

[TABLE]

where $\lambda^{\prime}(x;z)\coloneqq\lim_{t\to 0}\frac{\lambda(x+tz)-\lambda(x)}{t}$ .

Proof.

$(i)$

This was proved by Baes, see Corollary 24 in [3]. 2. $(ii)$

Baes showed that for every $i$ , the function $\lambda_{i}:\mathcal{E}\to\mathbb{R}$ that maps $x\in\mathcal{E}$ to its $i$ -th largest eigenvalue is directionally differentiable, see Theorem 36 in [3]. Therefore, all components of $\lambda$ are directionally differentiable, so $\lambda$ must also be directionally differentiable. Then, it is a general fact that a Lipschitz continuous function that is directionally differentiable everywhere must also satisfy the limit above, see Lemma 2.1.1 and Remark 2.1.2 in [13].

∎

3.1 Simultaneous diagonalization

Let $\mathcal{E}$ be a Euclidean Jordan algebra of rank $r$ . Given $x\in\mathcal{E}$ , we denote by $L_{x}:\mathcal{E}\to\mathcal{E}$ the Lyapunov operator associated to $x$ , which is the linear map satisfying

[TABLE]

Given another element $y\in\mathcal{E}$ , we say that $x$ and $y$ operator commute if

[TABLE]

holds. It is known that $x$ and $y$ operator commute if and only if they share a common Jordan frame $\mathcal{J}$ , see Lemma X.2.2 in [6]. This means that there are $r$ mutually orthogonal primitive idempotents $\mathcal{J}=[c_{1},\cdots,c_{r}]$ such that $c_{1}+\cdots+c_{r}={e}$ and

[TABLE]

where the $a_{i}$ and $b_{i}$ are the eigenvalues of $x$ and $y$ , respectively. More generally if $\mathcal{J}$ is a Jordan frame for which $x\in\mathcal{E}$ can be expressed as linear combination of the $c_{i}$ , we say that $\mathcal{J}$ diagonalizes $x$ . Therefore, the existence of a common Jordan frame for $x$ and $y$ means that $x$ and $y$ are simultaneously diagonalizable.

Here, the $a_{i}$ and $b_{i}$ that appear in the decomposition of $x$ and $y$ are not necessarily sorted in nondecreasing/nonincreasing order. However, reordering the $c_{i}$ , we may suppose that the $a_{i}$ are sorted in an nonincreasing order, i.e., $a_{i}=\lambda_{i}(x)$ , for all $i$ . With respect to this new ordering, we can write

[TABLE]

where $[\tilde{b}_{1},\ldots,\tilde{b}_{r}]$ is some permutation of $[b_{1},\ldots,b_{r}]$ . Because the idempotents in $\mathcal{J}$ are orthogonal amongst themselves, we have for every $i$

[TABLE]

With that in mind, we are going to introduce the function ${\mathrm{diag}\,}(\cdot,\mathcal{J}):\mathcal{E}\to\mathbb{R}^{r}$ , which maps an element $z\in\mathcal{E}$ to its “diagonal” with respect the Jordan frame $\mathcal{J}$ . That is, we have

[TABLE]

If $\mathcal{J}$ is a frame that diagonalizes $z$ , then ${\mathrm{diag}\,}(z,\mathcal{J})$ is, in fact, the eigenvalue vector of $z$ . Of course, ${\mathrm{diag}\,}(z,\mathcal{J})$ might not be sorted in any particular way. However, for the specific $x$ and $y$ we have discussed so far, we have

[TABLE]

We are now going to introduce two more extra notations. We will denote by $\mathcal{J}(x,y)$ the set of common Jordan frames $\mathcal{J}$ for $x,y$ for which ${\mathrm{diag}\,}(x,\mathcal{J})=\lambda(x)$ . In other words, not only $\mathcal{J}$ must be a common Jordan for $x$ and $y$ , but it must also be such that the eigenvalues of $x$ appear in nonincreasing order. Here, we emphasize that the eigenvalues of $y$ might appear in no particular order. By convention, if $x$ and $y$ do not operator commute, we will define $\mathcal{J}(x,y)=\emptyset$ . We observe that since $L_{\alpha y+\beta z}=\alpha L_{y}+\beta L_{z}$ , we have

[TABLE]

Furthermore, we will define $\mathcal{J}(x)\coloneqq\mathcal{J}(x,x)$ . That is, $\mathcal{J}(x)$ is the set of Jordan frames of $x$ for which the eigenvalues of $x$ appear in nonincreasing order. We have $\mathcal{J}(x,y)\subseteq\mathcal{J}(x)$ for every $x,y\in\mathcal{E}$ .

We also need a map that plays the opposite role of ${\mathrm{diag}\,}(\cdot,\mathcal{J})$ . Let ${\mathrm{Diag}\,}(\cdot,\mathcal{J}):\mathbb{R}^{r}\to\mathcal{E}$ be the map that takes a vector in $\mathbb{R}^{r}$ and constructs a “diagonal element” in $\mathcal{E}$ according to $\mathcal{J}$ , i.e.,

[TABLE]

We have ${\mathrm{diag}\,}({\mathrm{Diag}\,}(u,\mathcal{J}),\mathcal{J})=u$ , for every $u\in\mathbb{R}^{r}$ . We observe that, since $[c_{1},\ldots,c_{r}]$ is a Jordan frame, the eigenvalues of ${\mathrm{Diag}\,}(u,\mathcal{J})$ are precisely the $u_{i}$ .

3.2 The directional derivative of the $i$ -th largest eigenvalue

In this section, we will describe an expression proved by Baes [3] to compute the directional derivative of the $i$ -th largest eigenvalue. For that, we need to review the Peirce decomposition, the properties of quadratic maps in Euclidean Jordan algebras and, most regrettably, introduce more notation.

Let $c$ be an idempotent and $\alpha\in\mathbb{R}$ . We define

[TABLE]

Now, let $x\in\mathcal{E}$ be an arbitrary element (not necessarily an idempotent), the quadratic map of $x$ is the linear map $Q_{x}:\mathcal{E}\to\mathcal{E}$ such that

[TABLE]

$Q_{x}$ is always self-adjoint. With that, we have the following result.

Theorem 5 (Peirce Decomposition, see Proposition IV.1.1 and page 64 in [6]).

Let $(\mathcal{E},{\circ})$ be an Euclidean Jordan algebra of rank $r$ and let $c\in\mathcal{E}$ be an idempotent of rank $\ell$ . Then $\mathcal{E}$ is decomposed as the orthogonal direct sum

[TABLE]

In addition, $(V(c,1),{\circ})$ and ${(V(c,0),{\circ})}$ are Euclidean Jordan algebras of rank $\ell$ and $r-\ell$ , respectively. The orthogonal projections on $V(c,1)$ and $V(c,0)$ are given by $Q_{c}$ and $Q_{{e}-c}$ , respectively.

Next, we move on to the necessary notation. The eigenvalues of $x$ might be repeated so, for instance, it could be the case that $\lambda_{3}(x)=\lambda_{4}(x)=\lambda_{5}(x)$ . The next notation corresponds to a way of assigning the indices $3,4,5$ to $1,2,3$ . That is, we need to map an index $i$ to its “relative position” with respect to the eigenvalues of $x$ that are equal to $\lambda_{i}(x)$ . Here, we will mostly follow the notation proposed by Baes in [3] and define for every $p\in\{1,\ldots,r\}$ , the integer $l_{p}(x)\geq 1$ which is such that

[TABLE]

Furthermore, if $x=\sum_{i=1}\lambda_{i}(x)c_{i}\in\mathcal{E}$ we will denote by $e_{p}(x)$ the sum of the $c_{i}$ satisfying $\lambda_{i}(x)=\lambda_{p}(x)$ , i.e.,

[TABLE]

We remark that $\mathsf{f}^{\prime}_{p}(x)$ was used instead of $e_{p}(x)$ in [3].

Example 6.

Suppose that the rank of $\mathcal{E}$ is $r=7$ and the eigenvalues of $x\in\mathcal{E}$ are as follows.

[TABLE]

Then $l_{1}=l_{7}=1$ , because $\lambda_{1}$ and $\lambda_{7}$ are unique eigenvalues. We have $l_{2}=1$ , $l_{3}=2$ and $l_{4}=3$ , since $\lambda_{2},\lambda_{3},\lambda_{4}$ are, respectively, the “first”, “second” and “third” eigenvalues of a group of three equal eigenvalues. Similarly, we have $l_{5}=1$ and $l_{6}=2$ .

We have $e_{1}(x)=c_{1}$ , $e_{7}(x)=c_{7}$ ,

[TABLE]

Finally, let $\mathcal{E}^{\prime}\subseteq\mathcal{E}$ be an Euclidean Jordan algebra and let $x\in\mathcal{E}^{\prime}$ . Then, the eigenvalues of $x$ as an element of $\mathcal{E}^{\prime}$ might be different from the eigenvalues of $x$ seen as an element of $\mathcal{E}$ . When it is necessary to make this distinction, we will denote the $i$ -th eigenvalue of $x$ seen as element of $\mathcal{E}^{\prime}$ by

[TABLE]

The eigenvalue map of the algebra $\mathcal{E}^{\prime}$ will be similarly denoted by $\lambda(\cdot,\mathcal{E}^{\prime})$ . We have now all pieces in place to state the following theorem.

Theorem 7 (Baes, Theorem 36 in [3]).

Let $x,z\in\mathcal{E}$ and consider the spectral decomposition of $x$ :

[TABLE]

Then the directional derivative of the $i$ -th largest eigenvalue of $x$ along the direction $z$ is given by

[TABLE]

where $c=e_{i}(x)$ .

From Theorem 5, $Q_{c}z$ is the projection of $z$ in the algebra $V(c,1)$ . Therefore, to compute $\lambda_{i}^{\prime}(x;z)$ we need to project $z$ on $V(c,1)$ , and then compute the $l_{i}(x)$ -th eigenvalue of the projection with respect the algebra $V(c,1)$ , where $l_{i}(x)$ is the “relative position” of the index $i$ with respect to the eigenvalues of $x$ that are equal to $\lambda_{i}(x)$ .

3.3 Spectral functions and sets

Let $\mathcal{E}$ be a Euclidean Jordan algebra of rank $r$ and let $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ be a function. We say that $f$ is a symmetric function if $f(Pu)=f(u)$ holds for every $u\in\mathbb{R}^{r}$ and every permutation matrix $P\in\mathcal{P}^{r}$ . Symmetric functions satisfy the following key relation between subdifferentials:

[TABLE]

whenever $\Diamond$ is $\hat{\partial},\partial$ or $\partial^{\infty}$ , e.g., Proposition 2 in [19]. We remark that (6) will be used often in this paper.

We denote by $F:\mathcal{E}\to\overline{\mathbb{R}}$ the spectral map induced by $f$ , which is the function defined as

[TABLE]

The function $F$ is well-defined, even if $f$ is not symmetric. However, if $f$ is indeed symmetric, many properties of $f$ are transferred to $F$ .

There is also a notion of spectral set. We say that $Q\subseteq\mathbb{R}^{r}$ is a symmetric set if $PQ=Q$ for every $P\in\mathcal{P}^{r}$ . Then the spectral set induced by $Q$ is defined as

[TABLE]

To conclude this subsection, we now move on to the notion of weakly spectral sets/maps, which was introduced by Gowda and Jeong in [12]. We say that a linear bijection $A:\mathcal{E}\to\mathcal{E}$ is a Jordan algebra automorphism if

[TABLE]

The group of Jordan algebra automorphisms is denoted by ${\mathrm{Aut}\,}\mathcal{E}$ . Then, a function $F:\mathcal{E}\to\overline{\mathbb{R}}$ is said to be weakly spectral if

[TABLE]

A set $\Omega\subseteq\mathcal{E}$ is said to be weakly spectral if $A\Omega=\Omega$ holds for every $A\in{\mathrm{Aut}\,}\mathcal{E}$ . A spectral map/set must also be weakly spectral, but the converse is not true in general, see remarks in Section 3 of [12].

4 Transfer principles for generalized subdifferentials

We start with a description of our setting and a few conventions. Throughout Sections 4 and 5, $(\mathcal{E},{\circ})$ denotes a Euclidean Jordan algebra of rank $r$ , the inner product of two elements of $\mathcal{E}$ is given by (4) and the norm is the one induced by $\eqref{eq:inner}$ . Although we are using the same symbol to denote the Euclidean inner product on $\mathbb{R}^{r}$ and the trace inner product on $\mathcal{E}$ , there will be no confusion. The letters $x,y,z,s$ will always be reserved for elements of $\mathcal{E}$ and $u,v,d$ for elements of $\mathbb{R}^{r}$ .

Let $F:\mathcal{E}\to\overline{\mathbb{R}}$ be a spectral function induced by some symmetric function $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ . Our first goal is to prove the following meta-formula:

[TABLE]

where $\Diamond$ is either $\hat{\partial},\partial$ , or $\partial^{\infty}$ .

Remark 8.

For the sake of dispelling any possible confusion, $\Diamond f(\lambda(x))$ should be interpreted as $(\Diamond f)(\lambda(x))$ , i.e., $\Diamond f(\lambda(x))$ is the generalized subdifferential of $f$ at $\lambda(x)$ .

Proving (Transfer) will require several tools old and new, such as commutation principles [23, 12], majorization principles [11] and the formulae for the directional derivatives of the eigenvalue functions [3].

4.1 Commutation principles and generalized subdifferentials

The first step towards (Transfer) is proving that if $F$ is a spectral function and $s$ is any generalized subgradient of $x$ , then $x$ and $s$ must operator commute. For that, we will use a commuting principle proved by Ramírez, Seeger and Sossa [23].

Theorem 9 (Ramírez, Seeger and Sossa111

Here, we are quoting the theorem as it appears in Gowda and Jeong’s paper [12] (Theorem 1.1 therein), since it is more suited to our purposes. [23]).

Suppose that $\Omega\subseteq\mathcal{E}$ is a spectral set and $F:\mathcal{E}\to\mathbb{R}$ is a spectral function. Let $\Theta:\mathcal{E}\to\mathbb{R}$ be Fréchet differentiable. If $x^{*}$ is a local minimizer/maximizer of

[TABLE]

then $x^{*}$ and $\nabla\Theta(x^{*})$ operator commute222We recall that $x^{*}$ is a local minimum if there exists a neighbourhood $\mathcal{V}$ of $x^{*}$ such that $\Theta(x^{*})+F(x^{*})\leq\Theta(x)+F(x)$ holds for every $x\in\Omega\cap\mathcal{V}$ ..

Recently, Gowda and Jeong showed that it is possible to weaken the hypothesis of Theorem 9 and consider weakly spectral sets/functions instead [12].

Theorem 10 (Gowda and Jeong [12]).

The conclusion of Theorem 9 holds if $\Omega$ is a weakly spectral set and $F$ is a weakly spectral function.

Using the variational characterization of the regular subdifferential, we can prove the following new result, which is more general than what is strictly necessary for proving (Transfer), but we believe it is still useful.

Proposition 11 (Operator commutativity for weakly spectral functions).

Let $F:\mathcal{E}\to\overline{\mathbb{R}}$ be a weakly spectral function. Suppose

[TABLE]

where $\Diamond$ is either $\hat{\partial},\partial$ or $\partial^{\infty}$ . Then, $x$ and $s$ operator commute.

Proof.

First, we prove the result for the case $s\in\hat{\partial}F(x)$ . By Proposition 1, there exists a $C^{1}$ function $H$ such that $H(x)=F(x)$ , $\nabla H(x)=s$ and $H(y)\leq F(y)$ for all $y$ near $x$ . We invoke Theorem 10 using $F$ , $\Theta=-H$ and $\Omega={\rm dom}\,F$ . By the properties of $H$ , we have that $x$ is a local minimum of $\Theta+F=F-H$ . Therefore, $x$ commutes with $\nabla\Theta(x)=-s$ , so it must commute with $s$ too. In reality, there are some minor technical details we have overlooked, see the footnote333The functions in Theorem 10 are finite functions defined everywhere, whereas $F$ is an extended value function and $H$ is defined only in a neighbourhood of $x$ . To sidestep this, we define $\hat{F}$ such that $\hat{F}(y)=F(y)$ if $y\in{\rm dom}\,F$ and $\hat{F}(y)=F(x)$ if $y\not\in{\rm dom}\,F$ . With that, $\hat{F}$ is still a weakly spectral function. Next we need to extend $H$ to a function defined over $\mathcal{E}$ which coincides with $H$ in some neighbourhood of $x$ . It is a classical fact that this can always be done and here we show briefly why. Suppose that $H$ is defined over some open set $\mathcal{U}$ . Let $\mathcal{V}\subseteq\mathcal{U}$ be an open ball such that $\mathrm{cl}\,\mathcal{V}\subseteq\mathcal{U}$ and over which $x$ is a local minimizer of $F-H$ . Next, pick any function $\psi$ that is smooth and such that $\psi$ is $1$ on the compact set $\mathrm{cl}\,\mathcal{V}$ and [math] outside $\mathcal{U}$ . Then, we define $\hat{H}$ by letting $\hat{H}(y)=\psi(y)H(y)$ if $y\in\mathcal{U}$ and $\hat{H}(y)=0$ if $y\not\in\mathcal{U}$ . With that, we have that $\nabla\hat{H}(x)=s$ and $x$ is a local minimum of $\hat{F}-\hat{H}$ restricted to ${\rm dom}\,F$ . Then, as before, we can invoke Theorem 10 with $\hat{F}$ , $\Omega={\rm dom}\,F$ and $\Theta=-\hat{H}$ . below.

Next, suppose instead that $s\in\partial F(x)$ or $s\in\partial^{\infty}F$ . Then, there are sequences $\{x^{k}\}$ , $\{s^{k}\},\{t^{k}\}$ such that every $k$ satisfies $s^{k}\in\hat{\partial}F(x^{k})$ and the following limits hold.

[TABLE]

Here, there are two cases for $\{t^{k}\}$ . If $s\in\partial F(x)$ , then $t^{k}=1$ for every $k$ . If $s\in\partial^{\infty}F(x)$ , then $t^{k}\downarrow 0$ .

Either way, because $s^{k}\in\hat{\partial}F(x^{k})$ , from what we have proved so far, we have that $s^{k}$ and $x^{k}$ operator commute for every $k$ . That is,

[TABLE]

By taking limits, we conclude that $L_{s}L_{x}=L_{x}L_{s}$ must also hold. Therefore, $s$ and $x$ operator commute too. ∎

4.2 The easy inclusion

Next, we prove the inclusion “ $\subseteq$ ” in (Transfer), when $\Diamond=\hat{\partial}$ .

Proposition 12 (The easy inclusion).

Let $F:\mathcal{E}\to\overline{\mathbb{R}}$ be the spectral function induced by a symmetric function $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ . Let $s\in\hat{\partial}F(x)$ . Then, $x$ and $s$ operator commute and for any $\mathcal{J}\in\mathcal{J}(x,s)$ we have

[TABLE]

Proof.

Let $s\in\hat{\partial}F(x)$ . By Proposition 1 there exists a neighborhood $\mathcal{U}$ of $x$ and a $C^{1}$ function $H:\mathcal{U}\to\mathbb{R}$ such that $H(y)\leq F(y)$ for all $y\in\mathcal{U}$ and $H(x)=F(x)$ , $\nabla H(x)=s$ . In addition, by Proposition 11, $s$ and $x$ operator commute. Therefore, $\mathcal{J}(x,s)$ must be nonempty, i.e., $x$ and $s$ have at least one common Jordan frame.

Let $\mathcal{J}\in\mathcal{J}(x,s)$ and consider the linear map ${\mathrm{Diag}\,}(\cdot,\mathcal{J}):\mathbb{R}^{r}\to\mathcal{E}$ . Since ${\mathrm{Diag}\,}(\cdot,\mathcal{J})$ is continuous, $V={\mathrm{Diag}\,}(\cdot,\mathcal{J})^{-1}(\mathcal{U})$ is an open set of $\mathbb{R}^{r}$ containing $\lambda(x)$ . Now, let $h:V\to\mathbb{R}$ be such that

[TABLE]

Let $v\in V$ . Using the symmetry of $f$ and the properties of $H$ , we obtain

[TABLE]

That is, $f(v)\geq h(v)$ holds for every $v\in V$ . Also $h(\lambda(x))=H({\mathrm{Diag}\,}(\lambda(x),\mathcal{J}))=H(x)=f(\lambda(x))$ . By the chain rule, we also have $\nabla h(\lambda(x))={\mathrm{diag}\,}(s,\mathcal{J})$ . Therefore, by Proposition 1, we conclude that ${\mathrm{diag}\,}(s,\mathcal{J})\in\hat{\partial}f(\lambda(x))$ . ∎

4.3 The hard inclusion

The hard part of proving (Transfer) is establishing the inclusion “ $\supseteq$ ”, when $\Diamond=\hat{\partial}$ . From Lewis’ discussion in [19], it seems that one of the key steps for proving (Transfer) in the case of symmetric matrices is a result relating the diagonal of a matrix $Z$ with the directional derivative $\lambda^{\prime}(X;Z)$ , see Theorem 5 in [19]. We will prove an analogous result by following an original approach making use of a recent majorization principle proved by Gowda in [11].

Let $u\in\mathbb{R}^{r}$ , we denote by ${u}^{\downarrow}$ the element in $\mathbb{R}^{r}_{\geq}$ corresponding to a reordering of the coordinates of $u$ in such a way that

[TABLE]

Now, let $v\in\mathbb{R}^{r}$ be another element. Then, we say that $u$ is majorized by $v$ and write $u\prec v$ if

[TABLE]

and the sum of components of both $u$ and $v$ coincide, i.e., $u_{1}+\cdots+u_{r}=v_{1}+\cdots+v_{r}$ . It is a classical fact following from Birkhoff’s theorem that $u$ is majorized by $v$ if and only if $v$ lies in the convex hull of all permutations of $v$ , i.e.,

[TABLE]

see Section B in Chapter 2 of [22]. If $x,y\in\mathcal{E}$ we say that $x$ is majorized by $y$ and write $x\prec y$ if $\lambda(x)$ is majorized by $\lambda(y)$ . Whenever majorization principles are used, it is safer to mention the standard disclaimers that, throughout the literature, there seems to be no consensus on the direction of the inequalities appearing in the definition of majorization. In some texts, “ $\geq$ ” is used instead of “ $\leq$ ”. Here, we are following the convention in [11], which by its turn follows the notation in [4].

Let $X$ be a symmetric matrix. It is known that the diagonal entries of $X$ are majorized by the eigenvalues of $X$ . Gowda recently extended this fact to Euclidean Jordan algebras.

Proposition 13 (Gowda, Example 7 and Theorem 6 in [11]).

Let $\mathcal{J}$ be a Jordan frame and let $x\in\mathcal{E}$ . Then, ${\mathrm{diag}\,}(x,\mathcal{J})$ is majorized by $\lambda(x)$ . In particular,

[TABLE]

Proof.

Consider the map $\psi:\mathcal{E}\to\mathcal{E}$ defined by

[TABLE]

In [11], the map $\psi$ is denoted by “ ${\mathrm{Diag}\,}$ ” and it has a different meaning from the map ${\mathrm{Diag}\,}$ we are using in this paper. In any case, in Example 7 and Theorem 6 in [11], Gowda showed that $\psi(x)\prec x$ holds for every $x\in\mathcal{E}$ . Accordingly, we have

[TABLE]

Now, we observe that the components of ${\mathrm{diag}\,}(x,\mathcal{J})$ are precisely the eigenvalues of $\psi(x)$ . Furthermore, the fact that a vector $u\in\mathbb{R}^{r}$ is majorized by $v\in\mathbb{R}^{r}$ does not change if we permute the entries of $u$ or $v$ . We conclude that ${\mathrm{diag}\,}(x,\mathcal{J})\prec\lambda(x)$ and that ${\mathrm{diag}\,}(x,\mathcal{J})\in{\mathrm{conv}\,}\{P\lambda(x)\mid P\in\mathcal{P}^{r}\}.$ ∎

We are now able to prove an analogous of Theorem 4 of [19] for Euclidean Jordan algebras.

Theorem 14 (The diagonal map and directional derivatives of the eigenvalue map).

Let $x,z\in\mathcal{E}$ and let $\mathcal{J}\in\mathcal{J}(x)$ . Then

[TABLE]

First, we sketch the general proof strategy for Theorem 14. The idea is to separate the vector $\lambda(x)$ in blocks of equal eigenvalues and apply the formula in Theorem 7 for each block. Then, for each block, we associate a Euclidean Jordan algebra $\mathcal{E}^{j}$ and invoke Proposition 13. Since Proposition 13 is invoked in a blockwise fashion according to the blocks of equal eigenvalues of $x$ , the resulting pieces can be glued together to obtain a convex combination of matrices in $\mathcal{P}^{r}(\lambda(x))$ .

Proof.

To start, let us consider the spectral decomposition of $x$ ,

[TABLE]

where $\lambda_{1}(x)\geq\cdots\geq\lambda_{r}(x)$ and $\mathcal{J}=[c_{1},\ldots,c_{r}]$ is a Jordan frame. Now, we use the notation described in Section 3.2 and denote by $l_{i}(x)$ the “relative position” of the index $i$ with respect the eigenvalues of $x$ that are equal to $\lambda_{i}(x)$ .

Next, let $r_{1},\ldots,r_{\ell}$ be such that

[TABLE]

Here, $\ell$ is the number of distinct eigenvalues of $x$ . For convenience, we define $r_{0}=0$ and $n_{j}=r_{j}-r_{j-1}$ for $j\in\{1,\ldots,\ell\}$ . Then, we divide ${\mathrm{diag}\,}(z,\mathcal{J})$ in $\ell$ parts according to the blocks of equal eigenvalues of $x$ :

[TABLE]

where

[TABLE]

We do the same for the map $\lambda$ and divide $\lambda$ in $\ell$ maps such that

[TABLE]

Here, each $\lambda^{j}$ is a map $\mathcal{E}\to\mathbb{R}^{n_{j}}$ such that

[TABLE]

Applying Theorem 7 to each $\lambda^{j}$ , we obtain

[TABLE]

where $e_{r_{j}}$ is the sum of the idempotents associated to the eigenvalues equal to $\lambda_{r_{j}}(x)$ and $\mathcal{E}^{j}$ is the Jordan algebra $V(e_{r_{j}},1)$ of rank $n_{j}$ .

Let $z^{j}\coloneqq Q_{e_{r_{j}}}(z)$ , for every $j$ . From Theorem 5, $z^{j}$ is the orthogonal projection of $z$ onto $\mathcal{E}^{j}$ . The indices from $r_{j-1}+1$ to $r_{j}$ all correspond to equal eigenvalues of $x$ . Therefore, from (7) and the definition of the relative index $l_{r_{j-1}+k}$ , we conclude that

[TABLE]

where we recall that $\lambda(\cdot,\mathcal{E}^{j})$ is the eigenvalue map of the algebra $\mathcal{E}^{j}$ . Next, let $\mathcal{J}^{j}\coloneqq[c_{r_{j-1}+1},\ldots,c_{r_{j}}]$ . Since $\mathcal{J}$ is a Jordan frame and the sum of the elements of $\mathcal{J}^{j}$ is $e_{r_{j}}$ (the identity element of $\mathcal{E}^{j}$ ), we have that $\mathcal{J}^{j}$ is a Jordan frame in the algebra $\mathcal{E}^{j}$ . We will now prove that ${\mathrm{diag}\,}(z^{j},\mathcal{J}^{j})=u^{j}$ . Let $k$ be an integer such that $r_{j-1}+1\leq k\leq{r_{j}}$ , we have

[TABLE]

where the second equality follows from the fact $Q_{e_{r_{j}}}$ is self-adjoint and the third equality follows from the fact that $Q_{e_{r_{j}}}(c_{k})=c_{k}$ since $e_{r_{j}}$ is the identity element in $\mathcal{E}^{j}$ and $c_{k}$ is an idempotent contained in $\mathcal{E}^{j}$ . Since this holds for every $k$ satisfying $r_{j-1}+1\leq k\leq{r_{j}}$ , we conclude that ${\mathrm{diag}\,}(z^{j},\mathcal{J}^{j})=u^{j}$ . From (8) and Proposition 13 applied to $z^{j},\mathcal{J}^{j}$ and $\mathcal{E}^{j}$ , we conclude that for every $j$ , we have

[TABLE]

That is, there are nonnegative coefficients $\alpha_{j,k}$ and $\kappa_{j}$ permutation matrices $P^{j,k}\in\mathcal{P}^{n_{j}}$ such that

[TABLE]

We are now almost done. First, we define $A^{j}$ as the following $n_{j}\times n_{j}$ matrix

[TABLE]

Next, we define $A$ as the matrix satisfying

[TABLE]

Because of (10), we have

[TABLE]

which together with (9) implies that

[TABLE]

Now, we consider an arbitrary matrix $P$ appearing in (11) which is of the form

[TABLE]

$P$ is a block diagonal matrix and since each block is a permutation matrix, $P$ is a permutation matrix too. Furthermore, by construction, the block structure of $P$ follows the pattern of equal eigenvalues of $x$ . So, for instance, $P^{1,j_{1}}$ has size $n_{1}=r_{1}$ , which corresponds to the first block of $r_{1}$ equal eigenvalues of $x$ . For this reason, we obtain

[TABLE]

Accordingly, $P$ belongs to $\mathcal{P}^{r}(\lambda(x))$ and from (11) and (12), we conclude that

[TABLE]

∎

Next, we will prove the inclusion “ $\supseteq$ ” in (Transfer), when $\Diamond=\hat{\partial}$ . With all the preliminary results in place, we can proceed analogously to Theorem 5 of [19].

Proposition 15 (The hard inclusion).

Let $F:\mathcal{E}\to\overline{\mathbb{R}}$ be the spectral function induced by a symmetric function $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ . Then

[TABLE]

Proof.

Let $s\in\mathcal{E}$ and $\mathcal{J}\in\mathcal{J}(x,s)$ be such that ${\mathrm{diag}\,}(s,\mathcal{J})\in\hat{\partial}f(\lambda(x))$ . Our goal is to show that $s\in\hat{\partial}F(x).$ In view of (2), $s\in\hat{\partial}F(x)$ will be established if we show that for every $\epsilon>0$ , there exists $\delta$ such that $\left\|z\right\|\leq\delta$ implies

[TABLE]

However, since $\mathcal{J}$ diagonalizes $s$ , we have

[TABLE]

Therefore, our goal is to show that for every $\epsilon>0$ , there exists $\delta$ such that $\left\|z\right\|\leq\delta$ implies

[TABLE]

Now, we will set up a few objects that will help us towards proving (Goal). First, we observe that ${\mathrm{diag}\,}(s,\mathcal{J})\in\hat{\partial}f(\lambda(x))$ and (6) implies that

[TABLE]

Next, we define $\Lambda$ to be the convex hull of the $P{\mathrm{diag}\,}(s,\mathcal{J})$ with $P\in\mathcal{P}^{r}(\lambda(x))$ and denote by $\delta^{*}_{\Lambda}$ the corresponding support function. Since $\Lambda$ is generated by a finite number of elements, we have

[TABLE]

Now that the pieces are in place, we move on to proving (Goal). Let $\epsilon>0$ . From the definition of regular subgradients (see (1)) and from (2), for every $P\in\mathcal{P}^{r}(\lambda(x))$ , there exists $\delta_{P}$ such that $\left\|v\right\|\leq\delta_{P}$ implies

[TABLE]

In particular, if we let $\hat{\delta}=\min_{P\in\mathcal{P}^{r}(\lambda(x))}\delta_{P}$ , we conclude that

[TABLE]

whenever $\left\|v\right\|\leq\hat{\delta}$ . From item $(ii)$ of Lemma 4 and decreasing $\hat{\delta}$ if necessary, we have that if $z\in\mathcal{E}$ satisfies $\left\|z\right\|\leq\hat{\delta}$ , it holds that

[TABLE]

By item $(i)$ of Lemma 4, $\left\|\lambda(x+z)-\lambda(x)\right\|\leq\left\|z\right\|$ . Therefore, if $z$ satisfies $\left\|z\right\|\leq\hat{\delta}$ , we obtain from (13) that

[TABLE]

Since $\delta^{*}_{\Lambda}$ is the pointwise maximum of linear functions, $\delta^{*}_{\Lambda}$ is a Lipschitz continuous sublinear function with Lipschitz constant $\kappa$ given by

[TABLE]

Therefore, for every $u,v\in\mathbb{R}^{r}$ , we have

[TABLE]

Now, we let $u=\lambda^{\prime}(x;z)$ and $v=\lambda(x+z)-\lambda(x)-\lambda^{\prime}(x;z)$ in (16) and use the resulting inequality back in (15), to obtain

[TABLE]

where the last inequality follows from (14).

By Theorem 14, we have

[TABLE]

Therefore, there are nonnegative numbers $\alpha_{1},\ldots,\alpha_{\ell}$ such that their sum is $1$ and

[TABLE]

where each $P_{i}$ belongs to $\mathcal{P}^{r}(\lambda(x))$ . We recall that, by definition, $\delta^{*}_{\Lambda}(Pu)=\delta^{*}_{\Lambda}(u)$ for every $P\in\mathcal{P}^{r}(\lambda(x))$ and $u\in\mathbb{R}^{r}$ . Using the convexity of $\delta^{*}_{\Lambda}$ , we obtain

[TABLE]

Using inequality (18) in (17), we obtain that for every $z\in\mathcal{E}$ with $\left\|z\right\|\leq\hat{\delta}$ , we have

[TABLE]

Since $\epsilon$ was arbitrary, this shows that (Goal) holds. ∎

4.4 Main results

From Propositions 12 and 15, we conclude that (Transfer) holds for the case $\Diamond=\hat{\partial}$ . Next, will prove transfer results for the approximate and horizon subdifferentials which will conclude the proof of (Transfer).

Proposition 16 (The approximate and horizon subdifferentials of spectral functions).

Let $F:\mathcal{E}\to\overline{\mathbb{R}}$ be the spectral function induced by a symmetric function $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ . Then, for $x\in\mathcal{E}$ , we have

[TABLE]

Proof.

First, we prove the inclusion “ $\subseteq$ ” in (19) and (20). Let $s\in\partial F(x)$ or $s\in\partial^{\infty}F(x)$ . By definition, there are sequences $\{x^{k}\},\{s^{k}\},\{t^{k}\}$ such that $s^{k}\in\hat{\partial}F(x^{k})$ holds for every $k$ and

[TABLE]

Here, there are two cases for $\{t^{k}\}$ . If $s\in\partial F(x)$ , then $t^{k}=1$ for every $k$ . If $s\in\partial^{\infty}F(x)$ , then $t^{k}\downarrow 0$ . Since $s^{k}\in\hat{\partial}F(x^{k})$ holds for every $k$ , Proposition 12 implies the existence of $\mathcal{J}^{k}\in\mathcal{J}(x^{k},s^{k})$ such that

[TABLE]

Let $\mathcal{J}^{k}=[c_{1,k},\ldots,c_{r,k}]$ . Since $\left\|c_{i,k}\right\|=1$ for every $i$ and $k$ , passing to a subsequence if necessary, we may assume that for every $i$ , $c_{i,k}$ converges to some $\overline{c}_{i}$ . Elementary properties of limits show that ${\overline{c}_{i}\circ\overline{c}_{j}}=0$ if $i\neq j$ and ${\overline{c}_{i}\circ\overline{c}_{i}}=\overline{c}_{i}$ . Therefore $\overline{\mathcal{J}}=[\overline{c}_{1},\ldots,\overline{c}_{r}]$ is a Jordan frame in $\mathcal{E}$ .

Now, we need to examine whether $\overline{\mathcal{J}}\in\mathcal{J}(x,s)$ . We have

[TABLE]

Since each $\lambda_{i}(\cdot)$ is a continuous function and $x^{k}\to x$ , we conclude that

[TABLE]

An analogous argument shows that $\overline{\mathcal{J}}$ diagonalizes $s$ . Gathering all we have shown, we obtain that ${\mathrm{diag}\,}(s^{k},\mathcal{J}^{k})\in\hat{\partial}f(\lambda(x^{k}))$ holds for every $k$ and

[TABLE]

That is, $\overline{\mathcal{J}}\in\mathcal{J}(x,s)$ together with either ${\mathrm{diag}\,}(s,\overline{\mathcal{J}})\in\partial f(\lambda(x))$ (if $s\in\partial F(x)$ ) or ${\mathrm{diag}\,}(s,\overline{\mathcal{J}})\in\partial^{\infty}f(\lambda(x))$ (if $s\in\partial^{\infty}F(x)$ ).

We will now prove the inclusion “ $\supseteq$ ”. Let $s\in\mathcal{E}$ be such that there are sequences $\{u^{k}\},\{d^{k}\},\{t^{k}\}$ satisfying $d^{k}\in\hat{\partial}f(u^{k})$ for every $k$ and

[TABLE]

where $\mathcal{J}\in\mathcal{J}(x,s)$ . Here, either $t^{k}=1$ for every $k$ or $t^{k}\downarrow 0$ . Let $\mathcal{J}=[c_{1},\ldots,c_{r}]$ .

For every $k$ , let $P^{k}\in\mathcal{P}^{r}$ be a permutation matrix such that $P^{k}u^{k}={(u^{k})}^{\downarrow}$ . Since $d^{k}\in\hat{\partial}f(u^{k})$ holds for every $k$ and $f$ is a symmetric function, we have from (6) that

[TABLE]

Let

[TABLE]

Let $\sigma$ be the permutation on the set $\{1,\ldots,r\}$ induced by $P^{k}$ , i.e., $\sigma(i)=j$ , if and only if, $P^{k}$ permutes the $i$ -th and the $j$ -th entries of a vector. We have $\lambda(x^{k})={(u^{k})}^{\downarrow}$ and $P^{k}\mathcal{J}\in\mathcal{J}(x^{k},s^{k})$ , where $P^{k}\mathcal{J}$ is defined as

[TABLE]

Therefore, from (21) we have

[TABLE]

which combined with Proposition 15 shows that

[TABLE]

Next, since $u^{k}\to\lambda(x)$ , it follows that $x^{k}\to x$ . Again, recalling that $f$ is a symmetric function and that

[TABLE]

we have $F(x^{k})\to F(x)$ , since $f(u^{k})\to f(\lambda(x))$ . Similarly, we have $t^{k}s^{k}\to s$ , since $t^{k}d^{k}\to{\mathrm{diag}\,}(s,\mathcal{J})$ . This shows that $s\in\partial F(x)$ (if ${\mathrm{diag}\,}(s,\mathcal{J})\in\partial f(\lambda(x))$ ) or $s\in\partial^{\infty}F(x)$ (if ${\mathrm{diag}\,}(s,\mathcal{J})\in\partial^{\infty}f(\lambda(x))$ ). ∎

We can now state our main result.

Theorem 17 (Generalized subdifferentials of spectral functions).

Let $(\mathcal{E},{\circ})$ be a Euclidean Jordan algebra of rank $r$ and let $F:\mathcal{E}\to\overline{\mathbb{R}}$ be the spectral function induced by a symmetric function $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ . Then, for $x\in\mathcal{E}$ , we have

[TABLE]

whenever $\Diamond$ is $\hat{\partial},\partial$ or $\partial^{\infty}$ .

Proof.

Follows from Propositions 12, 15, 16. ∎

4.5 Convex hull of generalized subdifferentials and the Clarke subdifferential

In this subsection, we will prove the following meta-formula

[TABLE]

whenever $\Diamond$ is a subdifferential which behaves nicely with respect to permutations and for which (Transfer) holds. One of the motivations for this formula is, of course, the study of the Clarke subdifferential, which we will discuss next. First, we recall that $f$ is locally Lipschitz continuous at $\hat{u}$ if there exists some neighbourhood $U$ of $\hat{u}$ and a constant $\kappa$ such that

[TABLE]

Using the construction of the Clarke subdifferential through the Bouligand derivative, Baes proved in his PhD thesis that, if $f$ is locally Lipschitz, then the meta-formula (Transfer) holds when $\Diamond$ is either the Bouligand or the Clarke subdifferential, see Proposition 4.5.1 and Theorems 4.5.4 and 4.5.5 in [2]. However, denoting by $\partial_{C}$ the Clarke subdifferential, it turns out that, under local Lipschitzness, we have

[TABLE]

see Theorem 9.61 in [24]. Therefore, with some effort, Theorem 17 can be used to give another proof that (Transfer) holds when $\Diamond$ is $\partial_{C}$ and $f$ is locally Lipschitz continuous. The first step towards this idea is the following result, which is a variant of Theorem 14.

Proposition 18.

Let $x,s\in\mathcal{E}$ be such that $x$ and $s$ operator commute. Then, for every $\mathcal{J}\in\mathcal{J}(x)$ and every $\hat{\mathcal{J}}\in\mathcal{J}(x,s)$ we have

[TABLE]

Proof.

By Theorem 14, we already have

[TABLE]

All we need to do now is to relate $\lambda^{\prime}(x,s)$ and ${\mathrm{diag}\,}(s,\hat{\mathcal{J}})$ . For that, we will proceed as in the proof of Theorem 14.

Let us consider the spectral decomposition of $x$ according to $\hat{\mathcal{J}}=[\hat{c}_{1},\ldots,\hat{c}_{r}]$ ,

[TABLE]

Then, we use the notation described in Section 3.2 and denote by $l_{i}(x)$ the “relative position” of the index $i$ with respect the eigenvalues of $x$ that are equal to $\lambda_{i}(x)$ . Furthermore, we let $e_{i}$ be the sum of the idempotents $\hat{c}_{i}$ associated to the eigenvalues equal to $\lambda_{i}(x)$ . We also let $r_{1},\ldots,r_{\ell}$ be such that

[TABLE]

Here, $\ell$ is the number of distinct eigenvalues of $x$ . For convenience, we define $r_{0}=0$ and $n_{j}=r_{j}-r_{j-1}$ for $j\in\{1,\ldots,\ell\}$ . Then, we divide ${\mathrm{diag}\,}(s,\hat{\mathcal{J}})$ and $\lambda^{\prime}(x;s)$ in $\ell$ parts according to the blocks of equal eigenvalues of $x$ :

[TABLE]

First, we observe that if $\lambda_{i}(x)=\lambda_{j}(x)$ , then we have $e_{i}=e_{j}$ . Then, from the formula for the directional derivatives (Theorem 7) and the fact that $\hat{\mathcal{J}}$ diagonalizes $s$ , we obtain

[TABLE]

where $\mathcal{E}^{j}=V(e_{r_{j}},1)$ . We recall that $Q_{e_{r_{j}}}(s)$ is the orthogonal projection of $s$ onto $V(e_{r_{j}},1)$ . And, again, because $\hat{\mathcal{J}}$ diagonalizes $s$ , we obtain

[TABLE]

which is the spectral decomposition of $Q_{e_{r_{j}}}(s)$ in the algebra $\mathcal{E}^{j}$ . In particular, the eigenvalues of $Q_{e_{r_{j}}}(s)$ in the algebra $\mathcal{E}^{j}$ are precisely the components of $u^{j}$ . We also need to recall that $\lambda_{l_{r_{j-1}+k}}(Q_{e_{r_{j}}}(s);\mathcal{E}^{j})$ is, in fact, the $k$ -th largest eigenvalue of $Q_{e_{r_{j}}}(s)$ in the algebra $\mathcal{E}^{j}$ .

Piecing everything together, we conclude that $v^{j}$ is just the result of sorting $u^{j}$ in nonincreasing order. Therefore, there exists a permutation matrix $P^{j}\in\mathcal{P}^{n_{j}}$ such that $v^{j}=P^{j}u^{j}$ , for every $j\in\{1,\ldots,\ell\}$ . Then, if we let

[TABLE]

we have $\lambda^{\prime}(x,s)=\hat{P}{\mathrm{diag}\,}(s,\hat{\mathcal{J}})$ and since the block structure of $P$ follows the blocks of equal eigenvalues of $\lambda(x)$ , we have $P\in\mathcal{P}^{r}(\lambda(x))$ . From (22), we have

[TABLE]

since $\mathcal{P}^{r}(\lambda(x))$ is a group. ∎

For what follows, we say that a subdifferential $\Diamond$ is permutation compatible if

[TABLE]

whenever $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ is a symmetric function and $P\in\mathcal{P}^{r}$ . We note that all subdifferentials $\hat{\partial},\partial,\partial^{\infty},\partial_{C}$ that have appeared so far in this paper are permutation compatible. With that, we are ready to prove the following meta-theorem which might be applicable to other subdifferentials not discussed in this paper.

Theorem 19 (Convex hull of generalized subdifferentials).

Let $F:\mathcal{E}\to\overline{\mathbb{R}}$ be the spectral function induced by a symmetric function $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ . Then, for $x\in\mathcal{E}$ , we have

[TABLE]

where $\Diamond$ is any permutation compatible subdifferential for which (Transfer) holds. In particular, if $\lambda(x)\in\mathrm{int}\,({\rm dom}\,f)$ and $f$ is locally Lipschitz continuous at $\lambda(x)$ , then (Transfer) holds when $\Diamond=\partial_{C}$ .

Proof.

First we prove the “ $\supseteq$ ” inclusion. Suppose $s$ and $\mathcal{J}$ are such that $\mathcal{J}\in\mathcal{J}(x,s)$ and ${\mathrm{diag}\,}(s,\mathcal{J})$ is the convex combination of $d_{1},\ldots,d_{\ell}\in\Diamond f(\lambda(x))$ . Then, since (Transfer) holds, we have

[TABLE]

Because $s$ is a convex combination of the ${\mathrm{Diag}\,}(d_{i},\mathcal{J})$ , we obtain $s\in{\mathrm{conv}\,}\Diamond F(x)$ .

Next, we prove the “ $\subseteq$ ” inclusion. Let $s_{1},s_{2}\in\Diamond F(x)$ . Since (Transfer) holds, there are $\mathcal{J}_{1}\in\mathcal{J}(x,s_{1})$ and $\mathcal{J}_{2}\in\mathcal{J}_{2}(x,s_{2})$ such that

[TABLE]

Let $s_{3}$ be a convex combination of $s_{1},s_{2}$ , so that

[TABLE]

for some $\alpha\in[0,1]$ . Since $x,s_{1}$ and $x,s_{2}$ are pairs of simultaneously diagonalizable elements, the same must be true of the pair $x,s_{3}$ , see (5). We conclude that there exists $\mathcal{J}_{3}\in\mathcal{J}(x,s_{3})$ . Now, we invoke Proposition 18 with $\mathcal{J}=\mathcal{J}_{3}$ and $\hat{\mathcal{J}}=\mathcal{J}_{1}$ , to conclude that

[TABLE]

Because $\Diamond$ is permutation compatible, (23) implies that $P{\mathrm{diag}\,}(s_{1},\mathcal{J}_{1})$ belongs to $\Diamond f(\lambda(x))$ for every $P\in\mathcal{P}^{r}({\lambda(x)})$ . Therefore, ${\mathrm{diag}\,}(s_{1},\mathcal{J}_{3})\in{\mathrm{conv}\,}\Diamond f(\lambda(x))$ . A completely analogous argument for $s_{2}$ shows that

[TABLE]

Since ${\mathrm{diag}\,}(s_{3},\mathcal{J}_{3})$ is a convex combination of ${\mathrm{diag}\,}(s_{1},\mathcal{J}_{3})$ and ${\mathrm{diag}\,}(s_{2},\mathcal{J}_{3})$ , we conclude that, indeed,

[TABLE]

which proves the inclusion “ $\subseteq$ ”.

Finally, if $f$ is locally Lipschitz continuous at $\lambda(x)\in\mathrm{int}\,({\rm dom}\,f)$ , the fact that the eigenvalue map is Lipschitz continuous (Lemma 4) shows that $F$ must be locally Lipschitz continuous at $x$ . Therefore,

[TABLE]

This shows that (Transfer) holds with $\Diamond=\partial_{C}$ . ∎

Next, we will take a look at the Clarke subdifferential of spectral functions without assuming local Lipschitzness, in order to extend Baes’ results. First, we will briefly explain some technical issues related to this task. In Theorem 8.9 of [24], we see that each of the generalized subdifferentials $\hat{\partial},\partial,\partial^{\infty}$ is associated to a corresponding notion of normal cone. In this context, the Clarke subdifferential is defined using the convexified version of the normal cone associated to $\partial$ , see Section J in chapter 8 of [24]. The problem is that, by doing so, the Clarke subdifferential can be larger than the convex hull of the approximate subdifferential. Therefore, in general, we have $\partial_{C}F(x)\neq{\mathrm{conv}\,}\partial F(x)$ .

Nevertheless, under local lower semicontinuity, we have the following, see Lemma 4.1 in [20]. We recall that $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ is said to be locally lower semicontinuous at $u$ , if $f(u)$ is finite and there exists $\epsilon>0$ such that $\{v\in\mathbb{R}^{r}\mid\left\|u-v\right\|\leq\epsilon,f(v)\leq\alpha\}$ is closed for every $\alpha$ satisfying $\alpha\leq f(u)+\epsilon$ , see Definition 1.33 in [24].

Lemma 20 (Lemma 4.1 in [20]).

Suppose $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ is locally lower semicontinuous at $u$ . Then,

[TABLE]

With the aid of Lemma 20, we are now in position to extend Baes’ results on the Clarke subdifferential.

Theorem 21 (Clarke subgradients of spectral functions under local lower semicontinuity).

Let $F:\mathcal{E}\to\overline{\mathbb{R}}$ be the spectral function induced by a symmetric function $f:\mathbb{R}^{r}\to\overline{\mathbb{R}}$ . The following hold:

$(i)$

$F$ * is locally lower semicontinuous at $x\in\mathcal{E}$ if and only if $f$ is locally lower semicontinuous at $\lambda(x)$ .* 2. $(ii)$

If $F$ is locally lower semicontinuous at $x$ , then (Transfer) is valid when $\Diamond=\partial_{C}$ .

Proof.

Item $(i)$ follows from the continuity of the eigenvalue map $\lambda$ and elementary properties of the maps ${\mathrm{diag}\,}(\cdot,\mathcal{J})$ and ${\mathrm{Diag}\,}(\cdot,\mathcal{J})$ when $\mathcal{J}\in\mathcal{J}(x)$ . We will omit its proof.

Now, we move on to item $(ii)$ . Under Lemma 20, we have

[TABLE]

First, suppose that $s\in\partial_{C}F(x)$ , so there is a sequence $\{s^{k}\}\subseteq\mathcal{E}$ such that $s^{k}\to s$ and for each $k$ we have

[TABLE]

where ${\overline{s}}^{k}\in{\mathrm{conv}\,}\partial F(x)$ and $s^{k}_{\infty}\in{\mathrm{conv}\,}\partial^{\infty}F(x)$ . By Theorem 19, there are $\overline{\mathcal{J}}^{k}\in\mathcal{J}(x,{\overline{s}}^{k})$ and $\mathcal{J}_{\infty}^{k}\in\mathcal{J}(x,s^{k}_{\infty})$ such that

[TABLE]

Because ${\overline{s}}^{k}$ and $s^{k}_{\infty}$ both operator commute with $x$ , we conclude that $s^{k}$ operator commutes with $x$ as well, see (5). Therefore, there exists a Jordan frame $\mathcal{J}^{k}$ such that $\mathcal{J}^{k}\in\mathcal{J}(x,s^{k})$ . Next, we apply Proposition 18 two times. First with ${\overline{s}}^{k}$ , $\mathcal{J}^{k}$ , $\overline{\mathcal{J}}^{k}$ and then with $s^{k}_{\infty}$ , $\mathcal{J}^{k}$ , $\mathcal{J}^{k}_{\infty}$ in order to obtain that

[TABLE]

Since (6) holds for the approximate and horizon subdifferentials, we have

[TABLE]

for every $P\in\mathcal{P}^{r}(\lambda(x))$ when $\Diamond$ is $\partial$ or $\partial^{\infty}$ . Therefore, (26) together with (27) and (28) implies that

[TABLE]

and

[TABLE]

We now proceed as in the proof of Proposition 16. Since the idempotents in $\mathcal{J}^{k}$ have norm 1, passing to a converging subsequence if necessary, the Jordan frame $\mathcal{J}^{k}$ converges to some Jordan frame $\mathcal{J}\in\mathcal{J}(x,s)$ and we have

[TABLE]

Together with (24) and (29), we conclude that the inclusion “ $\subseteq$ ” holds in (Transfer) when $\Diamond$ is $\partial_{C}$ .

Now, for the “ $\supseteq$ ” inclusion, suppose that $s$ is such that ${\mathrm{diag}\,}(s,\mathcal{J})\in\partial_{C}f(\lambda(x))$ with $\mathcal{J}\in\mathcal{J}(x,s)$ . By (24), there is a sequence $\{u^{k}\}\subseteq\mathbb{R}^{r}$ with $u^{k}\to{\mathrm{diag}\,}(s,\mathcal{J})$ such that

[TABLE]

where $\overline{u}^{k}\in{\mathrm{conv}\,}\partial f(\lambda(x))$ and $u_{\infty}^{k}\in{\mathrm{conv}\,}\partial^{\infty}f(\lambda(x))$ . Therefore, ${\mathrm{Diag}\,}(\overline{u}^{k},\mathcal{J})+{\mathrm{Diag}\,}({u}_{\infty}^{k},\mathcal{J})\to s$ . In addition, by Theorem 19, we have

[TABLE]

Using (25), we conclude that $s\in\partial_{C}F(x)$ . ∎

4.6 Subdifferentials of the $k$ -th largest eigenvalue function

In this subsection, as an application of Theorems 17, 19 and 21, we will compute the generalized subdifferentials of the function $\lambda_{k}(\cdot):\mathcal{E}\to\mathbb{R}$ that maps an element $x\in\mathcal{E}$ to its $k$ -th largest eigenvalue, for $k\in\{1,\ldots,r\}$ .

Let $f_{k}:\mathbb{R}^{r}\to\mathbb{R}$ be the function that maps $u\in\mathbb{R}^{r}$ to its $k$ -th largest component. Then, $f_{k}$ is a symmetric function and $\lambda_{k}$ is the spectral function generated by $f_{k}$ . We note that, since the eigenvalue map is Lipschitz continuous, each $\lambda_{k}$ must be Lipschitz continuous as well. In what follows, $a^{i}\in\mathbb{R}^{r}$ denotes the $i$ -th unit vector and we recall that $u_{i}$ denotes the $i$ -th component of $u\in\mathbb{R}^{r}$ . We also define

[TABLE]

For a finite set $C$ , we denote its cardinality by $\lvert C\rvert$ . The generalized subdifferentials of $f_{k}$ are described by the following proposition, see Proposition 6 and Theorem 9 in [19].

Proposition 22.

The following hold.

[TABLE]

*where $\alpha=1-k+\lvert\{i\mid u_{i}\geq f_{k}(u)\}\rvert$ . *

Let $\mathcal{I}$ denote the set of primitive idempotents of $\mathcal{E}$ . We recall that $c\in\mathcal{I}$ if and only if $c$ is nonzero, ${c\circ c}=c$ and $c$ cannot be written as the sum of two nonzero orthogonal idempotents.

Lemma 23 (Frame extension lemma).

Let $x\in\mathcal{E}$ and $c\in\mathcal{I}$ . If ${x\circ c}=\sigma c$ for some $\sigma\in\mathbb{R}$ , then $\sigma$ is an eigenvalue of $x$ and there is a Jordan frame $\mathcal{J}\in\mathcal{J}(x)$ such that $c\in\mathcal{J}$ . In particular, $\mathcal{J}(x,c)\neq\emptyset$ .

Proof.

By the Peirce decomposition (Theorem 5), we have

[TABLE]

Then, since ${x\circ c}=\sigma c$ , we have ${c\circ(x-\sigma c)}=0$ . Therefore, $x-\sigma c\in V(c,0)$

$V(c,0)$ is a Euclidean Jordan algebra (see Theorem 5). Furthermore, since $c$ has rank $1$ , the algebra $V(c,0)$ has rank $r-1$ . Therefore, we can find a Jordan frame $\hat{\mathcal{J}}=[c_{1},\ldots,c_{r-1}]$ that diagonalizes $x-\sigma c$ in $V(c,0)$ . It follows that

[TABLE]

where $\sigma_{i}\in\mathbb{R}$ for every $i$ . We now need to check that $\mathcal{J}=[c,c_{1},\ldots,c_{r-1}]$ is a Jordan frame. All elements of $\mathcal{J}$ are primitive idempotents. Furthermore, ${c_{i}\circ c_{j}}=0$ if $i\neq j$ . Since $\hat{\mathcal{J}}\subseteq V(c,0)$ , we also have ${c\circ c_{i}}=0$ for every $i$ . Since the identity element of $V(c,0)$ is ${e}-c$ and $\hat{\mathcal{J}}$ is a Jordan frame in $V(c,0)$ , we have

[TABLE]

This shows that $c+c_{1}+\cdots+c_{r-1}={e}$ . Therefore, $\mathcal{J}$ is indeed a Jordan frame of the algebra $\mathcal{E}$ and (30) shows that $\mathcal{J}$ diagonalizes $x$ . Since eigenvalues are unique, $\sigma$ must be one of the eigenvalues of $x$ . Reordering $\mathcal{J}$ if necessary, we obtain $\mathcal{J}\in\mathcal{J}(x,c)$ . ∎

Lemma 24 (Convex hull of primitive idempotents).

Let $x\in\mathcal{E}$ and $\sigma\in\mathbb{R}$ be an eigenvalue of $x$ . Let

[TABLE]

Let $s\in{\mathrm{conv}\,}\mathcal{I}(x,\sigma)$ .

$(i)$

The eigenvalues of $s$ are nonnegative and sum to $1$ . 2. $(ii)$

*There is $\mathcal{J}\in\mathcal{J}(x,s)$ such that $\langle s,c\rangle=0$ for every $c\in\mathcal{J}$ not belonging to $\mathcal{I}(x,\sigma)$ . *

Proof.

Primitive idempotents have trace equal to $1$ and the trace function is linear, so elements in ${\mathrm{conv}\,}\mathcal{I}(x,\sigma)$ must have trace $1$ too. Then, we recall that any idempotent $c$ must be belong to ${\mathcal{K}}\coloneqq\{{x\circ x}\mid x\in\mathcal{E}\}$ , which is a symmetric cone (see Theorem III.2.1 in [6]). In particular, ${\mathcal{K}}$ is a convex cone and, since $s$ is a convex combination of elements of ${\mathcal{K}}$ , $s$ belongs to ${\mathcal{K}}$ which implies that its eigenvalues are nonnegative.

Next, we move on to item $(ii)$ . Pick any Jordan frame for $x$ and let $\hat{c}$ denote the sum of the primitive idempotents associated to the eigenvalue $\sigma$ . By Proposition 3, $\hat{c}$ does not depend on the choice of Jordan frame. Since $s\in{\mathrm{conv}\,}\mathcal{I}(x,\sigma)$ , we have

[TABLE]

where $c_{i}\in\mathcal{I}(x,\sigma)$ for every $i$ and the $\alpha_{i}$ are nonnegative and sum to $1$ . First, we will show that $s\in V(\hat{c},1)$ .

By Lemma 23, each $c_{i}$ can be extended to a Jordan frame $\mathcal{J}_{i}\in\mathcal{J}(x,c_{i})$ with $c_{i}\in\mathcal{J}_{i}$ . Then, the idempotents in $\mathcal{J}_{i}$ associated to the eigenvalue $\sigma$ must sum to $\hat{c}$ by Proposition 3 and, at the same time, ${c^{\prime}\circ c_{i}}=0$ holds whenever $c^{\prime}\in\mathcal{J}_{i}$ and $c^{\prime}\neq c_{i}$ . We conclude that

[TABLE]

Therefore, each $c_{i}$ belongs to $V(\hat{c},1)$ , which shows that $s\in V(\hat{c},1)$ . Since $V(\hat{c},1)$ and $V(\hat{c},0)$ are Euclidean Jordan algebras, there is a Jordan Frame $\widehat{\mathcal{J}}\subseteq V(\hat{c},1)$ that diagonalizes $s$ . Next, since $x-\sigma\hat{c}\in V(\hat{c},0)$ , there is a Jordan frame $\widetilde{\mathcal{J}}\subseteq V(\hat{c},0)$ that diagonalizes $x-\sigma\hat{c}$ .

Let $\mathcal{J}\coloneqq\widehat{\mathcal{J}}\cup\widetilde{\mathcal{J}}$ . First, because $\widehat{\mathcal{J}}\subseteq V(\hat{c},1)$ and $\widetilde{\mathcal{J}}\subseteq V(\hat{c},0)$ are Jordan frames, we have (the well-known fact) that $\mathcal{J}$ is a Jordan frame in the algebra $\mathcal{E}$ .

Then, since $\widetilde{\mathcal{J}}$ diagonalizes $x-\sigma\hat{c}$ , $\widehat{\mathcal{J}}$ diagonalizes $s$ and the sum of the elements of $\widehat{\mathcal{J}}$ is $\hat{c}$ (the unit element of $V(\hat{c},1)$ ), we conclude that $\mathcal{J}$ diagonalizes $x$ and $s$ . We also observe that $\widehat{\mathcal{J}}\subseteq\mathcal{I}(x,\sigma)$ , which can be seen by expressing $x$ as a linear combination of the elements in $\mathcal{J}$ and recalling that the idempotents of $\widehat{\mathcal{J}}$ sum to $\hat{c}$ .

Finally, if $c\in\mathcal{J}$ but $c\not\in\mathcal{I}(x,\sigma)$ , then $c\in\widetilde{\mathcal{J}}$ and $\langle s,c\rangle=0$ , because $V(\hat{c},1)$ and $V(\hat{c},0)$ are orthogonal spaces. Reordering $\mathcal{J}$ if necessary, we obtain $\mathcal{J}\in\mathcal{J}(x,s)$ with the required properties. ∎

We are now equipped to prove the following result.

Theorem 25 (Generalized subdifferentials of $\lambda_{k}$ ).

Let $\mathcal{E}$ be a Euclidean Jordan algebra of rank $r$ and let $\lambda_{k}(\cdot)$ denote the function that maps an element to its $k$ -largest eigenvalue. The following hold.

[TABLE]

where $\alpha=1-k+\lvert\{i\mid\lambda_{i}(x)\geq\lambda_{k}(x)\}\rvert$ .

Proof.

The equality $\partial^{\infty}\lambda_{k}(x)=\{0\}$ follows from Theorem 17 and Proposition 22.

We will now prove the formula for $\partial_{C}\lambda_{k}$ . Let $s\in\partial_{C}\lambda_{k}(x)$ . By Theorem 19 and Proposition 22, there exists $\mathcal{J}\in\mathcal{J}(x,s)$ such that

[TABLE]

Because $s$ is written as a linear combination of elements of $\mathcal{J}$ , (35) implies that $s$ is a convex combination of the idempotents of $\mathcal{J}$ associated to $\lambda_{k}(x)$ . Observing that those idempotents satisfy ${x\circ c}=\lambda_{k}(x)c$ , we obtain

[TABLE]

which shows that “ $\subseteq$ ” holds in (31).

Conversely, suppose that $s\in{\mathrm{conv}\,}\mathcal{I}(x,\lambda_{k}(x))$ . By item $(i)$ of Lemma 24 applied to $x,s$ and $\lambda_{k}(x)$ , the eigenvalues of $s$ are nonnegative and sum to $1$ . Furthermore, by item $(ii)$ of Lemma 24, there exists $\mathcal{J}\in\mathcal{J}(x,s)$ such that $\langle s,c\rangle=0$ , whenever $c\in\mathcal{J}$ and $c$ is not associated to $\lambda_{k}(x)$ . This, together with Proposition 22, shows that

[TABLE]

because the nonzero components of ${\mathrm{diag}\,}(s,\mathcal{J})$ are nonnegative, sum to $1$ and are located only at indices associated to idempotents in $\mathcal{I}(x,\lambda_{k}(x))$ . By Theorem 19, we have $s\in\partial_{C}\lambda_{k}(x)$ , which shows that (31) holds.

The expressions for $\hat{\partial}\lambda_{k}(x),\partial\lambda_{k}(x)$ are consequences of Theorem 17, Proposition 22, the formula for $\partial_{C}\lambda_{k}(x)$ and the fact that $\lvert{\mathrm{supp}\,}(\lambda(x))\rvert={\mathrm{rank}\,}(x)$ . ∎

5 The KL-exponent of spectral functions

We recall the definitions of the KL property and KL-exponent, see Definitions 2.2 and 2.3 in [21]. In what follows, we define ${\rm dom}\,\partial f\coloneqq\{u\in\mathbb{R}^{r}\mid\partial f(u)\neq\emptyset\}$ . If $C$ is a subset of $\mathbb{R}^{r}$ , we define ${\mathrm{dist}\,}(u,C)=\inf\{\left\|v-u\right\|\mid v\in C\}$ . If $\mathcal{C}$ is a subset of $\mathcal{E}$ , we define ${\mathrm{dist}\,}(x,\mathcal{C})$ analogously using the norm induced by (4).

Definition 26 (KL-property and KL-exponent).

A lower semicontinuous function $f$ is said to satisfy the KL property at $u\in{\rm dom}\,\partial f$ if there exists a neighbourhood $U$ of $u$ , $\nu\in(0,\infty]$ and a continuous concave function $\psi:[0,\nu)\to\mathbb{R}_{+}$ with $\psi(0)=0$ such that

$(i)$

$\psi$ * is continuously differentiable on $(0,\nu)$ with (its derivative) $\psi^{\prime}$ positive over $(0,\nu)$ ;* 2. $(ii)$

for all $v\in U$ with $f(u)<f(v)<f(u)+\nu$ , we have

[TABLE]

In particular, $f$ is said to satisfy the KL property with exponent $\alpha$ at $u\in{\rm dom}\,\partial f$ , if $\psi$ can be taken to be $\psi(t)=ct^{1-\alpha}$ for some positive constant $c$ .

First, we need the following lemma.

Lemma 27.

Let $f:\mathbb{R}^{r}\to\mathbb{R}$ be a symmetric function and let $F:\mathcal{E}\to\mathbb{R}$ be the corresponding spectral function. Then, for every $y\in\mathcal{E}$ and for every Jordan frame $\hat{\mathcal{J}}$ which diagonalizes $y$ (see Section 3.1) we have

[TABLE]

Proof.

Let $y\in\mathcal{E}$ and let $\hat{\mathcal{J}}$ be a Jordan frame which diagonalizes $y$ . From (6) and since permutation matrices are orthogonal matrices, we obtain

[TABLE]

In particular,

[TABLE]

Therefore, it suffices to show that ${\mathrm{dist}\,}(0,\partial F(y))={\mathrm{dist}\,}(0,\partial f(\lambda(y))).$ From Theorem 17, we have

[TABLE]

Therefore, ${\mathrm{dist}\,}(0,\partial F(y))\geq{\mathrm{dist}\,}(0,\partial f(\lambda(y)))$ . To show the opposite inequality, let $d\in\partial f(\lambda(y)),\mathcal{J}\in\mathcal{J}(y).$ By Theorem 17, $s\coloneqq{\mathrm{Diag}\,}(d,\mathcal{J})$ is such that $s\in\partial F(y)$ . Furthermore, we have $\left\|s\right\|=\left\|d\right\|$ . This shows that ${\mathrm{dist}\,}(0,\partial F(y))\leq{\mathrm{dist}\,}(0,\partial f(\lambda(y)))$ . ∎

Theorem 28 (Transfer principle for the KL property and KL exponent).

Let $f:\mathbb{R}^{r}\to\mathbb{R}$ be a symmetric function and let $F:\mathcal{E}\to\mathbb{R}$ be the corresponding spectral function. Then,

$(i)$

$F$ * satisfies the KL property $x$ if and only if $f$ satisfies the KL property at $\lambda(x)$ . In addition, the $\psi$ and $\nu$ in Definition 26 can be taken to be the same for both $f$ and $F$ .* 2. $(ii)$

$F$ * satisfies the KL property with exponent $\alpha$ at $x$ if and only if $f$ satisfies the KL property with exponent $\alpha$ at $\lambda(x)$ .*

Proof.

First we prove item $(i)$ . By Theorem 17 we have $x\in{\rm dom}\,\partial F$ if and only if $\lambda(x)\in{\rm dom}\,\partial f$ . Next, suppose that $f$ satisfies the KL property at $\lambda(x)$ and let $U,\nu$ and $\psi$ be as in Definition 26.

Since $\lambda$ is continuous, $\mathcal{U}\coloneqq\lambda^{-1}(U)$ is a neighbourhood of $x$ . Therefore, if $y\in\mathcal{U}$ is such that $F(x)<F(y)<F(x)+\nu$ , we have

[TABLE]

By Lemma 27 and item $(ii)$ of Definition 26 applied to $f$ and $\psi$ , we have

[TABLE]

This shows that $F$ satisfies the KL property at $x$ with the same $\psi$ and $\nu$ .

Now, we prove the converse. Suppose that $F$ satisfies the KL property at $x$ and let $\mathcal{U}$ be a neighbourhood of $x$ together with $\psi$ and $\nu$ such that Definition 26 is satisfied.

Let $\mathcal{J}\in\mathcal{J}(x)$ and $U\coloneqq{\mathrm{Diag}\,}(\cdot,\mathcal{J})^{-1}(\mathcal{U})$ . Then, whenever $v\in U$ is such that $f(\lambda(x))<f(v)<f(\lambda(x))+\nu$ , we have

[TABLE]

By item $(ii)$ of Definition 26, we have

[TABLE]

By Lemma 27, we have

[TABLE]

This shows that $f$ satisfies the KL property at $\lambda(x)$ with the same $\psi$ and $\nu$ , which concludes the proof of item $(i)$ .

Next, we observe that item $(ii)$ is a particular case of the previous item, when $\psi$ can be taken to be $\psi(t)=ct^{1-\alpha}$ . ∎

Remark 29.

*In Theorem 3.2 of [21] there is a result about the KL-exponent of function compositions of the form $g_{1}(g_{2}(\cdot))$ . However, the result requires that $g_{2}$ be continuously differentiable, so it cannot be used to prove Theorem 28. *

Acknowledgments

We thank the referees for their comments, which helped to improve the paper. This work was partially supported by the Grant-in-Aid for Scientific Research (B) (19H04069) and the Grant-in-Aid for Young Scientists (19K20217) from Japan Society for the Promotion of Science.

Bibliography26

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] H. Attouch, J. Bolte, P. Redont, and A. Soubeyran. Proximal alternating minimization and projection methods for nonconvex problems: An approach based on the Kurdyka-Łojasiewicz inequality. Mathematics of Operations Research , 35(2):438–457, 2010.
2[2] M. Baes. Spectral functions and smoothing techniques on Jordan algebras . Ph D thesis, Université catholique de Louvain, 2006.
3[3] M. Baes. Convexity and differentiability properties of spectral functions and spectral mappings on Euclidean Jordan algebras. Linear Algebra and its Applications , 422(2):664 – 700, 2007.
4[4] R. Bhatia. Matrix Analysis . Graduate Texts in Mathematics. Springer New York, 1997.
5[5] J.-S. Chen, X. Chen, and P. Tseng. Analysis of nonsmooth vector-valued functions associated with second-order cones. Mathematical Programming , 101(1):95–117, Sep 2004.
6[6] J. Faraut and A. Korányi. Analysis on symmetric cones . Oxford mathematical monographs. Clarendon Press, Oxford, 1994.
7[7] L. Faybusovich. Several Jordan-algebraic aspects of optimization. Optimization , 57(3):379–393, 2008.
8[8] M. Fukushima, Z. Luo, and P. Tseng. Smoothing functions for second-order-cone complementarity problems. SIAM Journal on Optimization , 12(2):436–460, 2002.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Generalized subdifferentials of spectral functions over Euclidean Jordan algebras

Abstract

1 Introduction

1.1 Previous works

1.2 Contributions of this work

2 Preliminaries

2.1 Notation

2.2 Generalized subdifferentials

Proposition 1** (Rockafellar and Wets, Proposition 8.5 in [24]).**

3 Euclidean Jordan algebras

Theorem 2** (Spectral Theorem, see Theorem III.1.2 in [6]).**

Proposition 3** (Unique sum of primitive idempotents, see Theorems III.1.1 and III.1.2 in [6]).**

Lemma 4** (Properties of the eigenvalue map).**

Proof.

3.1 Simultaneous diagonalization

3.2 The directional derivative of the iii-th largest eigenvalue

Theorem 5** (Peirce Decomposition, see Proposition IV.1.1 and page 64 in [6]).**

Example 6**.**

Theorem 7** (Baes, Theorem 36 in [3]).**

3.3 Spectral functions and sets

4 Transfer principles for generalized subdifferentials

Remark 8**.**

4.1 Commutation principles and generalized subdifferentials

Theorem 9** **(Ramírez, Seeger and Sossa111

Theorem 10** (Gowda and Jeong [12]).**

Proposition 11** (Operator commutativity for weakly spectral functions).**

Proof.

4.2 The easy inclusion

Proposition 12** (The easy inclusion).**

Proof.

4.3 The hard inclusion

Proposition 13** (Gowda, Example 7 and Theorem 6 in [11]).**

Proof.

Theorem 14** (The diagonal map and directional derivatives of the eigenvalue map).**

Proof.

Proposition 15** (The hard inclusion).**

Proof.

4.4 Main results

Proposition 16** (The approximate and horizon subdifferentials of spectral functions).**

Proof.

Theorem 17** (Generalized subdifferentials of spectral functions).**

Proof.

4.5 Convex hull of generalized subdifferentials and the Clarke subdifferential

Proposition 18**.**

Proof.

Theorem 19** (Convex hull of generalized subdifferentials).**

Proof.

Lemma 20** (Lemma 4.1 in [20]).**

Theorem 21** (Clarke subgradients of spectral functions under local lower semicontinuity).**

Proof.

4.6 Subdifferentials of the kkk-th largest eigenvalue function

Proposition 22**.**

Lemma 23** (Frame extension lemma).**

Proof.

Lemma 24** (Convex hull of primitive idempotents).**

Proof.

Theorem 25** (Generalized subdifferentials of λk\lambda_{k}λk​).**

Proof.

5 The KL-exponent of spectral functions

Definition 26** (KL-property and KL-exponent).**

Lemma 27**.**

Proof.

Theorem 28** (Transfer principle for the KL property and KL exponent).**

Proof.

Remark 29**.**

Acknowledgments

Proposition 1 (Rockafellar and Wets, Proposition 8.5 in [24]).

Theorem 2 (Spectral Theorem, see Theorem III.1.2 in [6]).

Proposition 3 (Unique sum of primitive idempotents, see Theorems III.1.1 and III.1.2 in [6]).

Lemma 4 (Properties of the eigenvalue map).

3.2 The directional derivative of the $i$ -th largest eigenvalue

Theorem 5 (Peirce Decomposition, see Proposition IV.1.1 and page 64 in [6]).

Example 6.

Theorem 7 (Baes, Theorem 36 in [3]).

Remark 8.

Theorem 9 (Ramírez, Seeger and Sossa111

Theorem 10 (Gowda and Jeong [12]).

Proposition 11 (Operator commutativity for weakly spectral functions).

Proposition 12 (The easy inclusion).

Proposition 13 (Gowda, Example 7 and Theorem 6 in [11]).

Theorem 14 (The diagonal map and directional derivatives of the eigenvalue map).

Proposition 15 (The hard inclusion).

Proposition 16 (The approximate and horizon subdifferentials of spectral functions).

Theorem 17 (Generalized subdifferentials of spectral functions).

Proposition 18.

Theorem 19 (Convex hull of generalized subdifferentials).

Lemma 20 (Lemma 4.1 in [20]).

Theorem 21 (Clarke subgradients of spectral functions under local lower semicontinuity).

4.6 Subdifferentials of the $k$ -th largest eigenvalue function

Proposition 22.

Lemma 23 (Frame extension lemma).

Lemma 24 (Convex hull of primitive idempotents).

Theorem 25 (Generalized subdifferentials of $\lambda_{k}$ ).

Definition 26 (KL-property and KL-exponent).

Lemma 27.

Theorem 28 (Transfer principle for the KL property and KL exponent).

Remark 29.