Von Neumann Type of Trace Inequalities for Schatten-Class Operators

Gunther Dirr; Frederik vom Ende

arXiv:1906.00758·math.FA·March 30, 2023

Von Neumann Type of Trace Inequalities for Schatten-Class Operators

Gunther Dirr, Frederik vom Ende

PDF

TL;DR

This paper extends von Neumann's trace inequality and eigenvalue inequalities from finite-dimensional matrices to Schatten-class operators on infinite-dimensional Hilbert spaces, utilizing recent results on the $C$-numerical range.

Contribution

It introduces a generalization of classical trace and eigenvalue inequalities to Schatten-class operators in infinite-dimensional settings, expanding their applicability.

Findings

01

Generalized von Neumann's trace inequality to Schatten-class operators.

02

Extended eigenvalue inequalities for hermitian operators.

03

Utilized recent $C$-numerical range results for the generalization.

Abstract

We generalize von Neumann's well-known trace inequality, as well as related eigenvalue inequalities for hermitian matrices, to Schatten-class operators between complex Hilbert spaces of infinite dimension. To this end, we exploit some recent results on the $C$ -numerical range of Schatten-class operators. For the readers' convenience, we sketched the proof of these results in the Appendix.

Equations137

U, V \in U_{n} max ∣ tr (A U B V) ∣ = \sum_{j = 1}^{n} s_{j} (A) s_{j} (B),

U, V \in U_{n} max ∣ tr (A U B V) ∣ = \sum_{j = 1}^{n} s_{j} (A) s_{j} (B),

{tr (A U B V) ∣ U, V \in U_{n}} = K_{r} (0)

{tr (A U B V) ∣ U, V \in U_{n}} = K_{r} (0)

\sum_{j = 1}^{n} λ_{j}^{↓} (A) λ_{j}^{↑} (B) \leq tr (A B) \leq \sum_{j = 1}^{n} λ_{j}^{↓} (A) λ_{j}^{↓} (B),

\sum_{j = 1}^{n} λ_{j}^{↓} (A) λ_{j}^{↑} (B) \leq tr (A B) \leq \sum_{j = 1}^{n} λ_{j}^{↓} (A) λ_{j}^{↓} (B),

C = \sum_{n = 1}^{\infty} s_{n} (C) ⟨ f_{n}, \cdot ⟩ g_{n},

C = \sum_{n = 1}^{\infty} s_{n} (C) ⟨ f_{n}, \cdot ⟩ g_{n},

\displaystyle\mathcal{B}^{p}(\mathcal{X},\mathcal{Y}):=\Big{\{}C\in\mathcal{K}(\mathcal{X},\mathcal{Y})\,\Big{|}\,\sum\nolimits_{n=1}^{\infty}s_{n}(C)^{p}<\infty\Big{\}}

\displaystyle\mathcal{B}^{p}(\mathcal{X},\mathcal{Y}):=\Big{\{}C\in\mathcal{K}(\mathcal{X},\mathcal{Y})\,\Big{|}\,\sum\nolimits_{n=1}^{\infty}s_{n}(C)^{p}<\infty\Big{\}}

\displaystyle\|C\|_{p}:=\Big{(}\sum\nolimits_{n=1}^{\infty}s_{n}(C)^{p}\Big{)}^{1/p}

\displaystyle\|C\|_{p}:=\Big{(}\sum\nolimits_{n=1}^{\infty}s_{n}(C)^{p}\Big{)}^{1/p}

∥ C ∥_{\infty} := n \in N sup s_{n} (C) = s_{1} (C) .

∥ C ∥_{\infty} := n \in N sup s_{n} (C) = s_{1} (C) .

∥ S C T ∥_{p} \leq ∥ S ∥∥ C ∥_{p} ∥ T ∥ .

∥ S C T ∥_{p} \leq ∥ S ∥∥ C ∥_{p} ∥ T ∥ .

tr (C) := \sum_{i \in I} ⟨ f_{i}, C f_{i} ⟩,

tr (C) := \sum_{i \in I} ⟨ f_{i}, C f_{i} ⟩,

tr (C T) = tr (T C) and ∣ tr (C T) ∣ \leq ∥ C ∥_{p} ∥ T ∥_{q} .

tr (C T) = tr (T C) and ∣ tr (C T) ∣ \leq ∥ C ∥_{p} ∥ T ∥_{q} .

T = \sum_{j = 1}^{\infty} λ_{j} (T) ⟨ e_{j}, \cdot ⟩ e_{j}

T = \sum_{j = 1}^{\infty} λ_{j} (T) ⟨ e_{j}, \cdot ⟩ e_{j}

\displaystyle\Delta(A,B):=\max\Big{\{}\max_{z\in A}d(z,B),\max_{z\in B}d(z,A)\Big{\}}\,.

\displaystyle\Delta(A,B):=\max\Big{\{}\max_{z\in A}d(z,B),\max_{z\in B}d(z,A)\Big{\}}\,.

n \to \infty lim (max A_{n}) = max A and n \to \infty lim (min A_{n}) = min A .

n \to \infty lim (max A_{n}) = max A and n \to \infty lim (min A_{n}) = min A .

W_{C} (T) := {tr (C U^{†} T U) ∣ U \in U (X)} .

W_{C} (T) := {tr (C U^{†} T U) ∣ U \in U (X)} .

S_{C} (T) := {tr (C U T V) ∣ U \in U (X), V \in U (Y)} .

S_{C} (T) := {tr (C U T V) ∣ U \in U (X), V \in U (Y)} .

\sum_{k = K + 1}^{\infty} s_{k} (C)^{p} < \frac{ε ^{p}}{( 3 κ ) ^{p}},

\sum_{k = K + 1}^{\infty} s_{k} (C)^{p} < \frac{ε ^{p}}{( 3 κ ) ^{p}},

∥ S C - S_{n} C ∥_{p}

∥ S C - S_{n} C ∥_{p}

< ∥ S C_{1} - S_{n} C_{1} ∥_{p} + \frac{2 ε}{3} .

∥ S C_{1} - S_{n} C_{1} ∥_{p} \leq k = 1 \sum K s_{k} (C) ∥ ⟨ e_{k}, \cdot ⟩ (S f_{k} - S_{n} f_{k}) ∥_{p} = k = 1 \sum K s_{k} (C) ∥ S f_{k} - S_{n} f_{k} ∥ .

∥ S C_{1} - S_{n} C_{1} ∥_{p} \leq k = 1 \sum K s_{k} (C) ∥ ⟨ e_{k}, \cdot ⟩ (S f_{k} - S_{n} f_{k}) ∥_{p} = k = 1 \sum K s_{k} (C) ∥ S f_{k} - S_{n} f_{k} ∥ .

∥ S f_{k} - S_{n} f_{k} ∥ < \frac{ε}{3 \sum _{k = 1}^{K} s _{k} ( C )}

∥ S f_{k} - S_{n} f_{k} ∥ < \frac{ε}{3 \sum _{k = 1}^{K} s _{k} ( C )}

∥ S C S^{†} - S_{n} C S_{n}^{†} ∥_{p}

∥ S C S^{†} - S_{n} C S_{n}^{†} ∥_{p}

\displaystyle\leq\kappa\big{(}\|CS^{\dagger}-CS_{n}^{\dagger}\|_{p}+\|SC-S_{n}C\|_{p}\big{)}\,.\qed

n \to \infty lim \overline{S_{C_{n}} (T_{n})} = \overline{S_{C} (T)} .

n \to \infty lim \overline{S_{C_{n}} (T_{n})} = \overline{S_{C} (T)} .

n \to \infty lim \overline{W_{C_{n}} (T_{n})} = \overline{W_{C} (T)} .

n \to \infty lim \overline{W_{C_{n}} (T_{n})} = \overline{W_{C} (T)} .

κ := sup {∥ C ∥_{p}, ∥ C_{1} ∥_{p}, ∥ C_{2} ∥_{p}, \dots} and τ := sup {∥ T ∥_{q}, ∥ T_{1} ∥_{q}, ∥ T_{2} ∥_{q}, \dots} .

κ := sup {∥ C ∥_{p}, ∥ C_{1} ∥_{p}, ∥ C_{2} ∥_{p}, \dots} and τ := sup {∥ T ∥_{q}, ∥ T_{1} ∥_{q}, ∥ T_{2} ∥_{q}, \dots} .

∥ C - C_{n} ∥_{p} < \frac{ε}{4 τ} as well as ∥ T - T_{n} ∥_{q} < \frac{ε}{4 κ}

∥ C - C_{n} ∥_{p} < \frac{ε}{4 τ} as well as ∥ T - T_{n} ∥_{q} < \frac{ε}{4 κ}

∣ w - w_{n} ∣

∣ w - w_{n} ∣

< \frac{ε}{2} + ∣ tr ((C - C_{n}) U T V) ∣ + ∣ tr (V C_{n} U (T - T_{n})) ∣

\leq \frac{ε}{2} + ∥ C - C_{n} ∥_{p} ∥ U ∥∥ T ∥_{q} ∥ V ∥ + ∥ V ∥∥ C_{n} ∥_{p} ∥ U ∥∥ T - T_{n} ∥ q

\leq \frac{ε}{2} + ∥ C - C_{n} ∥_{p} τ + κ ∥ T - T_{n} ∥_{q} < ε

∣ v_{n} - \tilde{v}_{n} ∣

∣ v_{n} - \tilde{v}_{n} ∣

< \frac{ε}{2} + ∣ tr ((C_{n} - C) U_{n} T_{n} V_{n}) ∣ + ∣ tr (V_{n} C U_{n} (T_{n} - T)) ∣

\leq \frac{ε}{2} + ∥ C - C_{n} ∥_{p} τ + κ ∥ T - T_{n} ∥_{q} < ε . \qed

\displaystyle P_{C}(T):=\Big{\{}\sum\nolimits_{n=1}^{\infty}\lambda_{n}(C)\lambda_{\sigma(n)}(T)\,\Big{|}\,\sigma:\mathbb{N}\to\mathbb{N}\text{ is any permutation}\Big{\}}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Editor

\revisionMonth dd, yyyy

Von Neumann Type of Trace Inequalities for Schatten-Class Operators

Gunther Dirr and Frederik vom Ende

GUNTHER DIRR, Institute of Mathematics, University of Würzburg, D-97074 Würzburg, Germany

[email protected]

FREDERIK VOM ENDE, Department of Chemistry, Technische Universität München, D-85747 Garching, Germany–and–Munich Centre for Quantum Science and Technology (MCQST), D-80799 München, Germany

[email protected]

(Date: Month dd, yyyy)

Abstract.

We generalize von Neumann’s well-known trace inequality, as well as related eigenvalue inequalities for hermitian matrices, to Schatten-class operators between complex Hilbert spaces of infinite dimension. To this end, we exploit some recent results on the $C$ -numerical range of Schatten-class operators. For the readers’ convenience, we sketched the proof of these results in the Appendix.

1991 Mathematics Subject Classification:

4

keywords:

$C$ -numerical range; Schatten-class operators; trace inequality; von Neumann inequality

††volume-info: Volume 00, Number 0, 0000

7B10, 15A42, 47A12

1. INTRODUCTION

In the mid thirties of the last century, von Neumann [20, Thm. 1] derived the following beautiful and widely used trace inequality for complex $n\times n$ matrices:

Let $A,B\in\mathbb{C}^{n\times n}$ with singular values $s_{1}(A)\geq s_{2}(A)\geq\ldots\geq s_{n}(A)$ and $s_{1}(B)\geq s_{2}(B)\geq\ldots\geq s_{n}(B)$ , respectively, be given. Then

[TABLE]

where $\mathcal{U}_{n}$ denotes the unitary group.

In fact, the above result can be reinterpreted as a characterization of the image of the unitary double-coset $\{AUBV\,|\,U,V\in\mathbb{C}^{n\times n}\text{ unitary}\}$ under the trace-functional, i.e.

[TABLE]

with $r:=\sum_{j=1}^{n}s_{j}(A)s_{j}(B)$ and $K_{r}(0)=\{z\in\mathbb{C}\,,\,|z|\leq r\}$ being the closed disk of radius $r$ centred around the origin. This results from the elementary observation that the left-hand side of (1.2) is circular (simply replace $U$ by $e^{i\varphi}U$ ). Another well-known consequence of (1.1), a von Neumann inequality for hermitian matrices [10, Ch. 9.H.1], reads as follows.

Let $A,B\in\mathbb{C}^{n\times n}$ hermitian with respective eigenvalues $(\lambda_{j}(A))_{j=1}^{n}$ and $(\lambda_{j}(B))_{j=1}^{n}$ be given. Then

[TABLE]

where the superindeces $\downarrow$ and $\uparrow$ denote the decreasing and increasing sorting of the eigenvalue vectors, respectively.

The area of applications of von Neumann’s inequalities and, more generally, singular value decompositions (SVD) is enormous. It ranges from operator theory [6, 17] and numerics [8] to more applied fields like control theory [9], neural networks [14] as well as quantum dynamics and quantum control [7, 18]. An overview can be found in [10, 12]. Now the goal of this short contribution is to generalize these inequalities to Schatten-class operators on infinite-dimensional Hilbert spaces. In doing so, some recent results on the $C$ -numerical range of Schatten-class operators [3, 4] turn out to be quite helpful. For the readers’ convenience, we sketched the corresponding proofs in Appendix A.

This paper is organized as follows: Section 2 introduces the key notions and concepts of this work such as 2.1 Schatten classes, 2.2 convergence of compact sets via the Hausdorff metric as well as 2.3 the $C$ -numerical range for Schatten-class operators. Section 3 then presents the main results as mentioned above. Appendix A outlines the outsourced proof of some crucial geometrical results regarding the $C$ -numerical range.

2. NOTATION AND PRELIMINARIES

Unless stated otherwise, here and henceforth $\mathcal{X}$ and $\mathcal{Y}$ are arbitrary infinite-dimensional complex Hilbert spaces while $\mathcal{H}$ and $\mathcal{G}$ are reserved for infinite-dimensional separable complex Hilbert spaces. Moreover, let $\mathcal{B}(\mathcal{X},\mathcal{Y})$ , $\mathcal{U}(\mathcal{X},\mathcal{Y})$ , $\mathcal{K}(\mathcal{X},\mathcal{Y})$ , $\mathcal{F}(\mathcal{X},\mathcal{Y})$ and $\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ denote the set of all bounded, unitary, compact, finite-rank and $p$ -th Schatten-class operators between $\mathcal{X}$ and $\mathcal{Y}$ , respectively. As usual, if $\mathcal{X}$ and $\mathcal{Y}$ coincide we simply write $\mathcal{B}(\mathcal{X})$ , $\mathcal{U}(\mathcal{X})$ , etc.

Scalar products are conjugate linear in the first argument and linear in the second one. For an arbitrary subset $S\subset\mathbb{C}$ , the notations $\overline{S}$ and $\operatorname{conv}(S)$ stand for its closure and convex hull, respectively. Finally, given $p,q\in[1,\infty]$ , we say $p$ and $q$ are conjugate if $\frac{1}{p}+\frac{1}{q}=1$ .

2.1. INFINITE-DIMENSIONAL HILBERT SPACES AND THE SCHATTEN CLASSES

For a comprehensive introduction to Hilbert spaces of infinite dimension as well as Schatten-class operators, we refer to, e.g., [1, 11] and [5]. Here, we recall only some basic results which will be used frequently throughout this paper.

Lemma 2.1 (Schmidt decomposition).

For each $C\in\mathcal{K}(\mathcal{X},\mathcal{Y})$ , there exists a decreasing null sequence $(s_{n}(C))_{n\in\mathbb{N}}$ in $[0,\infty)$ as well as orthonormal systems $(f_{n})_{n\in\mathbb{N}}$ in $\mathcal{X}$ and $(g_{n})_{n\in\mathbb{N}}$ in $\mathcal{Y}$ such that

[TABLE]

where the series converges in the operator norm.

As the singular numbers $(s_{n}(C))_{n\in\mathbb{N}}$ in Lemma 2.1 are uniquely determined by $C$ , the $p$ -th Schatten-class $\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ is (well-)defined via

[TABLE]

for $p\in[1,\infty)$ . The Schatten- $p$ -norm

[TABLE]

turns $\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ into a Banach space. Moreover, for $p=\infty$ , we identify $\mathcal{B}^{\infty}(\mathcal{X},\mathcal{Y})$ with the set of all compact operators $\mathcal{K}(\mathcal{X},\mathcal{Y})$ equipped with the norm

[TABLE]

Note that $\|C\|_{\infty}$ coincides with the ordinary operator norm $\|C\|$ . Hence $\mathcal{B}^{\infty}(\mathcal{X},\mathcal{Y})$ constitutes a closed subspace of $\mathcal{B}(\mathcal{X},\mathcal{Y})$ and thus a Banach space, too.

Remark 2.2.

Evidently, if $C\in\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ for some $p\in[1,\infty]$ then the series (2.1) converges in the Schatten- $p$ -norm.

The following results can be found in [5, Coro. XI.9.4 & Lemma XI.9.9].

Lemma 2.3.

(a)

Let $p\in[1,\infty]$ . Then for all $S,T\in\mathcal{B}(\mathcal{X})$ , $C\in\mathcal{B}^{p}(\mathcal{X})$ :

[TABLE]

(b)

Let $1\leq p\leq q\leq\infty$ . Then $\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})\subseteq\mathcal{B}^{q}(\mathcal{X},\mathcal{Y})$ and $\|C\|_{p}\geq\|C\|_{q}$ for all $C\in\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ .

Note that due to (a), all Schatten-classes $\mathcal{B}^{p}(\mathcal{X})$ constitute–just like the compact operators–a two-sided ideal in the $C^{*}$ -algebra of all bounded operators $\mathcal{B}(\mathcal{X})$ .

Now for any $C\in\mathcal{B}^{1}(\mathcal{X})$ , the trace of $C$ is defined via

[TABLE]

where $(f_{i})_{i\in I}$ can be any orthonormal basis of $\mathcal{X}$ . The trace is well-defined, as one can show that the right-hand side of (2.2) is finite and does not depend on the choice of $(f_{i})_{i\in I}$ . Important properties are the following, cf. [5, Lemma XI.9.14].

Lemma 2.4.

Let $C\in\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ and $T\in\mathcal{B}^{q}(\mathcal{Y},\mathcal{X})$ with $p,q\in[1,\infty]$ conjugate. Then one has $CT\in\mathcal{B}^{1}(\mathcal{Y})$ and $TC\in\mathcal{B}^{1}(\mathcal{X})$ with

[TABLE]

In order to recap the well-known diagonalization result for compact normal operators, we first have to fix the term eigenvalue sequence of a compact operator $T\in\mathcal{K}(\mathcal{H})$ . In general, it is obtained by arranging the (necessarily countably many) non-zero eigenvalues in decreasing order with respect to their absolute value and each eigenvalue is repeated as many times as its algebraic multiplicity111By [11, Prop. 15.12], every non-zero element $\lambda\in\sigma(T)$ of the spectrum of $T$ is an eigenvalue of $T$ and has a well-defined finite algebraic multiplicity $\nu_{a}(\lambda)$ , e.g., $\nu_{a}(\lambda):=\dim\ker(T-\lambda I)^{n_{0}}$ , where $n_{0}\in\mathbb{N}$ is the smallest natural number $n\in\mathbb{N}$ such that $\ker(T-\lambda I)^{n}=\ker(T-\lambda I)^{n+1}$ . calls for. If only finitely many non-vanishing eigenvalues exist, then the sequence is filled up with zeros, see [11, Ch. 15]. For our purposes, we have to pass to a slightly modified eigenvalue sequence as follows:

•

If the range of $T$ is infinite-dimensional and the kernel of $T$ is finite-dimensional, then put $\operatorname{dim}(\operatorname{ker}T)$ zeros at the beginning of the eigenvalue sequence of $T$ .

•

If the range and the kernel of $T$ are infinite-dimensional, mix infinitely many zeros into the eigenvalue sequence of $T$ .

Because in Definition 2.12 arbitrary permutations will be applied to the modified eigenvalue sequence, we do not need to specify this mixing procedure further, cf. also [3, Lemma 3.6].

•

If the range of $T$ is finite-dimensional leave the eigenvalue sequence of $T$ unchanged.

Lemma 2.5 ([1], Thm. VIII.4.6).

Let $T\in\mathcal{K}(\mathcal{H})$ be normal, i.e. $T^{\dagger}T=TT^{\dagger}$ . Then there exists an orthonormal basis $(e_{n})_{n\in\mathbb{N}}$ of $\mathcal{H}$ such that

[TABLE]

where $(\lambda_{j}(T))_{j\in\mathbb{N}}$ is the modified eigenvalue sequence of $T$ .

2.2. SET CONVERGENCE

In order to transfer results about convexity and star-shapedness of the $C$ -numerical range from matrices to Schatten-class operators, we need a concept of set convergence. We will use the Hausdorff metric on compact subsets (of $\mathbb{C}$ ) and the associated notion of convergence, see, e.g., [13].

The distance between $z\in\mathbb{C}$ and a non-empty compact subset $A\subseteq\mathbb{C}$ is given by $d(z,A):=\min_{w\in A}d(z,w)=\min_{w\in A}|z-w|$ , based on which the Hausdorff metric $\Delta$ on the set of all non-empty compact subsets of $\mathbb{C}$ is defined via

[TABLE]

The following characterization of the Hausdorff metric is readily verified.

Lemma 2.6.

Let $A,B\subset\mathbb{C}$ be two non-empty compact sets and let $\varepsilon>0$ . Then $\Delta(A,B)\leq\varepsilon$ if and only if for all $z\in A$ , there exists $w\in B$ with $d(z,w)\leq\varepsilon$ and vice versa.

With this metric one can introduce the notion of convergence for sequences $(A_{n})_{n\in\mathbb{N}}$ of non-empty compact subsets of $\mathbb{C}$ such that the maximum- as well as the minimum-operator are continuous in the following sense.

Lemma 2.7.

Let $(A_{n})_{n\in\mathbb{N}}$ be a bounded sequence of non-empty, compact subsets of $\mathbb{R}$ which converges to $A\subset\mathbb{R}$ . Then the sequences of real numbers $(\max A_{n})_{n\in\mathbb{N}}$ and $(\min A_{n})_{n\in\mathbb{N}}$ are convergent with

[TABLE]

Proof.

Let $\varepsilon>0$ . By assumption, there exists $N\in\mathbb{N}$ such that $\Delta(A_{n},A)<\varepsilon$ for all $n\geq N$ . Hence by Lemma 2.6 one finds $a_{n}\in A_{n}$ with $|\max A-a_{n}|<\varepsilon$ and thus $\max A<a_{n}+\varepsilon<\max A_{n}+\varepsilon\,.$ Similarly, there exists $a\in A$ such that $|\max A_{n}-a|<\varepsilon$ , so $\max A_{n}<a+\varepsilon<\max A+\varepsilon\,.$ Combining both estimates, we get $|\max A-\max A_{n}|<\varepsilon$ . The case of the minimum is shown analogously. ∎

2.3. THE $C$ -NUMERICAL RANGE OF SCHATTEN-CLASS OPERATORS

In this subsection, we present a few approximation results and collect some material on the $C$ -numerical range of Schatten-class operators which is of fundamental importance in Section 3. Because said results appeared only in an addendum [4] to another publication [3] on trace-class operators, we decided to sketch the proof in the appendix for the readers’ convenience.

Definition 2.8.

Let $p,q\in[1,\infty]$ be conjugate. Then for $C\in\mathcal{B}^{p}(\mathcal{X})$ and $T\in\mathcal{B}^{q}(\mathcal{X})$ , the $C$ -numerical range of $T$ is defined to be

[TABLE]

Following (1.2), for $C\in\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ and $T\in\mathcal{B}^{q}(\mathcal{Y},\mathcal{X})$ with $p,q\in[1,\infty]$ conjugate one may actually introduce the more general set (now invoking the unitary equivalence orbit $UTV$ of $T$ instead of the unitary similarity orbit $U^{\dagger}TU$ )

[TABLE]

Note that all traces involved are well-defined due to Lemma 2.3 and 2.4.

Lemma 2.9.

Let $p\in[1,\infty]$ , $C\in\mathcal{B}^{p}(\mathcal{X})$ and $(S_{n})_{n\in\mathbb{N}}$ be a sequence in $\mathcal{B}(\mathcal{X})$ which converges strongly to $S\in\mathcal{B}(\mathcal{X})$ . Then one has $S_{n}C\to SC$ , $CS_{n}^{\dagger}\to CS^{\dagger}$ , and $S_{n}CS_{n}^{\dagger}\to SCS^{\dagger}$ for $n\to\infty$ with respect to the norm $\|\cdot\|_{p}$ .

Proof.

The cases $p=1$ and $p=\infty$ are proven in [3, Lemma 3.2]. As the proof for $p\in(1,\infty)$ is essentially the same, we sketch only the major differences. First, choose $K\in\mathbb{N}$ such that

[TABLE]

where $\kappa>0$ satisfies $\|S\|\leq\kappa$ and $\|S_{n}\|\leq\kappa$ for all $n\in\mathbb{N}$ . The existence of the constant $\kappa>0$ is guaranteed by the uniform boundedness principle. Then decompose $C=\sum\nolimits_{k=1}^{\infty}s_{k}(C)\langle e_{k},\cdot\rangle f_{k}$ into $C=C_{1}+C_{2}$ with $C_{1}:=\sum\nolimits_{k=1}^{K}s_{k}(C)\langle e_{k},\cdot\rangle f_{k}$ finite-rank. By Lemma 2.3 one has

[TABLE]

Thus, what remains is to choose $N\in\mathbb{N}$ such that $\|SC_{1}-S_{n}C_{1}\|_{p}<\varepsilon/3$ for all $n\geq N$ . To this end, consider the estimate

[TABLE]

Then the strong convergence of $(S_{n})_{n\in\mathbb{N}}$ yields $N\in\mathbb{N}$ such that

[TABLE]

for $k=1,\dots,K$ and all $n\geq N$ . This shows $\|SC-S_{n}C\|_{p}\to 0$ as $n\to\infty$ . All other assertions are an immediate consequence of $\|A\|_{p}=\|A^{\dagger}\|_{p}$ for $A\in\mathcal{B}^{p}(\mathcal{X})$ and

[TABLE]

Proposition 2.10.

Let $C\in\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ , $T\in\mathcal{B}^{q}(\mathcal{Y},\mathcal{X})$ with $p,q\in[1,\infty]$ conjugate and let $(C_{n})_{n\in\mathbb{N}}$ and $(T_{n})_{n\in\mathbb{N}}$ be sequences in $\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ and $\mathcal{B}^{q}(\mathcal{Y},\mathcal{X})$ , respectively, such that $\lim_{n\to\infty}\|C-C_{n}\|_{p}=\lim_{n\to\infty}\|T-T_{n}\|_{q}=0\,.$ Then

[TABLE]

If, additionally, $\mathcal{X}=\mathcal{Y}$ then

[TABLE]

Proof.

W.l.o.g. let $C_{n},T_{n}\neq 0$ for some $n\in\mathbb{N}$ –else all the involved sets would be trivial–so we may introduce the positive but (as seen via the reverse triangle inequality) finite numbers

[TABLE]

Let $\varepsilon>0$ . By assumption there exists $N\in\mathbb{N}$ such that

[TABLE]

for all $n\geq N$ . We shall first tackle (2.3), as (2.4) can be shown in complete analogy. The goal will be to satisfy the assumptions of Lemma 2.6 in order to show $\Delta(\overline{S_{C}(T)},\overline{S_{C_{n}}(T_{n})})<\varepsilon$ for all $n\geq N$ .

Let $w\in\overline{S_{C}(T)}$ so one finds $U\in\mathcal{U}(\mathcal{X})$ , $V\in\mathcal{U}(\mathcal{Y})$ such that $w^{\prime}:=\operatorname{tr}(CUTV)$ satisfies $|w-w^{\prime}|<\frac{\varepsilon}{2}$ . Thus for $w_{n}:=\operatorname{tr}(C_{n}UT_{n}V)$ by Lemma 2.3 and 2.4

[TABLE]

for all $n\geq N$ .

Similarly, let $n\geq N$ . Then for $v_{n}\in\overline{S_{C_{n}}(T_{n})}$ one finds $U_{n}\in\mathcal{U}(\mathcal{X})$ , $V_{n}\in\mathcal{U}(\mathcal{Y})$ such that $v_{n}^{\prime}:=\operatorname{tr}(C_{n}U_{n}T_{n}V_{n})$ satisfies $|v_{n}-v_{n}^{\prime}|<\frac{\varepsilon}{2}$ . Thus for $\tilde{v}_{n}:=\operatorname{tr}(CU_{n}TV_{n})$ we obtain

[TABLE]

The preceding proposition together with Lemma 2.9 immediately entails the next result.

Corollary 2.11.

Let $C\in\mathcal{B}^{p}(\mathcal{H})$ , $T\in\mathcal{B}^{q}(\mathcal{H})$ with $p,q\in[1,\infty]$ conjugate. Then $\lim_{k\to\infty}\overline{W_{C}(\Pi_{k}T\Pi_{k})}=\overline{W_{C}(T)}\,,$ where $\Pi_{k}$ is the orthogonal projection onto the span of the first $k$ elements of an arbitrarily chosen orthonormal basis $(e_{n})_{n\in\mathbb{N}}$ of $\mathcal{H}$ .

Here we used the well-known fact that the orthogonal projections $\Pi_{k}$ strongly converge to the identity $\operatorname{id}_{\mathcal{H}}$ for $k\to\infty$ , cf., e.g., [3, Lemma 3.2].

Definition 2.12 ( $C$ -spectrum).

Let $p,q\in[1,\infty]$ be conjugate. Then, for $C\in\mathcal{B}^{p}(\mathcal{H})$ with modified eigenvalue sequence $(\lambda_{n}(C))_{n\in\mathbb{N}}$ and $T\in\mathcal{B}^{q}(\mathcal{H})$ with modified eigenvalue sequence $(\lambda_{n}(T))_{n\in\mathbb{N}}$ , the $C$ -spectrum of $T$ is defined via

[TABLE]

Hölder’s inequality and the standard estimate $\sum\nolimits_{n=1}^{\infty}|\lambda_{n}(C)|^{p}\leq\sum\nolimits_{n=1}^{\infty}s_{n}(C)^{p}$ , cf. [11, Prop. 16.31], yield

[TABLE]

showing that the elements of $P_{C}(T)$ are well-defined and bounded by $\|C\|_{p}\|T\|_{q}$ .

Now, if the operators $C$ and $T$ are particularly “nice”, one can connect the $C$ -numerical range and the $C$ -spectrum of $T$ as follows:

Theorem 2.13 ([4]).

Let $C\in\mathcal{B}^{p}(\mathcal{H})$ and $T\in\mathcal{B}^{q}(\mathcal{H})$ with $p,q\in[1,\infty]$ conjugate. Then the following statements hold.

(a)

$\overline{W_{C}(T)}$ * is star-shaped with respect to the origin.*

(b)

If either $C$ or $T$ is normal with collinear eigenvalues, then $\overline{W_{C}(T)}$ is convex.

(c)

If $C$ and $T$ both are normal, then $P_{C}(T)\subseteq W_{C}(T)\subseteq\operatorname{conv}(\overline{P_{C}(T)})$ . If, in addition, the eigenvalues of $C$ or $T$ are collinear then $\overline{W_{C}(T)}=\operatorname{conv}(\overline{P_{C}(T)})$ .

As stated in the beginning, a sketch of the proof can be found in Appendix A.

3. MAIN RESULTS

Considering the inequalities (1.1) and (1.3) from the introduction, it arguably is easier to generalize the former, i.e. to generalize von Neumann’s “original” trace inequality to Schatten-class operators. To start with we first investigate the finite-rank case.

Lemma 3.1.

Let $C\in\mathcal{F}(\mathcal{X},\mathcal{Y})$ , $T\in\mathcal{F}(\mathcal{Y},\mathcal{X})$ and $k:=\max\{\operatorname{rk}(C),\operatorname{rk}(T)\}\in\mathbb{N}_{0}$ . Then $S_{C}(T)=K_{r}(0)$ where $r:=\sum_{j=1}^{k}s_{j}(C)s_{j}(T)$ .

Proof.

Defining $k$ as above, Lemma 2.1 yields orthonormal systems $(e_{j})_{j=1}^{k}$ , $(h_{j})_{j=1}^{k}$ in $\mathcal{X}$ and $(f_{j})_{j=1}^{k}$ , $(g_{j})_{j=1}^{k}$ in $\mathcal{Y}$ such that

[TABLE]

Note that forcing both sums to have same summation range means that, potentially, some of the singular values have to be complemented by zeros, which is not of further importance.

“ $\subseteq$ ”: Let any $U\in\mathcal{U}(\mathcal{X})$ , $V\in\mathcal{U}(\mathcal{Y})$ be given. Then

[TABLE]

by direct computation. Now consider the subspaces

[TABLE]

so there exist orthonormal bases of the form

[TABLE]

of $Z_{1}$ and $Z_{2}$ for some $N,N^{\prime}\geq k$ , respectively. W.l.o.g.222This can be done for example by sufficiently expanding the “smaller” orthonormal systems in $\mathcal{X}$ or $\mathcal{Y}$ and possibly passing to new subspaces $Z_{1}^{\prime}\supset Z_{1}$ or $Z_{2}^{\prime}\supset Z_{2}$ which is always doable because we are in infinite dimensions. The particular choice of $Z_{1}^{\prime}$ and $Z_{2}^{\prime}$ is irrelevant because we only need the orthonormal systems which represent $C$ and $T$ to be contained within these finite-dimensional subspaces. we can assume $N=N^{\prime}$ and define

[TABLE]

for $j=1,\ldots,k$ . This yields $N\times N$ matrices

[TABLE]

which satisfy $\operatorname{tr}(C^{\prime}T^{\prime})=\sum\nolimits_{i,j=1}^{k}s_{i}(T)s_{j}(C)\langle e_{j},Uh_{i}\rangle\langle g_{i},Vf_{j}\rangle$ . By construction, one readily verifies that $(a_{j})_{j=1}^{k},(b_{j})_{j=1}^{k}$ are orthonormal systems in $\mathbb{C}^{N}$ so $s_{j}(T^{\prime})=s_{j}(T)$ for all $j=1,\ldots,N$ . Thus von Neumann’s original result (1.1) yields

[TABLE]

“ $\supseteq$ ”: We first consider unitary operators $U_{T}\in\mathcal{B}(\mathcal{X})$ , $V_{T}\in\mathcal{B}(\mathcal{Y})$ such that $U_{T}h_{j}=e_{j}$ and $V_{T}f_{j}=g_{j}$ for all $j=1,\ldots,k$ . This is always possible by completing the respective orthonormal systems $(e_{j})_{j=1}^{k}$ , $\ldots$ to orthonormal bases $(e_{j})_{j\in J}$ , $\ldots$ which can then be transformed into each other via some unitary. This allows us to construct $\tilde{T}:=U_{T}TV_{T}=\sum\nolimits_{j=1}^{k}s_{j}(T)\langle f_{j},\cdot\rangle e_{j}$ such that

[TABLE]

for any $\tilde{U}\in\mathcal{U}(\mathcal{X})$ , $\tilde{V}\in\mathcal{U}(\mathcal{Y})$ . Of course $S_{C}(T)=S_{C}(\tilde{T})$ and the latter satisfies

•

$r\in S_{C}(\tilde{T})$ : choose $\tilde{U}=\operatorname{id}_{\mathcal{X}}$ , $\tilde{V}=\operatorname{id}_{\mathcal{Y}}$ and also

•

$0\in S_{C}(\tilde{T})$ : choose $\tilde{U}$ , $\tilde{V}$ as cyclic shift on the first $k$ basis elements, i.e.

[TABLE]

and similarly $\tilde{V}$ (on $\{f_{1},\ldots,f_{k}\}$ ).

Now because the unitary group $\mathcal{U}(\mathcal{Y})$ on any Hilbert space $\mathcal{Y}$ is path-connected 333The standard argument for this goes as follows, cf. [16, Proof of Thm. 12.37]: For every $U\in\mathcal{U}(\mathcal{Y})$ there exists self-adjoint $Q\in\mathcal{B}(\mathcal{Y})$ such that $U=\exp(iQ)$ . Then $t\mapsto T(t):=\exp(itQ)$ is a continuous mapping of $[0,1]$ into $\mathcal{U}(\mathcal{Y})$ with $T(0)=\operatorname{id}_{\mathcal{Y}}$ and $T(1)=U$ . Thus every unitary operator is path-connected to the identity which implies path-connectedness of $\mathcal{U}(\mathcal{Y}$ ). and because the mapping $f:\mathcal{B}(\mathcal{X})\times\mathcal{B}(\mathcal{Y})\to\mathbb{C}$ , $(U,V)\mapsto\operatorname{tr}(CU\tilde{T}V)$ is continuous, the image $f(\mathcal{U}(\mathcal{X})\times\mathcal{U}(\mathcal{Y}))$ has to be path-connected as well. In particular, [math] and $r$ are path-connected within $S_{C}(T)$ , i.e. for every $s\in[0,r]$ there exists $\phi(s)\in[0,2\pi)$ such that $se^{i\phi(s)}\in S_{C}(\tilde{T})=S_{C}(T)$ .

Finally, we can use the fact that $S_{C}(T)$ is circular–which follows easily by replacing $U$ by $e^{i\varphi}U\in\mathcal{U}(\mathcal{X})$ with $\varphi\in[0,2\pi]$ –to conclude $S_{C}(T)\supseteq K_{r}(0)$ and thus $S_{C}(T)=K_{r}(0)$ . ∎

Theorem 3.2.

Let $C\in\mathcal{B}^{p}(\mathcal{X},\mathcal{Y})$ , $T\in\mathcal{B}^{q}(\mathcal{Y},\mathcal{X})$ with $p,q\in[1,\infty]$ conjugate. Then

[TABLE]

In particular, one has $\overline{S_{C}(T)}=K_{r}(0)$ with $r:=\sum_{j=1}^{\infty}s_{j}(C)s_{j}(T)$ .

Proof.

By Lemma 2.1 $C=\sum_{j=1}^{\infty}s_{j}(C)\langle e_{j},\cdot\rangle f_{j}$ , $T=\sum_{j=1}^{\infty}s_{j}(T)\langle g_{j},\cdot\rangle h_{j}$ for some orthonormal systems $(e_{j})_{j\in\mathbb{N}}$ , $(h_{j})_{j\in\mathbb{N}}$ in $\mathcal{X}$ and $(f_{j})_{j\in\mathbb{N}}$ , $(g_{j})_{j\in\mathbb{N}}$ in $\mathcal{Y}$ . This allows us to define finite rank approximations $C_{n}:=\sum_{j=1}^{n}s_{j}(C)\langle e_{j},\cdot\rangle f_{j}$ and $T_{n}:=\sum_{j=1}^{n}s_{j}(T)\langle g_{j},\cdot\rangle h_{j}$ To pass to the original operators $C,T$ , we use Remark 2.2 to see

[TABLE]

Because of this we may apply Proposition 2.10 and Lemma 3.1 to obtain

[TABLE]

with $r_{n}:=\sum_{j=1}^{n}s_{j}(C)s_{j}(T)$ . Using the obvious fact $\Delta(K_{r}(0),K_{r_{n}}(0))=|r-r_{n}|$ for all $n\in\mathbb{N}$ one readily verifies $\overline{S_{C}(T)}=\lim_{n\to\infty}K_{r_{n}}(0)=K_{r}(0)$ with $r=\sum_{j=1}^{\infty}s_{j}(C)s_{j}(T)$ . ∎

Remark 3.3.

To see that the supremum in (3.1) is not necessarily a maximum, consider $\mathcal{H}=\ell_{2}(\mathbb{N})$ with standard basis $(e_{j})_{j\in\mathbb{N}}$ . Now the positive definite trace-class operator $C=\sum_{j=1}^{\infty}\frac{1}{2^{j}}\langle e_{j},\cdot\rangle e_{j}$ as well as the compact operator $T=\sum_{k=1}^{\infty}\frac{1}{2^{k}}\langle e_{k+1},\cdot\rangle e_{k+1}$ satisfy

[TABLE]

for any $U,V\in\mathcal{U}(\mathcal{H})$ . We know that $\sup_{U,V\in\mathcal{U}(\mathcal{H})}|\operatorname{tr}(CUTV)|=\sum_{j=1}^{\infty}(\frac{1}{2^{j}})^{2}$ but if this was a maximum, then by the above calculation $\langle e_{j},Ue_{k+1}\rangle=\langle e_{k+1},Ve_{j}\rangle=\delta_{jk}$ for all $j,k\in\mathbb{N}$ . The only operators which satisfy these conditions are the left- and the right-shift, respectively, both of which are not unitary–a contradiction.

Finally, we are prepared to extend inequality (1.3) to Schatten-class operators on separable Hilbert spaces.

Theorem 3.4.

Let $C\in\mathcal{B}^{p}(\mathcal{H})$ , $T\in\mathcal{B}^{q}(\mathcal{H})$ both be self-adjoint with $p,q$ conjugate and let the positive semi-definite operators $C^{+},T^{+}$ and $C^{-},T^{-}$ denote the positive and negative part of $C,T$ , respectively (i.e. $C=C^{+}-C^{-}$ , $T=T^{+}-T^{-}$ ). Then

[TABLE]

as well as

[TABLE]

*In particular, one has:

$-\sum_{j=1}^{\infty}\big{(}\lambda_{j}^{\downarrow}(C^{+})\lambda_{j}^{\downarrow}(T^{-})+\lambda_{j}^{\downarrow}(C^{-})\lambda_{j}^{\downarrow}(T^{+})\big{)}\leq\operatorname{tr}(CT)\leq\sum_{j=1}^{\infty}\big{(}\lambda_{j}^{\downarrow}(C^{+})\lambda_{j}^{\downarrow}(T^{+})+\lambda_{j}^{\downarrow}(C^{-})\lambda_{j}^{\downarrow}(T^{-})\big{)}$

Proof.

Let $C\in\mathcal{B}^{p}(\mathcal{H})$ , $T\in\mathcal{B}^{q}(\mathcal{H})$ both be self-adjoint with $p,q$ conjugate and first assume that $T$ has at most $k\in\mathbb{N}$ non-zero eigenvalues. Then the following is straightforward to show:

[TABLE]

Note that in this case the (modified) eigenvalue sequences of $T$ contains infinitely many zeros. Now let us address the general case. Choose any orthonormal eigenbasis $(e_{n})_{n\in\mathbb{N}}$ of $T$ with corresponding modified eigenvalue sequence (Lemma 2.5). Moreover, let $\Pi_{k}=\sum\nolimits_{j=1}^{k}\langle e_{j},\cdot\rangle e_{j}$ the projection onto the span of the first $k$ eigenvectors of $T$ . Then $\Pi_{k}T\Pi_{k}$ has at most $k$ non-zero eigenvalues and our preliminary considerations combined with Corollary 2.11 and Theorem 2.13 (c) as well as Lemma 2.7 readily imply

[TABLE]

where we used the identity $(\Pi_{k}T\Pi_{k})^{\pm}=\Pi_{k}T^{\pm}\Pi_{k}$ . Now, the last step is to show that $(\sum\nolimits_{j=1}^{k}\lambda_{j}^{\downarrow}(C^{+})\lambda_{j}^{\downarrow}(\Pi_{k}T^{+}\Pi_{k}))_{k\in\mathbb{N}}$ converges to $\sum\nolimits_{j=1}^{\infty}\lambda_{j}^{\downarrow}(C^{+})\lambda_{j}^{\downarrow}(T^{+})$ . Let $\varepsilon>0$ (and w.l.o.g. $T\neq 0$ ). As $(\lambda_{j}^{\downarrow}(C^{+}))_{j\in\mathbb{N}}$ is a sequence in $\ell^{p}_{+}(\mathbb{N})$ we find $N\in\mathbb{N}$ with

[TABLE]

where for $p=\infty$ , the left-hand side becomes $\sup_{n>N}\lambda_{n}^{\downarrow}(C^{+})=\lambda_{N+1}^{\downarrow}(C^{+})\,$ .

Either way, associated to this $N$ one can choose $K\geq N$ such that the first $N$ largest eigenvalues of $T^{+}$ are listed in $(\lambda_{j}^{\downarrow}(\Pi_{K}T^{+}\Pi_{K}))_{j\in\mathbb{N}}$ and thus $\lambda_{j}^{\downarrow}(T^{+})=\lambda_{j}^{\downarrow}(\Pi_{K}T^{+}\Pi_{K})$ for all $j=1,\ldots,N$ . Putting things together and using Hölder’s inequality yields

[TABLE]

The case of $C^{-},T^{-}$ as well as the infimum-estimate are shown analogously which concludes the proof. ∎

Therefore if $C,T$ are self-adjoint (i.e. $W_{C}(T)\subseteq\mathbb{R}$ ), a path-connectedness argument similar to the proof of Lemma 3.1 shows $(a,b)\subseteq W_{C}(T)\subseteq[a,b]$ with $a$ ( $\leq 0$ ) given by (3.3) and $b$ ( $\geq 0$ ) given by (3.2). In particular, $\overline{W_{C}(T)}=[a,b]$ .

Acknowledgements.

This work was supported by the Bavarian excellence network enb via the International PhD Programme of Excellence Exploring Quantum Matter (exqm).

4. APPENDIX

A. PROOF OF THEOREM 2.13

The overall idea is to transfer properties of $W_{C}(T)$ from finite to infinite dimensions via the set convergence introduced in Section 2.2. However, we first need two auxiliary results to characterize the star-center of $\overline{W_{C}(T)}$ later on.

Lemma 4.1.

Let $T\in\mathcal{K}(\mathcal{X})$ and $(e_{k})_{k\in\mathbb{N}}$ be any orthonormal system in $\mathcal{X}$ . Then

(a)

$\displaystyle\sum\nolimits_{k=1}^{n}|\langle e_{k},Te_{k}\rangle|\leq\sum\nolimits_{k=1}^{n}s_{k}(T)$ * for all $n\in\mathbb{N}$ and*

(b)

$\lim_{k\to\infty}\langle e_{k},Te_{k}\rangle=0\,.$ **

Proof.

(a) Consider a Schmidt decomposition $\sum\nolimits_{m=1}^{\infty}s_{m}(T)\langle f_{m},\cdot\rangle g_{m}$ of $T$ so

[TABLE]

Defining $\lambda_{m}:=\sum_{k=1}^{n}|\langle e_{k},f_{m}\rangle\langle g_{m},e_{k}\rangle|$ for all $m\in\mathbb{N}$ , using Cauchy-Schwarz and Bessel’s inequality one gets

[TABLE]

for all $m\in\mathbb{N}$ . On the other hand, said inequalities also imply

[TABLE]

Hence, because $(s_{m}(T))_{m\in\mathbb{N}}$ is decreasing by construction, an upper bound of $\sum\nolimits_{m=1}^{\infty}s_{m}(T)\lambda_{m}$ is obtained by choosing $\lambda_{1}=\ldots=\lambda_{n}=1$ and $\lambda_{j}=0$ whenever $j>n$ . This shows the desired inequality. A proof of (b) can be found, e.g., in [11, Lemma 16.17]. ∎

Lemma 4.2.

Let $C\in\mathcal{B}^{p}(\mathcal{H})$ with $p\in(1,\infty]$ and let $q\in[1,\infty)$ such that $p,q$ are conjugate. Furthermore, let $(e_{n})_{n\in\mathbb{N}}$ be any orthonormal system in $\mathcal{H}$ . Then

[TABLE]

Proof.

First, let $p=\infty$ , so $q=1$ . As $C$ is compact, by Lemma 4.1 (b) one has $\lim_{k\to\infty}\langle e_{k},Ce_{k}\rangle=0$ , hence the sequence of arithmetic means converges to zero as well. Next, let $p\in(1,\infty)$ and $\varepsilon>0$ . Moreover, we assume w.l.o.g. $C\neq 0$ so $s_{1}(C)=\|C\|\neq 0$ . As $C\in\mathcal{B}^{p}(\mathcal{H})$ , one can choose $N_{1}\in\mathbb{N}$ such that $\sum_{k=N_{1}+1}^{\infty}s_{k}(C)^{p}<\frac{\varepsilon^{p}}{2^{p}}$ and moreover $N_{2}\in\mathbb{N}$ such that $\frac{1}{n^{1/q}}<\frac{\varepsilon}{2\sum\nolimits_{k=1}^{N_{1}}s_{k}(C)}$ for all $n\geq N_{2}$ . Then, for any $n\geq N:=\max\{N_{1}+1,N_{2}\}$ , Lemma 4.1 and Hölder’s inequality yield the estimate

[TABLE]

What we also need is some mechanism to associate bounded operators on $\mathcal{H}$ with matrices. In doing so, let $(e_{n})_{n\in\mathbb{N}}$ be some orthonormal basis of $\mathcal{H}$ and let $(\hat{e}_{i})_{i=1}^{n}$ be the standard basis of $\mathbb{C}^{n}$ . For any $n\in\mathbb{N}$ we define $\Gamma_{n}:\mathbb{C}^{n}\to\mathcal{H}$ , $\hat{e_{i}}\mapsto\Gamma_{n}(\hat{e}_{i}):=e_{i}$ and its linear extension to all of $\mathbb{C}^{n}$ . With this, let

[TABLE]

be the operator which “cuts out” the upper $n\times n$ block of (the matrix representation of) $A$ with respect to $(e_{n})_{n\in\mathbb{N}}$ . The key result now is the following:

Proposition 4.3.

Let $C\in\mathcal{B}^{p}(\mathcal{H})$ , $T\in\mathcal{B}^{q}(\mathcal{H})$ with $p,q\in[1,\infty]$ conjugate be given. Furthermore, let $(e_{n})_{n\in\mathbb{N}}$ and $(g_{n})_{n\in\mathbb{N}}$ be arbitrary orthonormal bases of $\mathcal{H}$ . Then

[TABLE]

where $[\,\cdot\,]_{k}^{e}$ and $[\,\cdot\,]_{k}^{g}$ are the maps given by (4.1) with respect to $(e_{n})_{n\in\mathbb{N}}$ and $(g_{n})_{n\in\mathbb{N}}$ , respectively. Moreover, if $C$ are $T$ both are normal then

[TABLE]

where $(e_{n})_{n\in\mathbb{N}}$ and $(g_{n})_{n\in\mathbb{N}}$ are the orthonormal bases of $\mathcal{H}$ which diagonalize $C$ and $T$ , respectively.

Proof.

For $p=1,q=\infty$ (or vice versa) proofs are given in [3, Thm. 3.1 & 3.6] which can be adjusted to $p,q\in(1,\infty)$ by minimal modifications. ∎

With these preparations we are ready for proving our main result about the $C$ -numerical range of Schatten-class operators.

Proof of Theorem 2.13.

(a): For arbitrary orthonormal bases $(e_{n})_{n\in\mathbb{N}}$ , $(g_{n})_{n\in\mathbb{N}}$ of $\mathcal{H}$ as well as any $n\in\mathbb{N}$ , it is readily verified that

[TABLE]

Both factors converge and, by Lemma 4.2, at least one of them goes to [math] as $n\to\infty$ . Moreover, $W_{[C]^{e}_{2n}}([T]^{g}_{2n})$ is star-shaped with respect to $(\operatorname{tr}([C]^{e}_{2n})\operatorname{tr}([T]^{g}_{2n})/(2n)$ for all $n\in\mathbb{N}$ , cf. [2, Thm. 4]. Because Hausdorff convergence preserves star-shapedness [3, Lemma 2.5 (d)], Proposition 4.3 implies that $\overline{W_{C}(T)}$ is star-shaped with respect to $0\in\mathbb{C}$ .

For what follows let $(e_{n})_{n\in\mathbb{N}},(g_{n})_{n\in\mathbb{N}}$ be the orthonormal bases of $\mathcal{H}$ which diagonalize $C$ and $T$ , respectively.

(b): W.l.o.g. let $C$ be normal with collinear eigenvalues. Since $C$ is compact (i.e. its eigenvalue sequence is a null sequence) there exists $\phi\in[0,2\pi)$ such that $e^{i\phi}C$ is self-adjoint and by Proposition 4.3 we obtain

[TABLE]

Moreover, as $[e^{i\phi}C]_{2n}^{e}\in\mathbb{C}^{2n\times 2n}$ is hermitian for all $n\in\mathbb{N}$ we conclude that $W_{[e^{i\phi}C]_{2n}^{e}}([e^{-i\phi}T]_{2n}^{e})$ is convex, cf. [15]. The fact that Hausdorff convergence preserves convexity [3, Lemma 2.5 (c)] then yields the desired result.

(c): The inclusion $P_{C}(T)\subseteq W_{C}(T)$ is shown exactly like [3, Thm. 3.4–first inclusion]. For the second inclusion, we note that by assumption $[C]^{e}_{n}$ and $[T]^{g}_{n}$ are diagonal and thus normal for all $n\in\mathbb{N}$ . Hence [19, Coro. 2.4] tells us

[TABLE]

for all $n\in\mathbb{N}$ . Using that Hausdorff convergence preserves inclusions [3, Lemma 2.5 (a)], (4.2) together with Proposition 4.3 yields

[TABLE]

Finally, applying the closure and the convex hull to the inclusions $P_{C}(T)\subseteq W_{C}(T)$ yields $\operatorname{conv}(\overline{P_{C}(T)})\subseteq\operatorname{conv}(\overline{W_{C}(T)})=\overline{W_{C}(T)}$ , where the last equality is due to (b), and thus $\overline{W_{C}(T)}=\operatorname{conv}(\overline{P_{C}(T)})$ . ∎

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Berberian , Introduction to Hilbert Space , Amer. Math. Soc., Chelsea 1976.
2[2] W.S. Cheung, N.K. Tsing , The C 𝐶 C -Numerical Range of Matrices is Star-Shaped, Lin. Multilin. Alg. , 41 (1996), 245–250.
3[3] G. Dirr, F. vom Ende , The C 𝐶 C -Numerical Range in Infinite Dimensions, Lin. Multilin. Alg. , 2018, In press: https://doi.org/10.1080/03081087.2018.1515884.
4[4] G. Dirr, F. vom Ende , Authors’ Addendum to "The C-Numerical Range in Infinite Dimensions", Lin. Multilin. Alg. , 2019, In press: https://doi.org/10.1080/03081087.2019.1604624.
5[5] N. Dunford, J. Schwartz , Linear Operators: Spectral Theory , Pure and applied mathematics, New York: Interscience Publishers, New York 1963.
6[6] K. Fan , Maximum Properties and Inequalities for the Eigenvalues of Completely Continuous Operators, Proc. Natl. Acad. Sci. USA , 37 (1951), 760–766.
7[7] S.J. Glaser, T. Schulte-Herbrüggen, M. Sieveking, et al. , Unitary Control in Quantum Ensembles: Maximising Signal Intensity in Coherent Spectroscopy, Science , 280 (1998), 421–424.
8[8] G.H. Golub, C.F. van Loan , Matrix Computations , The Johns Hopkins University Press, Baltimore 1989.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Von Neumann Type of Trace Inequalities for Schatten-Class Operators

Abstract.

1991 Mathematics Subject Classification:

keywords:

1. INTRODUCTION

2. NOTATION AND PRELIMINARIES

2.1. INFINITE-DIMENSIONAL HILBERT SPACES AND THE SCHATTEN CLASSES

Lemma 2.1** (Schmidt decomposition).**

Remark 2.2**.**

Lemma 2.3**.**

Lemma 2.4**.**

Lemma 2.5** ([1], Thm. VIII.4.6).**

2.2. SET CONVERGENCE

Lemma 2.6**.**

Lemma 2.7**.**

Proof.

2.3. THE CCC-NUMERICAL RANGE OF SCHATTEN-CLASS OPERATORS

Definition 2.8**.**

Lemma 2.9**.**

Proof.

Proposition 2.10**.**

Proof.

Corollary 2.11**.**

Definition 2.12** (CCC-spectrum).**

Theorem 2.13** ([4]).**

3. MAIN RESULTS

Lemma 3.1**.**

Proof.

Theorem 3.2**.**

Proof.

Remark 3.3**.**

Theorem 3.4**.**

Proof.

Acknowledgements.

4. APPENDIX

A. PROOF OF THEOREM 2.13

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Proposition 4.3**.**

Proof.

Proof of Theorem 2.13.

Lemma 2.1 (Schmidt decomposition).

Remark 2.2.

Lemma 2.3.

Lemma 2.4.

Lemma 2.5 ([1], Thm. VIII.4.6).

Lemma 2.6.

Lemma 2.7.

2.3. THE $C$ -NUMERICAL RANGE OF SCHATTEN-CLASS OPERATORS

Definition 2.8.

Lemma 2.9.

Proposition 2.10.

Corollary 2.11.

Definition 2.12 ( $C$ -spectrum).

Theorem 2.13 ([4]).

Lemma 3.1.

Theorem 3.2.

Remark 3.3.

Theorem 3.4.

Lemma 4.1.

Lemma 4.2.

Proposition 4.3.