Horn's problem and Harish-Chandra's integrals. Probability density   functions

Jean-Bernard Zuber

arXiv:1705.01186·math-ph·September 13, 2018

Horn's problem and Harish-Chandra's integrals. Probability density functions

Jean-Bernard Zuber

PDF

TL;DR

This paper computes the probability density functions of eigenvalues for sums of random Hermitian matrices, providing explicit results for small sizes and exploring patterns in symmetric and skew-symmetric cases through numerical experiments.

Contribution

It explicitly derives eigenvalue PDFs for sums of random Hermitian matrices for small sizes and investigates numerical patterns in symmetric and skew-symmetric cases.

Findings

01

Explicit eigenvalue PDFs for small n Hermitian matrices.

02

Numerical patterns of enhancement in symmetric and skew-symmetric cases.

03

Comparison of theoretical results with numerical experiments.

Abstract

Horn's problem -- to find the support of the spectrum of eigenvalues of the sum $C = A + B$ of two $n$ by $n$ Hermitian matrices whose eigenvalues are known -- has been solved by Knutson and Tao. Here the probability distribution function (PDF) of the eigenvalues of $C$ is explicitly computed for low values of $n$ , for $A$ and $B$ uniformly and independently distributed on their orbit, and confronted to numerical experiments. Similar considerations apply to skew-symmetric and symmetric real matrices under the action of the orthogonal group. In the latter case, where no analytic formula is known in general and we rely on numerical experiments, curious patterns of enhancement appear.

Figures40

Click any figure to enlarge with its caption.

Equations159

A = U diag (α_{1}, α_{2}, \dots, α_{n}) U^{†} .

A = U diag (α_{1}, α_{2}, \dots, α_{n}) U^{†} .

α_{1} \geq α_{2} \geq \dots \geq α_{n} .

α_{1} \geq α_{2} \geq \dots \geq α_{n} .

\underline{α} = diag (α_{1}, α_{2}, \dots, α_{n}) .

\underline{α} = diag (α_{1}, α_{2}, \dots, α_{n}) .

i = 1 \sum n γ_{i} = i = 1 \sum n (α_{i} + β_{i}),

i = 1 \sum n γ_{i} = i = 1 \sum n (α_{i} + β_{i}),

H (α, i x) = \int_{U (n)} D U exp (i tr \underline{x} U \underline{α} U^{†})

H (α, i x) = \int_{U (n)} D U exp (i tr \underline{x} U \underline{α} U^{†})

p (γ ∣ α, β) = const. Δ (γ)^{2} \int d^{n} x Δ (x)^{2} H (α, i x) H (β, i x) H (γ, i x)^{*},

p (γ ∣ α, β) = const. Δ (γ)^{2} \int d^{n} x Δ (x)^{2} H (α, i x) H (β, i x) H (γ, i x)^{*},

φ_{A} (X) := E (e^{i tr X A}) = \int_{U (n)} D U exp (i tr X U \underline{α} U^{†})

φ_{A} (X) := E (e^{i tr X A}) = \int_{U (n)} D U exp (i tr X U \underline{α} U^{†})

E (e^{i tr X C}) = φ_{A} (X) φ_{B} (X)

E (e^{i tr X C}) = φ_{A} (X) φ_{B} (X)

p (C ∣ α, β) = \frac{1}{( 2 π ) ^{n^{2}}} \int D X e^{- i tr X C} φ_{A} (X) φ_{B} (X),

p (C ∣ α, β) = \frac{1}{( 2 π ) ^{n^{2}}} \int D X e^{- i tr X C} φ_{A} (X) φ_{B} (X),

κ = (2 π)^{n (n - 1) /2} / p = 1 \prod n p!

κ = (2 π)^{n (n - 1) /2} / p = 1 \prod n p!

Δ (x) = i < j \prod (x_{i} - x_{j})

Δ (x) = i < j \prod (x_{i} - x_{j})

φ_{A} (X) = H (α, i x) φ_{B} (X) = H (β, i x)

φ_{A} (X) = H (α, i x) φ_{B} (X) = H (β, i x)

p (γ ∣ α, β)

p (γ ∣ α, β)

H (α, i x) = \overset{κ}{^} i^{- n (n - 1) /2} \frac{det e ^{i x_{i} α_{j}}}{Δ ( x ) Δ ( α )}

H (α, i x) = \overset{κ}{^} i^{- n (n - 1) /2} \frac{det e ^{i x_{i} α_{j}}}{Δ ( x ) Δ ( α )}

\overset{κ}{^} = p = 1 \prod n - 1 p! .

\overset{κ}{^} = p = 1 \prod n - 1 p! .

p (γ ∣ α, β) = \frac{κ ^{2} κ ^ ^{3}}{( 2 π ) ^{n^{2}}} i^{- n (n - 1) /2} \frac{Δ ( γ )}{Δ ( α ) Δ ( β )} \int \frac{d ^{n} x}{Δ ( x )} det e^{i x_{i} α_{j}} det e^{i x_{i} β_{j}} det e^{- i x_{i} γ_{j}} .

p (γ ∣ α, β) = \frac{κ ^{2} κ ^ ^{3}}{( 2 π ) ^{n^{2}}} i^{- n (n - 1) /2} \frac{Δ ( γ )}{Δ ( α ) Δ ( β )} \int \frac{d ^{n} x}{Δ ( x )} det e^{i x_{i} α_{j}} det e^{i x_{i} β_{j}} det e^{- i x_{i} γ_{j}} .

det e^{i x_{i} α_{j}}

det e^{i x_{i} α_{j}}

Δ (u) := 1 \leq i < j \leq n \prod (u_{i} + u_{i + 1} + \dots u_{j - 1})

Δ (u) := 1 \leq i < j \leq n \prod (u_{i} + u_{i + 1} + \dots u_{j - 1})

A_{j} (P, P^{'}, P^{''}) = k = 1 \sum j (α_{P (k)} + β_{P^{'} (k)} - γ_{P^{''} (k)}) - \frac{j}{n} k = 1 \sum n (α_{k} + β_{k} - γ_{k}) .

A_{j} (P, P^{'}, P^{''}) = k = 1 \sum j (α_{P (k)} + β_{P^{'} (k)} - γ_{P^{''} (k)}) - \frac{j}{n} k = 1 \sum n (α_{k} + β_{k} - γ_{k}) .

p (γ ∣ α, β)

p (γ ∣ α, β)

J_{n} (α, β; γ)

\frac{κ ^{2} κ ^ ^{3} n !}{( 2 π ) ^{n (n - 1)}} = \frac{\prod _{1}^{n - 1} p !}{n !},

\frac{κ ^{2} κ ^ ^{3} n !}{( 2 π ) ^{n (n - 1)}} = \frac{\prod _{1}^{n - 1} p !}{n !},

n! \int_{\sum _{i} γ _{i} = \sum _{i} α _{i} + \sum _{i} β _{i} γ _{n} \leq γ _{n - 1} \leq \dots \leq γ _{1}} d^{n - 1} γ p (γ ∣ α, β) = 1

n! \int_{\sum _{i} γ _{i} = \sum _{i} α _{i} + \sum _{i} β _{i} γ _{n} \leq γ _{n - 1} \leq \dots \leq γ _{1}} d^{n - 1} γ p (γ ∣ α, β) = 1

\int_{\sum _{i} γ _{i} = \sum _{i} α _{i} + \sum _{i} β _{i} γ _{n} \leq γ _{n - 1} \leq \dots \leq γ _{1}} d^{n - 1} γ \frac{Δ ( γ )}{Δ ( α ) Δ ( β )} J_{n} (α, β; γ) = \frac{1}{\prod _{1}^{n - 1} p !}

\int_{\sum _{i} γ _{i} = \sum _{i} α _{i} + \sum _{i} β _{i} γ _{n} \leq γ _{n - 1} \leq \dots \leq γ _{1}} d^{n - 1} γ \frac{Δ ( γ )}{Δ ( α ) Δ ( β )} J_{n} (α, β; γ) = \frac{1}{\prod _{1}^{n - 1} p !}

γ_{1, 2} = \frac{1}{2} [α_{1} + α_{2} + β_{1} + β_{2} \pm α_{12}^{2} + β_{12}^{2} + 2 α_{12} β_{12} cos ψ]

γ_{1, 2} = \frac{1}{2} [α_{1} + α_{2} + β_{1} + β_{2} \pm α_{12}^{2} + β_{12}^{2} + 2 α_{12} β_{12} cos ψ]

γ_{12} = \pm α_{12}^{2} + β_{12}^{2} + 2 α_{12} β_{12} cos ψ

γ_{12} = \pm α_{12}^{2} + β_{12}^{2} + 2 α_{12} β_{12} cos ψ

ρ (γ_{12}) = - \frac{1}{4} sin ψ \frac{d ψ}{d γ _{12}} = \frac{1}{2} \frac{∣ γ _{12} ∣}{α _{12} β _{12}},

ρ (γ_{12}) = - \frac{1}{4} sin ψ \frac{d ψ}{d γ _{12}} = \frac{1}{2} \frac{∣ γ _{12} ∣}{α _{12} β _{12}},

∣ α_{12} - β_{12} ∣ \leq γ_{12} \leq α_{12} + β_{12} \cup - (α_{12} + β_{12}) \leq γ_{12} \leq - ∣ α_{12} - β_{12} ∣,

∣ α_{12} - β_{12} ∣ \leq γ_{12} \leq α_{12} + β_{12} \cup - (α_{12} + β_{12}) \leq γ_{12} \leq - ∣ α_{12} - β_{12} ∣,

max (α_{1} + β_{2}, α_{2} + β_{1}) \leq γ_{1} \leq α_{1} + β_{1} α_{2} + β_{2} \leq γ_{2} \leq min (α_{1} + β_{2}, α_{2} + β_{1})

max (α_{1} + β_{2}, α_{2} + β_{1}) \leq γ_{1} \leq α_{1} + β_{1} α_{2} + β_{2} \leq γ_{2} \leq min (α_{1} + β_{2}, α_{2} + β_{1})

∣ α_{12} - β_{12} ∣ \leq γ_{12} \leq α_{12} + β_{12},

∣ α_{12} - β_{12} ∣ \leq γ_{12} \leq α_{12} + β_{12},

J_{2} (α, β; γ) = \frac{1}{2 π i} P, P^{'} \in S_{2} \sum ε_{P} ε_{P^{'}} \int_{R} \frac{d u}{u} e^{i u A (P, P^{'}, I)}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

*Sorbonne Université, UPMC Univ Paris 06, UMR 7589, LPTHE, F-75005, Paris, France

& CNRS, UMR 7589, LPTHE, F-75005, Paris, France

[email protected]

Horn’s problem – to find the support of the spectrum of eigenvalues of the sum $C=A+B$ of two $n$ by $n$ Hermitian matrices whose eigenvalues are known – has been solved by Klyachko and by Knutson and Tao. Here the probability distribution function (PDF) of the eigenvalues of $C$ is explicitly computed for low values of $n$ , for $A$ and $B$ uniformly and independently distributed on their orbit, and confronted to numerical experiments. Similar considerations apply to skew-symmetric and symmetric real matrices under the action of the orthogonal group. In the latter case, where no analytic formula is known in general and we rely on numerical experiments, curious patterns of enhancement appear.

Keywords: Horn problem. Harish-Chandra integrals.

Mathematics Subject Classification 2010: 15Bxx, 60B20

1 Horn’s problem for Hermitian matrices

1.1 A short review and summary of results

Let $H_{n}$ be the $n^{2}$ -dimensional (real) space of Hermitian matrices of size $n$ . Any matrix $A\in H_{n}$ may be diagonalized by a unitary transformation $U\in{\rm U}(n)$

[TABLE]

Since permutations of $S_{n}$ belong to ${\rm U}(n)$ , one may always assume that these (real) eigenvalues have been ordered according to

[TABLE]

In the following we are mostly interested in the generic case where all these inequalities are strict, with no pair of equal eigenvalues. We denote by $\alpha$ the multiplets of eigenvalues thus ordered and by $\underline{\alpha}$ the diagonal matrix

[TABLE]

Conversely, given such an $\alpha$ , the set of matrices $A$ with that spectrum of eigenvalues forms the orbit $\Omega_{\alpha}$ of $\underline{\alpha}$ under the adjoint action of ${\rm U}(n)$ .

Horn’s problem deals with the following question: given two multiplets $\alpha$ and $\beta$ ordered as in (2), and $A\in\Omega_{\alpha}$ and $B\in\Omega_{\beta}$ , what can be said about the eigenvalues $\gamma$ of $C=A+B\,$ ? Obviously $\gamma$ belongs to the hyperplane in $\mathbb{R}^{n}$ defined by

[TABLE]

expressing that ${\rm tr\,}C={\rm tr\,}A+{\rm tr\,}B$ .

Horn [1] had conjectured the form of a set of necessary and sufficient inequalities to be satisfied by $\gamma$ to belong to the spectrum of a matrix $C$ . After contributions by several authors, see in particular [2], and [3] for a history of the problem, these conjectures were proved by Knutson and Tao [4, 5], see also [6], through the introduction of combinatorial objects, honeycombs and hives, see examples below.

What makes Horn’s problem fascinating are its many facets [2, 3]. The problem has unexpected interpretations and applications in symplectic geometry, Schubert calculus, … and representation theory. In the latter, the above problem has a direct connection with the determination of Littlewood-Richardson (LR) coefficients, i.e., with the computation of multiplicities in the decomposition of the tensor product of two irreducible polynomial representations of GL( $n$ ).

In the present work, we show that for two random matrices $A$ and $B$ chosen uniformly on the orbits $\Omega_{\alpha}$ and $\Omega_{\beta}$ , respectively, (uniformly in the sense of the ${\rm U}(n)$ Haar measure on these orbits), the probability density function (PDF) $p(\gamma|\alpha,\beta)$ of $\gamma$ may be written in terms of the integral

[TABLE]

where $\underline{x}={\rm diag\,}(x_{1},x_{2},\cdots,x_{n})$ , in the general form

[TABLE]

see Proposition 1 below.

In the present case this integral ${\mathcal{H}}(\alpha,x)$ is well known and has a simple expression, the so-called HCIZ integral [7, 8]. Then the $x$ integration may be carried out, at least for low values of $n$ , resulting in explicit expressions for the PDF.

The method generalizes to other sets of matrices and their adjoint orbits under appropriate groups. We discuss the case of the real orthogonal group acting on real symmetric or skew-symmetric matrices. Similarities and differences between these cases are pointed out.

Equation (5) is reminiscent of a well known analogous formula for the determination of LR-coefficients in terms of characters. This is no coincidence, as there exist deep connections between the two problems: Horn’s problem may be regarded as a semi-classical limit of the Littlewood-Richardson one, as anticipated by Heckman [9] and made explicit in [4, 5]. We intend to return to these connections in a forthcoming paper [10].

The general formula (5) is an explicit realization of the content of Theorem 4 in [5] and may have been known to many people, see [11, 12, 13, 14] for related work. The main original results of the present paper are the detailed calculations carried out in various cases of low dimension, and their confrontation with numerical “experiments”. This work may thus be regarded as an exercise in concrete and experimental mathematics….

1.2 The probability density function (PDF)

Let $A$ be a random matrix of $H_{n}$ chosen uniformly on the orbit $\Omega_{\alpha}$ , i.e., $A=U\underline{\alpha}U^{\dagger}$ , with $U$ uniformly distributed in ${\rm U}(n)$ in the sense of the normalized Haar measure $DU$ . The characteristic function of the random variable $A$ may be written as

[TABLE]

where $X\in H_{n}$ . This is referred to as the Fourier transform of the orbital measure in the literature. For two independent random matrices $A\in\Omega_{\alpha}$ and $B\in\Omega_{\beta}$ , the characteristic function of the sum $C=A+B$ is the product

[TABLE]

from which the PDF of $C$ may be recovered by an inverse Fourier transform

[TABLE]

which is, a priori, a distribution (in the sense of generalized function).

Here $DX$ stands for the Lebesgue measure on Hermitian matrices. If $X=U_{X}\underline{x}U_{X}^{\dagger}$ , that measure may be expressed as $DX=\kappa\prod_{i}dx_{i}\Delta(x)^{2}DU_{X}$ , where111for this and other normalizing constants, see Appendix A

[TABLE]

and

[TABLE]

is the Vandermonde determinant of the $x$ ’s. It is clear that $\varphi_{A}(X)$ and $\varphi_{B}(X)$ depend only on the eigenvalues $\alpha_{i}$ , $\beta_{i}$ and $x_{i}$ of $A$ , $B$ and $X$ , namely

[TABLE]

in terms of the HCIZ integral introduced above. Also $p(C|\alpha,\beta)$ is invariant under conjugation of $C$ by unitary matrices of ${\rm U}(n)$ and is thus only a function of the eigenvalues $\gamma_{i}$ of $C$ . The PDF of the $\gamma$ ’s must incorporate the Jacobian from the measure, hence

[TABLE]

with three copies of the HCIZ integral

[TABLE]

where11footnotemark: 1

[TABLE]

Thus finally

Proposition 1

. The probability distribution function of eigenvalues $\gamma$ , given $\alpha$ and $\beta$ , is

[TABLE]

where $\kappa$ and $\hat{\kappa}$ are given in (8) and (12).**

Note that while $\alpha$ and $\beta$ are ordered as in (2), the integration over the group mixes the order of the $\gamma$ ’s and the PDF (13) thus applies to unordered $\gamma$ ’s. In particular $p$ is normalized by $\int_{\mathbb{R}^{n}}d^{n}\gamma\,\,p(\gamma|\alpha,\beta)=1$ .

Let’s us sketch the way the above integral may be handled. One writes for each determinant

[TABLE]

where $\varepsilon_{P}$ is the signature of permutation $P$ .

In the product of the three determinants, the prefactor $e^{\mathrm{i\,}\sum_{j=1}^{n}x_{j}\sum_{k=1}^{n}(\alpha_{k}+\beta_{k}-\gamma_{k})/n}$ yields, upon integration over $\frac{1}{n}\sum x_{j}$ , $2\pi$ times a Dirac delta of $\sum_{k}(\alpha_{k}+\beta_{k}-\gamma_{k})$ , expressing the conservation of the trace in Horn’s problem. One is left with an integration over $(n-1)$ variables222The Jacobian from $(x_{1},\cdots,x_{n})$ to $(\frac{1}{n}\sum x_{j},u_{1},\cdots,u_{n-1})$ is $(-1)^{n-1}$ . $u_{j}:=x_{j}-x_{j+1}$ of $(n!)^{3}$ terms of the form $\int_{\mathbb{R}^{n-1}}\frac{du_{j}}{\widetilde{\Delta}(u)}\prod_{j}e^{\mathrm{i\,}u_{j}A_{j}(P,P^{\prime},P^{\prime\prime})}$ where

[TABLE]

and

[TABLE]

It is also easy to see that one may absorb $P^{\prime\prime}$ through a redefinition of the $x$ ’s by $P^{\prime\prime}$ : $x_{j}\mapsto x_{P^{\prime\prime}(j)}$ (which introduces a welcome sign $\varepsilon_{P^{\prime\prime}}$ from the Vandermonde $\Delta(x)$ ) and a change of $P$ and $P^{\prime}$ into $P^{\prime\prime}P$ and $P^{\prime\prime}P^{\prime}$ . Thus $P^{\prime\prime}$ may be taken to be the trivial permutation $I$ in the above, with an overall factor $n!$ . Hence

[TABLE]

This is the expression that we are going to study in more detail for $n=2$ , $n=3$ and (to a lesser extent) $n=4,\ n=5$ . The constant in front of (17) reads

[TABLE]

which is equal to $\frac{1}{2},\frac{1}{3},\frac{1}{2},\frac{12}{5},\cdots$ for $n=2,3,4,5,\cdots$

**Remarks.

**1. Note that in that computation of $p$ , the last term in the r.h.s. of (16) drops out, because of the relation (3) embodied in the Dirac delta. The merit of that term is to make explicit the invariance of $A_{j}$ under a simultaneous translation of all $\gamma$ ’s: $\forall i,\ \gamma_{i}\to\gamma_{i}+c$ , expressing the fact that the PDF of eigenvalues of $C=A+B$ takes the same values as that of $A+B+cI$ , on a shifted support.

Convergence of ${\mathcal{J}}_{n}$ . ${\mathcal{J}}_{n}$ in (18) is a double sum over the symmetric group $S_{n}$ of the Fourier transform of ${\widetilde{\Delta}(u)}^{-1}$ evaluated at $A_{j}(P,P^{\prime},I)$ . Each of these integrals is absolutely convergent at infinity for $n>2$ , and is only semi-convergent for $n=2$ . Each one exhibits poles for vanishing partial sums $(u_{i}+u_{i+1}+\cdots u_{j-1})$ , (i.e., $x_{i}=x_{j}$ ), but the sum is regular at these points, as a result of the $(x_{i},x_{j})$ anti-symmetry of the determinant in (14). This enables us to introduce a Cauchy principal value prescription at each of these points, including infinity, and to compute each integral on the r.h.s. of (18) by repeated contour integrals (generalized Dirichlet integrals), see below. The resulting function of $\gamma$ is a piece-wise polynomial of degree $(n-1)(n-2)/2$ , a “box spline” as defined in [15].
In accordance with Theorem 4 of [5], the interpretation of ${\mathcal{J}}_{n}$ is that it gives the volume of the polytope in honeycomb space. This will be discussed in more detail in [10].
The normalization of ${\mathcal{J}}_{n}$ follows from that of $p$

[TABLE]

hence

[TABLE]

which equals $1,\frac{1}{2},\frac{1}{12},\frac{1}{288},\cdots$ for $n=2,3,4,5$ .

1.3 The case $n=2$

1.3.1 Direct calculation

For $n=2$ , the averaging of $B={\rm diag\,}(\beta_{1},\beta_{2})$ over the U(2) unitary group may be worked out directly, since in $UBU^{\dagger}$ , one may take simply $U=\exp-i\sigma_{2}\psi$ , $\sigma_{2}=\mbox{\footnotesize{\mbox{$ \begin{pmatrix}0&-\mathrm{i,}\ \mathrm{i,}&0\end{pmatrix} $}}}$ the Pauli matrix, $\psi$ an Euler angle between 0 and $\pi$ with the measure $\frac{1}{2}\sin\psi\,d\psi$ . The (unordered) eigenvalues of $A+UBU^{\dagger}$ are then

[TABLE]

(here and below, $\alpha_{12}:=\alpha_{1}-\alpha_{2}$ etc.) whence

[TABLE]

whose density is

[TABLE]

on its support

[TABLE]

in agreement with Horn’s inequalities. Indeed if we now choose $\gamma_{2}\leq\gamma_{1}$ , the latter read

[TABLE]

whence

[TABLE]

a triangular inequality familiar from the “rules of addition of angular momenta”, aka the Littlewood–Richardson coefficients for SU(2).

1.3.2 Applying eq. (17-18)

According to (18), for $n=2$ ,

[TABLE]

with

[TABLE]

Recall that $\alpha_{12},\beta_{12}\geq 0$ by convention, while $\gamma_{12}$ is unconstrained at this stage. As explained above, the $u$ integral, not absolutely convergent at infinity and with a pole at 0, is to be interpreted as a Cauchy principal value and then computed by a standard contour integral (Dirichlet integral)

[TABLE]

with $\epsilon$ the sign function. Thus

[TABLE]

if all $A(P,P^{\prime},I)\neq 0$ , which turns out to be expressible in terms of the characteristic (indicator) functions ${\bf 1}_{I}$ and ${\bf 1}_{-I}$ of the intervals $I=(|\alpha_{12}-\beta_{12}|,\alpha_{12}+\beta_{12})$ and $-I$

[TABLE]

If one of the arguments of the sign functions $\epsilon(\gamma_{12}\pm\alpha_{12}\pm\beta_{12})$ vanishes, i.e., if $\gamma_{12}$ stands at one of the end points of one of the intervals $I$ or $-I$ , one may see, returning to the original integral, that one must take the corresponding $\epsilon(0)=0$ , or equivalently the characteristic function ${\bf 1}$ takes the value $\frac{1}{2}$ at the end points of its support.

Our final result for the $n=2$ PDF thus reads

[TABLE]

which does integrate to 1 over $\mathbb{R}^{2}$ , as it should. In that case, the density is a discontinuous, piece-wise linear function over its support. This is in full agreement with the results (20), (22) and (23).

1.4 The case $n=3$

1.4.1 The inequalities and the polygon for $n=3$

Assuming the inequalities (2) satisfied by $\alpha,\beta$ and $\gamma$

[TABLE]

as well as (3), the Horn inequalities read

[TABLE]

These inequalities follow from Knutson-Tao’s inequalities on the honeycomb $\xi$ variable of Fig. 1

[TABLE]

Inequalities (30) are the necessary and sufficient conditions for $\gamma$ to belong to the polygon in the plane $\gamma_{1},\gamma_{2}$ (with $\gamma_{3}$ given by (3)). See [4] for a detailed discussion and proof. This polygon is at most an octagon, see Fig. 2. The red lines are AB: $\gamma_{3}=\gamma_{3min}$ , i.e., $\gamma_{1}+\gamma_{2}=\alpha_{1}+\alpha_{2}+\beta_{1}+\beta_{2}$ and DE: $\gamma_{3}=\gamma_{3max}$ ; and by (29), we retain only the part of the polygon below the diagonal $\gamma_{1}=\gamma_{2}$ (broken line IJ) and above HG: $\gamma_{3}=\gamma_{2}$ hence $\gamma_{1}+2\gamma_{2}=\sum\alpha_{i}+\beta_{i}$ (the blue line). Some of these lines may not cross the quadrangle CC’FF’, see figures below.

1.4.2 The PDF for $n=3$

According to (13-18), we may write for $n=3$

[TABLE]

where use has been made of (3). Integrating once again term by term by principal value and contour integrals, we find

[TABLE]

Note that in that expression, the vanishing of $A_{1}$ yields a vanishing result. The somewhat ambiguous value of the sign function at 0 is thus irrelevant. In the domain $\gamma_{3}\leq\gamma_{2}\leq\gamma_{1}$ , the corresponding sum of $2\times 6^{2}=72$ contributions vanishes if the set of Horn’s inequalities (30) is not satisfied, but conversely it is fairly difficult to read these inequalities off expression (35). When (3) and (29-30) are satisfied, it may be shown that this sum reduces to a sum of 4 terms

[TABLE]

where

[TABLE]

In Fig. 3, the three sectors in the $(\gamma_{1},\gamma_{2})$ plane where $\psi_{\alpha\beta}$ takes one of three values of (37) are depicted. It is manifest that $\psi_{\alpha\beta}$ is a continuous function of $\gamma$ , thanks to (3).

We recall that we have assumed that all $\alpha_{i}$ ’s on the one hand, and all $\beta_{j}$ ’s on the other, are distinct333Otherwise, ${\mathcal{J}}_{n}$ vanishes, by antisymmetry of the determinant in (14).. Then the function ${\mathcal{J}}_{3}$ is a piece-wise linear continuous function of the $\gamma$ ’s, making $p(\gamma|\alpha,\beta)$ a “piece-wise degree 4 polynomial” continuous function of those variables. The lines along which ${\mathcal{J}}_{3}$ is not differentiable are the segments of the three half-lines depicted on Fig. 3 that lie inside the polygon, those obtained when $\alpha$ and $\beta$ are swapped, and the inside segment of the line $\gamma_{2}=\alpha_{2}+\beta_{2}$ . These singular lines appear on some of the figures below.

Upon integration over $\gamma_{1},\gamma_{2}$ , the function $p$ of (32) sums to $1/6$ in the domain defined by (3, 29-30), hence to 1 on the $3!$ sectors obtained by relaxing (29).

Remark. There is an alternative expression of ${\mathcal{J}}_{3}$ that follows from its identification with the “volume” of the polytope of honeycombs, here simply the length of the $\xi$ -interval (31). This will be discussed in more detail in [10]. Thus we may also write, again when (3) and (29-30) are satisfied

[TABLE]

The non-differentiability of ${\mathcal{J}}_{3}$ occurs along lines where two arguments of the $\min$ or of the $\max$ functions coincide, but the detailed pattern is more difficult to grasp than on expression (36,37).

1.4.3 Examples

Take for example $\alpha=\beta=(1,0,-1)$ . Then $(\gamma_{1},\gamma_{2})$ subject to inequality (29) is restricted to a quadrangular domain ABDF with corners at $(2,0),\ (1,1),\ (0,0),\ (2,-1)$ . A typical plot of eigenvalues in that domain and their histogram obtained with samples of respectively 10,000 and $10^{6}$ random unitary matrices $U$ in ${\rm diag\,}(\alpha)+U{\rm diag\,}(\beta)U^{\dagger}$ is displayed in Fig. 4.a and 4.b, while the plot of the function $p(\gamma|\alpha,\beta)$ is in Fig. 4.c. Finally Fig. 4.d gives the full distribution when inequality (29) is relaxed.

Other examples are displayed in Fig. 5, exhibiting the lines of non-differentiability, as well as the sharp features of the PDF as two (or more) of the eigenvalues $\alpha$ or $\beta$ coalesce. All these plots, histograms and figures have been computed in Mathematica[16], making use in particular of the RandomVariate[CircularUnitaryMatrixDistribution[n]]

(resp. RandomVariate[CircularRealMatrixDistribution[n]] in sec. 2 and 3 below) to generate unitary, resp. real orthogonal matrices, uniformly distributed according to the Haar measure of ${\rm SU}(n)$ , resp ${\rm O}(n)$ or ${\rm SO}(n)$ .

Our result (36) is in excellent agreement with these numerical experiments, as seen on the figures.

1.5 The cases $n=4$ and $n=5$

The cases $n=4$ and $n=5$ have also been worked out, see Appendix B for some indications.

2 The probability density function (PDF) for real symmetric matrices

One may also consider Horn’s problem for real symmetric matrices of size $n$ .

Given two $n$ -plets of real eigenvalues $\alpha$ and $\beta$ , ordered as in (2), what is the range of eigenvalues $\gamma$ of ${\rm diag\,}(\alpha)+O\,{\rm diag\,}(\beta)\,O^{T}$ where now $O\in{\rm O}(n)$ , the group of real orthogonal matrices ? According to Fulton [3], the ordered $\gamma$ ’s still live in a convex domain given by the same conditions as in the Hermitian case. What about their PDF ? It turns out it looks quite different from the Hermitian case.

For $n=2$ , we have the sum rule $\gamma_{1}+\gamma_{2}=\alpha_{1}+\alpha_{2}+\beta_{1}+\beta_{2}$ . The difference $\gamma_{12}:=\gamma_{1}-\gamma_{2}$ , taken to be non negative by convention, depends only on $\alpha_{12}:=\alpha_{1}-\alpha_{2}\geq 0$ and $\beta_{12}\geq 0$ , namely $\gamma_{12}=\sqrt{\alpha_{12}^{2}+\beta_{12}^{2}+2\alpha_{12}\beta_{12}\cos(2\theta)}$ , with $0\leq\theta\leq 2\pi$ the angle of the relative O(2) rotation $O$ between $A$ and $B$ , whence a density $\rho(\gamma_{12})=-\frac{2}{\pi}\frac{d\theta}{d\gamma_{12}}$ , equal to

[TABLE]

with $\gamma_{12min}=|\alpha_{12}-\beta_{12}|,\ \gamma_{12max}=\alpha_{12}+\beta_{12}$ . This function is singular (but integrable) at the edges $\gamma_{12min}$ and $\gamma_{12max}$ of the support if $\gamma_{12min}\neq 0$ , and only at $\gamma_{12max}$ if $\gamma_{12min}=0$ , see Fig. 6.

For $n\geq 3$ , we have no analytic formula, but numerical experiments reveal curious enhanced regions and ridges in the density of points or histogram, see Figures 7. Empirically444M. Vergne (private communication) has shown that this is indeed the case., for $n=3$ , these enhancements take place along the same half-lines that appeared in the discussion of eq. (36-37), namely $(\gamma_{1}=\alpha_{1}+\beta_{2},\gamma_{2}\geq\alpha_{3}+\beta_{1})$ , $(\gamma_{2}=\alpha_{3}+\beta_{1},\gamma_{1}\leq\alpha_{1}+\beta_{2})$ , $(\gamma_{1}+\gamma_{2}=\alpha_{1}+\alpha_{3}+\beta_{1}+\beta_{2},\gamma_{1}\geq\alpha_{1}+\beta_{2})$ , restricted to their segments inside the polygon; the same with $\alpha$ and $\beta$ swapped; and the segment of the line $\gamma_{2}=\alpha_{2}+\beta_{2}$ inside the polygon. Similar features also occur for higher $n$ . The nature of these enhancements, presumably a weak integrable singularity, or even better, an analytic expression for the PDF, remain to be found.

3 The probability density function (PDF) for real skew-symmetric matrices

The same Horn’s problem may again be posed about real skew-symmetric matrices of size $n$ with the adjoint action of the group ${\rm O}(n)$ or ${\rm SO}(n)$ . Such matrices may always be block-diagonalized in the form

[TABLE]

We refer to such $\alpha$ ’s as the “eigenvalues” of $A$ . (The actual eigenvalues are in fact the $\pm i\alpha_{j}$ , $j=1,\cdots,m$ , together with 0 if $n=2m+1$ .) In the case of ${\rm O}(n)$ or ${\rm SO}(2m+1)$ , one may again order the $\alpha$ ’s as in (2) and choose them to be non negative. For the group ${\rm SO}(2m)$ , however, the matrix that swaps the sign of any $\alpha_{i}$ or $\beta_{i}$ is of determinant $-1$ : only an even number of sign changes are allowed but we may still impose

[TABLE]

and likewise for the $\beta_{j}$ ’s 555This reflects the structure of the Weyl group of type $B_{m}$ or $D_{m}$ .. As elsewhere in the present work, we focus on the case where the inequalities are strict.

Given two skew-symmetric matrices $A$ and $B$ and their eigenvalues $\alpha$ and $\beta$ , what is the range and density of the eigenvalues $\gamma$ of $A+OBO^{T}$ when $O$ runs over the real orthogonal group ${\rm O}(n)$ or ${\rm SO}(n)$ ?

In that case we have a Harish-Chandra integral at our disposal

[TABLE]

where on the second line, the primed sum $\sum^{\prime}$ runs over an even number of minus signs. In the denominator, $\Delta_{O}$ stands for

[TABLE]

if $m>1$ , while for $m=1$ , by convention $\prod_{1\leq i<j\leq m}(\alpha_{i}^{2}-\alpha_{j}^{2})\equiv 1$ . Finally the constants are (see Appendix A)

[TABLE]

(the numerators of which may also be regarded as the products $\prod_{i}m_{i}!$ of factorials of the Coxeter exponents of the Lie algebra $D_{m}=so(2m)$ (for $m\geq 4$ ), resp. of $B_{m}=so(2m+1)$ (for $m\geq 2$ )).

3.1 Case of even $n=2m$

A calculation similar to that of sect. 1.2 then leads to

[TABLE]

with as before, an even number of minus signs for $\varepsilon$ , and likewise for $\varepsilon^{\prime},\varepsilon^{\prime\prime}$ .

For $m=1$ , Horn’s problem is trivial: any skew-symmetric matrix $B=\mbox{\footnotesize{\mbox{$ \begin{pmatrix}0&\beta\ -\beta&0\end{pmatrix} $}}}$ commutes with an SO(2) rotation matrix while for the permutation $P=\mbox{\footnotesize{\mbox{$ \begin{pmatrix}0&1\ 1&0\end{pmatrix} $}}}$ that belongs to O(2) but not to SO(2), $P.B.P=-B$ . When $O\in{\rm O}(2)$ , resp. $\in{\rm SO}(2)$ , the “eigenvalues” of $A+O.B.O^{T}$ are $\pm\alpha\pm\beta$ with two independent signs, resp. simply $\alpha+\beta$ , which is precisely what is given by (43) when the $x$ integration is worked out :

[TABLE]

For $m=2$ (4 by 4 skew-symmetric matrices), using variables $s=(x_{1}+x_{2})$ and $t=(x_{1}-x_{2})$ , we write in the ${\rm SO}(4)$ case

[TABLE]

while in the O(4) case, each square bracket is replaced by

[TABLE]

After expansion and use of the formula

[TABLE]

one finds for SO(4)

[TABLE]

with the indicator functions of the intervals

[TABLE]

In the O(4) case, the result would be similar, with the big bracket in (45) replaced by

[TABLE]

and a sum over intervals

[TABLE]

where $\varepsilon,\varepsilon^{\prime}$ are two independent signs.

It is an easy exercise to check that $p$ integrates to 1 over the whole $\gamma$ -plane.

The resulting PDF is much more irregular than in the $n=4$ Hermitian case, with discontinuities across some lines. Its support is clearly convex in the ${\rm SO}(4)$ case, in accordance with general theorems. In the O(4) case, the support may be non convex, as apparent on Fig. 8. This is a consequence of the non connectivity of the group. When the contributions of the two connected parts ${\rm SO}(4)$ and ${\rm O}(4)\backslash{\rm SO}(4)$ are computed separately, one sees clearly that convexity of the support is restored for each666My thanks to Allen Knutson and Michèle Vergne for emphasizing the rôle of connectivity of the group in the convexity theorem..

3.2 Case of odd $n=2m+1$

We now write

[TABLE]

For $m=1$ , i.e., $n=3$ , the calculation is essentially identical to that of sect. 1.3.2 777indeed, the action of $U(2)$ on Hermitian matrices $\begin{pmatrix}\alpha&0\\ 0&-\alpha\end{pmatrix}$ and $\begin{pmatrix}\beta&0\\ 0&-\beta\end{pmatrix}$ resembles that of O(2) on skew-symmetric matrices $\begin{pmatrix}0&\alpha\\ -\alpha&0\end{pmatrix}$ and $\begin{pmatrix}0&\beta\\ -\beta&0\end{pmatrix}$ …

[TABLE]

thus a piece-wise linear and discontinuous function of $\gamma$ .

For $n=5$ , $m=2$ , we have

[TABLE]

We then make use as above of variables $s=(x_{1}+x_{2})$ and $t=(x_{1}-x_{2})$ and of the identity

[TABLE]

and the $x$ -integral in (50) reduces to

[TABLE]

We refrain from giving the full expression of ${\mathcal{I}}$ (a sum of $2^{7}$ terms …), which is a continuous and piecewise quadratic function of the $\gamma$ ’s, and just display a sample of results for explicit examples, see Fig. 9.

In general, the inequalities determining the support have been written by Belkale and Kumar [18].

4 Discussion

The same calculation could be carried out for quaternionic anti-selfdual matrices and their orbits under the action of the group Sp( $2m$ ), where again a Harish-Chandra formula is available. To keep this paper in a reasonable size, we refrain from discussing that case.

Both in the Hermitian/unitary and the skew-symmetric/orthogonal cases, we observe the same feature: the PDF tends to become more and more regular as $n$ increases: a sum of Dirac masses for the lowest values, ( $n=1$ , resp. $n=2$ ), then a discontinuous function for $n=2$ , resp. $n=3,4$ , and finally a continuous function of class $C^{n-3}$ for $n\geq 3$ , resp. $C^{p}$ with $p=\lfloor\frac{1}{2}(n-5)\rfloor$ for $n\geq 5$ . By Riemann-Lebesgue theorem, this is just a reflection of the increasingly fast decay of its Fourier transform at large $x$ .

We recall that our discussion has left aside the case where two or more eigenvalues coincide…

Acknowledgements

It is a pleasure to thank Michel Bauer for helpful suggestions and a careful reading of the manuscript, Denis Bernard, Robert Coquereaux and Philippe Di Francesco for their interest and encouragement, and Hugo Ricateau for his advises on Mathematica. I’m very grateful to Allen Knutson and especially to Michèle Vergne for inspiring exchanges and guidance in the literature.

Appendix A. Normalization constants

Consider the set ${\mathcal{X}_{n}}$ of Hermitian, resp real skew-symmetric, $n$ by $n$ matrices.

For $A\in{\mathcal{X}_{n}}$ , with eigenvalues $\alpha_{i}$ (in the sense of (40) in the skew-symmetric case), write the Lebesgue measure on $A$ as $DA=\kappa\Delta(\alpha)^{2}\prod_{i=1}^{r}d\alpha_{i}\,DU_{A}$ , with $U_{A}\in{\rm U}(n)$ , resp $\in{\rm O}(n)$ .

The constant $\kappa$ and the Harish-Chandra integral

[TABLE]

are given by the following Table.

$\begin{array}[]{c|c|c|c|c}{\mathcal{X}_{n}}&\Delta(\alpha)&\kappa&{\mathcal{H}}_{G}(\alpha,\beta)&\hat{\kappa}\\ &&&&\\ \hline\cr&&&&\\ \textrm{Hermitian}&\prod_{1\leq i<j\leq n}(\alpha_{i}-\alpha_{j})&\frac{(2\pi)^{n(n-1)/2}}{\prod_{p=1}^{n}p!}&\hat{\kappa}\frac{(\det e^{\alpha_{i}\beta_{j}})_{i,j=1,\cdots,n}}{\Delta(\alpha)\Delta(\beta)}&\prod_{p=1}^{n-1}p!\\ H_{n}&&&&\\ \hline\cr&&&&\\ \textrm{skew-symmetric}&\prod_{1\leq i<j\leq m}(\alpha_{i}^{2}-\alpha_{j}^{2})&\frac{2^{2m^{2}-\frac{3}{2}m}\pi^{m(m-1)}}{m!\prod_{p=1}^{m-1}(2p)!}&\hat{\kappa}\frac{(\det\cos{2\alpha_{i}\beta_{j}})_{i,j=1,\cdots,m}}{\Delta(\alpha)\Delta(\beta)}&\frac{(m-1)!\prod_{p=1}^{m-1}(2p-1)!}{2^{(m-1)^{2}}}\\ A_{2m}&&&&\\ \hline\cr&&&&\\ \textrm{skew-symmetric}&\prod_{i}\alpha_{i}\prod_{1\leq i<j\leq m}(\alpha_{i}^{2}-\alpha_{j}^{2})&\frac{2^{2m^{2}+\frac{1}{2}m}\pi^{m^{2}}}{m!\prod_{p=1}^{m}(2p)!}&\hat{\kappa}\frac{(\det\sin{2\alpha_{i}\beta_{j}})_{i,j=1,\cdots,m}}{\Delta(\alpha)\Delta(\beta)}&\frac{\prod_{p=1}^{m}(2p-1)!}{2^{m^{2}}}\\ A_{2m+1}&&&&\\ \hline\cr\end{array}$

The constant $\kappa$ may be determined by carrying out the calculation of a Gaussian integral in two different ways, integrating either over the original matrix elements, or over the eigenvalues.

The constant $\hat{\kappa}$ may be determined by considering the limit where all $\alpha_{i}$ are scaled to zero.

Appendix B. The cases of SU(4) and SU(5)

B.1 Horn’s inequalities for 4 by 4 Hermitian matrices

[TABLE]

following from the 41 so-called $(*IJK)$ inequalities [Fu]

[TABLE]

B.2 The PDF for $n=4$

[TABLE]

with $A_{j}$ is a shorthand notation for $A_{j}(P,P^{\prime},I)$ given in (16).

For $\gamma_{4}\leq\gamma_{3}\leq\gamma_{2}\leq\gamma_{1}$ , this sum vanishes if the inequalities (B.1-B.2) are not satisfied.

${\mathcal{J}}_{4}$ is normalized according to (19), i.e., $\int_{\mathrm{sector}\atop\gamma_{4}\leq\gamma_{3}\leq\gamma_{2}\leq\gamma_{1}}d^{3}\gamma\,\frac{\Delta(\gamma)}{\Delta(\alpha)\Delta(\beta)}{{\mathcal{J}}_{4}}=\frac{1}{12}$ .

Note that the above expression of ${\mathcal{J}}_{4}$ has the property that the two sign functions $\epsilon(A_{1})$ and $\epsilon(A_{2}-A_{1})$ are in front of expressions that vanish when $A_{1}$ , resp. $A_{2}-A_{1}$ , vanishes. The somewhat ambiguous value of the sign function at 0 is thus irrelevant.

B.3 A few words about $n=5$

For $n=5$ , Horn’s inequalities and the expression of ${\mathcal{J}}_{5}$ are too cumbersome to be given here – it is a spline function made of 628 terms of degree 6…–, but may be found on the web site http://www.lpthe.jussieu.fr/~zuber/Z_Unpub.html. We have checked a certain number of consistency relations, its vanishing when Horn’s inequalities are not satisfied, and the normalization condition (19), namely $\int_{\mathrm{sector}\atop\gamma_{5}\leq\gamma_{4}\leq\gamma_{3}\leq\gamma_{2}\leq\gamma_{1}}d^{4}\gamma\,\frac{\Delta(\gamma)}{\Delta(\alpha)\Delta(\beta)}{\mathcal{J}_{5}}=\frac{1}{288}$ .

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Horn, Eigenvalues of sums of Hermitian matrices, Pacific J. Math. 12 (1962), 225–241.
2[2] A. A. Klyachko, Stable bundles, representation theory and Hermitian operators, Selecta Math. (N.S.) 4 (1998), 419–445.
3[3] W. Fulton, Eigenvalues, invariant factors, highest weights, and Schubert calculus, Bull. Amer. Math. Soc. 37 (2000), 209–249; http://arxiv.org/abs/math/9908012
4[4] A. Knutson, T. Tao, The honeycomb model of GL(n) tensor products I: proof of the saturation conjecture, J. Amer. Math. Soc. 12 (1999), 1055–1090; http://arxiv.org/abs/math/9807160
5[5] A. Knutson, T. Tao, Honeycombs and sums of Hermitian matrices, Notices Amer. Math. Soc. 48 (2000), 175–186; http://arxiv.org/abs/math/0009048
6[6] A. Knutson, T. Tao, C.Woodward, The honeycomb model of GL(n) tensor products II: Puzzles determine facets of the Littlewood-Richardson cone, J. Amer. Math. Soc. 17 (2004), 19–48; http://arxiv.org/abs/math/0107011
7[7] Harish-Chandra, Differential Operators on a Semisimple Algebra, Amer. J. Math. 79 (1957), 87–120
8[8] C. Itzykson, J.-B. Zuber, The planar approximation II, J. Math. Phys. 21 (1980), 411–421

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

1 Horn’s problem for Hermitian matrices

1.1 A short review and summary of results

1.2 The probability density function (PDF)

Proposition 1

1.3 The case n=2n=2n=2

1.3.1 Direct calculation

1.3.2 Applying eq. (17-18)

1.4 The case n=3n=3n=3

1.4.1 The inequalities and the polygon for n=3n=3n=3

1.4.2 The PDF for n=3n=3n=3

1.4.3 Examples

1.5 The cases n=4n=4n=4 and n=5n=5n=5

2 The probability density function (PDF) for real symmetric matrices

3 The probability density function (PDF) for real skew-symmetric matrices

3.1 Case of even n=2mn=2mn=2m

3.2 Case of odd n=2m+1n=2m+1n=2m+1

4 Discussion

Acknowledgements

Appendix A. Normalization constants

Appendix B. The cases of SU(4) and SU(5)

B.1 Horn’s inequalities for 4 by 4 Hermitian matrices

B.2 The PDF for n=4n=4n=4

B.3 A few words about n=5n=5n=5

1.3 The case $n=2$

1.4 The case $n=3$

1.4.1 The inequalities and the polygon for $n=3$

1.4.2 The PDF for $n=3$

1.5 The cases $n=4$ and $n=5$

3.1 Case of even $n=2m$

3.2 Case of odd $n=2m+1$

B.2 The PDF for $n=4$

B.3 A few words about $n=5$