A Proof of Vivo-Pato-Oshanin's Conjecture on the Fluctuation of von   Neumann Entropy

Lu Wei

arXiv:1706.08199·math-ph·August 11, 2017

A Proof of Vivo-Pato-Oshanin's Conjecture on the Fluctuation of von Neumann Entropy

Lu Wei

PDF

TL;DR

This paper provides a rigorous proof for a conjecture regarding the variance of von Neumann entropy in bipartite quantum systems, confirming a specific formula involving special functions.

Contribution

The paper offers the first complete proof of Vivo, Pato, and Oshanin's conjecture on the fluctuation of von Neumann entropy for quantum subsystems.

Findings

01

Confirmed the conjectured variance formula for von Neumann entropy

02

Validated the specific mathematical expression involving trigamma functions

03

Contributed to the theoretical understanding of quantum entropy fluctuations

Abstract

It was recently conjectured by Vivo, Pato, and Oshanin [Phys. Rev. E 93, 052106 (2016)] that for a quantum system of Hilbert dimension $mn$ in a pure state, the variance of the von Neumann entropy of a subsystem of dimension $m \leq n$ is given by \begin{equation*} -\psi_{1}\left(mn+1\right)+\frac{m+n}{mn+1}\psi_{1}\left(n\right)-\frac{(m+1)(m+2n+1)}{4n^{2}(mn+1)}, \end{equation*} where $ψ_{1} (\cdot)$ is the trigamma function. We give a proof of this formula.

Tables1

Table 1. Table 1: Special Cases

$m = 1$	$I_{A}$	$n (n + 1) ψ_{1} (n) + n (n + 1) ψ_{0}^{2} (n) + (4 n + 2) ψ_{0} (n) + 2$
	$I_{B}$	${(n ψ_{0} (n) + 1)}^{2}$
	$𝔼_{g} [T^{2}]$	$n (n + 1) ψ_{1} (n) + n (n + 1) ψ_{0}^{2} (n) + (4 n + 2) ψ_{0} (n) + 2$
$m = 2$	$I_{A}$	$2 (n (n + 2) ψ_{1} (n) + n (n + 2) ψ_{0}^{2} (n) + (7 n + 4) ψ_{0} (n) + n + 5)$
	$I_{B}$	$2 n (n + 1) ψ_{0}^{2} (n) + 2 (5 n + 1) ψ_{0} (n) + 2 n + 7$
	$𝔼_{g} [T^{2}]$	$2 (n (n + 2) ψ_{1} (n) + n (2 n + 1) ψ_{0}^{2} (n) + (8 n + 3) ψ_{0} (n) + 6)$
$m = n$	$I_{A}$	$\frac{1}{9} (- 18 n^{3} ψ_{1} (n) + 36 n^{3} ψ_{1} (1) + 18 n^{3} ψ_{0}^{2} (n) + 6 n (5 n^{2} + 3 n + 1) ψ_{0} (n) - 43 n^{3} + 33 n^{2} + 22 n + 6)$
	$I_{B}$	$\frac{1}{18} (- 72 n^{3} ψ_{1} (n) + 72 n^{3} ψ_{1} (1) + 18 (2 n - 1) n^{2} ψ_{0}^{2} (n) + 6 n (10 n^{2} - 3 n - 1) ψ_{0} (n) - 86 n^{3} + 57 n^{2} + 35 n + 12)$
	$𝔼_{g} [T^{2}]$	$\frac{1}{4} (8 n^{3} ψ_{1} (n) + 4 n^{2} (n^{2} + 1) ψ_{0}^{2} (n) + 4 n (n^{3} + n^{2} + 3 n + 1) ψ_{0} (n) + n (n + 1) (n^{2} + n + 2))$

Equations228

- ψ_{1} (mn + 1) + \frac{m + n}{mn + 1} ψ_{1} (n) - \frac{( m + 1 ) ( m + 2 n + 1 )}{4 n ^{2} ( mn + 1 )},

- ψ_{1} (mn + 1) + \frac{m + n}{mn + 1} ψ_{1} (n) - \frac{( m + 1 ) ( m + 2 n + 1 )}{4 n ^{2} ( mn + 1 )},

tr (XX^{†}) = 1.

tr (XX^{†}) = 1.

f (λ)

f (λ)

c = i = 1 \prod m Γ (n - i + 1) Γ (i) .

c = i = 1 \prod m Γ (n - i + 1) Γ (i) .

S = - i = 1 \sum m λ_{i} ln λ_{i},

S = - i = 1 \sum m λ_{i} ln λ_{i},

E_{f} [S] = ψ_{0} (mn + 1) - ψ_{0} (n) - \frac{m + 1}{2 n},

E_{f} [S] = ψ_{0} (mn + 1) - ψ_{0} (n) - \frac{m + 1}{2 n},

ψ_{0} (l) = - γ + k = 1 \sum l - 1 \frac{1}{k},

ψ_{0} (l) = - γ + k = 1 \sum l - 1 \frac{1}{k},

- ψ_{1} (mn + 1) + \frac{m + n}{mn + 1} ψ_{1} (n) - \frac{( m + 1 ) ( m + 2 n + 1 )}{4 n ^{2} ( mn + 1 )},

- ψ_{1} (mn + 1) + \frac{m + n}{mn + 1} ψ_{1} (n) - \frac{( m + 1 ) ( m + 2 n + 1 )}{4 n ^{2} ( mn + 1 )},

ψ_{1} (l) = \frac{π ^{2}}{6} - k = 1 \sum l - 1 \frac{1}{k ^{2}} .

ψ_{1} (l) = \frac{π ^{2}}{6} - k = 1 \sum l - 1 \frac{1}{k ^{2}} .

XX^{†} = \frac{YY ^{†}}{tr ( YY ^{†} )},

XX^{†} = \frac{YY ^{†}}{tr ( YY ^{†} )},

g (θ) = \frac{1}{c} 1 \leq i < j \leq m \prod (θ_{i} - θ_{j})^{2} i = 1 \prod m θ_{i}^{n - m} e^{- θ_{i}},

g (θ) = \frac{1}{c} 1 \leq i < j \leq m \prod (θ_{i} - θ_{j})^{2} i = 1 \prod m θ_{i}^{n - m} e^{- θ_{i}},

r = tr (YY^{†}) = i = 1 \sum m θ_{i}

r = tr (YY^{†}) = i = 1 \sum m θ_{i}

h_{mn} (r) = \frac{1}{Γ ( mn )} e^{- r} r^{mn - 1}, r \in [0, \infty) .

h_{mn} (r) = \frac{1}{Γ ( mn )} e^{- r} r^{mn - 1}, r \in [0, \infty) .

λ_{i} = \frac{θ _{i}}{r}, i = 1, \dots, m,

λ_{i} = \frac{θ _{i}}{r}, i = 1, \dots, m,

f (λ) h_{mn} (r) d r i = 1 \prod m d λ_{i} = g (θ) i = 1 \prod m d θ_{i} .

f (λ) h_{mn} (r) d r i = 1 \prod m d λ_{i} = g (θ) i = 1 \prod m d θ_{i} .

T = i = 1 \sum m θ_{i} ln θ_{i},

T = i = 1 \sum m θ_{i} ln θ_{i},

S = - i = 1 \sum m \frac{θ _{i}}{r} ln \frac{θ _{i}}{r} = r^{- 1} (r ln r - T) .

S = - i = 1 \sum m \frac{θ _{i}}{r} ln \frac{θ _{i}}{r} = r^{- 1} (r ln r - T) .

E_{f} [S]

E_{f} [S]

=

\int_{0}^{\infty} e^{- r} r^{a - 1} ln r d r = Γ (a) ψ_{0} (a), Re (a) > 0.

\int_{0}^{\infty} e^{- r} r^{a - 1} ln r d r = Γ (a) ψ_{0} (a), Re (a) > 0.

E_{g} [T] = mn ψ_{0} (n) + \frac{1}{2} m (m + 1),

E_{g} [T] = mn ψ_{0} (n) + \frac{1}{2} m (m + 1),

S^{2}

S^{2}

=

E_{f} [S^{2}] = \int_{λ} r^{- 2} (T^{2} + S 2 r^{2} ln r - r^{2} ln^{2} r) f (λ) i = 1 \prod m d λ_{i} .

E_{f} [S^{2}] = \int_{λ} r^{- 2} (T^{2} + S 2 r^{2} ln r - r^{2} ln^{2} r) f (λ) i = 1 \prod m d λ_{i} .

E_{f} [S^{2}] = \frac{1}{mn ( mn + 1 )} \int_{λ} \int_{r} T^{2} f (λ) h_{mn} (r) d r i = 1 \prod m d λ_{i} +

E_{f} [S^{2}] = \frac{1}{mn ( mn + 1 )} \int_{λ} \int_{r} T^{2} f (λ) h_{mn} (r) d r i = 1 \prod m d λ_{i} +

\frac{2}{mn ( mn + 1 )} \int_{λ} S f (λ) i = 1 \prod m d λ_{i} \int_{r} h_{mn} (r) r^{2} ln r d r -

\frac{1}{mn ( mn + 1 )} \int_{λ} f (λ) i = 1 \prod m d λ_{i} \int_{r} h_{mn} (r) r^{2} ln^{2} r d r .

\int_{0}^{\infty} e^{- r} r^{a - 1} ln^{2} r d r = Γ (a) (ψ_{1} (a) + ψ_{0}^{2} (a)), Re (a) > 0,

\int_{0}^{\infty} e^{- r} r^{a - 1} ln^{2} r d r = Γ (a) (ψ_{1} (a) + ψ_{0}^{2} (a)), Re (a) > 0,

E_{f} [S^{2}]

E_{f} [S^{2}]

mn (m + n) ψ_{1} (n) + mn (mn + 1) ψ_{0}^{2} (n) + m (m^{2} n + mn + m + 2 n + 1) ψ_{0} (n) + \frac{1}{4} m (m + 1) (m^{2} + m + 2),

mn (m + n) ψ_{1} (n) + mn (mn + 1) ψ_{0}^{2} (n) + m (m^{2} n + mn + m + 2 n + 1) ψ_{0} (n) + \frac{1}{4} m (m + 1) (m^{2} + m + 2),

ψ_{0} (l + n)

ψ_{0} (l + n)

ψ_{1} (l + n)

E_{g} [T^{2}]

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A Proof of Vivo-Pato-Oshanin’s Conjecture

on the Fluctuation of von Neumann Entropy

Lu Wei

[email protected]

Department of Electrical and Computer Engineering

University of Michigan-Dearborn, MI 48128, USA

Abstract

It was recently conjectured by Vivo, Pato, and Oshanin [Phys. Rev. E $\bm{93}$ , 052106 (2016)] that for a quantum system of Hilbert dimension $mn$ in a pure state, the variance of the von Neumann entropy of a subsystem of dimension $m\leq n$ is given by

[TABLE]

where $\psi_{1}(\cdot)$ is the trigamma function. We give a proof of this formula.

††preprint: APS/123-QED

I Background and the Conjecture

Consider a composite quantum system that consists of two subsystems $A$ and $B$ of Hilbert space dimensions $m$ and $n$ . The Hilbert space $\mathcal{H}_{A+B}$ of the composite system is given by the tensor product of the Hilbert spaces of the subsystems, $\mathcal{H}_{A+B}=\mathcal{H}_{A}\otimes\mathcal{H}_{B}$ . The random pure state of the composite system is written as a linear combination of the random coefficients $x_{i,j}$ and the complete basis $\left\{\Ket{i^{A}}\right\}$ and $\left\{\Ket{j^{B}}\right\}$ of $\mathcal{H}_{A}$ and $\mathcal{H}_{B}$ , $\Ket{\psi}=\sum_{i=1}^{m}\sum_{j=1}^{n}x_{i,j}\Ket{i^{A}}\otimes\Ket{j^{B}}$ . The corresponding density matrix $\rho=\Ket{\psi}\Bra{\psi}$ has the natural constraint ${\mathrm{tr}}(\rho)=1$ . This implies that the $m\times n$ random coefficient matrix $\mathbf{X}=(x_{i,j})$ satisfies

[TABLE]

Without loss of generality, it is assumed that $m\leq n$ . The reduced density matrix $\rho_{A}$ of the smaller subsystem $A$ admits the Schmidt decomposition $\rho_{A}=\sum_{i=1}^{m}\lambda_{i}\Ket{\phi_{i}^{A}}\Bra{\phi_{i}^{A}}$ , where $\lambda_{i}$ is the $i$ -th largest eigenvalue of $\mathbf{XX}^{{\dagger}}$ . The conservation of probability (1) now implies the constraint $\sum_{i=1}^{m}\lambda_{i}=1.$ The probability measure of the random coefficient matrix $\mathbf{X}$ is the Haar measure, where the entries are uniformly distributed over all the possible values satisfying the constraint (1). The resulting eigenvalue density of $\mathbf{XX}^{{\dagger}}$ is well known (see, e.g., Page (1993)),

[TABLE]

where $\delta(\cdot)$ is the Dirac delta function and the constant

[TABLE]

The random matrix ensemble (2) is also known as the (unitary) fixed-trace ensemble. The considered bipartite quantum system is a fundamental model that describes the interaction between physical object and its environment. For example Page (1993), the subsystem $A$ is the black hole and the subsystem $B$ is the associated radiation field. In another example Majumdar , the subsystem $A$ is a set of spins and the subsystem $B$ represents the environment of a heat bath.

A measure of the entanglement of the considered bipartite quantum system is the von Neumann entropy

[TABLE]

where $S\in\left[0,\ln{m}\right]$ . Its mean value was conjectured by Page Page (1993) as

[TABLE]

where $\mathbb{E}_{f}\!\left[\cdot\right]$ denotes that the expectation is taken over the fixed-trace ensemble (2). Here, $\psi_{0}(x)=\,\mathrm{d}\ln\Gamma(x)/\,\mathrm{d}x$ is the digamma function (Psi function) Luke (1969) and for a positive integer $l$ ,

[TABLE]

where $\gamma\approx 0.5772$ is the Euler’s constant. The mean value formula (5) was proved independently by Foong-Kanno Foong and Kanno (1994), Sánchez-Ruiz Sánchez-Ruiz (1995), Sen Sen (1996), and Adachi-Toda-Kubotani Adachi et al. (2009). For the orthogonal and symplectic fixed-trace ensembles, the mean formulas of the von Neumann entropy were derived by Kumar-Pandey Kumar and Pandey (2011).

To gain more insights, one needs to know the fluctuation of the von Neumann entropy. In fact, its mean value turns out to be a poor representative that has led to an incorrect conclusion on the full distribution Page (1993). Recently, Vivo, Pato, and Oshanin conjectured (Vivo et al., 2016, eq. (57)), based on small $n$ and $m$ calculations from some complicated representations (Vivo et al., 2016, eqs. (54)–(56), (A3), (A9)), that the variance of the von Neumann entropy $\mathbb{V}\!_{f}\!\left[S\right]$ equals

[TABLE]

where $\psi_{1}(x)=\,\mathrm{d}^{2}\ln\Gamma(x)/\,\mathrm{d}x^{2}$ is the trigamma function Luke (1969)111The digamma and trigamma functions are the polygamma functions of order zero and one, respectively. and for a positive integer $l$ ,

[TABLE]

In this paper, we show that the conjecture (7) of Vivo-Pato-Oshanin (VPO) is indeed correct. The presentation of the proof is organized as follows. In Sec. II, we relate the variance of the von Neumann entropy to that of an induced one over the Laguerre ensemble, which is calculated explicitly. The derived induced variance is simplified to functions involving digamma and trigamma functions in Sec. III that leads to a proof of the conjecture. Most of the technical tools for the simplification are presented in the Appendix. Finally, we point out that even though the exact distribution of von Neumann entropy is unknown, its asymptotic distribution was obtained via the Coulomb gas approach by Nadal-Majumdar-Vergassola Nadal et al. (2011).

II Variance of an Induced Entropy in Laguerre Ensemble

II.1 Variance Relation

By the construction (1), the random coefficient matrix $\mathbf{X}$ has a natural relation with a Wishart matrix $\mathbf{YY}^{{\dagger}}$ as

[TABLE]

where $\mathbf{Y}$ is an $m\times n$ ( $m\leq n$ ) matrix of independently and identically distributed complex Gaussian entries. The density of the eigenvalues $0<\theta_{m}<\dots<\theta_{1}<\infty$ of $\mathbf{YY}^{{\dagger}}$ equals Forrester (2010)

[TABLE]

where $c$ is given by (3) and the above ensemble is known as the Laguerre ensemble. The trace of the Wishart matrix

[TABLE]

follows a gamma distribution with the density Vivo et al. (2016)

[TABLE]

The relation (9) induces the change of variables

[TABLE]

that leads to a well-known relation (see, e.g. Page (1993)) among the densities (2), (10), and (12) as

[TABLE]

This relation implies that $r$ is independent of each $\lambda_{i}$ , $i=1,\ldots,m$ , since their densities factorize.

Page Page (1993) exploited the relation (14) by relating the first moment of von Neumann entropy over the fixed-trace ensemble (2) to that of an induced entropy 222For convenience of the discussion, we refer the random variable $T$ as an induced entropy, which may not have physical meaning of an entropy.

[TABLE]

over the Laguerre ensemble (10) as follows. First, by using the relations (13), one has

[TABLE]

Then, the expected value of $S$ is evaluated as

[TABLE]

where the expectation $\mathbb{E}_{g}\!\left[\cdot\right]$ is taken over the Laguerre ensemble (10). Here, (17) is obtained by the identity $r^{-1}h_{mn+1}(r)=h_{mn}(r)/mn$ and the fact that $r$ is independent of $\bm{\lambda}$ , and (18) is established by the change of measures (14) and the identity

[TABLE]

Sánchez-Ruiz Sánchez-Ruiz (1995) and Sen Sen (1996) have calculated that

[TABLE]

and together with the relation (18) leads to their proofs of Page’s conjecture on the mean entropy (5).

We now show that the idea of Page Page (1993) can be generalized to find a relation between the second moments (hence the variances since the first moments are known) of $S$ and $T$ , which is the starting point of our calculations. First, using the result (16) we have

[TABLE]

The expression (22) is obtained by replacing only the first power of $T$ in (21) by the identity (16), and the reason for this replacement will become clear. The second moment of $S$ can now be written as

[TABLE]

To utilize the independence between $r$ and $\bm{\lambda}$ , we multiple (23) by an appropriate constant $1=\int_{r}h_{mn+2}(r)\,\mathrm{d}{r}$ , which, with the fact that $r^{-2}h_{mn+2}(r)=h_{mn}(r)/mn(mn+1)$ , leads to

[TABLE]

From the second line of the above equation, we see that the replacement of the first power of $T$ by $S$ in (21) makes it possible to evaluate the integrals over $r$ and $\bm{\lambda}$ separately. Finally, using the change of measures (14) as well as the identities (19) and

[TABLE]

we arrive at

[TABLE]

Inserting the mean formula (5) and the VPO’s conjecture (7) into the definition $\mathbb{E}_{f}\!\left[S^{2}\right]=\mathbb{V}\!_{f}\!\left[S\right]+\mathbb{E}_{f}^{2}\!\left[S\right]$ , and equating it to the derived relation (25), the VPO’s conjecture boils down to showing that $\mathbb{E}_{g}\!\left[T^{2}\right]$ is given by

[TABLE]

where we have used the identities (cf. (6) and (8))

[TABLE]

for the case $l=mn+1$ , $n=1$ .

We have so far converted the VPO’s conjecture (7) evaluated over the fixed-trace ensemble (2) to an equivalent conjecture (26) evaluated over the Laguerre ensemble (10). Instead of working directly with the complicated correlation functions of the fixed-trace ensemble as in Adachi et al. (2009); Kumar and Pandey (2011); Vivo et al. (2016), the induced variance over the well-investigated correlation functions of the Laguerre ensemble can be explicitly calculated as will be shown in Sec. II.2. The proposed ‘moments conversion’ approach may be generalized to study the higher moments of the von Neumann entropy as well as other entanglement measures such as the Tsallis entropy and the Rényi entropy.

II.2 Calculations of the Induced Variance

Since $T^{2}=\sum_{i=1}^{m}\theta_{i}^{2}\ln^{2}\theta_{i}+2\sum_{1\leq i<j\leq m}\theta_{i}\theta_{j}\ln\theta_{i}\ln\theta_{j}$ , the calculation of $\mathbb{E}_{g}\!\left[T^{2}\right]$ involves one and two arbitrary eigenvalue densities, denoted respectively by $g_{1}(x_{1})$ and $g_{2}(x_{1},x_{2})$ , of the Laguerre ensemble as

[TABLE]

In general, the joint density of $N$ arbitrary eigenvalues $g_{N}(x_{1},\dots,x_{N})$ is related to the $N$ -point correlation function

[TABLE]

as Forrester (2010) $g_{N}(x_{1},\dots,x_{N})=X_{N}\left(x_{1},\dots,x_{N}\right)(m-N)!/m!$ , where $\det(\cdot)$ is the matrix determinant and the symmetric function $K(x_{i},x_{j})$ is the correlation kernel. In particular, we have

[TABLE]

As a result, one can represent (II.2) as

[TABLE]

where

[TABLE]

and we have used the result (20) and the definition

[TABLE]

Before computing the integrals $I_{A}$ and $I_{B}$ , the following results on the correlation functions (29) are needed. The correlation kernel of the Laguerre ensemble can be explicitly written as Forrester (2010)

[TABLE]

where

[TABLE]

with

[TABLE]

being the (generalized) Laguerre polynomial of degree $k$ . The Laguerre polynomials satisfy the well-known orthogonality relation Forrester (2010)

[TABLE]

where $\delta_{kl}$ is the Kronecker delta function. It is known that the one-point correlation function (cf. (29)) admits a more convenient representation as Sánchez-Ruiz (1995); Forrester (2010)

[TABLE]

We also need the following identity, due to Schrödinger Schrödinger (1926), that generalizes the integral (36) to

[TABLE]

By taking the first and second derivative on both sides of (II.2) with respect to $q$ , we obtain two more integral identities as shown in (II.2.1) (see also Sánchez-Ruiz (1995)) and (II.2.1), which are respectively denoted by $B_{s,t}^{(\alpha,\beta)}(q)$ and $A_{s,t}^{(\alpha,\beta)}(q)$ . With the above preparations, we now proceed to the calculations of $I_{A}$ in (31) and $I_{B}$ in (32).

II.2.1 Calculating $I_{A}$

By the fact that (cf. (29))

[TABLE]

one inserts (37) into (31) to obtain

[TABLE]

where for convenience we have further defined (cf. (II.2.1))

[TABLE]

We now use (II.2.1), and the contribution to the sum

[TABLE]

consists of the cases when the binomial terms are zero ( $k=0,\dots,m-3$ ) with the polygamma functions being infinity and are nonzero ( $k=m-2,m-1$ ) with the polygamma functions being finite. Namely, we have

[TABLE]

which by interpreting the gamma and polygamma functions of negative integer arguments as the limit $\epsilon\to 0$ of

[TABLE]

leads to a well-defined limit

[TABLE]

In the same manner that has led to $\mathcal{A}_{m-1,m-1}$ , we obtain

[TABLE]

Finally, we insert (43), (II.2.1), (II.2.1) into (40) and simplify the expression by rearranging the sums as well as using (27) to obtain

[TABLE]

II.2.2 Calculating $I_{B}$

Inserting (33) into (32) and using the symmetry of the correlation kernel, the integral $I_{B}$ can be represented as

[TABLE]

where we have further defined (cf. (II.2.1))

[TABLE]

The identity (II.2.1) gives

[TABLE]

where $j=k-1,k$ provides the nonzero contribution to the sum and we have used (27a) for the simplification. In the same manner, one obtains

[TABLE]

and the cases $j=2,\dots,m-1$ are computed to be

[TABLE]

Inserting (52), (53), and (54) into (50), we arrive at

[TABLE]

III Simplification of Summations

The remaining task is to simplify the sums appear in $I_{A}$ and $I_{B}$ to polygamma functions. This is a straightforward but tedious task, for which we need several finite sum identities as listed in the Appendix. Some remarks on these identities are also provided in the Appendix. Though $I_{A}$ in (49) and $I_{B}$ in (55) are valid for any positive integers $m$ and $n$ with $m\leq n$ , as will be seen it is convenient to assume $n>m\geq 3$ in the following simplification. For this reason, we will first simplify $I_{A}$ and $I_{B}$ in the case $n>m\geq 3$ . The remaining special cases will be considered at the end of this section.

For ease of presentation, we cite the identities used in each step on top of the equality symbol. The argument of each of the resulting polygamma functions is shifted to one of the following $n-m+2$ , $m$ , $n$ , $1$ , with the help of (27). In addition, simplification by combining like terms is also performed in each step without being explicitly mentioned. We start with $I_{A}$ in (49), where by using partial fraction decomposition the first sum is simplified as

[TABLE]

Similarly, the second sum in (49) is simplified as

[TABLE]

Inserting (56) and (57) into (49), $I_{A}$ is simplified to

[TABLE]

We now simplify $I_{B}$ in (55), where the first two sums are

[TABLE]

The remaining double sums in $I_{B}$ needs some preprocessing before the sum of the types in the appendix appear. Specifically, by shifting the inner sum $k\to k-j$ , changing the summation order, and using partial fraction decomposition, we have

[TABLE]

where $\mathcal{I}_{1}$ and $\mathcal{I}_{2}$ collect terms involving $1/j$ and $1/j^{2}$ , respectively, as (the terms involving $1/j^{0}$ cancel)

[TABLE]

The sums in $\mathcal{I}_{1}$ are further simplified as

[TABLE]

The sums in $\mathcal{I}_{2}$ are further simplified as

[TABLE]

where we also changed the summation order between $j$ and $k$ to arrive at the last equality, and $b_{1}$ , $b_{2}$ , $b_{3}$ are

[TABLE]

With $\mathcal{I}_{1}$ and $\mathcal{I}_{2}$ being simplified as in (66) and (67), respectively, we now insert (62) and (63) into (55) to obtain

[TABLE]

We observe that $I_{A}$ in (58) and $I_{B}$ in (71) share many common terms, where by inserting (58) and (71) into (30) the remaining terms of the induced variance $\mathbb{E}_{g}\!\left[T^{2}\right]$ are

[TABLE]

where we have used the results

[TABLE]

obtained by comparing (59)–(61) to (72)–(74). This completes the proof of the induced conjecture (26) in the case $n>m\geq 3$ and hence the VPO’s conjecture (7) for the same case.

Since $m\leq n$ , the remaining cases to be shown are $m=1$ , $m=2$ , and $m=n$ , where $I_{A}$ in (49) and $I_{B}$ in (55) can be directly computed. We list the simplified expressions for $I_{A}$ , $I_{B}$ , and the induced variance $\mathbb{E}_{g}\!\left[T^{2}\right]$ in Table 1 as shown on top of the next page. Each of the special cases is proven by comparing the expression of $\mathbb{E}_{g}\!\left[T^{2}\right]$ in Table 1 with that of the corresponding induced conjecture (26). We complete the proof of the VPO’s conjecture (7).

Acknowledgements.

The author wishes to thank Michael Milgram, Gregory Schehr, and Yu Xiang for the inspiring discussion.

Appendix A Finite Sum Identities Useful in Section III

[TABLE]

Some Remarks on the Identities in the Appendix

The formulas of finite sums of polygamma functions of the types (75)–(83) are straightforward to show. The proofs essentially involve changing the order of the sums and making use of the lower order sums already obtained in a recursive manner. In particular, the formulas (75)–(78) are available in (Brychkov, 2008, ch. 5.1). The formulas (79)–(83) can be read off from the expressions in (Spieß, 1990, p. 861) by keeping in mind the difference between polygamma functions (6), (8) and harmonic numbers.

The last three formulas (84)–(86) play a crucial role in the simplification in Sec. III as they connect some of the sums in (49) and (55) to polygamma functions. The first of them (84) is known as Chu-Vandermonde identity (Luke, 1969, p. 99). The next formula (85) can be established as follows. First, the identity (27a) implies that

[TABLE]

By using the definition of digamma function (6), changing the order of sums, and evoking Chu-Vandermonde identity (84), the first term in (87) is represented as

[TABLE]

Similarly, we have

[TABLE]

Inserting (A) and (A) into (87), we obtain a recurrence relation of the sum (85) as

[TABLE]

where we denote

[TABLE]

Finally, by iterating $m-1$ times the relation (90), we arrive at

[TABLE]

where we have used the fact that $s(0,n-m)=0$ . Note that the formula (85) can be also obtained via its connection to a hypergeometric function of unit argument as (Luke, 1969, p. 111)

[TABLE]

To prove the last formula (86), we first observe from (27b) that

[TABLE]

Following the same idea that has led to (90), we also obtain a recurrence relation in this case as

[TABLE]

where we denote

[TABLE]

Iterating $m-1$ times the relation (94), we arrive at

[TABLE]

where by using the identity (Milgram, 2004, eq. (23))

[TABLE]

we obtain the claimed formula (86). Though the expression (86) still contains a sum of digamma functions that may not be further simplified, it is sufficient for the simplification purpose. As shown in Sec. III, the terms involving this remaining sum cancel each other. Finally, we note that as a result of the relation to the hypergeometric function

[TABLE]

the formula (86) implies a byproduct that generalizes a result of Luke (Luke, 1969, p. 111) as

[TABLE]

which may be of independent interest.

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Page (1993) D. N. Page, Phys. Rev. Lett. 71 , 1291 (1993).
2(2) S. N. Majumdar, in The Oxford Handbook of Random Matrix Theory , edited by G. Akemann, J. Baik, and P. Di Francesco, Chap. 37.
3Luke (1969) Y. L. Luke, The Special Functions and Their Approximations , Vol. 1 (Academic Press, New York, 1969).
4Foong and Kanno (1994) S. K. Foong and S. Kanno, Phys. Rev. Lett. 72 , 1148 (1994).
5Sánchez-Ruiz (1995) J. Sánchez-Ruiz, Phys. Rev. E 52 , 5653 (1995).
6Sen (1996) S. Sen, Phys. Rev. Lett. 77 , 1 (1996).
7Adachi et al. (2009) S. Adachi, M. Toda, and H. Kubotani, Ann. Phys. 324 , 2278 (2009).
8Kumar and Pandey (2011) S. Kumar and A. Pandey, J. Phys. A: Math. Theor. 44 , 445301 (2011).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A Proof of Vivo-Pato-Oshanin’s Conjecture

Abstract

I Background and the Conjecture

II Variance of an Induced Entropy in Laguerre Ensemble

II.1 Variance Relation

II.2 Calculations of the Induced Variance

II.2.1 Calculating IAI_{A}IA​

II.2.2 Calculating IBI_{B}IB​

III Simplification of Summations

Acknowledgements.

Appendix A Finite Sum Identities Useful in Section III

Some Remarks on the Identities in the Appendix

II.2.1 Calculating $I_{A}$

II.2.2 Calculating $I_{B}$