Theoretical study on thermalization in isolated quantum systems

Ryusuke Hamazaki

arXiv:1901.01481·cond-mat.stat-mech·January 8, 2019

Theoretical study on thermalization in isolated quantum systems

Ryusuke Hamazaki

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of thermalization mechanisms in isolated quantum systems, focusing on the eigenstate thermalization hypothesis and the emergence of generalized Gibbs ensembles in nonintegrable models.

Contribution

It offers a theoretical investigation into how ETH and GGE describe thermalization, including predictions from random matrix theory and the role of local symmetries.

Findings

01

ETH predictions align with RMT in finite-size systems

02

GGE emerges in nonintegrable systems with local symmetries

03

Finite-size corrections to ETH are characterized

Abstract

Understanding how isolated quantum systems thermalize has recently gathered renewed interest almost 100 years after the first work by von Neumann, thanks to the experimental realizations of such systems. Experimental and numerical pieces of evidence imply that nonintegrability of the system plays an important role in thermalization. Nonintegrable systems that conserve energy alone are expected to be effectively described by the (micro)canonical ensemble due to the so-called eigenstate thermalization hypothesis (ETH) in the thermodynamic limit. In contrast, it is expected that stationary states in integrable systems are described not by the canonical ensemble but by the generalized Gibbs ensemble (GGE) due to the existence of many nontrivial conserved quantities. In this thesis, we study thermalization and its mechanism in nonintegrable systems from two perspectives. We first study how…

Equations706

Γ = {(q, p) \in Γ : E - Δ E < H (q, p) \leq E + Δ E},

Γ = {(q, p) \in Γ : E - Δ E < H (q, p) \leq E + Δ E},

\overset{ρ}{^}_{mic} (E) := \frac{1}{dim [ H _{E, Δ E} ]} \hat{P}_{E, Δ E},

\overset{ρ}{^}_{mic} (E) := \frac{1}{dim [ H _{E, Δ E} ]} \hat{P}_{E, Δ E},

{∣ E_{α} ⟩ : \hat{H} ∣ E_{α} ⟩ = E_{α} ∣ E_{α} ⟩, E - Δ E < E_{α} \leq E + Δ E}

{∣ E_{α} ⟩ : \hat{H} ∣ E_{α} ⟩ = E_{α} ∣ E_{α} ⟩, E - Δ E < E_{α} \leq E + Δ E}

\hat{P}_{E, Δ E} := ∣ E_{α} ⟩ \in H_{E, Δ E} \sum ∣ E_{α} ⟩ ⟨ E_{α} ∣

\hat{P}_{E, Δ E} := ∣ E_{α} ⟩ \in H_{E, Δ E} \sum ∣ E_{α} ⟩ ⟨ E_{α} ∣

\overset{ρ}{^} (t) := e^{- i \hat{H} t} \overset{ρ}{^}_{0} e^{i \hat{H} t} = α β \sum e^{i (E_{α} - E_{β}) t} ⟨ E_{β} ⟩ \overset{ρ}{^}_{0} E_{α} ∣ E_{β} ⟩ ⟨ E_{α} ∣

\overset{ρ}{^} (t) := e^{- i \hat{H} t} \overset{ρ}{^}_{0} e^{i \hat{H} t} = α β \sum e^{i (E_{α} - E_{β}) t} ⟨ E_{β} ⟩ \overset{ρ}{^}_{0} E_{α} ∣ E_{β} ⟩ ⟨ E_{α} ∣

\hat{H} = i = 1 \sum N h_{i} \overset{σ}{^}_{i}^{z} + i = 1 \sum N - 1 J \hat{σ}_{i} \cdot \hat{σ}_{i + 1},

\hat{H} = i = 1 \sum N h_{i} \overset{σ}{^}_{i}^{z} + i = 1 \sum N - 1 J \hat{σ}_{i} \cdot \hat{σ}_{i + 1},

\hat{H}_{eff} = E_{0} + i \sum h_{i}^{'} \overset{τ}{^}_{i}^{z} + ij \sum J_{ij}^{'} \overset{τ}{^}_{i}^{z} \overset{τ}{^}_{j}^{z} + n = 3 \sum i_{1} \dots i_{n} \sum K_{i_{1} \dots i_{n}}^{(n)} \overset{τ}{^}_{i_{1}}^{z} \dots \overset{τ}{^}_{i_{n}}^{z},

\hat{H}_{eff} = E_{0} + i \sum h_{i}^{'} \overset{τ}{^}_{i}^{z} + ij \sum J_{ij}^{'} \overset{τ}{^}_{i}^{z} \overset{τ}{^}_{j}^{z} + n = 3 \sum i_{1} \dots i_{n} \sum K_{i_{1} \dots i_{n}}^{(n)} \overset{τ}{^}_{i_{1}}^{z} \dots \overset{τ}{^}_{i_{n}}^{z},

Tr [\overset{ρ}{^} \hat{P}_{f}] ≃ Tr [\overset{ρ}{^}_{mic} (E) \hat{P}_{f}]

Tr [\overset{ρ}{^} \hat{P}_{f}] ≃ Tr [\overset{ρ}{^}_{mic} (E) \hat{P}_{f}]

H = H_{S} \otimes H_{S^{c}},

H = H_{S} \otimes H_{S^{c}},

A_{MITE} = \cup_{S} A_{S},

A_{MITE} = \cup_{S} A_{S},

A_{S} := {\hat{O}_{S} \otimes \hat{I}_{S^{c}} : \hat{O}_{S} is a Hermitian operator acting on H_{S}} .

A_{S} := {\hat{O}_{S} \otimes \hat{I}_{S^{c}} : \hat{O}_{S} is a Hermitian operator acting on H_{S}} .

\overset{ρ}{^}_{S} = \overset{ρ}{^}_{mic, S}

\overset{ρ}{^}_{S} = \overset{ρ}{^}_{mic, S}

H = i = 1 ⨂ N H_{i},

H = i = 1 ⨂ N H_{i},

H_{S_{l_{0}}} = i = i_{0} ⨂ i_{0} + l_{0} - 1 H_{i}

H_{S_{l_{0}}} = i = i_{0} ⨂ i_{0} + l_{0} - 1 H_{i}

i = i_{0} \prod i_{0} + l_{0} - 1 \overset{σ}{^}_{i}^{μ_{i}} (μ_{i} = 0, x, y, z)

i = i_{0} \prod i_{0} + l_{0} - 1 \overset{σ}{^}_{i}^{μ_{i}} (μ_{i} = 0, x, y, z)

\overset{σ}{^}_{i_{1}}^{α_{i_{1}}} \overset{σ}{^}_{i_{2}}^{α_{i_{2}}} \dots \overset{σ}{^}_{i_{k}}^{α_{i_{k}}} (α_{i} = x, y, z)

\overset{σ}{^}_{i_{1}}^{α_{i_{1}}} \overset{σ}{^}_{i_{2}}^{α_{i_{2}}} \dots \overset{σ}{^}_{i_{k}}^{α_{i_{k}}} (α_{i} = x, y, z)

H_{S_{M}} = H_{i_{1}} \otimes H_{i_{2}} \otimes \dots \otimes H_{i_{M}} (1 \leq i_{1} < i_{2} < \dots < i_{M} \leq N) .

H_{S_{M}} = H_{i_{1}} \otimes H_{i_{2}} \otimes \dots \otimes H_{i_{M}} (1 \leq i_{1} < i_{2} < \dots < i_{M} \leq N) .

H_{mic} = {∣ ψ ⟩ = α \in S \sum z_{α} ∣ E_{α} ⟩ : z_{α} \in C, α \in S \sum ∣ z_{α} ∣^{2} = 1},

H_{mic} = {∣ ψ ⟩ = α \in S \sum z_{α} ∣ E_{α} ⟩ : z_{α} \in C, α \in S \sum ∣ z_{α} ∣^{2} = 1},

S = {α : ∣ E - E_{α} ∣ < Δ E} .

S = {α : ∣ E - E_{α} ∣ < Δ E} .

P ({z_{α}}) α \in S \prod d Re z_{α} d Im z_{α} = c \times δ (α \in S \sum ∣ z_{α} ∣^{2} - 1) α \in S \prod d Re z_{α} d Im z_{α},

P ({z_{α}}) α \in S \prod d Re z_{α} d Im z_{α} = c \times δ (α \in S \sum ∣ z_{α} ∣^{2} - 1) α \in S \prod d Re z_{α} d Im z_{α},

E [⟨ ψ ⟩ \hat{O} ψ]

E [⟨ ψ ⟩ \hat{O} ψ]

V [⟨ ψ ⟩ \hat{O} ψ]

V_{mic} (\hat{O})

V_{mic} (\hat{O})

= ⟨ \hat{O} \hat{P}_{E, Δ E} \hat{O} ⟩_{mic} - ⟨ \hat{O} ⟩_{mic}^{2},

E [z_{α}^{*} z_{β}] = \frac{δ _{α β}}{d}

E [z_{α}^{*} z_{β}] = \frac{δ _{α β}}{d}

E [∣ z_{α} ∣^{2} ∣ z_{β} ∣^{2}] = \frac{1 + δ _{α β}}{d ( d + 1 )} .

E [∣ z_{α} ∣^{2} ∣ z_{β} ∣^{2}] = \frac{1 + δ _{α β}}{d ( d + 1 )} .

E [⟨ ψ ⟩ \hat{O} ψ]

E [⟨ ψ ⟩ \hat{O} ψ]

= \frac{1}{d} α \in S \sum O_{α α}

= ⟨ \hat{O} ⟩_{mic}

V [⟨ ψ ⟩ \hat{O} ψ]

V [⟨ ψ ⟩ \hat{O} ψ]

= α, β, γ, δ \in S \sum E [z_{α}^{*} z_{β} z_{γ}^{*} z_{δ}] O_{α β} O_{γ δ} - ⟨ \hat{O} ⟩_{mic}^{2}

= α \in S \sum \frac{2 O _{α α}^{2}}{d ( d + 1 )} + α, γ \in S, α \neq = γ \sum \frac{O _{α α} O _{γ γ}}{d ( d + 1 )} + α, β \in S, α \neq = β \sum \frac{∣ O _{α β} ∣ ^{2}}{d ( d + 1 )} - ⟨ \hat{O} ⟩_{mic}^{2}

= α, β \in S \sum \frac{∣ O _{α β} ∣ ^{2}}{d ( d + 1 )} - \frac{1}{d ^{2} ( d + 1 )} [α \in S \sum O_{α α}]^{2}

= \frac{V _{mic} ( O ^ )}{d + 1} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum many-body systems · Opinion Dynamics and Social Influence · Quantum, superfluid, helium dynamics

Full text

Abstract

Understanding how isolated quantum systems thermalize has recently gathered renewed interest almost 100 years after the first work by von Neumann, thanks to the experimental realizations of such systems. Experimental and numerical pieces of evidence imply that nonintegrability of the system plays an important role in thermalization. Nonintegrable systems that conserve energy alone are expected to be effectively described by the (micro)canonical ensemble due to the so-called eigenstate thermalization hypothesis (ETH) in the thermodynamic limit. In contrast, it is expected that stationary states in integrable systems are described not by the canonical ensemble but by the generalized Gibbs ensemble (GGE) due to the existence of many nontrivial conserved quantities. In this thesis, we study thermalization and its mechanism in nonintegrable systems from two perspectives.

First, we study how well the ETH and its finite-size corrections can be predicted by random matrix theory (RMT). We first analytically calculate the finite-size corrections of the ETH using the RMT model and show that their statistics is universal, depending only on the symmetry (and what is called singularity) of the observables as well as the symmetry of the Hamiltonians. Then, we numerically show that, in nonintegrable systems (that conserve energy alone) and for a wide class of observables, the matrix elements are in good agreement with the prediction of RMT. We also remark, however, that counterexamples of the RMT prediction always exist even among simple observables.

Next, we present our study on the emergence of the GGE in a nonintegrable system with an extensive number of local symmetries. We have numerically investigated a nonintegrable model of hard-core bosons with an extensive number of local $\mathbb{Z}_{2}$ symmetries. We find that the expectation values of macroscopic observables in the stationary state are described by the GGE and not by the canonical ensemble. We also show that, if the model has a less than extensive number of local symmetries, the stationary state is described by the canonical ensemble.

1 Statistical physics of isolated quantum systems: an overview
1.1 Foundations of quantum statistical mechanics and the notion of typicality
1.2 Approach to thermal equilibrium
1.2.1 Equilibration and thermalization
1.2.2 Systems that approach non-thermal stationary states
1.3 Experiments of isolated quantum systems
1.3.1 Ultracold atoms
1.3.2 Trapped ions and other systems
1.4 Organization of this thesis
2 Equilibration and thermalization by unitary time evolutions
2.1 Definitions of thermal equilibrium and typicality
2.1.1 General framework
2.1.2 Microscopic thermal equilibrium (MITE)
2.1.3 Macroscopic thermal equilibrium (MATE)
2.1.4 A looser way to consider thermal equilibrium
2.2 Conditions for equilibration and thermalization
2.2.1 Equilibration
2.2.2 Thermalization
2.3 Approach to thermal equilibrium from any initial state: the eigenstate thermalization hypothesis
2.3.1 Off-diagonal matrix elements and equilibration
2.3.2 Diagonal matrix elements and thermalization
2.4 Roles of initial states for equilibration and thermalization
3 Review of the eigenstate thermalization hypothesis (ETH)
3.1 Histories
3.2 Possible explanations of the ETH
3.2.1 Arguments by von Neumann and Reimann
3.2.2 Some predictions from random matrix theory in nonintegrable systems
3.2.3 Argument by Deutsch
3.3 Numerical simulations of the ETH
3.3.1 Level-spacing statistics of hardcore-particle systems
3.3.2 Diagonal matrix elements
3.3.3 Off-diagonal matrix elements
3.4 Weak ETH
3.5 Summary and remarks
4 Observable-dependence of how random matrix theory can predict deviations from the ETH
4.1 Motivations
4.2 Statistics of the finite-size corrections of the ETH for the random matrix model
4.2.1 Universal ratios between diagonal and off-diagonal matrix elements
4.2.2 Observable-dependent probability densities of the off-diagonal matrix elements
4.2.3 Conjectures from the random matrix model
4.3 Numerical verifications of the random matrix predictions
4.3.1 Models
4.3.2 Few-body observables
4.3.3 Many-body correlations
4.3.4 Density matrices corresponding to pure states
4.3.5 A simple counterexample
4.4 Conclusions and Discussions
5 Generalized Gibbs ensemble (GGE) in integrable systems
5.1 Non-thermal stationary states due to conserved quantities
5.2 The GGE in essentially free systems
5.3 Importance of the locality of conserved quantities and the truncated GGE
5.4 The GGE in interacting systems solved by the Bethe ansatz
5.5 Conclustion
6 Generalized Gibbs ensemble in a nonintegrable system with an extensive number of local symmetries
6.1 Motivation
6.2 A model with an extensive number of local symmetries
6.3 Time evolutions from two initial states
6.4 Verification of the GGE by the finite-size scaling analysis
6.5 Verification of the ETH for each symmetry sector
6.6 Models with fewer local symmetries
6.7 Conclusions and discussions
7 Conclusions and Future prospects
7.1 Conclusions
7.2 Future prospects
A Review of random matrix theory (RMT)
A.1 History
A.2 Definitions and classifications
A.2.1 Gaussian ensembles
A.3 Statistics in Gaussian random matrices
A.3.1 Level-spacing statistics
A.3.2 Distributions of eigenstates
A.4 Ergodicity of Gaussian random matrices
B Detailed derivations in the main text
B.1 Derivation of Eq. (2.26)
B.2 Derivation of Eq. ()
B.3 Derivation of Eqs. (4.18-4.29)
B.4 Justification of $\sigma^{2}\simeq\mathcal{V}/d$ in the RMT model (Subsection 4.2.2)
B.5 Occupation ratios of each symmetry sector in Sec. 6.3
C Miscellaneous topics
C.1 Tasaki’s MATE
C.2 The numerical verification of the ETH for many-body correlations in Chapter 4
C.3 The ETH for the models (b) and (c) in Sec. 6.6

Chapter 1 Statistical physics of isolated quantum systems: an overview

1.1 Foundations of quantum statistical mechanics and the notion of typicality

Statistical mechanics, originally developed by Boltzmann, Gibbs and Einstein, has long been an indispensable tool for diverse areas of physics, ranging from cosmology to biology. It tells us how to compute macroscopic variables and their fluctuations in thermodynamics from the knowledge of microscopic theories [1, 2]. Without explicit knowledge about the complex dynamics, we can calculate pressures or entropies of gases, once microscopic Hamiltonians of the atoms or molecules are given.

One of the most important assumptions in statistical mechanics is that macroscopic observables at thermal equilibrium can be computed using the microcanonical ensemble. The microcanonical ensemble is a probabilistic model where each microstate is taken from a certain energy shell with a uniform probability distribution. For a classical system, the microcanonical distribution function features a uniform density $\rho_{\mathrm{mic}}(\mathbf{q},\mathbf{p})$ over an energy shell

[TABLE]

where $\Delta E$ is some small energy width. For a quantum system, the microcanonical ensemble can be represented in the form of the following density matrix:

[TABLE]

where $\mathcal{H}_{E,\Delta E}$ is a Hilbert space that is spanned by the set of eigenstates in an energy shell

[TABLE]

and

[TABLE]

is a projection operator onto $\mathcal{H}_{E,\Delta E}$ . In this case, statistical mechanics tells us to calculate the expectation value of an observable $\hat{\mathcal{O}}$ at thermal equilibrium as $\braket{\hat{\mathcal{O}}}_{\mathrm{mic}}=\mathrm{Tr}[\hat{\rho}_{\mathrm{mic}}\hat{\mathcal{O}}]$ . This equal a priori probability postulate and the renowned Boltzmann’s entropy formula, $S=k_{B}\log W$ (where $W=\dim[\mathcal{H}_{E,\Delta E}]$ for a quantum system), can be regarded as two fundamental assumptions of statistical mechanics [3].

Although enormous previous studies have confirmed that statistical mechanics with the equal a priori probability postulate successfully predicts many thermal equilibrium physical phenomena quite accurately, the complete justification of this principle has not yet been made. The justification of the equal a priori probability postulate boils down to the following two questions:

What is the meaning of thermal equilibrium from a microscopic viewpoint? How is the microcanonical ensemble related to an actual microstate? 2. 2.

Why does thermal equilibrium emerge as a macroscopically stationary state? Can we prove it only by assuming microscopic kinetics, even without considering any thermal bath?

In fact, these questions were already actively investigated in the first half of the 20th century [4] (note that Boltzmann himself proposed the H theorem in an attempt to solve the second question). For isolated classical systems, the second question had been mainly studied in light of the ergodic theorem, which states that the long-time average of a physical quantity of a time-evolving microstate is equal to its phase-space average. We note that this theorem itself is now considered as being irrelevant to the foundation of statistical mechanics for several reasons [3, 5]. For isolated quantum systems, von Neumann tried to solve both of these questions in 1929 [6], which was just three years after the discovery of the Schrödinger equation. Although von Neumann’s discussion is worth notice even from the modern perspective, it had been forgotten until 2010 (see Chapter 2).

As an answer to the first question, the notion of “typicality” of thermal equilibrium has recently become popular [7, 8, 9, 10, 11, 12, 13, 14, 15]. The main idea of the typicality argument is that, if we can measure only a proper set $\mathcal{A}$ of restricted observables, almost all microstates in the energy shell are indistinguishable from the microcanonical ensemble. In other words, under the assumption of the typicality, most of the pure states can describe thermal equilibrium if we are interested in only observables in $\mathcal{A}$ . As an example, consider a box containing $N\gg 1$ classical particles. If we measure the number of particles in one half of the box, it is almost $N/2$ for most of the microstates, which is consistent with the prediction of the microcanonical distribution.

While in classical systems we may have to take a set of macroscopic observables for $\mathcal{A}$ in order to justify typicality, in quantum systems we are allowed to take a larger set of observables [13, 15]. In fact, we can rigorously show that most of the quantum pure states give the same expectation value of a general few-body operator as the microcanonical ensemble (see Chapter 2 for details) [7]. This difference implies that the applicability of quantum statistical mechanics is far wider than that of classical statistics mechanics.***Let us illustrate this with a simple example of $N$ (distinguishable) particles. For most of the microstates in the energy shell, the single-particle distribution of velocity is expected to obey the Maxwell-Boltzmann distribution if we make a histogram using $N$ particles. This is true both for classical and quantum cases, since we can write the single-particle distribution obtained from $N$ particles as macroscopic observables [3]. On the other hand, if we measure the velocity of a single particle (which is not a macroscopic observable), each microstate gives a definite value in the classical case and the Maxwell-Boltzmann distribution cannot be obtained from a single microstate. However, in the quantum case, the measurement results change because of quantum fluctuations, which allow us to obtain the Maxwell-Boltzmann distribution even from a single microstate [16]. Due to this fact, we mainly focus on quantum systems throughout this thesis.

Now, let us consider how the typicality is related to the second question, namely the approach to thermal equilibrium [14]. If the typicality holds true, most of the microstates in the Hilbert space are in thermal equilibrium, and non-equilibrium states are rare (see Fig. 1.1). From the figure, we expect that even if we prepare a non-equilibrium state as an initial state, it may rapidly develop into a thermal equilibrium by a unitary time evolution.

Though this argument of typicality seems natural, it is not quantitative enough. Actually, as we will see later, it is known both theoretically and experimentally that some systems cannot reach thermal equilibrium by a unitary time evolution even after an infinite time. Note that typicality of thermal equilibrium holds true in such systems, too. In order to know whether some initial states approach thermal equilibrium, we should straightforwardly begin with unitary time evolutions.

1.2 Approach to thermal equilibrium

1.2.1 Equilibration and thermalization

In this section we briefly explain how to formulate the approach to thermal equilibrium starting from unitary time evolutions (the details are discussed in Chapter 2). Let us consider an initial state $\hat{\rho}_{0}$ . Under a unitary time evolution, the state develops into

[TABLE]

at time $t$ (note that we set $\hbar=1$ throughout this thesis). We can immediately see that, as a microstate itself, $\hat{\rho}(t)$ will not be equivalent to a stationary (time-independent) microstate, much less a microcanonical ensemble $\hat{\rho}_{\mathrm{mic}}$ . In order to discuss if the system reaches thermal equilibrium, we have to restrict the set of observables $\mathcal{A}$ , as discussed in the previous section.

Here, we consider the approach to thermal equilibrium by dividing the problem into the following two steps:

Why does the state look stationary after a certain time? In other words, we want to know if $\braket{\hat{\mathcal{O}}}(t):=\mathrm{Tr}[\hat{\rho}(t)\hat{\mathcal{O}}]$ is almost equal to $\braket{\hat{\mathcal{O}}}_{\mathrm{stat}}:=\mathrm{Tr}[\hat{\rho}_{\mathrm{stat}}\hat{\mathcal{O}}]$ for some stationary state $\hat{\rho}_{\mathrm{stat}}$ and $\hat{\mathcal{O}}\in\mathcal{A}$ . We will call this the problem of equilibration. 2. 2.

Under the assumption of equilibration, can we justify the use of the microcanonical ensemble? In other words, we want to know if $\braket{\hat{\mathcal{O}}}_{\mathrm{stat}}:=\mathrm{Tr}[\hat{\rho}_{\mathrm{stat}}\hat{\mathcal{O}}]$ is nearly equal to $\braket{\hat{\mathcal{O}}}_{\mathrm{mic}}:=\mathrm{Tr}[\hat{\rho}_{\mathrm{mic}}\hat{\mathcal{O}}]$ .

If we can show both of them, we will say that the system approaches thermal equilibrium. We will call this the problem of thermalization.

At first sight, it seems that the information about the initial state $\hat{\rho}_{0}$ is important to answer these questions. In fact, by imposing certain conditions on $\hat{\rho}_{0}$ , we can show that the system equilibrates [17, 18, 19, 20, 21] or thermalizes [22, 14, 23]. We will briefly discuss the recent development about these conditions in Section 2.4.

However, for a sufficiently large quantum system, it is known that the approach to thermal equilibrium occurs for any initial states, if the so-called eigenstate thermalization hypothesis (ETH) is satisfied [6, 24, 25, 16, 17, 26]. The ETH focuses on matrix elements of an observable $\hat{\mathcal{O}}$ with respect to the energy eigenstates $\ket{E_{\alpha}}$ in an energy shell $\mathcal{H}_{E,\Delta E}$ . Roughly speaking, the ETH states that, for any $\ket{E_{\alpha}},\ket{E_{\beta}}\in\mathcal{H}_{E,\Delta E}$ , and in the thermodynamic limit, (a) off-diagonal matrix elements satisfy $\braket{E_{\alpha}}{\hat{\mathcal{O}}}{E_{\beta}}\simeq 0\>(\alpha\neq\beta)$ , and (b) diagonal matrix elements satisfy $\braket{E_{\alpha}}{\hat{\mathcal{O}}}{E_{\alpha}}\simeq\braket{\hat{\mathcal{O}}}_{\mathrm{mic}}(E_{\alpha})$ . The first condition is related to the problem on the equilibration, and the second condition corresponds to the other problem.

The ETH is actively investigated recently, based mainly on numerics, because rigorous proofs are highly nontrivial for general systems and observables. A number of numerical studies suggest that the ETH holds true for few-body observables in nonintegrable systems that conserve only energy [26, 27, 28, 29, 30, 31, 32, 33, 34], and that the ETH breaks down in integrable systems [35, 36] or systems in many-body localized phases [37, 38, 39] (see the next subsection). Analytically, it is shown that the ETH holds true for most of the observables [6, 40] (which may not be relevant to physical observables). In addition, relations to random matrix theory (RMT) are also proposed for nonintegrable systems [41]. Details of the ETH and how it is related to thermalization are explained in Chapter 2 and Chapter 3.

1.2.2 Systems that approach non-thermal stationary states

Although nonintegrable systems that conserve only energy are expected to approach thermal equilibrium, some systems are known to approach non-thermal stationary states. Integrable systems and systems in many-body localized (MBL) phases are two such famous examples, as confirmed both theoretically [42, 43, 44] and experimentally [45, 46]. In those systems, the usual ETH does not hold true in general, unlike in ordinary nonintegrable systems.

For nonequilibrium dynamics of integrable systems, models that are mappable to free systems [42, 47, 43, 48, 49, 50, 51, 52] or solvable by the Bethe ansatz [53, 54, 55, 56, 57, 58, 59, 60, 61] have intensively been investigated. Every energy eigenstate of these systems can be determined by the set of $N$ quantum numbers, such as quasi-momentum occupation numbers or rapidities ( $N$ is the size of the system). Such peculiarity of eigenstates is one of the properties of integrable systems, though the notion of quantum integrability is rather ambiguous [62]. In such integrable systems, there exist an extensive number of local conserved quantities, which prevent the system from approaching thermal equilibrium. Instead, stationary states are expected to be well described by the so-called generalized Gibbs ensemble (GGE) [63, 64, 42, 43], which takes initial values of the conserved quantities into account. We will review the integrable systems and the GGE in detail in Chapter 5.

Many-body localized (MBL) systems are quantum interacting systems where energy eigenstates are localized in space, triggered by (effective) disorder [65, 66, 67, 44, 68, 69, 70, 71, 72, 73, 74, 75]. Using perturbation theory, it was studied by Basko, Altschler, and Aleiner [65], who argued that the Anderson localization, which occurs in free systems with disordered potentials, survives even if electrons have sufficiently weak interactions. A few years later, Pal and Huse [44] numerically showed that the MBL occurs in an interacting Heisenberg chain with strong disordered transverse fields:

[TABLE]

where $h_{i}$ ’s are random in $i$ and $\hat{\sigma}$ is a Pauli operator. They also argued by the level-statistics analysis that delocalization-localization quantum phase transitions occur by changing the disorder strength, even at finite temperature. Such a property of phase transitions, or its critical properties, are still one of the open questions concerning the MBL [76, 77, 78].

In order to understand the property of the “fully” many-body localized systems (i.e. we assume all of the eigenstates are localized due to strong disorder), some phenomenology is proposed [70, 71, 72]. In this phenomenological argument, we note that a set of conserved quantities is almost localized in space, since there is no transport in localized systems. For example, for a Heisenberg chain in Eq. (1.6) with a large disorder, we expect that the effective Hamiltonian can be written as

[TABLE]

where $E_{0},h^{\prime}_{i},J^{\prime}_{ij},K_{i_{1}\cdots i_{n}}^{(n)}$ are constants, and $J^{\prime}_{ij}\>(K_{i_{1}\cdots i_{n}}^{(n)})$ ’s decay exponentially with $|i-j|\>\>(|i_{1}-i_{n}|)$ . The so-called localized bits (l-bits), $\hat{\vec{\tau}}_{i}$ , are quasi-localized conserved quantities, which have a large overlap with the operator $\hat{\vec{\sigma}}_{i}$ and have an exponentially small overlap with $\hat{\vec{\sigma}}_{j}\>(|i-j|\gg 1)$ . In other words, if the interaction is sufficiently weak, l-bits are expected to be constructed by dressing $\hat{\vec{\sigma}}_{i}$ perturbedly. This is in fact proven for some models [74, 75].

Since we have many (quasi-)local conserved quantities $\hat{\vec{\tau}}_{i}$ , it is expected that, the ETH and the approach to thermal equilibrium do not hold true in MBL systems. In fact, each energy eigenstate is determined by a set of $N$ conserved quantities, especially in fully many-body localized systems. This feature makes MBL systems akin to integrable systems [72]. However, unlike usual integrable systems, the MBL is robust against integrability-breaking perturbations as long as the disorder is sufficiently strong. For this reason, MBL systems are gathering attention as a useful phase that sustains the quantum order even at a finite temperature [79, 80, 81].

1.3 Experiments of isolated quantum systems

One of the reasons for the recent development of quantum isolated systems is the experimental realizations of such systems. While it is extremely difficult to simulate quantum many-body systems with a classical computer, we may realize them using complex quantum systems themselves, as Feynman pointed out [82]. In fact, current technologies allow us to control quantum models with various Hamiltonians using neutral atoms, ions, etc. [83, 84]. The approach to stationary states that are (not) thermal equilibrium is also observed by these (analogue) quantum simulators. In this section, we will briefly explain experiments that have addressed the issue of nonequilibrium dynamics in isolated systems.

1.3.1 Ultracold atoms

Ultracold atomic gases offer a suitable setting for analogue simulation of isolated quantum systems. By magnetic fields or optical dipole interactions, an atomic gas whose temperature is less than a microkelvin is trapped and isolated in a high vacuum chamber. The interactions between atoms can be tuned by a Feshbach resonance, which allows us to investigate novel phenomena of strongly correlated quantum matters [85, 86]. Moreover, by loading atoms onto optical lattices, various lattice models with controllable Hamiltonians are realized [87]. Controlling optical lattices, we can vary dimensionality of the models, or the shape of the lattices.

The approach to thermal equilibrium in an isolated nonintegrable system has been observed by Trotzky and coworkers [88] using 87Rb atoms in a one-dimensional optical lattice (see Fig. 1.2). By tuning a superlattice with bichromatic laser beams, they prepared a nonequilibrium initial state, where only one 87Rb atom resides at each “even” site (upper left figure in Fig. 1.2). Then, by quenching the height of the lattice potential, they let the system evolve according to the Bose-Hubbard Hamiltonian that is nonintegrable (upper middle figure). After a certain time, they make the optical lattice higher again to suppress further evolution, and readout the number density of “odd” sites (upper right figure). The quantum expectation value of the density $n_{\mathrm{odd}}(t)$ is shown as a function of time in the bottom figure of Fig. 1.2. We can see that $n_{\mathrm{odd}}(t)$ relaxes to $n_{\mathrm{odd}}(t)=0.5$ , which is consistent with the prediction at thermal equilibrium. They have also checked that the experimental results are in good agreement with the tDMRG numerical calculations, which confirms that the system actually undergoes a unitary time evolution.

Another notable example is the recent experiment that has demonstrated quantum thermalization in small systems [89]. In [89], Kaufman and coworkers have experimentally demonstrated that the approach to a thermal state takes place in a system with only six 87Rb atoms on six lattice sites. By controlling potentials of individual lattice sites with a digital micromirror device (DMD), they experimentally created a one-dimensional Bose-Hubbard model with six particles on six sites. They then succeeded in observing a unitary time evolution of the Renyi entanglement entropy or the local number density with the single-site microscopy. These quantities relax to some stationary values after a certain time. At that stage the entanglement entropy is nearly equal to thermal one, and local number densities coincide with the prediction at thermal equilibrium. They have further confirmed that the reduced density matrix restricted to local sites is nearly equal to the thermal ensemble, even though the entire system is small. These results are different from classical statistical mechanics that only considers macroscopic observables: they are expected to be genuine quantum thermalization that is related to the ETH.

Systems that do not approach thermal equilibrium are also realized. Kinoshita, Wenger and Weiss [45] conducted a pioneering experiment that demonstrates the absence of thermalization in a near-integrable system. They trapped 1D 87Rb gases in an anharmonic potential, and observed the time evolution of the momentum distribution of the gas. They found that the momentum distribution relaxes to some stationary value that cannot be described by the thermal ensemble. This result can be understood if we notice that the system is approximately described by the integrable Lieb-Liniger model. The group in Vienna published several papers on prethermalization of a 1D Lieb-Liniger gas [90, 91, 52]. After suddenly splitting the gas into two halves, they studied a time evolution of the correlation function of bosonic fields by interfering these two halves. In the experimentally observable timescale, the system seems to relax to a prethermalized state, which is a non-thermal quasi-stationary state emerging before complete thermalization [92, 93, 94]. They argued in the most recent paper [52] that the prethermalized state can be well fitted by the generalized Gibbs ensemble that considers occupation numbers of low-energy excited phonon modes.

Many-body localization has also been observed by Immanuel Bloch’s group [46, 96, 95, 97, 98]. While the first experiment [46] was done in a quasi-random optical lattice (i.e. the Aubry-André model), genuine random potentials in two-dimensional lattices have recently been realized, too [95]. In [95], the authors have initially prepared 87Rb atoms in the Mott insulator phase in a left half of a 2D optical lattice (see Fig. 1.3). Then they allow the system to evolve by the Bose-Hubbard Hamiltonian with disorder $\Delta$ , which can be generated by a DMD spatial light modulator. As shown in the figure, they have found that the atom population imbalance between left and right regions remains significant after a long time if the disorder is present. Using the population imbalance, they have observed the delocalization-localization phase transition as a function of the disorder strength. They argue that the critical disorder strength of the phase transition gets smaller when the interactions between atoms are weaker.

We note that ultracold atom experiments also allow us to pursue universal nonequilibrium phenomena before reaching stationary states. At a short timescale, light-cone-like spreading of two-point parity correlation functions ware observed [99], which indicates that the quasiparticle excitations propagate at a finite speed. This is consistent with the famous Lieb-Robinson bound [100], which imposes a bound on the speed of information spreading in systems with short-range interactions. Transport phenomena have also been observed. In Refs. [101, 102], the authors have studied expansion dynamics of fermionic [101] or bosonic [102] potassiums suddenly released from confining traps. They have succeeded in changing the interaction or the dimensionality of the systems, especially in Ref. [102]. Their main finding is that, while atoms spread ballistically in the integrable limit (i.e. the non-interacting limit in 1D and 2D, or the hard-core limit in 1D), a diffusive core appears on top of a ballistic background if we break integrability.

1.3.2 Trapped ions and other systems

Although current experiments of many-body quantum dynamics have been mainly done using cold atoms, other systems may also be useful. In fact, trapped ions offer a unique setting that is hard to obtain by ultracold atoms. Due to the balance between the Coulomb repulsion and the confinement by electromagnetic potentials, laser-cooled trapped ions have vibrational degrees of freedom, in addition to internal states (which we model with two-level pseudospins) [103]. Sideband transitions involving these internal and vibrational states lead to effective two-body interactions $J_{ij}^{\gamma}\>(\gamma=x,y,z)$ between distant spins at $i$ and $j$ , after eliminating the vibrational degrees of freedom [104]. Moreover, by detuning the laser frequency of the sideband transitions, the range $\alpha$ of the interaction $J_{ij}^{\gamma}\propto\frac{1}{|i-j|^{\alpha}}$ can be practically controlled like $0\lesssim\alpha\lesssim 3$ [105]. We can also create an effective transverse field, and, in fact, transverse-field Ising chains or XY chains with various interaction ranges have been realized [105]. By using such long-range spin models, the breakdown of the Lieb-Robinson bound were observed for the spreading of information, after a local quench with 40Ca+ by Blatt’s group [106] and a global quench with 171Yb+ by Monroe’s group [105]. Monroe’s group has also succeeded in observing the MBL using the Ising model with a disordered transverse field [107]. In Ref. [108], they also observed prethermalization in long-ranged Ising chain, where the quasi-stationary state cannot be described by the naive GGE. We note that thermalization of spins due to the coupling with the vibrational mode is observed by Clos and coworkers [109].

Up to now, compared to neutral atoms or ions, there are not so many experiments of thermalization in isolated systems that use other potential (analogue or digital) quantum simulators, including Rydberg atoms, polar molecules, superconducting qubits, photons, or NV centers in diamonds.†††We note that NV centers in diamonds have recently been used to demonstrate slow dynamics [110] and time-crystalline order [111] in disordered quantum many-body systems. One notable exception is the work by Neill and coworkers [112], where they have observed quantum thermalization of three superconducting qubits ( $S=\frac{1}{2}$ ) that are periodically driven by pulse sequences. Although the system does not conserve energy in this case, the dynamics can be written as unitary dynamics at each period of the cycles, if we neglect the decoherence from the environment. They observed the entanglement entropy between one qubit and the others after a long time. Then, what they found is follows: initial states that give thermal/low stationary entanglement entropies is related to initial states that go into chaotic/regular trajectories on a phase space of the corresponding classical dynamics ( $S\rightarrow\infty$ ). They also confirmed that the initial-state dependence of the stationary entanglement originates from the unitary dynamics of isolated quantum systems by checking that the decoherence due to the environment is independent of the initial states.

1.4 Organization of this thesis

In this thesis, we discuss the problem on thermalization by revisiting nonintegrable systems. We are especially motivated by the following two questions:

What is the underlying mechanism of the ETH, and that of the finite-size corrections from it in nonintegrable systems that conserve only energy? 2. 2.

Do nonintegrable systems relax to non-thermal stationary states (possibly described by the GGE) if they have additional conserved quantities due to symmetries?

With these motivations in mind, we organize our thesis as follows. In Chapter 2, we review the current understanding of equilibration and thermalization in isolated quantum systems. We especially explain how the ETH is relevant for equilibration and thermalization. In Chapter 3, we concentrate on the previous results that have addressed the first question raised above. We review some early-days explanations of the ETH from the viewpoint of random matrix theory (RMT). We also show some recent numerical simulations that have investigated the ETH and the finite-size corrections of it in quantum many-body systems. In Chapter 4, we show our first work related to the first question: are the ETH and its finite-size corrections predictable by the RMT in nonintegrable systems? We will first analytically calculate the finite-size corrections of the ETH using the RMT model and show that their statistics is universal which depends on the anti-unitary symmetries of the Hamiltonians and the observables.‡‡‡We will also see that the statistics will be further changed if the observable belongs to what we call the class of singular operators. Then, we will numerically show that, in nonintegrable systems and for a wide class of observables (including many-body operators), the matrix elements are in good agreement with the prediction of the RMT model. In Chapter 5, we review the previous works on the generalized Gibbs ensemble in integrable systems. We will stress the role of local conserved quantities. In Chapter 6, we show our second work (based on Ref. [113]) related to the second question: emergence of a non-thermal stationary state in a nonintegrable system with an extensive number of local symmetries. We have numerically investigated a nonintegrable model of hard-core bosons with an extensive number of local $\mathbb{Z}_{2}$ symmetries. We find that the expectation values of local observables in the stationary state are described by the GGE and not by the canonical ensemble. We also show that, if the model has less local symmetries, the stationary state is described by the canonical ensemble. In Chapter 7, we give the summary of the thesis with some remarks on the future prospect. The relations between the chapters are shown in Fig. 1.4.

Chapter 2 Equilibration and thermalization by unitary time evolutions

In this section, we review the current understanding of equilibration and thermalization in isolated quantum systems. Before considering the dynamics, we first explain some of the possible definitions of thermal equilibrium, following Refs. [13, 15]. Then, we consider how to formulate the approach to thermal equilibrium. We especially focus on the scenario of eigenstate thermalization hypothesis (ETH), which justifies thermalization for any initial state within the microcanonical energy shell. Finally, we explain the role of initial states on equilibration and thermalization when we do not assume the ETH.

Let us briefly summarize the history of quantum thermalization. Von Neumann tackled the problem of quantum thermalization as early as in 1929 [6]. He essentially showed that the ETH is a sufficient condition for the system to approach thermal equilibrium. He also proved that the ETH holds true for most of the decompositions of what he called macrospaces (see Subsection 2.1.3). Unfortunately, by 1950’s, his theorems were severely criticized as being meaningless by several researchers [114, 115, 116] because of the misunderstandings of the original statement. Due to these misunderstandings, von Neumann’s work had been forgotten for more than half a century until Goldstein and coworkers realized that it is of great value and published a commentary in 2010 [117]. Note, however, that important progresses were made both analytically [118, 25, 16, 17, 18, 19] and numerically [24, 26] even before the rediscovery of von Neumann’s work.

2.1 Definitions of thermal equilibrium and typicality

Here we review several definitions of thermal equilibrium, which slightly differ from one paper to another. Historically, von Neumann originally considered macroscopic observables and defined phase spaces using them, which he called macrospaces. Though he treated all macrospaces equally in order to discuss thermal equilibrium, Goldstein and coworkers realized that one special macrospace represents thermal equilibrium, and formulated macroscopic thermal equilibrium (MATE) [119, 117]. On the other hand, it turned out that the thermal ensemble can emerge by separating the entire system into a small subsystem and an environment through the quantum entanglement [9, 8]. These works lead to the notion of microscopic thermal equilibrium (MITE), which is a stronger condition than MATE. We note that these notions of thermal equilibrium are clearly reformulated only recently, with a more general framework [13, 15]. In this section, after discussing that general framework, we review MITE and MATE following Ref. [15], and then remark on a less strict view.***We note that some authors have proposed other formulations that are different from the formulation proposed in Refs. [13, 15]. For example, in Ref. [14], the author uses some tricks to treat local or few-body observables as macroscopic observables with the help of translational invariance or fictitious copies of the original system.

2.1.1 General framework

As we have mentioned in Chapter 1, we need to specify a set of observables, $\mathcal{A}$ , in order to discuss if a given (possibly pure) state $\hat{\rho}$ is close to thermal equilibrium. In Ref. [15], the authors have defined “thermal equilibrium relative to $\mathcal{A}$ ” essentially as follows: a state $\hat{\rho}$ is in thermal equilibrium relative to $\mathcal{A}$ if and only if for any $\hat{\mathcal{O}}\in\mathcal{A}$ , the probability distribution over the spectrum of $\hat{\mathcal{O}}$ with respect to $\hat{\rho}$ is approximately equal to that with respect to $\hat{\rho}_{\mathrm{mic}}(E)$ , where $E=\mathrm{Tr}[\hat{H}\hat{\rho}]$ . In other words, if $\hat{\mathcal{O}}=\sum_{f}f\hat{\mathcal{P}}_{f}$ , where $\hat{\mathcal{P}}_{f}$ is a projection operator with eigenvalue $f$ , then $\hat{\rho}$ satisfies

[TABLE]

for every $f$ . In the following, we see that we have to take operators on a small subsystem as $\mathcal{A}$ for MITE, and macroscopic observables as $\mathcal{A}$ for MATE.

2.1.2 Microscopic thermal equilibrium (MITE)

For MITE, we separate the entire system into a small subsystem and an environment, and take all observables that have supports on the subsystem. To see this, we first decompose the whole Hilbert space into a tensor product,

[TABLE]

where $S$ is a small subsystem and $S^{c}$ is an environment ( $c$ means the complement). We consider all possible subsystems $S$ that are small.†††For example, we can consider all spatially local regions $S$ that satisfy $\mathrm{diam}(S)\leq l_{0}\ll\mathrm{diam}(S\cup S^{c})$ for some $l_{0}$ , where $\mathrm{diam}(S)$ means the diameter of $S$ . MITE is thermal equilibrium relative to

[TABLE]

where

[TABLE]

MITE can also be simply written as follows: for all small $S$ ,

[TABLE]

is satisfied, where $\hat{\rho}_{S}:=\mathrm{Tr}_{S^{c}}[\hat{\rho}]$ .

While we usually consider spatially local subsystems for the choice of $S$ , we sometimes take a set of general few-body operators as $\mathcal{A}_{S}$ , too. As a simple example, let us consider a one-dimensional lattice quantum spin 1/2 chain of $N$ sites with a periodic boundary condition. The whole Hilbert space $\mathcal{H}$ is a direct product of the Hilbert space at each site $\mathcal{H}_{i}$ :

[TABLE]

where $\dim[\mathcal{H}_{i}]=2$ and $\dim[\mathcal{H}]=2^{N}$ . If we are interested in spatially local observables with length smaller than $l_{0}\ll N$ for MITE, then

[TABLE]

for some site $1\leq i_{0}\leq N$ . In this case,

[TABLE]

can be a basis set of the Hermitian operators $\hat{\mathcal{O}}_{S}$ acting on $\mathcal{H}_{S}$ , where $\hat{\sigma}_{i}^{0}:=\hat{\mathbb{I}}_{i}$ is the $2\times 2$ identity operator. On the other hand, we can also consider a set of general $k$ -body operators that satisfy $k\leq M$ for some $M$ . If $M\ll N$ , then they are called few-body operators. Here, we define $k$ -body operators as those whose basis set can be written as

[TABLE]

for $1\leq i_{1}<i_{2}<\cdots<i_{k}\leq N$ . In this case, the subsystem for MITE can be taken as

[TABLE]

In both cases of local and few-body operators, we usually assume that $\dim[\mathcal{H}_{S}]\ll\dim[\mathcal{H}]$ in considering MITE.‡‡‡We note, however, that the notion of MITE may be extended for subsystems that are as large as the half of the system size [120, 15, 121].

Proof of typicality for MITE

We can show the typicality of pure states in the microcanonical energy shell, $\mathcal{H}_{\mathrm{mic}}=\mathcal{H}_{E,\Delta E}$ for MITE that considers general few-body operators [9, 7, 8]. Note that $\mathcal{H}_{\mathrm{mic}}$ is explicitly written as

[TABLE]

where $\alpha$ labels energy eigenvalues that fall within the energy shell, and

[TABLE]

We consider picking up a pure state $\ket{\psi}\in\mathcal{H}_{\mathrm{mic}}$ randomly, and assume that $z_{\alpha}$ ’s are taken from the following probability distribution:

[TABLE]

where the constant $c$ is determined from $\int P(\left\{z_{\alpha}\right\})\prod_{\alpha\in\mathcal{S}}d\mathrm{Re}z_{\alpha}d\mathrm{Im}z_{\alpha}=1$ .

We first show that, for any operator $\hat{\mathcal{O}}$ ,

[TABLE]

where $\mathbb{E}$ and $\mathbb{V}$ are the expectation value and the variance over $P(\left\{z_{\alpha}\right\})$ , respectively, $||\hat{\mathcal{O}}||_{\mathrm{op}}$ is an operator norm of $\hat{\mathcal{O}}$ ,111The operator norm $||\hat{\mathcal{O}}||_{\mathrm{op}}$ is defined as the square root of the maximum eigenvalue of $\hat{\mathcal{O}}^{\dagger}\hat{\mathcal{O}}$ . If $\hat{\mathcal{O}}$ is Hermitian, it is the largest absolute eigenvalue of $\hat{\mathcal{O}}$ . $d=\dim[\mathcal{H}_{\mathrm{mic}}]$ , and

[TABLE]

where $\mathcal{O}_{\alpha\beta}=\braket{E_{\alpha}}{\hat{\mathcal{O}}}{E_{\beta}}$ . To show Eq. (2.14), we use expectation values of the moments of $\left\{z_{\alpha}\right\}$ . For second moments, we have

[TABLE]

and the only non-vanishing fourth moments are

[TABLE]

The first and third moments are all zero. Then we obtain

[TABLE]

and

[TABLE]

Moreover, we obtain

[TABLE]

From Chebyshev’s inequality, Eq. (2.14) implies

[TABLE]

for any $\epsilon>0$ , where $\mathbb{P}$ denotes the probability with respect to $P(\left\{z_{\alpha}\right\})$ . By taking $\epsilon=d^{-1/3}$ , we obtain

[TABLE]

Since we are discussing the typicality of MITE, we can assume that $\hat{\mathcal{O}}$ is an $M$ -body operator for some number $M$ , which we assume is independent of $N$ . In fact, however, we can also show the typicality for an operator that is written as the sum of the $M$ -body operators. Thus, we will consider such a general operator, which can be written for the case of a spin 1/2 model as

[TABLE]

The generalization to other models is straightforward. Here $f_{i_{1}\cdots i_{M};\alpha_{i_{1}}\cdots\alpha_{i_{M}}}$ ’s are constants whose absolute values are bounded by $f$ , which is independent of $N$ . Then, the operator norm of $\hat{\mathcal{O}}$ is bounded as

[TABLE]

Thus, $||\hat{\mathcal{O}}||_{\mathrm{op}}$ does not increase faster than the polynomial of $N$ . Since $d$ increases exponentially with $N$ , $d^{-1/3}$ and $\frac{||\hat{\mathcal{O}}||_{\mathrm{op}}^{2}}{d^{-2/3}(d+1)}$ appearing in Eq. (2.23) decrease rapidly with $N$ . Then, Eq. (2.23) means that, when $N$ is sufficiently large, most of the pure states $\ket{\psi}$ with respect to Eq. (2.13) give $\braket{\psi}{\hat{\mathcal{O}}}{\psi}\simeq\braket{\hat{\mathcal{O}}}_{\mathrm{mic}}$ for any operator $\hat{\mathcal{O}}$ that can be written as the sum of few-body operators.

We can prove the typicality of MITE in the form of Eq. (2.5), too. Namely, we can prove

[TABLE]

for any $\epsilon>0$ . The proof is given in Appendix B.1. Here, we remember the equivalence of ensembles between the microcanonical ensemble and the canonical ensemble in a thermodynamically normal system, which has recently been proven rigorously under certain conditions [22, 122, 23]. Then, Eq. (2.26) further indicates that $\hat{\rho}_{S}\simeq\hat{\rho}_{\mathrm{can},S}$ holds true for most pure states, where $\hat{\rho}_{\mathrm{can}}=\frac{e^{-\beta\hat{H}}}{Z}$ is the canonical ensemble with $\beta$ being determined from the condition $\mathrm{Tr}[{\hat{H}\hat{\rho}}]=\mathrm{Tr}[{\hat{H}\hat{\rho}_{\mathrm{can}}}]$ . This type of the typicality is called the canonical typicality [8].

2.1.3 Macroscopic thermal equilibrium (MATE)

For MATE, we consider only a set of observables $\{\hat{M}^{\prime}_{1},\hat{M}^{\prime}_{2},\cdots,\hat{M}^{\prime}_{K}\}$ that are measured macroscopically, such as the magnetization density or the number of particles (invoking the usual thermodynamics). In general, $\{\hat{M}^{\prime}_{1},\hat{M}^{\prime}_{2},\cdots,\hat{M}^{\prime}_{K}\}$ are not commutable with one another. However, we expect that we can construct a set of commuting operators $\{\hat{M}_{1},\hat{M}_{2},\cdots,\hat{M}_{K}\}$ from $\{\hat{M}^{\prime}_{1},\hat{M}^{\prime}_{2},\cdots,\hat{M}^{\prime}_{K}\}$ , with $||\hat{M}_{l}-\hat{M}^{\prime}_{l}||\>(1\leq l\leq K)$ being small, if the commutators among $\{\hat{M}^{\prime}_{1},\hat{M}^{\prime}_{2},\cdots,\hat{M}^{\prime}_{K}\}$ are sufficiently small. Since this conjecture has been proven for several situations [123], we will use $\{\hat{M}_{1},\hat{M}_{2},\cdots,\hat{M}_{K}\}$ as macroscopic observables (we remark that the formalism that uses $\{\hat{M}^{\prime}_{1},\hat{M}^{\prime}_{2},\cdots,\hat{M}^{\prime}_{K}\}$ is proposed by Tasaki [14]. See Sec. C.1).

Since $\{\hat{M}_{1},\hat{M}_{2},\cdots,\hat{M}_{K}\}$ are commutable with one another, we can decompose the Hilbert space using the simultaneous eigenstates of these macroscopic observables. We denote such eigenstates by $\ket{\left\{\mu\right\},\lambda}=\ket{\mu_{1}\cdots\mu_{K},\lambda}\>(\hat{M}_{l}\ket{\left\{\mu\right\},\lambda}=\mu_{l}\ket{\left\{\mu\right\},\lambda})$ , where $\mu_{l}$ ’s are determined only macroscopically (i.e., only macroscopically different $\mu_{l}$ ’s can be distinguished), and $\lambda$ labels the degeneracies. Then the orthogonal decomposition of the Hilbert space is

[TABLE]

where the projection onto $\mathcal{H}_{\left\{\mu\right\}}$ can be written as

[TABLE]

We call $\mathcal{H}_{\left\{\mu\right\}}$ as a macrospace.

We now consider that $\hat{M}_{1}$ is a coarse-grained Hamiltonian ( $\hat{M}_{1}^{\prime}=\hat{H}$ ). Then, $\mu_{1}$ denotes the coarse-grained energy, $\mu_{1}\sim E\pm\Delta E$ , where $\Delta E$ represents the inaccuracy due to the coarse-graining. We consider the Hilbert space with $\mu_{1}\simeq E$ as the microcanonical energy shell at energy $E$ . Then this energy shell can be decomposed by the other macroscopic observables, $\{\hat{M}_{2},\cdots,\hat{M}_{K}\}$ , as

[TABLE]

We note that

[TABLE]

An important observation, which von Neumann did not realize, is that for one special set of $(\mu_{2},\cdots,\mu_{K})$ , we should usually expect

[TABLE]

In other words, the dimension of only one macrospace dominates, and we will call that macrospace as the thermal equilibrium subspace, $\mathcal{H}_{\mathrm{eq}}$ . As a result, for a small $\epsilon>0$ , we can decompose $\mathcal{H}_{\mathrm{mic}}$ as

[TABLE]

where we define a nonequilibrium subspace, $\mathcal{H}_{\mathrm{neq}}$ , as the direct sum of non-thermal equilibrium macrospaces.

We define that a state $\hat{\rho}$ is in MATE if and only if

[TABLE]

for small $\delta$ . MATE is also regarded as thermal equilibrium relative to $\mathcal{A}_{\mathrm{MATE}}=\{\hat{M_{1}},\hat{M}_{2},\cdots,\hat{M}_{K}\}$ . As for MITE, we can show the typicality of pure states for MATE. In fact, MITE generally implies MATE, since a macroscopic observable can be written as a sum of local operators. In that sense, MITE is a stronger assumption for thermal equilibrium than MATE.§§§As an example of a state that seems to be in MATE but not in MITE, consider a noninteracting spin 1/2 chain of $N$ sites, where $\hat{H}=0$ . Let us take a product state $\ket{\psi}=\bigotimes_{i=1}^{N}\ket{\psi_{i}}$ , where $\ket{\psi_{i}}\in\mathcal{H}_{i}$ (the local Hilbert space at each site). If each $\ket{\psi_{i}}$ is randomly chosen from $\mathcal{H}_{i}$ , we expect that $\hat{\rho}=\ket{\psi}\bra{\psi}$ and the microcanonical ensemble $\hat{\rho}_{\mathrm{mic}}=\frac{1}{D}\hat{\mathbb{I}}_{D\times D}\>(D=2^{N})$ are indistinguishable for macroscopic observables such as $\hat{M}_{z}=\sum_{i=1}^{N}\hat{\sigma}_{i}^{z}$ : $\mathrm{Tr}[\hat{\rho}\hat{M}_{z}]\simeq\mathrm{Tr}[\hat{\rho}_{\mathrm{mic}}\hat{M}_{z}]=0$ . On the other hand, if we only consider the first spin, we have $\hat{\rho}_{1}=\ket{\psi_{1}}\bra{\psi_{1}}\neq\frac{1}{2}\hat{\mathbb{I}}_{2\times 2}=(\hat{\rho}_{\mathrm{mic}})_{1}$ . This shows that $\ket{\psi}$ is not in MITE.

Although MATE seems a natural situation for considering thermodynamics (because it only considers macroscopic observables), we know that MITE is also meaningful in quantum statistical mechanics, so we will more often pay attention to MITE than MATE.

2.1.4 A looser way to consider thermal equilibrium

We have seen how to formulate thermal equilibrium by restricting observables, but it is, in general, not easy to test whether a state satisfies these criteria. Instead, many previous studies have investigated the expectation values of certain observables [16, 18, 26, 40], and simply checked if they are approximately equal to the prediction of the thermal ensemble:

[TABLE]

Although the expectation value of a single observable $\hat{\mathcal{O}}\in\mathcal{A}_{\mathrm{MITE/MATE}}$ would not tell us if the state is in MITE/MATE, at least it implies. Moreover, the definition of MITE/MATE may be too restrictive, and $\hat{\mathcal{O}}\not\in\mathcal{A}_{\mathrm{MITE/MATE}}$ may satisfy the condition (2.34), which implies that statistical mechanics is applicable even beyond MITE. This possibility has recently been pointed out in Refs. [120, 124, 121]. For these reasons, we will mainly consider (2.34) for certain observables in the following discussions, including our works in Chapters 4 and 6.

2.2 Conditions for equilibration and thermalization

Now, we consider the meanings and conditions for a nonequilibrium initial state $\ket{\psi_{0}}$ to approach thermal equilibrium under unitary time evolutions (for simplicity, we will consider pure states in this section). As we have mentioned in the overview, it is convenient to separate the problem into two. We first consider equilibration, namely, a phenomenon where a state seemingly relaxes to a stationary state. Then, we will discuss when the stationary state is indistinguishable to a thermal state when we measure $\hat{\mathcal{O}}$ .

2.2.1 Equilibration

We first consider the meaning of equilibration. Naively, we might expect that after some equilibration time, $T_{\mathrm{eq}}$ , the expectation value of an observable $\hat{\mathcal{O}}$ is almost equal to some stationary value $\mathcal{O}_{\mathrm{d}}$ (the meaning of the subscript “d” is explained later):

[TABLE]

where $\braket{\hat{\mathcal{O}}}(t)=\braket{\psi(t)}{\hat{\mathcal{O}}}{\psi(t)}$ with $\ket{\psi(t)}=e^{-i\hat{H}t}\ket{\psi_{0}}$ . This equation is wrong in general because of the quantum recurrence theorem [125]; we can rigorously show that, for an arbitrary small $\epsilon>0$ , there exists an infinite sequence $0<T_{i}\>(i=1,2,\cdots)$ such that

[TABLE]

This theorem also implies that there always exists $t=T_{\mathrm{rec}}$ that makes $\braket{\hat{\mathcal{O}}}(t)$ arbitrary close to $\braket{\hat{\mathcal{O}}}(0)$ . Then, if $|\braket{\hat{\mathcal{O}}}(0)-\mathcal{O}_{\mathrm{d}}|$ is large, Equation (2.35) does not hold true. The important thing to notice here is that the recurrence times become super-exponential of the system size, and that we rarely expect such recurrences. In other words, it is reasonable to regard equilibration as a phenomenon where the expectation value of an observable stays close to some stationary value for almost all times (see Fig. 2.1):

[TABLE]

In order to justify Eq. (2.37), we consider the average and the variance of $\braket{\hat{\mathcal{O}}}(t)$ over time $t$ . In doing this, we assume two assumptions about the energy eigenvalues $E_{\alpha}$ of the Hamiltonian $\hat{H}$ , namely,

(non-degeneracies)

[TABLE]

and 2. 2.

(non-resonances)

[TABLE]

We expand the initial state by the energy eigenstates as

[TABLE]

where $c_{\alpha}=\braket{E_{\alpha}}{\psi_{0}}$ . Then the long-time average of $\braket{\hat{\mathcal{O}}}(t)$ becomes

[TABLE]

where $\overline{\cdots}=\lim_{T\rightarrow\infty}\frac{1}{T}\int_{0}^{T}\cdots dt$ denotes the average over time. We have also introduced

[TABLE]

which is called the diagonal ensemble because it is diagonal in the basis of the energy eigenstates (the subscript “d” stands for “diagonal”). Similarly, the variance over time can be calculated as

[TABLE]

Using Chebyshev’s inequality, Eq. (2.2.1) and Eq. (2.2.1) lead to

[TABLE]

for any $\epsilon>0$ , where $\mathbb{P}$ denotes the probability with respect to the uniform measure $t\in[0,\infty)$ .¶¶¶This probability distribution is not normalizable in a rigorous sense, since $\int_{0}^{\infty}dt=\infty$ . We regard it as the uniform measure in $[0,T]$ for a sufficiently large $T$ . Under certain conditions, we can show that

$\displaystyle\frac{1}{T}\int_{0}^{T}dt[{\braket{\hat{\mathcal{O}}}(t)}-\mathcal{O}_{\mathrm{d}}]^{2}$

(2.45)

is small when $T$ is sufficiently large. Using Markov’s inequality, we can justify equilibration [21].

Then, when $\Delta\mathcal{O}_{t}^{2}$ is sufficiently smal, we can justify Eq. (2.37) with $\mathcal{O}_{\mathrm{d}}$ being the long-time average of $\braket{\hat{\mathcal{O}}}(t)$ . In Sec. 2.3 and Sec. 2.4, we will consider in what situations $\Delta\mathcal{O}_{t}^{2}$ is small in order to justify equilibration.

Timescales

Before considering thermalization, we shall make a remark about timescales of equilibration. Although we have taken the long-time average in Eq. (2.2.1) and Eq. (2.2.1), we usually do not have to wait for such a long time to observe equilibration. Indeed, we know that systems approach stationary state in an accessible time $T_{\mathrm{eq}}$ by experiments [88] and numerics [26]. Many authors have tried to estimate such timescales of equilibration. In Ref. [126], Goldstein, Hara and Tasaki estimated the timescale with which the state gets out of a typically chosen nonequilibrium subspace of MATE (see Eq. (2.1.3)). The obtained timescale is $\sim\frac{\hbar}{k_{B}T}$ , which is unphysically short in general. The lesson from this observation is that we should not rely on the typicality argument in considering the dynamics (note, however, that the typicality argument might be useful to describe quick prethermalization [127]). Another notable proposition is to consider the operator norm of the commutator between the Hamiltonian and an observable $\hat{\mathcal{O}}$ in order to estimate the slowest timescale of equilibration [128]. If $||[\hat{\mathcal{O}},\hat{H}]||_{\mathrm{op}}\leq\chi$ , then using the Heisenberg representation for $\hat{\mathcal{O}}$ , we obtain

[TABLE]

which implies that $|\braket{\hat{\mathcal{O}}}(t)-\braket{\hat{\mathcal{O}}}(0)|$ remains small for $t\ll\frac{1}{\chi}$ . Thus we expect that equilibration will not occur until $T_{\mathrm{eq}}\sim\frac{1}{\chi}$ if $|\mathcal{O}_{\mathrm{d}}-\braket{\hat{\mathcal{O}}}(0)|$ is large. In Ref. [128], the authors investigate $\chi$ to find that there are operators that decay slower than the diffusive modes. Finally, we notice that the Lieb-Robinson bound [100] can be used to estimate the timescale in a system with short-range interactions. Indeed, we can rigorously show that the effect of local operation in region A is negligible in region B till the time $t\sim L/v$ , where $L$ is the distance of regions A and B, and $v$ is the finite velocity that depends on the interactions [129].

Although many works are present, estimating timescales of equilibration is still an open question. One of the difficulties is that timescales of equilibration are highly observable- and system-dependent.∥∥∥We note that some systems may not show equilibration within accessible timescales [130, 128] Moreover, complex systems show prethermalization, which means that the equilibration cannot be characterized by a single timescale. The issue of timescales is interesting, but still much remains to be understood, and we will set the issue aside in the following discussions.

2.2.2 Thermalization

Next, we consider if the stationary state is close to the thermal state. Since we have seen that the expectation values at the stationary state is regarded as the expectation value with respect to the diagonal ensemble, what we have to prove is that $\mathrm{Tr}[\hat{\rho}_{\mathrm{d}}\hat{\mathcal{O}}]\simeq\mathrm{Tr}[\hat{\rho}_{\mathrm{mic}}\hat{\mathcal{O}}]$ , or more explicitly

[TABLE]

where $d=\dim[\mathcal{H}_{\mathrm{mic}}]$ . We will consider when Eq. (2.48) is justified in the following sections. We note, however, that the approach to MITE can also be formulated at the ensemble level:

[TABLE]

which requires that all of the operators acting on $S$ should thermalize.

2.3 Approach to thermal equilibrium from any initial state: the eigenstate thermalization hypothesis

In this section, we introduce the eigenstate thermalization hypothesis (ETH), which is one of the main subjects in this thesis. We will see that the ETH for off-diagonal terms is related to equilibration, and the ETH for diagonal terms is related to thermalization.

The ETH is a statement for matrix elements $\mathcal{O}_{\alpha\beta}$ in the thermodynamic limit. We say that the ETH holds true if for any $\alpha,\beta$ in a certain energy range,******In many nonintegrable systems, Eq. (2.50) seems to hold true for most of the eigenstates except for the edge of the spectrum.

[TABLE]

where the arrow denotes the thermodynamic limit, and $\mathcal{O}_{\mathrm{m}}(x)$ is a smooth function of $x$ . We note that there are some small deviations from the ETH in finite-size systems, and that they decrease with system sizes (see Chapter 3 for details).

2.3.1 Off-diagonal matrix elements and equilibration

First, Eq. (2.50) implies that every off-diagonal term vanishes in the thermodynamic limit, namely $\mathcal{O}_{\alpha\beta}\rightarrow 0\>(\alpha\neq\beta)$ . Then, we can see that equilibration is justified under the assumption of the ETH, since

[TABLE]

in the thermodynamic limit. This argument is applicable for any initial state $\ket{\psi_{0}}$ .

2.3.2 Diagonal matrix elements and thermalization

Next, we consider the validity of Eq. (2.48) from diagonal matrix elements in Eq. (2.50). In order to do this, we make one more assumption: the spread of the energy of $\ket{\psi_{0}}$ is macroscopically negligible, namely,

[TABLE]

where $E=\braket{\psi_{0}}{\hat{H}}{\psi_{0}}$ , and $V$ is a system size. This condition is satisfied for usual quench setups [26]. To see one example, we consider a quench where we change the Hamiltonian from $\hat{H}_{0}$ to $\hat{H}=\hat{H}_{0}+\hat{V}$ , with $\hat{H}_{0}$ and $\hat{V}$ expressed as the sums of local operators. We also assume that $\ket{\psi_{0}}$ is an eigenstate (e.g., a ground state) of $\hat{H}_{0}$ for simplicity. Then

[TABLE]

If we write $\hat{V}$ as $\hat{V}=\sum_{i}\hat{v}_{i}$ , where $i$ denotes a lattice site and $\hat{v}_{i}$ is a localized operator near $i$ , then

[TABLE]

If $\ket{\psi_{0}}$ has a cluster property, $\braket{\psi_{0}}{\hat{v}_{i}\hat{v}_{j}}{\psi_{0}}-\braket{\psi_{0}}{\hat{v}_{i}}{\psi_{0}}\braket{\psi_{0}}{\hat{v}_{j}}{\psi_{0}}$ is sufficiently small when $|i-j|$ becomes large. Therefore, $\delta E$ scales only subextensively with the system size.

Under the assumption of Eq. (2.52), we can show that, if the system is large enough,

[TABLE]

where $\mathcal{O}_{\mathrm{m}}^{\prime}(x)=\frac{\mathrm{d}\mathcal{O}_{\mathrm{m}}(x)}{\mathrm{d}x}$ . Similarly, we can show

[TABLE]

which leads to Eq. (2.48) in the thermodynamic limit ( $\Delta E$ is a microcanonical energy shell). This argument is applicable for any initial state $\ket{\psi_{0}}$ that satisfies Eq. (2.52).

We remark that the ETH for diagonal matrix elements is in a sense a necessary condition for the system to approach thermal equilibrium from any initial state as well. Consider a composite system with a small system and a bath, $\mathcal{H}_{S}\otimes\mathcal{H}_{S^{c}}$ , which is the setup for MITE. Roughly speaking, in Ref. [131], the authors showed the following: If all of the product states in the energy shell, $\rho_{S}\otimes\rho_{S^{c}}\in\mathcal{H}_{\mathrm{mic}}$ , relax to MITE as

[TABLE]

Then we obtain the ETH for diagonal matrix elements (in the MITE form) as

[TABLE]

where $\hat{\tau}_{\alpha}=\ket{\alpha}\bra{\alpha}$ .††††††They used the trace norm ( $||\hat{\mathcal{O}}||:=\mathrm{Tr}[\sqrt{\hat{\mathcal{O}}^{\dagger}\hat{\mathcal{O}}}]$ ) for the proof.

2.4 Roles of initial states for equilibration and thermalization

In this section, we review several conditions of initial states, with which equilibration or thermalization is justified. We have seen that equilibration and thermalization occur for any nonequilibrium initial state in the energy shell, if we admit the ETH. However, equilibration and thermalization is also shown to occur if the initial state satisfies certain conditions. For example, integrable systems often equilibrate, though the ETH breaks down in such systems [42, 52]. We will see that the so-called effective dimension,

[TABLE]

where $S_{2}(\hat{\rho}_{\mathrm{d}})=-\ln\left[\mathrm{Tr}[\hat{\rho}_{\mathrm{d}}^{2}]\right]$ is a Rényi-2 entropy, plays important roles for equilibration and thermalization.

First, we consider the condition for equilibration of a Hermitian operator $\hat{\mathcal{O}}$ , following Refs. [18, 21]. We can bound Eq. (2.2.1) from above as

[TABLE]

where we have used the Cauchy-Schwartz inequality $\left\{\mathrm{Tr}[\hat{A}\hat{B}^{\dagger}]\right\}^{2}\leq\mathrm{Tr}[\hat{A}\hat{A}^{\dagger}]\mathrm{Tr}[\hat{B}\hat{B}^{\dagger}]$ , and the fact that $\mathrm{Tr}[\hat{A}\hat{B}]\leq||\hat{A}||_{\mathrm{op}}\mathrm{Tr}[\hat{B}]$ for positive operator $\hat{A}$ and $\hat{B}$ . We can see that, if the effective dimension $d_{\mathrm{eff}}$ is much larger than $||\hat{\mathcal{O}}||_{\mathrm{op}}^{2}$ , equilibration occurs thanks to Eq. (2.44). In fact, $d_{\mathrm{eff}}$ increases exponentially with the system size in many setups regardless of integrability of the systems. Since we have seen that $||\hat{\mathcal{O}}||_{\mathrm{op}}^{2}$ increases at most as a polynomial in the system size if $\hat{\mathcal{O}}$ is a sum of few-body operators (see Sec. 2.1), we expect equilibration of such an operator.

Though the proof above relies on several assumptions, namely Eq. (2.38) and Eq. (2.39), we remark that these assumptions can be abandoned or weakened [20, 21]. In Ref. [21], Short and Farrelly discussed equilibration without assuming the non-degenerate and the strict non-resonance conditions. They considered a Hamiltonian that can be written as $\hat{H}=\sum_{\alpha}E_{\alpha}\hat{\mathcal{P}}_{\alpha}$ , where $\hat{\mathcal{P}}_{\alpha}$ is a projection operator onto the Hilbert space with an associated energy $E_{\alpha}$ (note that we allow degeneracies). They also introduced the so-called density of energy gaps, which is defined as

[TABLE]

where $\#\left\{\right\}$ means the number of the elements in the set $\left\{\right\}$ . We note that Eq. (2.39) corresponds to assuming $\lim_{\epsilon\rightarrow 0+}N(\epsilon)=1$ . Then they showed that

[TABLE]

for any $\epsilon>0$ , where $\tilde{d}_{\mathrm{eff}}^{-1}:=\sum_{\alpha}\left\{\braket{\psi_{0}}{\hat{\mathcal{P}}_{\alpha}}{\psi_{0}}\right\}^{2}$ .‡‡‡‡‡‡They derived a more general inequality concerning the finite time average, but we will not discuss it here.

Next, we comment on the condition about initial states with which the system thermalizes. Here let us consider a macroscopic observable, which can be written as the average of local operators like $\hat{\mathcal{O}}=\frac{1}{V}\sum_{i}\hat{o}_{i}$ . In Ref. [132], Mori showed that, in certain systems (e.g., 1D short-range interacting lattice systems), an exponentially small fraction of the energy eigenstates in $\mathcal{H}_{\mathrm{mic}}$ give the different expectation value from the microcanonical ensemble. In other words, for an arbitrary $\delta>0$ , there exists $\gamma>0$ such that

[TABLE]

where $\mathbb{P}$ denotes the probability in the uniform distribution of $\alpha$ with $\ket{E_{\alpha}}\in\mathcal{H}_{\mathrm{mic}}$ , and $N$ is the system size. This is a stronger form of what is called a weak ETH (see Sec. 3.4). Using this, the difference between $\braket{\hat{\mathcal{O}}}_{\mathrm{d}}$ and $\braket{\hat{\mathcal{O}}}_{\mathrm{mic}}$ can be estimated. Let us denote ${\tilde{\mathcal{S}}}(\subset\mathcal{S})$ as the set of $\alpha$ ’s that satisfy $|\mathcal{O}_{\alpha\alpha}-\braket{\hat{\mathcal{O}}}_{\mathrm{mic}}|>\delta$ . In a large system, we can approximate $\ket{\psi_{0}}\in\mathcal{H}_{\mathrm{mic}}$ and thus

[TABLE]

Since $\sum_{\alpha\in{\tilde{\mathcal{S}}}}1\rightarrow de^{-\gamma N}$ for a large $N$ , the condition

[TABLE]

justifies that $|\braket{\hat{\mathcal{O}}}_{\mathrm{d}}-\braket{\hat{\mathcal{O}}}_{\mathrm{mic}}|$ is small for a large $N$ . The condition of thermalization in Eq. (2.65), or similar conditions to it [14, 23], are tighter than the condition of equilibration (which requires exponentially large $d_{\mathrm{eff}}$ ). In fact, it is not easy to show in what conditions Eq. (2.65) is achieved.

Chapter 3 Review of the eigenstate thermalization hypothesis (ETH)

As we have seen in Chapter 2, the ETH is expected to play a crucial role in quantum thermalization of nonintegrable isolated systems. Though many numerical simulations suggest that the ETH holds true in nonintegrable systems for few-body observables, there are hardly any mathematical proofs of the ETH for a given set of a Hamiltonian and an observable. We do not have definite criteria of when the ETH holds true, either. Despite the lack of the complete understanding, possible analytical explanations of the ETH have been proposed since the notable work by von Neumann [6]. These explanations and numerical verifications will provide important clues for mathematical proofs and definite criteria.

In this chapter, we review the previous detailed studies on the ETH. After looking back into the history of the ETH, we review three possible analytical explanations of why the ETH seems true for a wide variety of systems. In particular, some important relations between the ETH in nonintegrable systems and random matrix theory (RMT) will be discussed in Subsection 3.2.2. We also introduce an ansatz that describes the finite-size deviations from the ETH. Then, we will show some previous numerical results that tested the ETH. Finally, we remark on what is called the weak ETH. The structure of this chapter is summarized in Fig. 3.1.

3.1 Histories

Historically, von Neumann first tried to justify what is now essentially regarded as the ETH [6]. He originally showed that for almost all decompositions of macrospaces, the ETH holds true. Thus, his argument is similar (but not equivalent) to the typicality argument that we have reviewed in Chapter 2. Since the ETH holds true, thermalization also occurs for almost all decompositions of macrospaces. Von Neumann called this fact the quantum ergodic theorem, but it is actually irrelevant to the classical ergodicity, which incurred the misunderstandings of his work in 1950’s. We note that, though von Neumann originally considered only macroscopic observables, his argument has recently been extended to arbitrary observables by Reimann [40].

The study of the ETH greatly developed from the late 1970’s to the 1990’s, motivated by the relation between the quantum chaos theory and random matrix theory (RMT). This relation was especially investigated for quantum systems that have semiclassical limits ( $\hbar\rightarrow 0$ ). One important achievement is the establishment of two conjectures about the statistics of the energy eigenvalues of the Hamiltonian. If the corresponding classical system is chaotic, the level-spacing distribution of the eigenvalues shows the Wigner-Dyson distribution, which is predicted by RMT. This conjecture is called the Bohigas-Giannoni-Schmit (BGS) conjecture [133]. On the other hand, if the corresponding classical system is integrable, the level-spacing distribution of the eigenvalues shows the Poisson distribution. This conjecture is called the Berry-Tabor conjecture [134]. Similarly, statistics of the energy eigenstates was investigated especially for semiclassical models. It is conjectured that, the energy eigenstates are delocalized in phase space if the corresponding classical system is chaotic (Berry’s conjecture [135]), which is also consistent with RMT.***In fact, physical systems may have a negligible fraction of non-delocalized energy eigenstates even in a classically chaotic system, which is called a scar. However, the presence of scars will not be important in the following discussions [16]. These observations lead to the work by Peres [118], who conjectured that matrix elements of observables in the basis of the Hamiltonian look random and suggested the notion of the ETH and its finite-size corrections. Several authors [136, 137] also numerically verified that, matrix elements of observables are distributed as Gaussian in systems whose corresponding classical systems are chaotic. Srednicki also applied Berry’s conjecture to show that thermalization occurs thanks to the ETH [16].

The connection to RMT also encouraged researchers to investigate the ETH from the viewpoint of (non)integrability, even if many-body quantum systems have no classical counterparts (we have summarized the relations between RMT and (non)integrability in Fig. 3.2). In fact, analogously to the semiclassical situations, level distributions of the eigenvalues are different depending on the integrability of the Hamiltonian. If it is a nonintegrable system that conserves only energy, the level spacings show the Wigner-Dyson distribution; if it is integrable (e.g., mappable to free systems or solvable by the Bethe ansatz), the level spacings show the Poisson distribution. A similar classification in light of nonintegrability was sought for eigenstates and matrix elements in general many-body quantum systems. In 1985, Jensen and Shanker numerically investigated the ETH for nonintegrable and integrable transverse spin chains [24]. In 1991, Deutsch proposed the origin of the ETH by considering the integrable model perturbed by random interactions [25] (see Sec. 3.2.3). In 1999, Srednicki developed his semiclassical arguments, and conjectured a general form of matrix elements in nonintegrable systems [138], which describes the ETH and its finite-size corrections (see Sec. 3.2.2). We note that his conjecture is testable for general many-body quantum systems.

These works were rediscovered after Rigol, Dunjko, and Olshanii numerically demonstrated the validity of the ETH in a nonintegrable many-body quantum system and its breakdown in an integrable system [26]. After that, many numerical simulations appeared that tested the ETH for both diagonal and off-diagonal matrix elements of (a sum of) few-body operators [27, 139, 140, 141, 30, 31, 33]. Some of them investigated how the finite-size corrections of the ETH behave [141, 30, 31, 33, 142], sometimes referring to the Srednicki’s argument (see Eq. (3.35)).

Though many works have been done, the complete understanding of the ETH is yet to be made. Rigorous proofs of the ETH for a given set of a Hamiltonian and an observable are hardly obtained. We do not have definite criteria of when the ETH holds true, either.†††For example, even though most of the numerical simulations have tested the ETH only for (a sum of) few-body observables in nonintegrable systems that conserve only energy, several studies indicate that the ETH may hold true in wider situations [120, 124, 121]. In seeking for mathematical proofs and definite criteria, it is important to understand underlying mechanisms of why the ETH is valid (note that even qualitative understanding has not sufficiently been obtained). In that sense, the possible analytical explanations and numerical verifications have the meanings of providing clues for understanding such mechanisms of the ETH.

3.2 Possible explanations of the ETH

In this section, we review three possible explanations of the ETH. First, we review the argument by von Neumann [6] and Reimann [40]; they investigated the ETH using the notion similar to the typicality. Second, in Subsection 3.2.2, we review the analogy between nonintegrable systems and random matrices; this analogy leads to the explanations of the ETH. We introduce a general form of matrix elements which describes the ETH and its finite-size corrections, which are predicted by Srednicki [138]. Topics treated in this subsection is especially relevant to Chapter 4. Third, we briefly explain the argument by Deutsch, who modeled a nonintegrable Hamiltonian as the sum of an integrable Hamiltonian and a random perturbation.

3.2.1 Arguments by von Neumann and Reimann

In this section, we review von Neumann’s original work on the justification of the ETH, using the typicality-like argument. He only considered macroscopic observables, but Reimann has recently extended Von Neumann’s argument to an arbitrary observable with a more modern method. Thus, we follow the paper by Reimann [40].

The setup and the statement

As a setup, we consider only one microcanonical energy shell and assume that the initial state is in this energy shell ( $\ket{\psi_{0}}\in\mathcal{H}_{\mathrm{mic}}$ ). Then, we need to show the ETH for matrix elements $\mathcal{O}_{\alpha\beta}$ with $\ket{E_{\alpha}},\ket{E_{\beta}}\in\mathcal{H}_{\mathrm{mic}}$ to justify thermalization. Thus, we can consider an observable $\hat{O}:=\mathcal{\hat{P}}_{\mathrm{mic}}\hat{\mathcal{O}}\mathcal{\hat{P}}_{\mathrm{mic}}$ , instead of $\hat{\mathcal{O}}$ itself (note that $\mathcal{O}_{\alpha\beta}={O}_{\alpha\beta}$ , where $\hat{\mathcal{P}}_{\mathrm{mic}}$ is the projection into $\mathcal{H}_{\mathrm{mic}}$ ). If we diagonalize $\hat{O}$ as $\hat{O}=\sum_{i=1}^{d}a_{i}\ket{a_{i}}\bra{a_{i}}$ , where $d=\dim[\mathcal{H}_{\mathrm{mic}}]$ , we can define the transformation $U:\mathcal{H}_{\mathrm{mic}}\rightarrow\mathcal{H}_{\mathrm{mic}}$ between the bases $\left\{\ket{E_{\alpha}}\right\}$ and $\left\{\ket{a_{i}}\right\}$ , whose matrix elements are $U_{\alpha i}:=\braket{E_{\alpha}}{a_{i}}$ . We note that $U$ can be an arbitrary $d\times d$ unitary matrix if the Hamiltonian has neither unitary nor anti-unitary symmetry.

Von Neumann and Reimann showed that, for almost all $U$ (with respective to the unitary Haar measure), the ETH holds true. Namely, they showed that for any $\epsilon>0$ ,

[TABLE]

where $\mathbb{P}$ denotes the probability over $U$ with respect to the unitary Haar measure, and

[TABLE]

Before proving the inequalities, let us explain the meanings of these inequalities. When $d$ is large enough, the right-hand sides of Eq. (3.1) and Eq. (3.2) become negligibly small, and the ETH occurs for almost all $U$ . Thus, for a physically relevant set of a Hamiltonian and an observable as well, it is not unnatural to consider $U$ as being “typical,” which leads to the ETH. We note that the uniform Haar distribution of $U$ can be formally regarded as taking a randomly sampled matrix for $\mathcal{\hat{P}}_{\mathrm{mic}}\hat{H}\mathcal{\hat{P}}_{\mathrm{mic}}$ with a fixed observable.‡‡‡In fact, a random matrix whose probability distribution is invariant under arbitrary unitary transformations has eigenstates that are distributed uniformly with respect to the unitary Haar measure. See Appendix A. Instead, it can also be regarded as taking a randomly sampled $\hat{O}$ with a fixed Hamiltonian, which is close to the original argument of macrospaces by von Neumann.

Proof

Here we prove Eq. (3.1) and Eq. (3.2). In order to deal with the probability with respect to the unitary Haar measure, we use Levy’s lemma [143, 9]:

[TABLE]

for any $\epsilon>0$ . Here “ $\mathrm{Prob}$ ” means the probability with respect to uniformly distributed random points $\phi$ on a $d^{\prime}$ -dimensional unit sphere $\mathbb{S}^{d^{\prime}}\subset\mathbb{R}^{d^{\prime}+1}$ , and $\braket{\cdots}_{\phi}$ denotes the average over $\phi$ . A function $g(\phi):\mathbb{S}^{d^{\prime}}\rightarrow\mathbb{R}$ is a Lipshitz continuous function with a Lipshitz constant $\eta$ . In our case, $\ket{\phi}\in\mathcal{H}_{\mathrm{mic}}$ represents a point $\phi$ on a $(2d-1)$ -dimensional unit sphere ( $d^{\prime}=2d-1$ ). Moreover, we can show that $g(\phi)=\braket{\phi}{\hat{O}}{\phi}$ is a Lipschitz continuous with $\eta=\Delta_{\hat{O}}$ because [143]

[TABLE]

where $\hat{O^{\prime}}:=\hat{O}-X_{\hat{O}}/2$ and $X_{\hat{O}}:=\max_{i}a_{i}+\min_{i}a_{i}$ . We also note that $\braket{g}_{\phi}=\braket{\hat{O}}_{\mathrm{mic}}$ which can be obtained by the same calculation done in obtaining Eq. (2.14). Then, observing that randomizing $\ket{\phi}$ and randomizing $U$ are equivalent, we obtain

[TABLE]

Now, we will prove Eq. (3.1). We have

[TABLE]

for any $\ket{E_{\alpha}}\in\mathcal{H}_{\mathrm{mic}}$ . To deal with “max” in Eq. (3.1), we note that for an arbitrary set of functions $\{f_{\rho}\}_{\rho}$ , we have

[TABLE]

Here $\theta(\cdot)$ denotes a step function. Applying this, we obtain

[TABLE]

This is equivalent to Eq. (3.1).

To prove Eq. (3.2), we use the following inequality which is proven in Appendix B.2. Namely, for any $\epsilon>0$ ,

[TABLE]

where $\ket{\phi}$ is some state in the energy shell, and $\alpha\neq\beta$ . Applying this and Eq. (3.2.1), we obtain

[TABLE]

which is equivalent to Eq. (3.2).

3.2.2 Some predictions from random matrix theory in nonintegrable systems

In the previous subsection, we saw that we can rigorously show the ETH for almost all $U$ ’s. However, there is no a priori reason to believe that the typicality of $U$ with respect to the unitary Haar measure is physically meaningful. In fact, we can easily find physical systems that have an atypical $U$ by taking them integrable or many-body localized systems, as we have seen in the overview. From this observation, it is reasonable to attribute the validity of the ETH to nonintegrability of the system. Actually, many previous studies suggested that nonintegrable systems have in common with random matrix theory (RMT), which also indicates how matrix elements of an observable behave.

In this section, we explain how nonintegrability of the system is related to RMT and the ETH. We first review the BGS conjecture, which connects the level-spacing statistics of nonintegrable systems with those of RMT. Next, we consider eigenvectors and matrix elements. We explain how the ETH is derived from a model of RMT, with a brief review of the works by Srednicki [16] and others. Finally, we introduce a general form of matrix elements in nonintegrable systems conjectured by Srednicki [138], which describes the ETH and its finite-size corrections. We summarize the results of this subsection in Fig. 3.3. For the sake of self-containedness, we summarize the basics of RMT in Appendix A.

Level-spacing statistics of random matrices and nonintegrable systems

We first consider level-spacing distributions of the Gaussian random matrices introduced by Dyson, namely the Gaussian unitary ensemble (GUE), the Gaussian orthogonal ensemble (GOE), and the Gaussian symplectic ensemble (GSE) (see Appendix A). Roughly speaking, matrices without any anti-unitary symmetry $\hat{T}$ belong to the GOE, matrices with only one anti-unitary symmetry $\hat{T}\>(\hat{T}^{2}=1)$ belong to the GOE, and matrices with only one anti-unitary symmetry $\hat{T}\>(\hat{T}^{2}=-1)$ belong to the GSE. A level-spacing distribution $P(s)$ is defined as the probability density for two neighboring energy levels $S_{\alpha+1}$ and $S_{\alpha}$ to have a spacing equal to $s$ . Here we assume that the spectrum $\left\{S_{\alpha}\right\}$ is obtained by renormalizing the original spectrum $\left\{E_{\alpha}\right\}$ by the so-called unfolding procedure [144], and that the mean level density of $\left\{S_{\alpha}\right\}$ is set to unity. It is known that the level-spacing distributions for $D\times D$ Gaussian random matrices are well approximated by those of $2\times 2$ Gaussian random matrices. For each of the three ensembles, the level-spacing distribution is given by the following Wigner-Dyson distribution:§§§As we will see, a matrix in the GSE has a doubly degenerate spectrum (the Kramers degeneracy). In this case, the level-spacing distribution $P_{\mathrm{GSE}}(s)$ is defined from the non-degenerate neighboring levels.

[TABLE]

where we have assumed normalization conditions for a probability density as $\int_{0}^{\infty}dsP(s)=1$ , and for a first moment of $s$ as $\int_{0}^{\infty}dssP(s)=1$ . The important feature of these distributions is that they all show $p(s\rightarrow 0+)=0$ , which means the level repulsions. This feature cannot be seen in the uncorrelated level statistics that leads to the following Poissonian form:

[TABLE]

To clarify the meaning of the statistics, we introduce the notion of what is called the “ergodicity of random matrices,” which relates the spectral statistics and the ensemble statistics.222Note that this notion is rather different from the usual ergodicity, which relates the long-time average and the phase-space average. Although the discussion of the previous paragraph considered the ensemble statistics of random Hamiltonians, we can also consider the spectral statistics for a randomly sampled single Hamiltonian, where we make a histogram of $S_{\alpha+1}-S_{\alpha}$ for $d_{s}$ different $\alpha$ ’s. The statement of the ergodicity of random matrices is the following: when the dimension of the random matrices $D$ and the number of the samplings $d_{s}$ is sufficiently large, the ensemble statistics and the spectral statistics coincide to a certain accuracy, for almost all fixed Hamiltonians randomly sampled from the ensemble. The ergodicity is proven for certain quantities, which include level-spacing statistics [145, 146]. Thus, we assume that we can also regard Eq. (3.12) and the other statistics as the spectral statistics of a fixed Hamiltonian.

The BGS conjecture states that the level spacings of quantum systems whose classical counterparts exhibit chaos show the Wigner-Dyson statistics. According to the symmetry of the system, the level statistics change to those of the random matrix with the same symmetry class. Despite the absence of the complete proof, the BGS conjecture is verified in many concrete situations. We note that for quantum systems whose classical counterparts are completely integrable, the Poisson statistics in Eq. (3.15) are expected to be applicable as implied by the Berry-Tabor conjecture [134].

Though the BGS conjecture was originally proposed for quantum systems that have classical counterparts, it is now known that the level-spacing statistics seems to be related to nonintegrability of general quantum systems that may not have classical counterparts. Many integrable systems, which include noninteracting systems, systems mappable to free systems and systems solvable by the Bethe ansatz, show the level-spacing statistics that is Poissonian or more degenerate.¶¶¶For example, $P(s)$ has a delta-function peak at $s=1$ for a single-mode harmonic oscillator. On the other hand, nonintegrable systems that conserve only energy are expected to show the Wigner-Dyson statistics. We note that if the nonintegrable system has unitary symmetries, the Hamiltonian is block-diagonalized and the level repulsions become unclear because eigenstates from different symmetry sectors are uncorrelated. In that case, by restricting the symmetry sectors, the Wigner-Dyson distributions are obtained within the sectors [147].

We comment on the analogy of the level statistics of Gaussian random matrices and nonintegrable systems. We have seen that the level-spacing distributions are similar between the two, but they only measure level correlations in local energy scale. On the other hand, if we consider quantities related to the global energy scale such as the level density $\rho(E)=\frac{1}{D}\sum_{\alpha=1}^{D}\delta(E-E_{\alpha})$ , these two are different. In fact, the Gaussian random matrices predict the following the “semicircle law” [148, 146, 144]:

[TABLE]

where $\overline{\cdots}$ denotes the ensemble average, and $\lambda=\sqrt{D\overline{|H_{ij}|^{2}}}$ for the GUE. This expression is not valid in realistic nonintegrable systems. One of the reasons for the discrepancy is that the Hamiltonian in physical systems consists of few-body and local interactions, unlike the Hamiltonian of Gaussian random matrices. Some other random matrices are proposed to deal with such physical structures [146], but we will not discuss them because it is difficult to analyze them in general (see, however, Subsection 3.2.3).

Matrix elements from the viewpoint of RMT

Next, let us examine how the eigenstates of random matrices predict matrix elements of observables, following the discussion similar to Ref. [41]. We calculate the ensemble average of diagonal and off-diagonal matrix elements of an observable $\hat{O}=\sum_{i}a_{i}\ket{a_{i}}\bra{a_{i}}$ . The matrix elements can be written as

[TABLE]

where $U_{\alpha i}:=\braket{E_{\alpha}}{a_{i}}$ denotes a basis transformation. Let us assume that the Hamiltonian belongs to the GUE and that the observable is fixed. Then, it is known that $U$ is distributed uniformly with respect to the unitary Haar measure (see Appendix A). In this case, we have the following moments of $U$ [146]:

[TABLE]

where $\overline{\cdots}$ denotes the ensemble average (the average with respect to the unitary Haar measure), and $d$ denotes the dimension of the matrices. These lead to the following average and the variance of the matrix elements:∥∥∥We note that in Ref. [41] the variance is different from our calculations because the authors of Ref. [41] ignore some correlations such as in Eq. (3.21), which contribute to the lowest-order term after the summation of $i$ and $j$ .

[TABLE]

and

[TABLE]

If $d$ is sufficiently large, matrix elements are written as

[TABLE]

where $R_{\alpha\beta}$ is a random variable that satisfies $\overline{R_{\alpha\beta}}=0$ and $\overline{|R_{\alpha\beta}|^{2}}=1$ . Note that the second term in Eq. (3.25) is much smaller than the first term due to the factor $\frac{1}{\sqrt{d}}$ .

Though the discussion above is based on the ensemble average, we can reinterpret this as the spectral average, if we assume the ergodicity of random matrices for a function $g$ of the matrix elements. Here we assume that we make samplings from the eigenstates in some Hilbert space $\mathcal{H}_{s}$ with $\dim[\mathcal{H}_{s}]=d_{s}\>(1\ll d_{s}\leq d)$ . We define $\mathcal{T}$ as a set of labels of the eigenstates in $\mathcal{H}_{s}$ . In this case, the ergodicity states that, for most of the fixed Hamiltonians randomly sampled from the ensemble, we have

[TABLE]

where

[TABLE]

and

[TABLE]

denotes the spectral average for the diagonal and the off-diagonal matrix elements, respectively. The ergodicity is proven for a wide class of $g$ [146], so we will assume it in the following discussions. Then, Eq. (3.25) can be regarded as the matrix elements for a fixed Hamiltonian which fluctuate from eigenstates to eigenstates, satisfying $\braket{R_{\alpha\alpha}}_{\mathcal{T}}=\braket{R_{\alpha\beta}}_{\mathcal{TT}}=0$ and $\braket{}{R_{\alpha\alpha}}{{}^{2}}_{\mathcal{T}}=\braket{}{R_{\alpha\beta}}{{}^{2}}_{\mathcal{TT}}=1$ . Note, however, that the Hermiticity of the observable requires that $R_{\alpha\beta}=R_{\beta\alpha}^{*}$ .

Relations to nointegrable systems

The statistics of energy eigenstates and that of matrix elements of observables in physical systems have been investigated especially in systems that have classical counterparts. In Ref. [135], Berry conjectured that, in quantum systems whose classical counterparts are chaotic, an energy eigenfunction $\psi_{\alpha}(\mathbf{x})$ is approximated as a Gaussian random function of $\mathbf{x}$ . He also suggested that such a Gaussian structure does not arise in systems whose classical counterparts are integrable. As Peres [118] and Srednicki [16] pointed out, the randomness of the eigenstates in quantum chaotic systems leads to the randomness of matrix elements of observables. In particular, Srednicki replaced the statistics of $\mathbf{x}$ used by Berry with the spectral statistics within the energy shell******In Srednicki’s 1994 paper, he used the term “eigenstate ensemble” for the statistics. However, his subsequent papers [149, 138] suggest that he also considered the spectral statistics. and derived the ETH for momentum distributions of a single particle in a dilute gas.

We expect that the RMT predictions for matrix elements of an observable $\hat{\mathcal{O}}$ also apply to general nonintegrable systems in analogy with the BGS conjecture. In this case, we may have to consider a sufficiently narrow energy shell in applying Eq. (3.25). We thus consider projecting an observable onto an energy shell $\mathcal{H}_{\mathrm{sh}}$ as $\hat{O}:=\mathcal{\hat{P}}_{\mathrm{sh}}\hat{\mathcal{O}}\mathcal{\hat{P}}_{\mathrm{sh}}=\sum_{i=1}^{d_{\mathrm{sh}}}a_{i}\ket{a_{i}}\bra{a_{i}}$ , where $d_{\mathrm{sh}}=\dim[\mathcal{H}_{\mathrm{sh}}]$ . Here $\mathcal{H}_{\mathrm{sh}}$ is a Hilbert space spanned by the energy eigenstates $\left\{\ket{E_{\alpha}}\right\}_{\alpha\in\mathcal{T}_{\mathrm{sh}}}$ , where

[TABLE]

and $\mathcal{\hat{P}}_{\mathrm{sh}}$ is a projection onto $\mathcal{H}_{\mathrm{sh}}$ . In nonintegrable systems, the transformation $U$ between $\left\{\ket{E_{\alpha}}\right\}_{\alpha\in\mathcal{T}_{\mathrm{sh}}}$ and $\left\{\ket{a_{i}}\right\}$ is expected to be so complex that we can conjecture that the RMT model Eq. (3.25) applies within this energy shell. Rewriting Eq. (3.25), we obtain

[TABLE]

where

[TABLE]

Note that $\omega_{\mathrm{sh}}$ is expected to be determined from the Hamiltonian and the observables. In numerics, we take samplings from $d_{s}\>(1\ll d_{s}\leq d_{\mathrm{sh}})$ energy eigenstates that satisfy $\mathcal{T}=\left\{\alpha:|E-E_{\alpha}|<\omega_{s}(<\omega_{\mathrm{sh}})\right\}$ .

The important point in Eq. (3.32) is that RMT predicts that nonintegrable systems have matrix elements with the following properties:

The diagonal matrix elements fluctuate around $\braket{\hat{\mathcal{O}}}_{\mathrm{sh}}(E_{\alpha})$ , and the off-diagonal matrix elements fluctuate around zero. The fluctuations decrease exponentially with the system size (because of the factor $\frac{1}{\sqrt{d_{\mathrm{sh}}}}$ ). 2. 2.

The statistics of the fluctuations is the same for any choice of $\alpha$ and $\beta$ within the energy shell. 3. 3.

For the GUE, the ratio $r$ of variances between diagonal and off-diagonal matrix elements is universally one. (We will see in Chapter 4 that the change of the symmetry class leads to different values of $r$ .)

The ETH and its finite-size corrections

Though Eq. (3.32) only concerns the matrix elements within the energy shell $\mathcal{H}_{\mathrm{sh}}$ , Srednicki [138] predicted that the matrix elements for the entire spectrum can be written down as

[TABLE]

where $E:=\frac{E_{\alpha}+E_{\beta}}{2},\omega:=E_{\alpha}-E_{\beta},O_{\mathrm{m}}(E):=\braket{\hat{\mathcal{O}}}_{\mathrm{sh}}(E)=\mathcal{O}_{\mathrm{m}}(E/V)$ (see Chapter 2), $S(E)$ is the microcanonical entropy at energy $E$ , and $f(E,\omega)$ is a mildly varying function of $E$ and $\omega$ . In addition, Hermiticity requires that $R_{\alpha\beta}=R_{\beta\alpha}^{*}$ and $f(E,\omega)={f(E,-\omega)}^{*}$ . The factor $e^{-S(E)/2}$ ensures that the second term vanishes exponentially with the system size, which is also the case in Eq. (3.32).††††††We note that the number of states $e^{S(E)}$ has some ambiguity because it depends on the width of the energy shell. In this thesis, we do not care about the exact value of $e^{S(E)}$ and just notice that $e^{S(E)}$ is expected to increase exponentially with the system size. Thus, if $\mathrm{Prob}[|R_{\alpha\beta}|\gg 1]$ is sufficiently small, the ETH is satisfied. In other words, the second term describes the finite-size corrections from the ETH. We also note that RMT requires that $f(E,\omega)$ is almost constant for $\omega<\omega_{\mathrm{sh}}$ .

Although Eqs. (3.32) and (3.35) can be tested in general quantum many-body systems, there are less numerical or analytical studies on them than the ETH (for numerical studies, see Sec. 3.3). In particular, it is not yet clear how universally Eqs. (3.32) and (3.35) describe the matrix elements of observables in nonintegrable systems. We will investigate these problems in Chapter 4.

3.2.3 Argument by Deutsch

Finally, we briefly explain the essence of Deutsch’s argument, following his original paper [25] and recent developments [150, 151]. In his formulation, we model a nonintegrable system as a Hamiltonian $\hat{H}$ , which can be written as an integrable Hamiltonian $\hat{H}_{0}$ plus an integrability-breaking perturbation $\hat{V}$ :

[TABLE]

For example, we can consider a situation in which $\hat{H}$ is the Hamiltonian of a non-interacting gas and $\hat{V}$ is a weak interaction between the particles. Let $\ket{n}{}_{0}$ be an eigenstate of $\hat{H}_{0}$ with an eigenvalue $E_{m}^{0}$ . Then

[TABLE]

is expected to be a sparse, banded matrix whose matrix elements rapidly decay with increasing $|n-m|$ . We will treat $V_{nm}^{0}$ as being sampled from an ensemble of random matrices that imitate certain physical properties such as the banded structure or sparsity. If we can show the ETH for almost all $\hat{V}$ , then we expect that it is true for physical perturbations, which is the spirit of the typicality (this is similar to what we saw in Subsection 3.2.1).

The randomness of $\hat{V}$ leads to the randomness of $\ket{E_{\alpha}}$ which is the eigenstate of $\hat{H}$ . Thus the transformation of the basis between perturbed and unperturbed eigenstates

[TABLE]

is also randomized. We define matrix elements of $\hat{\mathcal{O}}$ in the basis of the unperturbed energy eigenstates

[TABLE]

in addition to those in the basis of the perturbed eigenstates, $\mathcal{O}_{\alpha\beta}:=\braket{E_{\alpha}}{\hat{\mathcal{O}}}{E_{\beta}}$ . Note that $\mathcal{O}_{nm}^{0}$ is a non-random quantity. Using $\mathcal{U}$ , these matrix elements can be related to each other as

[TABLE]

In order to justify the ETH (for diagonal matrix elements) for most $\hat{V}$ , we have to show that $\mathcal{O}_{\alpha\alpha}-\mathcal{O}_{\beta\beta}$ is sufficiently small for most of the $\hat{V}$ ’s, if $E_{\alpha}$ and $E_{\beta}$ are close to each other (note that we do not consider an explicit energy shell in contrast to the previous subsections). In Ref. [150], Reimann justified this by the following two steps. First, he showed that the difference between the expectation values with respect to neighboring eigenstates

[TABLE]

is sufficiently small, where $\braket{\cdots}_{V}$ denotes the average over $\hat{V}$ . Second, he showed that the variance over $\hat{V}$ ,

[TABLE]

is sufficiently small. From the smallness of Eq. (3.41), we can say that $|\braket{\mathcal{O}_{\alpha\alpha}}_{V}-\braket{\mathcal{O}_{\beta\beta}}_{V}|$ is small if $|\alpha-\beta|$ is sufficiently small. Moreover, $\mathcal{O}_{\alpha\alpha}\simeq\braket{\mathcal{O}_{\alpha\alpha}}_{V}$ from the smallness of Eq. (3.42) for almost all $\hat{V}$ ’s, so $\mathcal{O}_{\alpha\alpha}\simeq\mathcal{O}_{\beta\beta}$ is concluded.

In order to prove that Eq. (3.41) and Eq. (3.42) are small, we need to calculate the moments of $\mathcal{U}$ ,

[TABLE]

Calculating the averages over a banded random matrix $\hat{V}$ is much more difficult than calculating the full random matrix average that we used in the previous subsections. Therefore, here we only mention some known results for the second moments. These results enable us to believe that Eq. (3.41) is small. For simplicity, we model a banded random matrix $\hat{H}=\hat{H}_{0}+\hat{V}$ as follows. Diagonal elements can be written as $H_{nn}^{0}=E_{n}^{0}=n\delta$ , where $\delta$ is a mean level spacing. Off-diagonal matrix elements are random and banded: we have $\braket{H^{0}_{nm}}_{V}=\braket{V^{0}_{nm}}_{V}=0$ , and

[TABLE]

where $v$ is the strength of the perturbation and we assume that a cutoff of the band is determined by the temperature $T^{-1}=\frac{\partial S}{\partial E}$ . In this model, if the dimension of the matrix is infinitely large, we can expect that the second moment of $\mathcal{U}$ takes the following Breit-Wigner form [25, 152, 40]‡‡‡‡‡‡The exact forms of $\braket{}{\mathcal{U}_{\alpha n}}{{}^{2}}_{V}$ slightly differ depending on models one assumes. However, it is expected that the following discussions are qualitatively correct for such models as well.

[TABLE]

for a relatively small $v$ (i.e., $\delta\ll\frac{2\pi v^{2}}{\delta}\ll T$ ). Here we note that $u_{2}(x)$ takes the maximum value at $x=0$ :

[TABLE]

Moreover, $u_{2}(x)$ monotonically increases (decreases) with $x$ when $x<0\>(x>0)$ .

Now, assuming Eq. (3.2.3), we consider the smallness of Eq. (3.41). We also assume that other second moments and a first moment vanish. First, we note

[TABLE]

Then

[TABLE]

Since $u_{2}(0)$ is expected to be small, Eq. (3.41) is small. We remark that the smallness of Eq. (3.42) or the ETH for the off-diagonal matrix elements can also be justified using higher moments of $\mathcal{U}$ [150].

3.3 Numerical simulations of the ETH

Here, we review some recent numerical simulations that investigate the ETH and its finite-size corrections for few-body operators in quantum many-body systems. First, we show some results found in Ref. [147] which discusses the relation between random matrices and nonintegrable systems of hardcore bosons or spinless fermions. Then, we review the previous works on the ETH and its finite-size corrections both for diagonal matrix elements and off-diagonal matrix elements.

3.3.1 Level-spacing statistics of hardcore-particle systems

In Ref. [147], Santos and Rigol demonstrate that the level-spacing distributions change in the course of integrable-nonintegrable transitions in systems of hardcore bosons or spinless fermions. Let us consider hardcore bosons here (the results are almost the same for spinless fermions). They consider $M$ hardcore bosons on a one-dimensional lattice with $N$ sites. The Hamiltonian of the system is

[TABLE]

where $\hat{b}_{i}$ is an annihilation operator of a hardcore boson at the site $i$ , and the periodic boundary condition is imposed. If $t^{\prime}=J^{\prime}=0$ , $\hat{H}$ is integrable since $\hat{H}_{0}$ can be mapped to the spin 1/2 XXZ model. If $\hat{V}$ becomes comparable to $\hat{H}_{0}$ , $\hat{H}$ is expected to be nonintegrable.

Santos and Rigol investigate the level-spacing distibutions $P(s)$ for various values of $t^{\prime}=V^{\prime}$ (by setting $t=V=1$ ). Since the system is translationally invariant, the Hamiltonian is decomposed into several quasi-momentum sectors. This means that we have to calculate the level spacings for each sector, not for the entire spectrum.******In fact, there are some sectors that have further symmetries. For example, the sector with zero quasi-momentum has a parity symmetry. We avoid using such sectors to obtain the level distributions. The obtained level-spacing distributions are shown in Fig. 3.4. We can see that, as $t^{\prime}$ becomes larger, $P(s)$ changes from the Poisson distribution $P_{\mathrm{P}}(s)$ to the Wigner-Dyson distribution $P_{\mathrm{GOE}}(s)$ . This result clearly shows that the nonintegrability of the system is well captured by the level-spacing distributions predicted by RMT. We note that the obtained distribution $P_{\mathrm{GOE}}(s)$ reflects the time-reversal symmetry of the Hamiltonian in Eq. (3.3.1).

3.3.2 Diagonal matrix elements

The ETH for diagonal matrix elements has extensively been investigated recently especially for few-body observables. After the notable work by Rigol [26] that uses hardcore bosons on a 2D lattice, the ETH has been numerically verified in various nonintegrable systems, including systems with spinless [27] or spinful [140] fermions, interacting spin chains [30], and Bose-Hubbard models [139]. It has also been known that the ETH breaks down in integrable [26] and MBL systems [37]. Recently, the coexistence of the energy ranges that do or do not satisfy the ETH is also gathering attention. Such phenomena are expected to be observed in the mobility edge of MBL systems [76], excited-state quantum phase transitions in Dicke and other models [153], and spontaneous symmetry breaking in 2D transverse Ising models [34, 154].

In Ref. [31], the authors investigate the ETH and its finite-size corrections for diagonal matrix elements of few-body operators. They consider a ladder composed of $(L=2p+1)$ spins with neighboring XXZ interactions (see the inset of Fig. 3.5). In the figure, the interactions of the dotted bonds are $\lambda$ times stronger than those of the solid bonds. If $\lambda=0$ , the ladder is decoupled to two spin chains and thus integrable. We also note that $\lambda\rightarrow\infty$ again makes the system integrable. If $\lambda\sim 1$ , the ladder becomes nonintegrable. Since the total magnetization is conserved, they use the fixed sector with $N_{\uparrow}=p$ up spins.

In Fig. 3.5, the diagonal matrix elements $A_{\alpha\alpha}=\braket{E_{\alpha}}{\hat{S}_{2}^{z}}{E_{\alpha}}$ are plotted for all of the eigenstates as a function of $E_{\alpha}/L$ . The upper row is for $\lambda=0$ (integrable) and the lower row is for $\lambda=1$ (nonintegrable). This figure shows that if we increase the system, the fluctuations of $A_{\alpha\alpha}$ rapidly decay for the entire spectrum (except for the edge) only when the system is nonintegrable. The authors in Ref. [31] investigated the (spectral) variance of the matrix elements and showed that it decays proportionally to $\frac{1}{\sqrt{D}}$ , where $D$ is the dimension of the Hilbert space. In another work [30], the Gaussian distribution of the fluctuations for the diagonal matrix elements of current operators has been reported using nonintegrable spin chains. This means that $R_{\alpha\alpha}$ in Eq. (3.35) obeys a Gaussian distribution for these operators.

3.3.3 Off-diagonal matrix elements

The ETH for off-diagonal matrix elements in many-body quantum systems has been less investigated than the ETH for diagonal matrix elements. We note, though, that it was already pointed out in Rigol’s paper [26] that the off-diagonal terms are very small.

In Ref. [33], the authors investigate the ETH and its finite-size corrections for off-diagonal matrix elements of few-body operators, similarly to Ref. [31]. They use the same model as shown in Fig. 3.5 and investigate off-diagonal matrix elements.

In Fig. 3.6, the off-diagonal matrix elements $|A_{\alpha\beta}|=|\braket{E_{\alpha}}{\hat{S}_{2}^{z}\hat{S}_{p+2}^{z}}{E_{\beta}}|$ are plotted for all of the eigenstates as a function of $E_{\alpha}$ and $E_{\beta}$ . The left figure is for $\lambda=0.5$ (nonintegrable) and the right figure is for $\lambda=5$ (near-integrable). If the system is nonintegrable, the behavior of the matrix elements seems to change mildly with the change of the energy. In other words, within a small energy shell $\left\{(E_{\alpha},E_{\beta}):|E_{\alpha}-E_{1}|<\omega_{s,1},|E_{\beta}-E_{2}|<\omega_{s,2}\right\}$ , typical magnitude of $A_{\alpha\beta}$ ’s seems constant. We note that the entire structure depends on the global energy, such as the bandlike structure as a function of $|E_{\alpha}-E_{\beta}|$ as shown in Fig. 3.6. On the other hand, if the system is integrable, we can see the block-like structure. The authors in Ref. [31] showed that the variance of the matrix elements decays proportionally to $\frac{1}{\sqrt{D}}$ only in the nonintegrable case, as is the case with diagonal matrix elements. They also found the Gaussian distributions of the matrix elements within the small energy shells in that case (similar results are found in Ref. [30]). This means that $R_{\alpha\beta}$ in Eq. (3.35) obeys a Gaussian distribution for these operators. We note that the Gaussian distributions have been investigated using systems that have classical counterparts in Refs. [136, 137].

Several studies have investigated off-diagonal matrix elements motivated by Eq. (3.35) [141, 41, 142, 155]. In Ref. [141], the authors have investigated the $\omega$ -dependence of off-diagonal matrix elements (namely, the behavior of $f(E,\omega)$ in Eq. (3.35)). The important findings are that $f(E,\omega)$ has a plateau for $\omega\lesssim\omega_{\mathrm{sh}}$ , which indicates the validity of Eq. (3.35), and that $f(E,\omega)$ rapidly decays when $\omega$ is large. We note, however, that this result is not enough to justify the mechanism of RMT in the energy shell. For example, it has been reported [30] that the variances of diagonal and off-diagonal matrix elements of current operators do not seem to be related to each other, contrary to the prediction of RMT. The mechanism and the justification of Eq. (3.35) (or Eq. (3.32)) have been still under investigation.

3.4 Weak ETH

We have mainly discussed the ETH that requires Eq. (2.50) for every eigenstate within certain energy ranges, which is sometimes called a strong ETH. However, some authors investigate a bit weaker statement, which is sometimes called a weak ETH [139, 156]. The weak ETH states that, the variance of diagonal matrix elments*†††The weak ETH is mostly discussed only for diagonal matrix elements. within some energy shell vanishes in the thermodynamic limit:

[TABLE]

The condition in Eq. (3.50) implies that most of the eigenstates satisfy

[TABLE]

because of Chebyshev’s inequality.

Although the strong ETH holds true only in nonintegrable systems, the weak ETH holds true for a wider class of systems. Several numerical studies show that the weak ETH holds true even in interacting integrable systems [55, 60]. In this case, $\Delta\mathcal{O}_{\mathrm{d}}$ decreases as a polynomial with system size $N$ , in contrast to nonintegrable systems, where $\Delta\mathcal{O}_{\mathrm{d}}$ is expected to decrease exponentially with $N$ . Moreover, it is rigorously shown that, in certain systems (e.g., 1D short-range interacting lattice systems) and for macroscopic observables that can be written as the average of local operators, an exponentially small fraction of the energy eigenstates violates Eq. (3.51) [132] (see Sec. 2.4). This is a refined statement of the weak ETH.

We note that the weak ETH does not justify thermalization from all initial states, unlike the strong ETH [139, 132]. When the weak ETH holds true and the strong ETH does not, we can find an eigenstate $\ket{E_{\alpha}}$ that violates Eq. (3.51) even in the thermodynamic limit. Then, if we take an initial state that has a peak at $\rho_{\alpha\alpha}$ , we expect to obtain a non-thermal stationary state. A crucial point here is that, for integrable systems, we can actually prepare such initial states that do not relax to thermal equilibrium with physically accessible protocols. Therefore, the relation between the weak ETH and thermalization is not simple.111As reviewed in Sec. 2.4, we impose the condition on initial states, namely Eq. (2.65), for systems to approach thermal equilibrium. This condition does not seem to hold true for integrable systems.

3.5 Summary and remarks

Let us summarize this chapter and make some critical comments. Though the rigorous proofs and definite criteria for the validity of the ETH have hardly been found, possible analytical explanations and numerical simulations provide clues for understanding the mechanisms of the ETH. We have reviewed such explanations in Sec. 3.2 and numerical simulations in Sec. 3.3. In particular, we explained the analogies between nonintegrable systems and random matrices in Subsection 3.2.2.

If we assume the analogy for matrix elements of observables, it predicts the ETH and its finite-size corrections within a small energy shell (see Eq. (3.32)), but the validity of this analogy is nontrivial. In fact, there are not many verifications of this analogy in actual nonintegrable systems: even though some numerical studies have investigated the finite-size fluctuations of the matrix elements in nonintegrable systems, their relations to the RMT predictions are not clear [30, 31, 33]. It is also unclear how the RMT prediction is relevant to other previous works concerning the validity of the ETH. For example, few-body properties of observables are often stressed as the validity of the ETH*‡‡‡Such arguments are made as follows: small subsystems can be regarded as being thermal through the quantum entanglement of the energy eigenstate; it is related to the ETH in the form of microscopic thermal equilibrium (MITE).; however, the relation to the RMT predictions remains to be clarified, as we did not use such properties in Subsection 3.2.2.

Chapter 4 Observable-dependence of how random matrix theory can predict deviations from the ETH

4.1 Motivations

The ETH is the possible scenario for thermalization as we have seen in Chapter 2 and it is expected to be related to RMT as reviewed in Chapter 3. If we assume the analogy between nonintegrable systems and RMT for matrix elements of observables, it predicts not only the ETH but also its finite-size corrections within a small energy shell (see Eqs. (3.32) and (3.35)).

However, the conjecture of Eq. (3.35) is not well verified in actual nonintegrable systems; matrix elements in such systems have been investigated, but the evidences of the RMT conjecture are few. For example, the Gaussian distributions found in Refs. [30, 31, 33] have not been attributed to the conjecture of RMT, yet. Moreover, the result of Ref. [30] does not seem consistent with the RMT conjecture. One of the reasons of the lack of the evidences is that the statistics $R_{\alpha\beta}$ in Eq. (3.35) has not been completely obtained yet.

To clarify to what extent RMT can predict actual situations, we should first refine and generalize Eq. (3.35) (or Eq. (3.32)) to make it applicable for a wide variety of situations, and then verify it by thorough numerics. We especially consider how the conjecture of RMT is influenced by observables we take, which has not been well investigated. Since the distributions of the matrix elements are determined not by $\ket{E_{\alpha}}$ alone but by the transformation of the basis $U_{\alpha i}=\braket{E_{\alpha}}{a_{i}}$ and the behavior of $\{a_{i}\}$ , the change of observables may alter the statistics of finite-size deviations of the ETH. We thus need to generalize the RMT prediction for arbitrary observables as well as for Hamiltonians (we note that previous studies focused only on nonintegrability of the Hamiltonians [31, 33, 142]).

The importance of observables also suggests that the nonintegrability of the Hamiltonian is not enough to justify that the matrix elements in the actual systems are predicted by those of RMT. Actually, we can easily find an observable for which the RMT conjecture and the ETH break down even in nonintegrable systems. To see this, take an energy eigenstate $\ket{E_{\delta}}$ of the Hamiltonian of the system and define $\hat{\mathcal{O}}=\ket{E_{\delta}}\bra{E_{\delta}}$ . Then, we trivially obtain

[TABLE]

Since $\braket{\hat{\mathcal{O}}}_{\mathrm{sh}}(E_{\delta})\rightarrow 0$ is expected in the thermodynamic limit, the ETH does not hold true in this case. Moreover, we can find an observable for which the ETH is not valid in a sufficiently large subsystem.*** This is understood by the following example. We take a one-dimensional spin 1/2 system with $N\gg 1$ sites with a local Hamiltonian. Consider a subsystem $\mathcal{M}$ with $M(>N/2)$ sites and a reduced density matrix of an energy eigenstate $\ket{E_{\alpha}}$ as $\hat{\rho}_{\mathcal{M}}=\mathrm{Tr}_{\mathcal{M}^{c}}[\ket{E_{\alpha}}\bra{E_{\alpha}}]$ . If all observables on $\mathcal{M}$ satisfied the ETH, $\hat{\rho}_{\mathcal{M}}$ would be written as

$\displaystyle\hat{\rho}_{\mathcal{M}}=\mathrm{Tr}_{\mathcal{M}^{c}}\left[\frac{e^{-\beta\hat{H}}}{Z}\right]\simeq\frac{e^{-\beta\hat{H}_{\mathcal{M}}}}{Z_{\mathcal{M}}}\>\>\>(\mathrm{wrong}),$

(4.2)

where $\hat{H}_{\mathcal{M}}$ denotes a Hamiltonian restricted onto $\mathcal{M}$ , $Z=\mathrm{Tr}[e^{-\beta\hat{H}}]$ and $Z_{\mathcal{M}}=\mathrm{Tr}_{\mathcal{M}}[e^{-\beta\hat{H}_{\mathcal{M}}}]$ . The first equality is the ETH in the MITE form and the second approximation comes from the locality of the Hamiltonian. Thus, this equation would lead to an extensive von Neumann entropy $S_{\mathrm{vN}}(\hat{\rho}_{\mathcal{M}})\simeq S_{\mathrm{vN}}(\frac{e^{-\beta\hat{H}_{\mathcal{M}}}}{Z_{\mathcal{M}}})\propto M$ . However, the property of pure states leads to $S_{\mathrm{vN}}(\hat{\rho}_{\mathcal{M}})=S_{\mathrm{vN}}(\hat{\rho}_{\mathcal{M}^{c}})\leq(N-M)\ln 2$ . We can see an apparent contradiction of these two representations by taking $M=\frac{3}{4}N$ . This contradiction arises from the false assumption of the ETH.

The unsolved question is to what type of observables the RMT conjecture does not apply. Such observables might be many-body observables, or something else.

In this chapter, we show that RMT can predict the finite-size corrections of the ETH in nonintegrable systems and for a wide class of observables, including many-body operators. In Section 4.2, we first refine and generalize the finite-size corrections of the ETH from the random matrix model. We will especially see that that the ratios between standard deviations of diagonal and off-diagonal matrix elements become universal ones that depend only on anti-unitary symmetries of the Hamiltonian and those of the observable. We also show that the probability densities of off-diagonal matrix elements obey the statistics that is determined by what we call singularity of observables as well as anti-unitary symmetries. In Section 4.3, we numerically investigate matrix elements of various observables in nonintegrable systems that only conserve energy. We will demonstrate that the finite-size corrections of the ETH are in excellent agreement with the predictions of RMT for a wide class of observables with various symmetries, including many-body correlations and singular operators. We also remark, though, that counterexamples always exist even for simple observables. We compare previous studies and our results in Fig. 4.1.

4.2 Statistics of the finite-size corrections of the ETH for the random matrix model

In this section, we make a refined RMT conjecture about the matrix elements within an energy shell, focusing on a change of the statistics $R_{\alpha\beta}$ . By calculating the ratios of standard deviations between diagonal and off-diagonal matrix elements, we show that RMT predicts the universal ratios that depend only on anti-unitary symmetries of the Hamiltonian and those of the observable. Next, we examine the probability densities of the off-diagonal matrix elements, and find that RMT predicts Gaussian statistics for a wide class of observables in consistent with previous numerical studies [30, 33] (in the case of the GOE), but that it predicts other statistics if observables are “singular.” We summarize our results and conjectures in Fig. 4.2 (see Subsection 4.2.3).

4.2.1 Universal ratios between diagonal and off-diagonal matrix elements

First, let us consider the ratio of standard deviations between diagonal and off-diagonal matrix elements that is defined as

[TABLE]

where

[TABLE]

If the Hamiltonian has Kramers degeneracies, the degenerate eigenstates can be written as $\ket{E_{\alpha}}$ and $\ket{\tilde{E_{\alpha}}}:=\hat{T}\ket{E_{\alpha}}$ . Because we can consider $\mathcal{O}_{\alpha\tilde{\alpha}}:=\braket{E_{\alpha}}{\hat{\mathcal{O}}}{\tilde{E_{\alpha}}}$ in this case, we introduce the corresponding ratio as†††We note that in the case of the GSE, we have a freedom to choose two orthogonal energy eigenstates in the Kramers degenerate space. In the numerical calculation in Sec. 4.3, we use two eigenstates that are directly obtained by the exact-diagonalization programming.

[TABLE]

where

[TABLE]

We assume that the symmetry of the system is at most one anti-unitary symmetry, the corresponding operator of which commutes with the Hamiltonian.‡‡‡In other words, we assume that the system has no unitary symmetry, whose corresponding operator (anti)commutes with the Hamiltonian. We also assume that the system has no anti-unitary symmetry, the corresponding operator of which anticommutes with the Hamiltonian. Then, the Hamiltonian in RMT belongs to the GUE, the GOE, or the GSE. The Hamiltonian that belongs to the GOE has an anti-unitary symmetry $\hat{T}$ that satisfies $\hat{T}^{2}=1$ . In contrast, the Hamiltonian that belongs to the GSE has an anti-unitary symmetry $\hat{T}$ that satisfies $\hat{T}^{2}=-1$ . In these two cases, we can consider two types of observables that satisfy either $\hat{T}\hat{\mathcal{O}}\hat{T}^{-1}=\hat{\mathcal{O}}$ or $\hat{T}\hat{\mathcal{O}}\hat{T}^{-1}=-\hat{\mathcal{O}}$ . We will call the former and latter observables as the even and odd operators, respectively.§§§Though operators that are neither even nor odd exist, we will not consider such operators for simplicity.

RMT predicts that $r$ and $r^{\prime}$ become universal values that depend only on the symmetries of the Hamiltonian and those of the observable. If the Hamiltonian belongs to the GUE, $r_{\mathrm{GUE}}\rightarrow 1\>(d\rightarrow\infty)$ is expected as we have seen in Sec. 3.2.2. However, we will show that the change in the symmetry affects the values of $r$ and $r^{\prime}$ as illustrated in the upper left table in Fig. 4.2.

The GOE

First consider the case in which the Hamiltonian belongs to the GOE and the observables are even under $\hat{T}$ . We assume that neither the Hamiltonian nor the observable has a degeneracy. As in Sec. 3.2.2, we define $\hat{O}=\sum_{i=1}^{d}a_{i}\ket{a_{i}}\bra{a_{i}}$ . We note that we can assume that $\hat{T}\ket{a_{i}}=\ket{a_{i}}$ and $\hat{T}\ket{E_{\alpha}}=\ket{E_{\alpha}}$ without loss of generality.111Let us consider $\ket{E_{\alpha}}$ . Since $\hat{H}\hat{T}\ket{E_{\alpha}}=\hat{T}\hat{H}\ket{E_{\alpha}}=E_{\alpha}\hat{T}\ket{E_{\alpha}}$ and no degeneracy exists, we obtain $\hat{T}\ket{E_{\alpha}}=e^{i\theta}\ket{E_{\alpha}}$ for some $\theta\in[0,2\pi)$ . If we redefine the eigenstate as $\ket{E_{\alpha}^{\prime}}:=e^{i\theta/2}\ket{E_{\alpha}}$ , we obtain $\hat{T}\ket{E_{\alpha}^{\prime}}=\ket{E_{\alpha}^{\prime}}$ . Then the matrix elements can be taken as being real because

[TABLE]

where $(\vec{a},\vec{b})$ denotes an inner product of $\vec{a}$ and $\vec{b}$ and we have used $(\hat{T}\vec{a},\hat{T}\vec{b})^{*}=(\vec{a},\vec{b})$ .

In this case, we can assume that the basis transformation $U_{\alpha i}:=\braket{E_{\alpha}}{a_{i}}$ is distributed uniformly with respect to the orthogonal Haar measure from RMT.222This reflects the fact that $U_{\alpha i}$ can be taken as being real: $\braket{E_{\alpha}}{a_{i}}=(\hat{T}\ket{E_{\alpha}},\hat{T}\ket{a_{i}})^{*}=\braket{E_{\alpha}}{a_{i}}^{*}$ . Some of the moments of $U$ can be written as

[TABLE]

Using Eqs. (4.9)-(4.12), $r_{\mathrm{GOE,even}}=\sqrt{2}$ is obtained as follows. The averages and the variances of diagonal and off-diagonal matrix elements can be calculated as

[TABLE]

which can be obtained with calculations similar to those made in Sec. 3.2.2. If we assume the ergodicity of random matrices, we can regard the ensemble average above as the spectral average for most of the randomly sampled Hamiltonians. Therefore, if $d$ is sufficiently large, we obtain

[TABLE]

which is different from $r_{\mathrm{GUE}}=1$ . We note that this ratio has been predicted in several studies [157, 158, 41].

Next, if the Hamiltonian belongs to the GOE and the observable is odd under $\hat{T}$ , $r_{\mathrm{GOE,odd}}=0$ is obtained. This results comes from the vanishing diagonal matrix elements:

[TABLE]

The GSE

If the Hamiltonian belongs to the GSE, it can be written as $\hat{H}=\sum_{\alpha=1}^{d/2}E_{\alpha}(\ket{E_{\alpha}}\bra{E_{\alpha}}+\ket{\tilde{E_{\alpha}}}\bra{\tilde{E_{\alpha}}})$ . Here, we note that $d$ is always even number in the GSE. Let us first consider that the observable is even. In this case, the observable can be written as $\hat{O}=\sum_{i=1}^{d}a_{i}\ket{a_{i}}\bra{a_{i}}=\sum_{i^{\prime}=1}^{d/2}a_{i^{\prime}}(\ket{a_{i^{\prime}}}\bra{a_{i^{\prime}}}+\ket{\tilde{a_{i^{\prime}}}}\bra{\tilde{a_{i^{\prime}}}})$ , where $\ket{\tilde{a_{i^{\prime}}}}:=\hat{T}\ket{a_{i^{\prime}}}$ and $\hat{O}\ket{\tilde{a_{i^{\prime}}}}=a_{i^{\prime}}\ket{\tilde{a_{i^{\prime}}}}$ . We assume that the Hamiltonian and the observable have no degeneracy except for Kramers degeneracies. By calculating (higher) moments of $\braket{E_{\alpha}}{a_{i^{\prime}}}$ and $\braket{E_{\alpha}}{\tilde{a_{i^{\prime}}}}$ using RMT, we obtain the averages and the variances of diagonal and off-diagonal matrix elements as follows (see Appendix B.3):

[TABLE]

Assuming the ergodicity of random matrices, we can regard the ensemble averages above as the spectral averages. Thus we obtain

[TABLE]

if $d$ is sufficiently large. For the ratio concerning the Kramers pair, we obtain $r^{\prime}_{\mathrm{GSE,even}}=0$ . This ratio comes from the vanishing ${O}_{\alpha\tilde{\alpha}}$ :

[TABLE]

Here we have used $\hat{T}^{2}=-1$ .

Finally, we consider the case in which the Hamiltonian belongs to the GSE and the observable is odd under $\hat{T}$ . In this case, the observable can be written as $\hat{O}=\sum_{i=1}^{d}a_{i}\ket{a_{i}}\bra{a_{i}}=\sum_{a_{i^{\prime}}>0}a_{i^{\prime}}(\ket{a_{i^{\prime}}}\bra{a_{i^{\prime}}}-\ket{\tilde{a_{i^{\prime}}}}\bra{\tilde{a_{i^{\prime}}}})$ , where $\ket{\tilde{a_{i^{\prime}}}}:=\hat{T}\ket{a_{i^{\prime}}}$ and $\hat{O}\ket{\tilde{a_{i^{\prime}}}}=-a_{i^{\prime}}\ket{\tilde{a_{i^{\prime}}}}$ . By calculating the (higher) moments of inner products such as $\braket{E_{\alpha}}{a_{i^{\prime}}}$ and $\braket{E_{\alpha}}{\tilde{a_{i^{\prime}}}}$ using RMT, we obtain the averages and the variances of matrix elements (see Appendix B.3). The averages of the matrix elements are all zero since $\sum_{i}a_{i}=0$ :

[TABLE]

For the variances, we obtain

[TABLE]

Assuming the ergodicity, we expect that

[TABLE]

if $d$ is sufficiently large.

4.2.2 Observable-dependent probability densities of the off-diagonal matrix elements

Next, we consider the probability densities of the off-diagonal matrix elements $|O_{\alpha\beta}|$ using RMT. To do this, we apply the method by Brody et al. [146] to the GUE and the GOE (in Ref. [146], the calculation is done only for the GOE).¶¶¶We will not consider the probability densities of the matrix elements in the case for the GSE for simplicity. We only consider off-diagonal matrix elements, since they are suitable for obtaining many samplings in later numerical calculations.

To calculate the probability densities of $|\braket{E_{\alpha}}{\hat{O}}{E_{\beta}}|$ , we first move $\ket{E_{\alpha}}$ uniformly in the $(d-1)$ -dimensional Hilbert space that is orthogonal to $\ket{E_{\beta}}$ and then move $\ket{E_{\beta}}$ uniformly in the $d$ -dimensional Hilbert space. We note that

[TABLE]

where $\mathcal{\hat{P}}_{(d-1)}$ is a projection operator onto a Hilbert space that is orthogonal to $\ket{E_{\beta}}$ . Since $\ket{v}:=\frac{\mathcal{\hat{P}}_{(d-1)}\hat{O}\ket{E_{\beta}}}{||\mathcal{\hat{P}}_{(d-1)}\hat{O}\ket{E_{\beta}}||}$ is a fixed normalized vector in the $(d-1)$ -dimensional Hilbert space, $\braket{E_{\alpha}}{v}$ is uniformly distributed on a high-dimensional unit sphere when we move $\ket{E_{\alpha}}$ . Consequently, the probability densities of $x:=|\braket{E_{\alpha}}{v}|^{2}$ obeys the following Porter-Thomas distribution in the large $d$ limit [146]:111The following results of the GOE hold true whether observables are even or odd. However, if observables are neither even nor odd, the results may not be valid.

[TABLE]

Since $|O_{\alpha\beta}|^{2}=||\mathcal{\hat{P}}_{(d-1)}\hat{O}\ket{E_{\beta}}||^{2}x:=s_{\beta}^{2}x$ , the probability densities of $y=|O_{\alpha\beta}|^{2}$ with a fixed $s_{\beta}$ is

[TABLE]

If we denote the probability densities of $s_{\beta}$ by $\rho_{s_{\beta}}(z)$ , the probability densities of $|O_{\alpha\beta}|^{2}$ can be written as

[TABLE]

We note that

[TABLE]

where $\hat{O}^{\prime}:=\hat{O}-\frac{1}{d}\sum_{\alpha}O_{\alpha\alpha}$ .

Nonsingular operators

Next, we consider the probability densities of $s_{\beta}$ by moving $\ket{E_{\beta}}$ . We first diagonalize $\hat{O}^{\prime}$ as

[TABLE]

where $a_{i}^{\prime}=a_{i}-\frac{1}{d}\sum_{\alpha}O_{\alpha\alpha}$ . Then

[TABLE]

The second moment of $s_{\beta}$ is thus calculated as

[TABLE]

where $\mathrm{O}(\cdot)$ denotes Landau’s symbol and we have used $\frac{1}{d}\sum_{i}a_{i}^{\prime}=0$ . Similarly, we obtain the fourth moment as

[TABLE]

From this expression, we can expect that if

[TABLE]

$s_{\beta}^{2}$ fluctuates little around the average value $\overline{s_{\beta}^{2}}=\frac{1}{d}\sum_{i=1}^{d}{a^{\prime}}_{i}^{2}$ because $\frac{\overline{s_{\beta}^{4}}-(\overline{s_{\beta}^{2}})^{2}}{(\overline{s_{\beta}^{2}})^{2}}\rightarrow 0$ . We will call $\hat{O}$ that satisfies Eq. (4.40) as nonsingular operators, following Ref. [146].

For nonsingular operators, $\rho_{|O_{\alpha\beta}|^{2}}(y)$ can be calculated by noticing $\rho_{s_{\beta}}(z)\rightarrow\delta(z-\sqrt{\mathcal{V}})$ , where $\mathcal{V}:=\overline{s_{\beta}^{2}}$ . The result is

[TABLE]

We can also write down the probability densities for $|O_{\alpha\beta}|$ as follows:

[TABLE]

where $0<y$ . Moreover, we assume that we can replace $\mathcal{V}/d$ with the spectral variance of the off-diagonal matrix elements:

[TABLE]

for sufficiently large $d$ and $d_{s}$ (see Appendix B.4). Therefore, we obtain the following expression:

[TABLE]

We note that the expression for the GOE is Gaussian, as numerically indicated in Refs. [30, 33]. We remark, though, that the probability densities of $|O_{\alpha\beta}|$ are not Gaussian for the GUE.∥∥∥It is expected that $\mathrm{Re}[O_{\alpha\beta}]$ and $\mathrm{Im}[O_{\alpha\beta}]$ independently obey Gaussian distributions. Assuming the ergodicity of random matrices, we can reconsider the probability densities in Eq. (4.44) as spectral statistics.

Singular operators

Here we consider an example that does not satisfy Eq. (4.40). The simplest example is an observable whose (modified) spectrum $\left\{a^{\prime}_{i}\right\}$ can be written as

[TABLE]

In this case,

[TABLE]

and the fluctuations of $s_{\beta}$ are not negligible. We will call such operators that do not satisfy Eq. (4.40) as singular operators, following Ref. [146].

Let us calculate the probability densities of a singular operator that can be written as follows:

[TABLE]

where $a$ is some real number. We will call this type of operators as the “most singular.” We first calculate the probability densities of $|O_{\alpha\beta}|^{2}=a^{2}|\braket{E_{\alpha}}{\psi}|^{2}|\braket{E_{\beta}}{\psi}|^{2}$ for $E_{\alpha}\neq E_{\beta}$ . If $d$ is large, $|\braket{E_{\alpha}}{\psi}|^{2}$ and $|\braket{E_{\beta}}{\psi}|^{2}$ become independent of each other [146]. Indeed, each of them follows the Porter-Thomas distributions in Eq. (4.32). The probability densities of $|O_{\alpha\beta}|^{2}$ can be calculated as

[TABLE]

where $K_{0}(0,y)=\int_{0}^{\infty}dze^{-y\mathrm{cosh}z}$ is a modified Bessel function of the second kind. Changing the variables, we obtain

[TABLE]

We note that

[TABLE]

which allows us to rewrite Eq. (4.49) as

[TABLE]

Finally, in terms of $\sigma^{2}$ , we can express Eq. (4.51) as

[TABLE]

Thus, we obtain the non-Gaussian distributions for singular operators. Assuming the ergodicity of random matrices, we can reconsider the probability densities in Eq. (4.52) as spectral statistics.

4.2.3 Conjectures from the random matrix model

We summarize the results of the matrix-element statistics for the random matrix models and clarify the conjecture about the statistics for actual nonintegrable models that conserve only energy. As we have seen in Sec. 3.2.2, the analogy between RMT and the nonintegrable systems seems to hold true in a small energy shell $\mathcal{H}_{\mathrm{sh}}$ . Thus, we consider $\hat{O}:=\hat{\mathcal{P}}_{\mathrm{sh}}\hat{\mathcal{O}}\hat{\mathcal{P}}_{\mathrm{sh}}$ for an observable $\hat{\mathcal{O}}$ in nonintegrable systems.

As is the case with the level-spacing statistics, the system is expected to be related to the random matrices with the same anti-unitary symmetry class. Hamiltonians without any anti-unitary symmetry are said to belong to “Class A,” the term adapted from the mathematical terminology. Similarly, Hamiltonians with only one anti-unitary symmetry $\hat{T}$ that satisfies $\hat{T}^{2}=1$ belong to “Class AI,” and Hamiltonians with only one anti-unitary symmetry $\hat{T}$ that satisfies $\hat{T}^{2}=-1$ belong to “Class AII.” We note that matrices in the GUE, the GOE and the GSE belong to Class A, Class AI, and Class AII, respectively (see Fig. 4.3).

As illustrated in the left column of Fig. 4.2, we conjecture that the ratios of standard deviations between diagonal and off-diagonal matrix elements within $\mathcal{H}_{\mathrm{sh}}$ become universal in nonintegrable systems, and that the universal values are determined from those of the random matrix models with the same symmetry class. We note that even and odd properties of $\hat{\mathcal{O}}$ are the same as those of $\hat{O}$ . In the language of Srednicki’s conjecture in Eq. (3.35), $\braket{}{R_{\alpha\alpha}}{{}^{2}}_{\mathcal{T}}=r^{2}\braket{}{R_{\alpha\beta}}{{}^{2}}_{\mathcal{TT}}\>(E_{\alpha}\neq E_{\beta})$ and $\braket{}{R_{\alpha\tilde{\alpha}}}{}_{\mathcal{T}}^{2}=r^{\prime 2}\braket{}{R_{\alpha\beta}}{{}^{2}}_{\mathcal{TT}}$ for $\omega<\omega_{\mathrm{sh}}$ , where $r$ and $r^{\prime}$ are determined from the symmetries of the Hamiltonian and those of the observable.

Moreover, we conjecture that the probability densities $\rho_{|\mathcal{O_{\alpha\beta}}|}(y)$ within $\mathcal{H}_{\mathrm{sh}}$ are those predicted by random matrix models, as shown in the right column of Fig. 4.2. In the language of Srednicki’s conjecture in Eq. (3.35), $R_{\alpha\beta}$ is Gaussian (or $y\times$ Gaussian) only when the observable is nonsingular for $\omega<\omega_{\mathrm{sh}}$ ; if the observable is the most singular, $K_{0}$ functions appear.

4.3 Numerical verifications of the random matrix predictions

In this section, we show some numerical results that investigate the statistics of the matrix elements of various observables in nonintegrable systems. In particular, focusing on the ratio $r,r^{\prime}$ and the statistics of off-diagonal matrix elements, we ask if the RMT predictions in the previous section hold true in actual situations.

4.3.1 Models

We first introduce one-dimensional spin chain models that contain Ising interactions, transverse fields and Dzyaloshinskii-Moriya interactions as follows:

[TABLE]

where $N$ denotes the number of spins, $\vec{D}=D\frac{1}{\sqrt{2}}(\vec{e}_{x}+\vec{e}_{z})$ , and we impose the open boundary condition. In addition, we assume that $J_{i}=J(1+\epsilon_{i})$ is a random variable that breaks the reflection symmetry of sites ( $i\rightarrow N-i$ ), where $\epsilon_{i}$ is uniformly chosen from $[-0.1,0.1]$ at each site.******As we will see in the following discussions, the randomness is sufficiently weak and no localization arises.

The model in Eq. (4.53) can be a nonintegrable model that conserves only energy by changing the strength of $\hat{H}_{\mathrm{TF}}$ and $\hat{H}_{\mathrm{DM}}$ . Further, by changing the parameter of the interactions and $N$ , nonintegrable systems that belong to Class A, AI, and AII are obtained. We note that our model is unique in a sense that all these three classes are achievable by changing only a few parameters. In the followings, we assume $J=1,h^{\prime}=-2.1h$ and consider three nonintegrable models (a), (b), and (c), which are determined by the parameters $h$ and $D$ .

First, model (a) is a model without any anti-unitary symmetry, which is obtained by taking $h=0.5$ and $D=0.9$ (i.e., model (a) belongs to Class A). If we calculate the level-spacing statistics of model (a), it obeys statistics similar to the level statistics of the GUE, $P_{\mathrm{GUE}}(s)=\frac{32s^{2}}{\pi^{2}}e^{-\frac{4s^{2}}{\pi}}$ , as shown in Fig. 4.4(i). Here, we also show the Poisson statistics $P_{\mathrm{P}}(s)=e^{-s}$ , the GOE statistics $P_{\mathrm{GOE}}(s)=\frac{\pi s}{2}e^{-\frac{\pi s^{2}}{4}}$ , and the GSE statistics $P_{\mathrm{GSE}}(s)=\frac{2^{18}s^{4}}{3^{6}\pi^{3}}e^{-\frac{64s^{2}}{9\pi}}$ for comparison (see Fig. 4.3).

Next, model (b) is a model with one anti-unitary symmetry $\hat{T}=\hat{K}\>(\hat{T}^{2}=1)$ , which is obtained by taking $h=0.5$ and $D=0$ (i.e., model (b) belongs to Class AI). Here, $\hat{K}$ denotes the complex conjugate operator. Note that $\hat{K}\hat{\sigma}_{i}^{x}\hat{K}^{-1}=\hat{\sigma}_{i}^{x},\hat{K}\hat{\sigma}_{i}^{y}\hat{K}^{-1}=-\hat{\sigma}_{i}^{y}$ , and $\hat{K}\hat{\sigma}_{i}^{z}\hat{K}^{-1}=\hat{\sigma}_{i}^{z}$ are satisfied. If we calculate the level-spacing statistics of model (b), it obeys statistics similar to the level statistics of the GOE, as shown in Fig. 4.4(ii).

Finally, model (c) is a model with one anti-unitary symmetry $\hat{T}=\hat{T}_{0}:=\left(\prod_{i=1}^{N}[i\hat{\sigma}_{i}^{y}]\right)\hat{K}$ , which is obtained by taking $h=0$ and $D=0.9$ . Since $\hat{T}_{0}^{2}=(-1)^{N}$ , model (c) belongs to Class AI if $N$ is even and Class AII if $N$ is odd. Indeed, when $N=13$ , the model obeys statistics similar to the level statistics of the GSE, as shown in Fig. 4.4(iii). On the other hand, when $N=12$ , the level statistics resembles that of the GOE, as shown in Fig. 4.4(iv).

4.3.2 Few-body observables

Using the models defined above, we first investigate matrix elements of few-body observables. We consider the $z$ -component of a spin at a certain site $\hat{\mathcal{O}}_{1}:=\hat{\sigma}^{z}_{\left[N/2\right]+1}$ and the correlation of two spins $\hat{\mathcal{O}}_{2}:=\hat{\sigma}^{z}_{\left[N\right/2]+1}\hat{\sigma}^{z}_{\left[N\right/2]+2}$ , where $[x]$ denotes the maximum integer that does not exceed $x$ . Since $\hat{\mathcal{O}}_{1}$ satisfies $\hat{K}\hat{\mathcal{O}}_{1}\hat{K}^{-1}=\hat{\mathcal{O}}_{1}$ , it is an even operator for models (a) and (b). On the other hand, since $\hat{T}_{0}\hat{\mathcal{O}}_{1}\hat{T}_{0}^{-1}=-\hat{\mathcal{O}}_{1}$ , it is odd for model (c). As for $\hat{\mathcal{O}}_{2}$ , it is an even operator for all of the models because $\hat{K}\hat{\mathcal{O}}_{2}\hat{K}^{-1}=\hat{\mathcal{O}}_{2},\hat{T}_{0}\hat{\mathcal{O}}_{2}\hat{T}_{0}^{-1}=\hat{\mathcal{O}}_{2}$ .

We first show the example of diagonal and off-diagonal matrix elements of $\hat{\mathcal{O}}=\hat{\mathcal{O}}_{1}$ for model (b) in Figs. 4.5(i)-(iv). Figure (i) shows the diagonal matrix elements $\mathcal{O}_{\alpha\alpha}$ for all of the eigenstates as a function of $E_{\alpha}$ . Similarly, Figure (ii) shows the density plot of the absolute value of the off-diagonal matrix elements as a function of $E_{\alpha}$ and $E_{\beta}$ . Both of these figures show that the behavior of the matrix elements depends on the global energy: for example, Figure (ii) shows that the typical magnitude of $|\mathcal{O}_{\alpha\beta}|$ vanishes as $|E_{\alpha}-E_{\beta}|$ becomes large. However, if we stick to some small energy shell $\mathcal{H}_{s}$ , the matrix elements $\mathcal{O}_{\alpha\beta}=O_{\alpha\beta}\>(\alpha,\beta\in\mathcal{T})$ seems to fluctuate randomly from one eigenstate to another eigenstate with a constant amplitude. Indeed, if we take an energy shell with width $2\omega_{s}=0.5$ and plot matrix elements in that energy shell, we obtain Figure (iii) for the diagonal matrix elements and Figure (iv) for the off-diagonal matrix elements as a function of $|E_{\beta}-E_{\alpha}|$ . Using the eigenstates within the energy shell, we can consider $r=\frac{\Delta\mathcal{O}_{\mathrm{d}}}{\Delta\mathcal{O}_{\mathrm{od}}},r^{\prime}=\frac{\Delta\mathcal{O}_{\mathrm{K}}}{\Delta\mathcal{O}_{\mathrm{od}}}$ and the probability densities of $\mathcal{O}_{\alpha\beta}$ .††††††We note that $\Delta\mathcal{O}_{\mathrm{d}}$ is numerically calculated from modified fluctuations where the effect of the energy shell is reduced. We make a linear fitting $O_{\mathrm{m}}(\tilde{E})=a\tilde{E}+b\>(|\tilde{E}-E|<\omega_{s})$ within a small energy shell instead of a constant $O_{\mathrm{m}}(E)$ . Then we consider the variance of $\mathcal{O}_{\alpha\alpha}-O_{\mathrm{m}}(\tilde{E}=E_{\alpha})$ .

The universal ratios

We show the results of $r$ calculated for models (a), (b), and (c) with $N=12$ as a function of energy in Figs. 4.6(i) and (ii). We have calculated standard deviations from the eigenstates within the range $[E-\omega_{s},E+\omega_{s}]$ , which is obtained by dividing the entire spectrum into $F$ regions. Here, $\omega_{s}:=\frac{E_{\mathrm{max}}-E_{\mathrm{min}}}{2F}$ ( $E_{\text{max(min)}}$ is the maximum (mimimum) energy eigenvalue) and we assume $\omega_{s}<\omega_{\mathrm{sh}}$ .222We have confirmed that the small change of $\omega_{s}$ does not affect the discussion below. Figure 4.6(i) shows the results for $\hat{\mathcal{O}}_{1}$ . For a wide range of spectrum (except for the edges), $r_{\mathrm{(a)}}\simeq 1,r_{\mathrm{(b)}}\simeq\sqrt{2}$ and $r_{\mathrm{(c)}}=0$ are obtained, where the subscript indicates the type of the model. Similarly, from the results for $\mathcal{\hat{O}}_{2}$ shown in Figure 4.6(ii), we obtain $r_{\mathrm{(a)}}\simeq 1,r_{\mathrm{(b)}}\simeq\sqrt{2}$ , and $r_{\mathrm{(c)}}\simeq\sqrt{2}$ . These results are consistent with the RMT conjecture that predicts $r_{\mathrm{A,even}}=1,r_{\mathrm{AI,even}}=\sqrt{2},r_{\mathrm{AI,odd}}=0$ .

We next show the results of $r$ calculated for models (a), (b), and (c) (and $r^{\prime}$ for model (c)) with $N=13$ as a function of energy in Figs. 4.6(iii) and (iv). Figure 4.6(iii) shows that the results for $\hat{\mathcal{O}}_{1}$ with $N=13$ are almost the same as those with $N=12$ if we consider models (a) or (b). On the other hand, we have $r_{\mathrm{(c)}}\simeq 1$ and $r^{\prime}_{\mathrm{(c)}}\simeq\sqrt{2}$ for model (c). Figure 4.6(iv) shows that the results for $\hat{\mathcal{O}}_{2}$ with $N=13$ are again almost the same as those with $N=12$ if we consider models (a) and (b). For model (c), we have $r_{\mathrm{(c)}}\simeq 1$ and $r^{\prime}_{\mathrm{(c)}}=0$ . These results are consistent with the RMT conjecture that predicts $r_{\mathrm{AII,even}}=1,r_{\mathrm{AII,odd}}=1,r^{\prime}_{\mathrm{AII,even}}=0$ , and $r^{\prime}_{\mathrm{AII,odd}}=\sqrt{2}$ .

To show that these results indeed depend only on the parity of $N$ , we show the $N$ -dependences of $\tilde{r}$ and $\tilde{r^{\prime}}$ in Fig. 4.7, where $\tilde{r}$ and $\tilde{r^{\prime}}$ are the average values of $r(E)$ and $r^{\prime}(E)$ in the middle of the spectrum, respectively. As shown in the graphs, $\tilde{r}$ is $1$ and $\sqrt{2}$ for models (a) and (b) independent of $N$ , respectively, since model (a)/(b) belongs to Class A/AI irrespective of $N$ . (Note that both $\hat{\mathcal{O}}_{1}$ and $\hat{\mathcal{O}}_{2}$ are even operators.) On the other hand, for model (c), $\tilde{r}$ and $\tilde{r^{\prime}}$ depend on the parity of $N$ because the model belongs to Class AI/AII when $N$ is even/odd.

These results indicate that the ratio of standard deviations in nonintegrable systems for few-body observables is consistent with the conjecture of the random matrix model, even when the anti-unitary symmetry of the systems and the observables are varied. In contrast to the previous study [30] where the authors claim that the diagonal and off-diagonal matrix elements seem unrelated, our results indicate that they are actually related. This fact strengthens the validity of the RMT predictions that may be related to the underlying mechanism of the ETH (for the small energy shell).

Probability densities of the off-diagonal elements

Next, we consider the probability densities of the off-diagonal matrix elements for few-body observables and show that they obey the conjecture of the random matrix models. For simplicity, we consider $\hat{\mathcal{O}}=\hat{\mathcal{O}}_{1}$ and $N=12$ . In Figure 4.8, we show the probability density of $|\mathcal{O}_{\alpha\beta}|\>(\alpha,\beta\in\mathcal{T})$ for models (a), (b), and (c). Here, we take an energy shell such that

[TABLE]

where $D$ is the dimension of the entire Hilbert space. As a reference, we also show the predictions of random matrices in Eq. (4.44). We note that $\sigma=\Delta\mathcal{O}_{\mathrm{od}}$ can be calculated numerically. We can see that the probability density for model (a) obeys the statistics corresponding to the GUE and that the probability density for models (b) and (c) obeys the statistics corresponding to the GOE. These results are consistent with the conjecture of the random matrix model.

4.3.3 Many-body correlations

In this subsection, we investigate the matrix elements of (nonsingular) many-body operators. We especially consider $l$ -body spin correlations that are defined by

[TABLE]

For models (a) and (b), $\hat{\mathcal{O}}_{l}$ is always an even operator. For model (c), $\hat{\mathcal{O}}_{l}$ is even if $l$ is even, and $\hat{\mathcal{O}}_{l}$ is odd if $l$ is odd.

As for few-body observables, we first show the example of diagonal and off-diagonal matrix elements of $\hat{\mathcal{O}}=\hat{\mathcal{O}}_{l}$ for model (b) in Figs. 4.9(i)-(iv). Figure (i) shows the diagonal matrix elements $\mathcal{O}_{\alpha\alpha}$ for all of the eigenstates as a function of $E_{\alpha}$ . Similarly, Figure (ii) shows the density plot of the absolute value of the off-diagonal matrix elements as a function of $E_{\alpha}$ and $E_{\beta}$ . Both of these figures show that the behavior of the matrix elements depends on the global energy. However, compared with the case of few-body observable (see Fig. 4.5), the dependence of the global energy is evident neither for diagonal nor off-diagonal matrix elements. Anyway, we take some small energy shell with width $2\omega_{s}=0.5$ and plot matrix elements. Then, we obtain Figure (iii) for the diagonal matrix elements and Figure (iv) for the off-diagonal matrix elements as a function of $|E_{\beta}-E_{\alpha}|$ . Using the eigenstates within the energy shell, we can again consider $r=\frac{\Delta\mathcal{O}_{\mathrm{d}}}{\Delta\mathcal{O}_{\mathrm{od}}}$ and the probability densities of $\mathcal{O}_{\alpha\beta}$ .

The universal ratios

We show $l$ -dependence of $\tilde{r}$ and $\tilde{r^{\prime}}$ for $N=12$ and $N=13$ in Fig. 4.10. As shown in the graphs, $\tilde{r}$ and $\tilde{r^{\prime}}$ become universal even for the large $l$ ’s that are comparable to $N$ , namely for many-body correlations. Indeed, $\tilde{r}$ is $1$ and $\sqrt{2}$ for models (a) and (b) independent of $N$ , respectively, since model (a)/(b) belongs to Class A/AI irrespective of $N$ and $\hat{\mathcal{O}}_{l}$ is always even. On the other hand, for model (c), $\tilde{r}$ and $\tilde{r^{\prime}}$ depend on the parity of $N$ and $l$ because of the changes of the symmetry.

Probability densities of the off-diagonal elements

Next, we consider the probability densities of the off-diagonal matrix elements of many-body observables $\hat{\mathcal{O}}=\hat{\mathcal{O}}_{l}$ and show that they also obey the conjecture of the random matrix models. For simplicity, we consider model (a) and $N=12$ . In Figure 4.11, we show the probability density of $|\mathcal{O}_{\alpha\beta}|\>(\alpha,\beta\in\mathcal{T})$ for $l=3,5,7,9$ , and 11. As a reference, we also show the predictions from RMT in Eq. (4.44). We can see that the probability density obeys the statistics corresponding to the GUE for all $l$ . This result indicates that the conjecture of the random matrix model is valid even for the many-body observables.

The results of the ratios and the probability densities indicate that the RMT conjecture about the ETH and its finite-size corrections within the small energy shell may be valid even for many-body observables. In fact, numerical simulations suggest that the ETH seems true for many-body operators, too (see Appendix C.2). Since the ETH seems true, even many-body observables are expected to relax to stationary values that are describable by the microcanonical ensemble. This fact is beyond the conventional notion of the thermal equilibrium introduced in Chapter 2, namely MITE and MATE.111In Ref. [124], it is reported that many-body operators may satisfy the ETH for diagonal matrix elements. However, it is difficult to interpret the possible origin of the ETH from their measure of justifying the ETH. Our results imply that local, macroscopic or few-body nature of observables is not a necessary condition for proving the (strong) ETH in a nonintegrable system (in contrast to the suggestion by Ref. [22]). We rather have to investigate why the random matrix description is valid in describing the finite-size corrections of the ETH for actual situations.

4.3.4 Density matrices corresponding to pure states

Here, we investigate the statistics of singular operators. As a singular operator, we take a density matrix corresponding to a pure state with a form $\hat{\mathcal{O}}=\ket{\psi}\bra{\psi}$ .222Note that $\hat{\mathcal{O}}$ is also the most singular observable even after projecting onto $\mathcal{H}_{\mathrm{sh}}$ . Indeed, since

$\displaystyle\mathcal{\hat{P}}_{\mathrm{sh}}\ket{\psi}\bra{\psi}\mathcal{\hat{P}}_{\mathrm{sh}}=\braket{\psi}{\mathcal{\hat{P}}_{\mathrm{sh}}}{\psi}\frac{\mathcal{\hat{P}}_{\mathrm{sh}}\ket{\psi}}{\sqrt{\braket{\psi}{\mathcal{\hat{P}}_{\mathrm{sh}}}{\psi}}}\frac{\bra{\psi}\mathcal{\hat{P}}_{\mathrm{sh}}}{\sqrt{\braket{\psi}{\mathcal{\hat{P}}_{\mathrm{sh}}}{\psi}}},$

(4.59)

it can be regarded as the form of Eq. (4.47) with $a=\braket{\psi}{\mathcal{\hat{P}}_{\mathrm{sh}}}{\psi}$ .

We note that the matrix elements $\mathcal{O}_{\alpha\beta}$ are relevant for the $\rho_{\alpha\beta}$ that appeared in the previous chapters if we regard $\ket{\psi}$ as an initial pure state.

If we take an eigenstate of $\hat{H}$ as $\ket{\psi}$ , the ETH does not hold true as we have seen in Sec. 4.1. In this case, the conjecture of the random matrix models is not valid, which means that there is always a counterexample of this conjecture even in nonintegrable systems. So, is there any pure state that satisfies the conjecture of the random matrix models?

To see this is the case, we take a $\gamma$ -th energy eigenstate of another Hamiltonian $\hat{H}_{0}$ as $\ket{\psi}$ , where $\hat{H}_{0}$ is a Hamiltonian with $h=0.05,D=0$ in Eq. (4.53).333We assume that the bond-dependent interaction is the same for $\hat{H}$ and $\hat{H}_{0}$ . Then $\mathcal{O}_{\alpha\beta}$ is relevant for the quench setup where the Hamiltonian is suddenly changed from $\hat{H}_{0}$ to $\hat{H}$ . For simplicity, we take $\gamma=D/2+1$ (a highly excited state) and $\gamma=1$ (a ground state). We note that $\hat{\mathcal{O}}$ is an even operator for both values of $\gamma$ in model (b).444Since $\hat{K}\hat{H}_{0}\hat{K}^{-1}=\hat{H}_{0}$ and $\hat{H}_{0}$ has no degeneracies (which is confirmed with numerics), we can assume that an eigenstate $\ket{E_{\gamma}}_{0}$ satifies $\hat{K}\ket{E_{\gamma}}_{0}=\ket{E_{\gamma}}_{0}$ .

In Fig. 4.12, we show the probability densities of the off-diagonal matrix elements in models (a) and (b) with $N=13$ for different values of $\gamma$ . As a reference, we also show the predictions of random matrices for nonsingular and the most singular cases (see Eqs. (4.44) and (4.52)). We can see that for both values of $\gamma$ , the probability density for model (a)/(b) obeys the GUE/GOE statistics for the most singular observables in Eq. (4.52) rather than the statistics for nonsingular observables in Eq. (4.44). These results are consistent with the conjecture of the random matrix model.

We note that the universal ratios are also found for these singular operators. Indeed, for model (a) with $N=13$ , we have $\tilde{r}=0.981$ and $\tilde{r}=0.984$ for $\gamma=D/2+1=4097$ and $\gamma=1$ , respectively. They are consistent with the RMT prediction of $r_{\mathrm{A}}=1$ . For model (b) with $N=13$ , we have $\tilde{r}=1.41$ and $\tilde{r}=1.43$ for $\gamma=D/2+1=4097$ and $\gamma=1$ , respectively. They are consistent with the RMT prediction of $r_{\mathrm{AI,even}}=\sqrt{2}$ .

4.3.5 A simple counterexample

Previous results show that the conjecture of the random matrix models applies to a wide class of observables with various symmetries including many-body or singular ones. Nevertheless, we can easily find counterexamples of the conjecture in Fig. 4.2 (and Srednicki’s conjecture in Eq. (3.35)) among simple observables. To illustrate this fact, we show the off-diagonal matrix elements of $\hat{\mathcal{O}}_{y}=\hat{\sigma}_{[N/2]+1}^{y}$ for model (b) in Figs. 4.13. The upper figure shows the density plot of the off-diagonal matrix elements for all of the eigenstates as a function of $E_{\alpha}$ and $E_{\beta}$ . As for the cases with $\hat{\mathcal{O}}_{1}$ and $\hat{\mathcal{O}}_{2}$ , the typical magnitude of $\mathcal{O}_{\alpha\beta}$ rapidly decays when $|E_{\alpha}-E_{\beta}|$ becomes large. On the other hand, contrary to the previous examples, the typical magnitude is small for $|E_{\alpha}-E_{\beta}|\simeq 0$ as well. To investigate this in detail, we take some small energy shell with width $2\omega_{s}=0.5$ and plot matrix elements as a function of $|E_{\alpha}-E_{\beta}|$ in the bottom figure of Fig. 4.13. The figure shows that the typical magnitude of $|\mathcal{O}_{\alpha\beta}|$ vanishes with $|E_{\alpha}-E_{\beta}|\rightarrow 0$ . This is in contradiction to the conjecture in Fig. 4.2 and Srednicki’s conjecture in Eq. (3.35), both of which predict the plateau-like structure of $|\mathcal{O}_{\alpha\beta}|$ for $\omega<\omega_{\mathrm{sh}}$ .‡‡‡‡‡‡Indeed, even if we make $\omega_{s}$ smaller, no plateau-like structure is obtained.

We plot the probability densities of the off-diagonal matrix elements $|\braket{E_{\alpha}}{\hat{\mathcal{O}}_{y}}{E_{\beta}}|$ for models (a), (b), and (c) in Fig. 4.14. As shown in the figure, the prediction of the random matrix models breaks down in model (b), whereas it holds true in models (a) and (c).

These results can be understood by realizing that $\hat{\mathcal{O}}=\hat{\sigma}_{[N/2]+1}^{y}=-\frac{i}{2h^{\prime}}[\hat{H},\hat{\sigma}_{[N/2]+1}^{z}]$ for model (b), where $h^{\prime}$ is defined in Eq. (4.53). We obtain

[TABLE]

Since $|\braket{E_{\alpha}}{\hat{\sigma}_{[N/2]+1}^{z}}{E_{\beta}}|$ behaves as in Fig. 4.5(iv) for $\omega<\omega_{s}$ (i.e., form a plateau-like structure), $|\mathcal{O}_{\alpha\beta}|$ behaves as shown in Fig. 4.13. This is the case only for model (b) because we do not have the relation in Eq. (4.60) for models (a) and (c).

The important point is that even simple (i.e., few-body and local) operators can break the RMT conjecture for matrix elements, and that such operators can always be easily constructed. In fact, by taking a commutator between the Hamiltonian and a local observable, we can obtain another local observable $\hat{\mathcal{O}}$ (if the Hamiltonian is composed of local interactions). Such an observable $\hat{\mathcal{O}}$ is expected to break the RMT conjecture, since the relation similar to Eq. (4.60) is obtained.

4.4 Conclusions and Discussions

We have shown that RMT can predict the finite-size corrections of the ETH (within some energy shell) in nonintegrable systems and for a wide class of observables, including many-body operators. We have first refined and generalized the RMT predictions for the finite-size corrections of the ETH (see Fig. 4.2). We have seen that the ratios between standard deviations of diagonal and off-diagonal matrix elements become universal ones that depend only on anti-unitary symmetries of the Hamiltonian and those of the observable. We have also shown that the probability densities of the off-diagonal matrix elements $\rho_{|\mathcal{O}_{\alpha\beta}|}(y)$ are different depending on the singularity of the observable as well as the symmetries of the Hamiltonian. Next, we have numerically investigated matrix-element statistics of various observables in nonintegrable systems that only conserve energy. We have demonstrated that the finite-size corrections of the ETH are in excellent agreement with the predictions of RMT for a wide class of observables with various symmetries, including many-body correlations and singular operators. We have also remarked that counterexamples can always be constructed even among simple observables.

Our results suggest that for a wide class of observables, the ETH holds with the mechanism related to RMT, which also tells us its finite-size corrections. Unlike the ETH for MITE or MATE, we show that even many-body operators can satisfy the ETH due to that mechanism.111We note that not all many-body operators satisfy the ETH as we have discussed in Sec. 4.1. This is expected because the crucial assumption is the behavior of $U_{\alpha i}$ , which is not directly related to few-body or macroscopic properties of observables. We thus expect that for the achievement of a rigorous proof of the ETH, it is important to investigate why the behavior of $U_{\alpha i}$ is mimicked by RMT. We note that counterexamples are easily constructed by taking $\hat{\mathcal{O}}=\ket{E_{\gamma}}\bra{E_{\gamma}}$ or by taking commutators of the Hamiltonian and another observable. These counterexamples are somehow “related” to the Hamiltonian. From this observation, it is important to understand such “relations” quantitatively for clarifying the criteria for the validity of the conjecture of the RMT model.******We note that the ETH may be valid for counterexamples of the RMT conjecture. Thus, the conjecture of the RMT model is not a necessary condition.

Let us comment on future perspectives. First, as we have mentioned above, quantifying the criteria for the breakdown of the RMT conjecture seems unavoidable for understanding the mechanisms of the ETH and its finite-size corrections. Operators that are written as $\hat{\mathcal{O}}=[\hat{H},[\hat{H},\cdots,[\hat{H},\hat{\mathcal{O}}^{\prime}]\cdots]]$ and operators that are conserved $[\hat{\mathcal{O}},\hat{H}]=0$ will break the RMT conjecture, but how about operators that approximately satify such relations? Commutators that involve the Hamiltonian may be utilized for quantifying them, but this is a future problem. Second, we need to extend our results in Subsection 4.2.2 to the off-diagonal matrix elements $\braket{E_{\alpha}}{\hat{\mathcal{O}}}{E_{\beta}}$ with $|E_{\alpha}-E_{\beta}|>\omega_{\mathrm{sh}}$ . We believe that within a small energy shell $\left\{(E_{\alpha},E_{\beta}):|E_{\alpha}-E_{1}|<\omega_{\mathrm{sh,1}},|E_{\beta}-E_{2}|<\omega_{\mathrm{sh,2}}\right\}$ , the probability densities of $O_{\alpha\beta}$ will obey the similar statistics as discussed in Subsection 4.2.2.222The RMT prediction should be extended for such cases. We expect to do this by a similar technique done in Subsection 4.2.2. Finally, it is interesting if we can relate our findings about the deviations from the ETH to measurable fluctuations (e.g., temporal fluctuations of the expectation value in Eq. (2.2.1)) in isolated small systems. As we have mentioned in Chapter 1, the dynamics in such small systems are realized in Ref. [89]. We expect that the RMT conjecture may help us understand finite-size effects that are not captured by the standard statistical mechanics.

Chapter 5 Generalized Gibbs ensemble (GGE) in integrable systems

In this chapter, we review the generalized Gibbs ensemble (GGE) in integrable systems. After introducing a general form of the GGE and raising several open questions, we consider simple integrable systems that can be mapped to free-quasiparticle systems. Then, we stress the importance of the locality of conserved quantities in constructing the GGE. In particular, we introduce the notion of the so-called “truncated GGE.” Finally, we briefly review the results for interacting integrable systems that are solvable by the Bethe ansatz.

5.1 Non-thermal stationary states due to conserved quantities

As we saw in the previous chapters, isolated systems can equilibrate if we only consider a restricted set of observables. One of the open questions is how to characterize the stationary state from the information of the initial state. Under certain conditions, we can assume that the stationary state can effectively be described by the diagonal ensemble introduced in Chapter 2: $\hat{\rho}_{\mathrm{d}}=\sum_{\alpha=1}^{D}|c_{\alpha}|^{2}\ket{E_{\alpha}}\bra{E_{\alpha}}$ . However, the diagonal ensemble is constructed from $D$ parameters, unlike the microcanonical ensemble that requires only macroscopic energy for its construction. What we want is a statistical ensemble that is constructed from a few parameters, such as the microcanonical, canonical and grandcanonical ensemble.

Nonintegrablity of the system is expected to play a key role in determining whether the stationary state is described by the usual (micro)canonical ensemble. In a nonintegrable system that conserves energy alone, we expect that the canonical ensemble describes the stationary state because of the ETH (see Chapters 2 and 3). On the other hand, the ETH does not hold true in integrable systems due to the existence of many conserved quantities. In this case, the stationary states are not described by the canonical ensemble in general (note that the equilibration often occurs without the ETH because of the large effective dimension). Are the stationary states in integrable systems describable by some other statistical ensembles?

The generalized Gibbs ensemble (GGE) is a candidate for a statistical ensemble that describes the stationary state in an integrable system. Let us denote the set of conserved quantities by $\{\hat{I}_{m}\}\>(m=1,2,\cdots)$ . For simplicity, we assume that $\hat{I}_{m}$ ’s commute with one another. The GGE is defined as [63, 64, 42]

[TABLE]

where $Z_{\mathrm{GGE}}:=\mathrm{Tr}[e^{-\sum_{m}\lambda_{m}\hat{I}_{m}}]$ and $\lambda_{m}$ ’s are determined from the initial information of the conserved quantities:

[TABLE]

Then, assuming the diagonal ensemble, we want the GGE to satisfy the following relation for (the sum of) few-body observables $\hat{\mathcal{O}}$ in the thermodynamic limit:***In the followings, we will only consider (sums of) few-body operators as observables. Furthermore, we will often consider spatially local observables.

[TABLE]

In the form of the microscopic thermal equilibrium (MITE), we want

[TABLE]

for a small subsystem $S$ .

We remark that there is another way to define the stationary state in the thermodynamic limit. While Eq. (5.4) conderns the diagonal ensemble for a finite system with the size $L$ and then take the thermodynamic limit $L\rightarrow\infty$ , we can first consider the thermodynamic limit and then the long-time limit:

[TABLE]

Then we can examine if

[TABLE]

holds true. The former definition in Eq. (5.4) is adopted in, e.g., Refs. [42, 43, 36], and the latter definition in Eq. (5.6) is adopted in, e.g., Refs. [159, 160, 35].111We note that the former definition can treat finite-size effects, and the latter definition is convenient for treating nonequilibrium steady states [161, 162, 163]. Two definitions are equivalent if

[TABLE]

Although this condition is nontrivial, it is proven for certain systems [35]. We note that we adopted the definition similar to the former (using the (micro)canonical ensemble instead of the GGE) in the previous chapters. We will also use the former definition in our work in Chapter 6.

The applicability of the GGE has not completely been understood yet. We raise two open questions.

To what kind of systems and observables is the GGE applicable? Do we need specific initial states to justify the GGE? 2. 2.

How should we choose the minimal set of $\{\hat{I}_{m}\}$ ? Are there more important conserved quantities?

Many previous studies investigated these questions using various integrable models. We will review some of them in the following sections.

5.2 The GGE in essentially free systems

First we consider an integrable system whose Hamiltonian can be mapped to a quadratic form:

[TABLE]

where $\hat{b}_{k}$ is an annihilation operator of some quasiparticle and $\epsilon_{k}$ is the dispersion relation. After the notable work using a one-dimensional lattice system with hard-core bosons [42, 43], the GGE has extensively been investigated in these essentially free-quasiparticle systems, including transverse-field Ising models [49, 50, 51], XY models [50], Luttinger liquids [159, 50], a system of hard-core anyons [164], and quantum field theories [160, 165, 166].

Let us illustrate a simple example in which the GGE is applicable, following Ref. [35]. We consider a one-dimensional fermionic paring model as follows:

[TABLE]

where $\hat{c}_{i}$ is an annihilation operator†††The anticommutation relations $\{\hat{c}_{i},\hat{c}_{j}\}=\{\hat{c}_{i}^{\dagger},\hat{c}_{j}^{\dagger}\}=0$ and $\{\hat{c}_{i},\hat{c}_{j}^{\dagger}\}=\delta_{ij}$ are satisfied. of a fermion at the site $i$ and we impose a periodic boundary condition. By the Fourier transformation $\hat{c}_{i}=\frac{1}{\sqrt{L}}\sum_{k}e^{-ikx_{i}}\hat{c}(k)$ , where $x_{i}=i$ is the coordinate of site $i$ (we set the lattice constant to unity), we obtain

[TABLE]

Using the Bogoliubov transformation

[TABLE]

we have the diagonalized Hamiltonian

[TABLE]

Here

[TABLE]

and

[TABLE]

We consider a quench from a pre-Hamiltonian $\hat{H}_{0}=\hat{H}_{f}(\Delta_{0},\mu)\>\>(\Delta_{0}\neq 0)$ to a post-Hamiltonian $\hat{H}=\hat{H}_{f}(0,\mu)$ at time $t=0$ . Note that $\hat{H}=-\sum_{k}(2J\cos(k)+\mu)\hat{c}^{\dagger}(k)\hat{c}(k)$ . We take an initial state as the Bogoliubov fermion vacuum:

[TABLE]

Using the Heisenberg representation $\hat{c}(k,t)=e^{i\hat{H}t}\hat{c}(k)e^{-i\hat{H}t}$ , we obtain

[TABLE]

Then, we can calculate the two-point functions at $t>0$ as

[TABLE]

In the position space we have

[TABLE]

The crucial aspect of our initial state is that we can use Wick’s theorem to calculate the multi-point correlation functions. For example,

[TABLE]

In the thermodynamic limit and the long-time limit, we have

[TABLE]

We thus conclude that the stationary state of a given subsystem is completely characterized by $\lim_{L\rightarrow\infty}f_{L}(l)$ .

The GGE is constructed from the set of mode occupation numbers $\hat{n}_{k}:=\hat{c}_{k}^{\dagger}\hat{c}_{k}$ with the quasi-momentum $k=0,\frac{2\pi}{L},\frac{4\pi}{L},\cdots,\frac{2\pi(L-1)}{L}$ as

[TABLE]

where $Z_{\mathrm{GGE}}=\mathrm{Tr}\left[e^{-\sum_{k}\lambda_{k}\hat{n}_{k}}\right]$ . Here $\lambda_{k}$ is determined by

[TABLE]

Since the GGE has a quadratic form, Wick’s theorem holds true. Then, due to the relation in Eq. (5.2), the stationary state and the GGE give the same multi-point correlation functions. Therefore, we prove that the stationary-state expectation values of observables on a small subsystem are correctly predicted by the GGE.

The method for using Wick’s theorem was first introduced in Ref. [50], which discussed transverse-field Ising models, Luttinger models, and XY models. To justify Wick’s theorem of the initial states, we can take the canonical ensemble or the ground state of Hamiltonians that are written as quadratic forms. Further, in Ref. [165], the authors proved the validity of the GGE in Eq. (5.24) for massive free quantum field theories by assuming the cluster decomposition properties of the initial state. This extends the applicability of the GGE to non-Gaussian initial states. However, it was also shown that the GGE in Eq. (5.24) fails for massless free field theories if the initial state is not Gaussian [166].

5.3 Importance of the locality of conserved quantities and the truncated GGE

In this section we discuss the importance of the locality of conserved quantities in describing the expectation values of local observables. While we have constructed the GGE from $L$ mode occupation numbers in Eq. (5.24), we can construct the GGE using an extensive conserved quantities that can be written as sums of local operators. Moreover, numerical simulations suggest that if conserved quantities become more local, they become more important in describing local observables.

To illustrate the notion of the locality of conserved quantities, consider the fermionic Hamiltonian $\hat{H}(\Delta=0,\mu)$ introduced in the previous section. As we have done in constructing the GGE in Eq. (5.24), we can use the mode occupation number $\hat{n}_{k}=\hat{c}_{k}^{\dagger}\hat{c}_{k}$ to construct the GGE. On the other hand, we can also consider an equivalent set of extensive conserved quantities as follows:

[TABLE]

Each conserved quantity is an extensive quantity that is the sum of local operators. We especially call these local operators as $(n+1)$ -local operators, since $\hat{c}_{i}^{\dagger}\hat{c}_{i+n}$ has a support on $(n+1)$ -neighboring sites.‡‡‡We note that an operator in Eq. (2.8) is $l_{0}$ -local if $\mu_{i_{0}}\neq 0$ and $\mu_{i_{0}+l_{0}-1}\neq 0$ . Then the GGE can be reconstructed from these conserved quantities as

[TABLE]

Here $\mu_{n,\pm}$ ’s are determined from the initial condition of the conserved quantities. We note that there are cases where the equivalence between mode occupation numbers and extensive conserved quantities does not exist [59].

The importance of the locality of conserved quantities was first realized by Fagotti and Essler [51] using the 1D transverse-field Ising model in the thermodynamic limit:

[TABLE]

This model is diagonalized with the Jordan-Wigner transformation followed by the Bogoliubov transformation. Even though this procedure is complicated because the Jordan-Wigner transformation is a nonlocal transformation, we can find a set of extensive conserved quantities instead of the mode occupation numbers as

[TABLE]

where

[TABLE]

The important point is that $\hat{I}^{(n,\pm)}$ ’s are $(n+2)$ -local because they can be written as the sums of $(n+2)$ -neighboring spin correlations. The GGE is constructed as

[TABLE]

The authors in Ref. [51] have confirmed that this ensemble well describes the stationary state after a certain quench.

Next, we want to reduce the number of the conserved quantities in the GGE without losing the validity of describing the stationary state for the subsystem with the size $l$ . In Ref. [51], the authors introduced what is called the “truncated generalized Gibbs ensemble” (tGGE):

[TABLE]

where $Z_{\mathrm{tGGE}}^{(y)}=\mathrm{Tr}\left[e^{-\sum_{n=0}^{y-1}\sum_{\sigma=\pm}\lambda_{n,\sigma}^{(y)}\hat{I}_{n,\sigma}}\right]$ , and we note that $\lambda_{y,n,\sigma}\neq\lambda_{n,\sigma}$ in general. In the limit $y\rightarrow\infty$ , $\hat{\rho}_{\mathrm{tGGE}}^{(y)}$ is equivalent to $\hat{\rho}_{\mathrm{GGE}}$ .

The authors in Ref. [51] investigated how close $\hat{\rho}_{\mathrm{tGGE},l}^{(y)}$ and $\hat{\rho}_{\mathrm{GGE},l}$ are for various $y$ , where the subscript $l$ means that the density matrices are reduced to the subsystem with the size $l$ . Figure 5.1 shows the distance $\mathcal{D}_{\infty}^{(y)}=\mathcal{D}(\hat{\rho}_{\mathrm{tGGE},l}^{(y)},\hat{\rho}_{\mathrm{GGE},l})$ between $\hat{\rho}_{\mathrm{tGGE},l}^{(y)}$ and $\hat{\rho}_{\mathrm{GGE},l}$ as a function of $y$ for various $l$ , where $\mathcal{D}(\hat{\rho}_{1},\hat{\rho}_{2})$ is defined as§§§They used the Frobenius norm that is defined by $||\hat{A}||_{F}:=\sqrt{\mathrm{Tr}[\hat{A}^{\dagger}\hat{A}]}$ .

[TABLE]

The different plots denote the different subsystem sizes with $l=5$ (the leftmost), 10, 15, $\cdots,50$ (the rightmost). The figure shows that the distances start to decrease rapidly only for $l\lesssim y$ . This result implies that the conserved quantities $\hat{I}_{n,\sigma}$ that satisfy $n\lesssim l$ are the most important in describing the subsystem with the size $l$ , and that the less local operators with $n\gg l$ play negligible roles. Simply put, we can say that if conserved quantities become more local (in a sense that they can be written as sums of local operators), they become more important in constructing the GGE that can describe local observables.

5.4 The GGE in interacting systems solved by the Bethe ansatz

The GGE in interacting systems that can be solved by the Bethe ansatz is also investigated. In contrast to essentially free-quasiparticle systems that we have reviewed in the previous two sections, we cannot use the mode occupation numbers to construct the GGE in interacting systems. In fact, the applicability of the GGE for such systems is an open question. However, efforts have been made for XXZ models [167, 58, 61], Lieb-Liniger models [53, 54, 56], quantum field theories [57, 59], and so on. In this section, we briefly review the recent development on the stationary states of the XXZ models.

In general, the eigenstates $\ket{E_{\alpha}}$ of the Bethe-ansatz-solvable systems are characterized by a set of complex quantum numbers $\{\lambda_{k}\}_{k}$ , as $\ket{E_{\alpha}}=\ket{\{\lambda_{k}\}_{k}}$ . The quantum numbers $\{\lambda_{k}\}_{k}$ are called rapidities and obtained by the so-called Bethe equations. For example, consider a 1D XXZ model as

[TABLE]

where $J>0$ and $\Delta=\cosh\eta\geq 1$ . If we consider the fixed sector with a magnetization $S_{\mathrm{tot}}^{z}=\frac{N}{2}-M$ , the Bethe equations for $\{\lambda_{k}\}_{k=1}^{M}$ can be written as [168, 61]

[TABLE]

for all $j$ . Note that these equations are very complicated because they are nonlinear and $\lambda_{j}$ ’s are related to one another.

The Bethe equations in Eq. (5.36) are obtained either from the so-called coordinate Bethe ansatz or the algebraic Bethe ansatz. In the coordinate Bethe ansatz, we assume a specific form of the wave functions (called the Bethe ansatz wavefunctions) as the eigenstates of the Hamiltonian [168]. Then, by imposing a periodic boundary condition, we obtain the Bethe equations. In the algebraic Bethe ansatz, we introduce some operators (called the R-matrix, the L-matrix, and the monodromy matrix) that satisfy certain algebras (i.e., the Yang-Baxter equations). We then construct the so-called transfer matrix $T(\lambda)$ from the monodromy matrix such that $[T(\lambda),T(\mu)]=0$ is satisfied for different two rapidities $\lambda$ and $\mu$ . If we choose the appropriate R-matrix, the transfer matrix becomes a generating function of a certain Hamiltonian (e.g., the Heisenberg, the XXZ or the Lieb-liniger Hamiltonian) and other extensive conserved quantities that are written as the sums of local operators. In this case, the transfer matrix and these conserved quantities (including the Hamiltonian) are simultaneously diagonalized. Finally, we construct the eigenstates of the transfer matrix using a creation operator that is determined from the monodoromy matrix. The Bethe equations in Eq. (5.36) are obtained as the consistency condition of this construction.

In Refs. [167, 58], the failure of the “naive” GGE in describing the stationary state is reported. In Ref. [58], the authors construct the GGE using the extensive conserved quantities:

[TABLE]

where $\hat{\mathcal{I}}_{j,j+1,\cdots,j+n}$ is an $(n+1)$ -local operator. We note that $\hat{I}_{n}$ is obtained from the $n$ th derivative of the transfer matrix. By comparing the expectation values of observables for the GGE and those for the stationary state,¶¶¶In general, it is not easy to treat the Bethe equations in Eq. (5.36). However, the so-called generalized thermodynamic Bethe ansatz (gTBA) is developed to approximately examine properties of the system in the thermodynamic limit [53]. In this approach, we replace the set of the rapidities with a distribution function of the rapidities. Then, we find the equation (i.e., the gTBA equation) for the distribution function from the knowledge of the ensembles under consideration. For such ensembles, we can take either the GGE or the stationary-state ensemble. The method for using the latter ensemble is called a quench action method [169, 170]. they found that there is a discrepancy between these two results. This result indicates that the GGE constructed only from the extensive sums of local operators cannot be used for the XXZ models.

A more improved GGE is proposed and investigated in Ref. [61] by taking the so-called quasi-local operators into account. While local operators have supports exactly on the finite number of neighboring sites, for quasi-local operators we allow quantities whose overlaps with less local sites are present but decay sufficiently fast (for an exact definition, see Ref. [171]). In XXZ models, it is shown that extensive conserved quantities that can be written as sums of such quasi-local operators are constructed by applying the algebraic Bethe ansatz in a more technical way than usual [171, 61].∥∥∥More concretely, the possible class of the $L$ matrices and the $R$ matrices are extended. The authors in Ref. [61] find that the GGE constructed from the extensive conserved quantities that are written as the sums of quasi-local as well as local operators well describes the expectation values of local observables in the stationary state. This result implies that the GGE is valid if we can include all conserved quantities that have significant overlaps with local observables.******We note that the importance of quasi-local operators has already been recognized in the context of the nonequilibrium transport in XXZ chains [172] and the GGE in quantum field theories [59]. We note, however, that a recent paper in Ref. [173] claims that even quasi-local operators may not be enough to characterize the stationary state in XXZ models. Thus, the applicability of the GGE in these interacting integrable models is still an open question.

5.5 Conclustion

We have reviewed the applicability of the GGE for two types of integrable systems: systems that can be mapped to free-quasiparticle systems and systems that are solved by the Bethe ansatz. The Hamiltonians of the systems of both types have eigenstates that are characterized by sets of certain quantum numbers. In contrast, it seems more important to pay attention to conserved quantities that can be written as sums of (quasi-)local operators. In fact, the locality of conserved quantities seems crucial in constructing the GGE that describes the expectation values of local observables in the stationary state, as the success of the truncated GGE indicates. Even though controversial discussions exist, many researchers believe that the GGE constructed from (quasi-)local operators will be valid in certain initial conditions.

Chapter 6 Generalized Gibbs ensemble in a nonintegrable system with an extensive number of local symmetries

6.1 Motivation

In the previous chapters, we have seen that conserved quantities play an important role for thermalization in isolated quantum systems. In nonintegrable systems that conserve energy alone, the stationary state is expected to be described by the (micro)canonical ensemble because of the ETH. On the other hand, the stationary state cannot be described by the canonical ensemble in systems that are integrable or many-body localized, since there exist many nontrivial conserved quantities in these systems.

As we have seen in Chapter 5, the GGE is the promising candidate for describing stationary states in integrable systems. These integrable systems have sets of conserved quantities from which each energy eigenstate can be identified. We note that this feature is also expected to exist in systems that show the “fully” many-body localization (see Subsection 1.2.2). In this case, each energy eigenstate is expected to be characterized by the localized bits (see Eq. (1.7)) [70, 71, 72].

To clarify the importance of conserved quantities for the appearance of non-thermal stationary states, it is interesting to study models with less numbers of conserved quantities than the usual integrable systems. Previous studies showed two extreme cases: the stationary state seems to be described by the canonical ensemble if the system conserves only energy, and the GGE is necessary when sufficiently many conserved quantities exist so that every eigenstate is identified. Then, it is of interest how many conserved quantities the system should possess for the appearance of the stationary states that are described by the GGE, not by the canonical ensemble. We note that such systems are nonintegrable in a sense that the sets of conserved quantities cannot characterize each energy eigenstate.

In this chapter, we discuss our work based on Ref. [113]. We show that the stationary state is described by the GGE if the system has an extensive number of local symmetries, even when it is a nonintegrable system. We have investigated a nonintegrable model of hard-core bosons with an extensive number of local $\mathbb{Z}_{2}$ symmetries by the exact-diagonalization analysis. We show that the expectation values of observables in the stationary state are described by the GGE rather than the canonical ensemble. In this case, the usual ETH does not hold true. Instead, the ETH for each symmetry sector, which we call the restricted ETH (rETH), holds true and we argue that the rETH plays an important role for our system to approach the GGE. We have also examined a model that has only one global $\mathbb{Z}_{2}$ symmetry, and a model with a size-independent number of local $\mathbb{Z}_{2}$ symmetries. We show that the usual canonical ensemble well describes the stationary states and that we do not have to use the GGE for these two models.

6.2 A model with an extensive number of local symmetries

As shown in Fig. 6.1(i), we consider a nonintegrable model of $N_{b}$ hard-core bosons on $N_{s}$ -number of lattice sites that are arranged in $L$ layers in the shape of triangles $(N_{s}=3L)$ . Each site $i\>(1\leq i\leq N)$ is labeled by two indices $(s,l)$ , where $l\>(=1,2,\dots,L)$ labels the layer and $s\>(=\mathrm{L,M,R})$ labels the position in each layer.

The Hamiltonian can be written as

[TABLE]

where $\hat{b}_{i}$ is the annihilation operator of a hard-core boson at the site $i$ , $t_{ij}\in\mathbb{R}$ is the hopping energy between two sites $i$ and $j$ , and $\langle i,j\rangle\>(i<j)$ represents a pair of neighboring sites.

We assume that

[TABLE]

is satisfied for the hopping energy $t_{ij}$ , which allows the system to have a local $\mathbb{Z}_{2}$ symmetry with the corresponding operator $\hat{P}_{l}\>(1\leq l\leq L)$ for each layer. This operator swaps two sites (L, $l$ ) and (R, $l$ ) in Fig. 6.1 (i), and satisfies

[TABLE]

We note that $\hat{P}_{l}$ can be written as

[TABLE]

where $[\hat{H},\hat{P}_{l}]=0$ and $\hat{P}_{l}^{2}=1$ are satisfied. We call the eigenvalues of $\hat{P}_{l}$ , namely $q_{l}=\pm 1$ , as positive and negative $\mathbb{Z}_{2}$ parities. If we map the hard-core bosons to the spin 1/2 operators, we can interpret $\hat{P}_{l}$ as the projection operator onto the spin singlet $(q_{l}=-1)$ and triplet $(q_{l}=+1)$ states that involve the spins on (L, $l$ ) and (R, $l$ ).

By this construction, the system has a symmetry group that can be written as

[TABLE]

Since $G$ is abelian, we can divide the set of energy eigenstates into the $|G|=2^{L}$ symmetry sectors [174] that are determined by a set of $\mathbb{Z}_{2}$ parities $\mathbf{q}:=(q_{l})_{l=1}^{L}$ . If we denote the symmetry sectors by $\mathbf{q}$ , the entire Hilbert space $\mathcal{H}$ of the system is decomposed as

[TABLE]

To remove unwanted symmetries and degeneracies, we add randomness to $t_{ij}$ . We assume that $t_{\mathrm{M}l,\mathrm{L}l}$ (= $t_{\mathrm{M}l,\mathrm{R}l}$ ), $t_{\mathrm{L}l,\mathrm{R}l}$ , and $t_{\mathrm{L}l,\mathrm{R}(l+1)}$ can be written as

[TABLE]

where the randomness $\epsilon_{ij}$ is uniformly chosen from $[-0.5,0.5]$ . We have confirmed that this randomness romoves all degeneracies and most of the symmetries except for the $\mathbb{Z}_{2}$ symmetry.

We note that the level-spacing statistics obeys the Wigner-Dyson statistics within each symmetry sector $\mathcal{H}_{\mathbf{q}}$ that contains sufficiently many eigenstates. As we have seen in Chapter 3, in nonintegrable systems that conserve energy alone, the level-spacing statistics is expected to resemble the Wigner-Dyson statistics, $P_{\mathrm{WD}}(s)=\frac{\pi}{2}se^{-\frac{\pi}{4}s^{2}}$ .***Note that we use the statistics for the GOE because the Hamiltonian in Eq. (6.1) is invariant under the time-reversal operation. On the other hand, the statistics in systems with additional conserved quantities obeys the one without level repulsions such as the Poisson statistics $P_{\mathrm{P}}(s)=e^{-s}$ . Figure 6.2 (i) shows the level-spacing statistics for the entire spectrum in our model. It is close to the Poisson statistics, not to the Wigner-Dyson statistics. This result reflects the fact that our model has $\mathbb{Z}_{2}$ symmetries [28]. Figures 6.2 (ii) and (iii) show the level-spacing statistics of the eigenstates that belong to the sectors with $\mathbf{q}_{1}:=(+1,+1,...,+1)$ and $\mathbf{q}_{2}:=(-1,+1,...,+1)$ , respectively. They obey the Wigner-Dyson statistics rather than the Poisson statistics.

6.3 Time evolutions from two initial states

As initial states, we consider two cases as $\ket{\psi_{0}}=\ket{\psi^{A}_{0}}$ and $\ket{\psi^{B}_{0}}$ , where bosons are placed at $(\mathrm{L},l)$ and $(\mathrm{M},l)$ , respectively (see Fig. 6.1 (ii)). We will call time evolutions from these initial states as Case A and Case B. We consider the cases of 1/3-filling, where $N_{b}=L$ and $l=1,2,\dots,L$ , and 1/6-filling, where $N_{b}=L/2$ and $l=2,4,\dots,L$ . While $\ket{\psi^{A}_{0}}$ extends over the different $\mathcal{H}_{\mathbf{q}}$ ’s, $\ket{\psi^{B}_{0}}$ belongs to only one sector $\mathcal{H}_{\mathbf{q}_{1}}$ , where $\mathbf{q}_{1}=(+1,+1,\dots,+1)$ (see Appendix B.5).

The state at time $t$ is obtained as $\ket{\psi(t)}=e^{-\frac{i\hat{H}t}{\hbar}}\ket{\psi_{0}}=\sum_{\alpha}c_{\alpha}e^{-\frac{i{E}_{\alpha}t}{\hbar}}\ket{E_{\alpha}}$ , where $c_{\alpha}=\braket{E_{\alpha}}{\psi_{0}}$ . The long-time average of a local observable ${\hat{\mathcal{O}}}$ is described by the diagonal ensemble under certain conditions (see Chapter 2):

[TABLE]

where $\hat{\rho}_{\mathrm{d}}:=\sum_{\alpha}|c_{\alpha}|^{2}\ket{E_{\alpha}}\bra{E_{\alpha}}$ .

We define the canonical ensemble and the GGE that may describe the stationary state with a few parameters. We define the canonical ensemble as

[TABLE]

where $Z_{\mathrm{can}}=\mathrm{Tr}[e^{-\beta\hat{H}}]$ and the inverse temperature $\beta$ is determined from the total energy $E_{0}:=\braket{\psi_{0}}{\hat{H}}{\psi_{0}}=\mathrm{Tr}[\hat{H}\hat{\rho}_{\mathrm{can}}].$ On the other hand, we define the GGE as

[TABLE]

where $Z_{\mathrm{GGE}}=\mathrm{Tr}[e^{-\tilde{\beta}\hat{H}-\sum_{l=1}^{L}\lambda_{l}\hat{P}_{l}}]$ . Here $\tilde{\beta}$ and $\lambda_{l}\>(1\leq l\leq L)$ are uniquely determined from the initial conditions as follows:

[TABLE]

Note that our definition of the GGE uses an extensive number of truly local conserved operators, whereas the usual GGE in integrable systems takes the sum of (quasi)-local operators as conserved quantities.

We note that both of the initial states have the total conserved energy

[TABLE]

which leads to the infinite temperature ( $\beta=0$ ) in the canonical ensemble. To show this, we should solve the equation for $\beta$ ,

[TABLE]

where $Z_{\mathrm{can}}=\sum_{\alpha}e^{-\beta E_{\alpha}}$ . Since the right-hand side of this equation monotonically decreases with respect to $\beta$ , we have a unique solution. Moreover, for the Hamiltonian in Eq. (6.1), we can show

[TABLE]

Here we have used $\mathrm{Tr}[\hat{b}^{\dagger}_{i}\hat{b}_{j}]=0$ for $i\neq j$ , which can be understood by treating the trace with the Fock basis on the sites. Consequently, we obtain $\mathrm{Tr}[\hat{H}]=\sum_{\alpha}E_{\alpha}=0$ , which leads to $\beta=0$ . Note that the canonical ensemble at the infinite temperature is proportional to the identity operator as $\hat{\rho}_{\mathrm{can}}=\frac{1}{D}$ , where $D:=\mathrm{dim}[\mathcal{H}]$ is the dimension of the entire Hilbert space.

As observables, we consider the Fourier transform of the hard-core boson operators and a (renormalized) mode occupation number with $\mathbf{k}=(k_{x},k_{y},k_{z})$ . Then we take a marginal distribution of the occupation number by integrating out $k_{z}$ , as $\hat{n}(k_{x},k_{y})=\frac{1}{2^{2}N_{b}}\sum_{{i,j}}\delta_{z_{i},z_{j}}e^{-i\mathbf{k\cdot(r_{i}-r_{j})}}\hat{b}^{\dagger}_{i}\hat{b}_{j}$ . Here we denote $\mathbf{r_{i}}=(x_{i},y_{i},z_{i})$ by the coordinate of the site $i$ (the lattice constant is set to unity). We will especially consider $\hat{n}_{00}:=\hat{n}(0,0),{n}_{01}:=\hat{n}(0,\pi)$ , and $\hat{n}_{11}:=\hat{n}(\pi,\pi)$ in the following discussions. We note that these observables are macroscopic in the sense that they can be written as the averages of local operators on each layer.

We show typical time evolutions of the expectation value of $\hat{n}_{01}$ for Case A in Figure 6.3. The left and right figures respectively show the result of the 1/3-filling ( $L=N_{b}=6$ ) and that of the 1/6-filling ( $L=8,N_{b}=4$ ). We also show the predictions of the diagonal ensemble, the canonical ensemble, and the GGE, which are respectively given by $\braket{\hat{n}_{01}}_{\mathrm{d}}:=\mathrm{Tr}[\hat{n}_{01}\hat{\rho}_{\mathrm{d}}]$ , $\braket{\hat{n}_{01}}_{\mathrm{can}}:=\mathrm{Tr}[\hat{n}_{01}\hat{\rho}_{\mathrm{can}}]$ and $\braket{\hat{n}_{01}}_{\mathrm{GGE}}:=\mathrm{Tr}[\hat{n}_{01}\hat{\rho}_{\mathrm{GGE}}]$ . We can see that the expectation values for large $t$ are well described by the prediction of the diagonal ensemble with small temporal fluctuations. As shown in the figure, we find that the GGE coincides with the prediction of the diagonal ensemble (Fig. 6.3) very well, whereas the canonical ensemble does not. In fact, the canonical ensemble at $\beta=0$ always gives††† For example, for ${\hat{n}_{01}}$ , we obtain

$\displaystyle\braket{\hat{n}_{01}}_{\mathrm{can}}=\frac{1}{2^{2}N_{b}D}\sum_{i,j}e^{-i\mathbf{k\cdot(r_{i}-r_{j})}}\delta_{z_{i},z_{j}}\mathrm{Tr}[\hat{b}^{\dagger}_{i}\hat{b}_{j}].$

(6.17)

Since the trace for $i\neq j$ vanishes, it becomes $\frac{1}{2^{2}N_{b}D}\sum_{i}\mathrm{Tr}[\hat{b}^{\dagger}_{i}\hat{b}_{i}]$ . Then, by treating the trace using the energy eigenstates, we have

$\displaystyle\braket{\hat{n}_{01}}_{\mathrm{can}}=\frac{1}{2^{2}N_{b}D}\sum_{i}\sum_{\alpha}\braket{E_{\alpha}}{\hat{b}^{\dagger}_{i}\hat{b}_{i}}{E_{\alpha}}=\frac{1}{4},$

(6.18)

where we have used $\sum_{i}\braket{E_{\alpha}}{\hat{b}^{\dagger}_{i}\hat{b}_{i}}{E_{\alpha}}=N_{b}$ . Similarly, we obtain $\braket{\hat{n}_{00}}_{\mathrm{can}}=\braket{\hat{n}_{11}}_{\mathrm{can}}=\frac{1}{4}$ .

[TABLE]

which is not equal to $\braket{\hat{n}_{00}}_{\mathrm{d}},\braket{\hat{n}_{01}}_{\mathrm{d}}$ , and $\braket{\hat{n}_{11}}_{\mathrm{d}}$ in general. This result highlights our key finding independent of the value of fillings: we need the GGE to describe the stationary state in a nonintegrable system with an extensive number of local symmetries. We will verify this observation in more detail by focusing on the case of the 1/3-filling ( $L=N_{b}$ ) in the following sections.

6.4 Verification of the GGE by the finite-size scaling analysis

In this section, we quantitatively analyze how well the GGE describes the stationary state compared with the canonical ensemble by the finite-size scaling analysis. We define the relative difference between the canonical ensemble/GGE and the diagonal ensemble as follows:

[TABLE]

Here $\hat{n}$ represents $\hat{n}_{00},\hat{n}_{01}$ or $\hat{n}_{11}$ , and $\overline{\cdot\cdot\cdot}$ denotes the average over 20 sample Hamiltonians with different values of randomness in $t_{ij}$ (see Eq. (6.8)).

As shown in Fig. 6.4, the relative difference of the GGE is about ten times smaller than that of the canonical ensemble (note that the graph is displayed using the semi-log scale). We note that the relative difference stays more than 10% for the canonical ensemble even if we increase $L$ , whereas it rapidly decreases with $L$ for the GGE.

Figure 6.4 also shows that there is some difference between Case A and Case B if we consider the $L$ -dependence of the relative difference of the GGE: the relative difference decreases less rapidly in Case A than in Case B with increasing the system size. This results from the mixing of the symmetry sectors with negative parities in Case A. We will examine this difference in detail in the next section.

6.5 Verification of the ETH for each symmetry sector

In this section, we analyze the ETH for diagonal matrix elements of observables $\hat{\mathcal{O}}$ and seek for the reason why the GGE works well and the canonical ensemble does not work in our model. We note that we will only treat the ETH for diagonal matrix elements and call it just as “the ETH” in the following discussions. As we have seen in Chapters 2 and 3, the ETH states that $\braket{E_{\alpha}}{\hat{\mathcal{O}}}{E_{\alpha}}$ is equal to the spectral average within a small energy shell in the thermodynamic limit. We will call $\mathcal{O}_{\alpha\alpha}=\braket{E_{\alpha}}{\hat{\mathcal{O}}}{E_{\alpha}}$ as the eigenstate expectation value (EEV). When the $|c_{\alpha}|$ ’s have a sharp peak around the average energy, the ETH justifies the microcanonical ensemble and the canonical ensemble (see Chapter 2).‡‡‡We assume the equivalence of the microcanonical ensemble and the canonical ensemble.

Figure 6.5 shows the EEVs for $\hat{n}_{01}$ , indicating that the ETH does not hold true for the entire spectrum. The fluctuations of EEVs (EEV fluctuations) $\Delta\mathcal{O}_{\alpha}$ indicated by a pair of arrows in Fig. 6.5 do not decrease even when $L$ becomes larger.§§§We note that $\Delta\mathcal{O}_{\alpha}$ is regarded as the second term on the right-hand side of Eq. (3.35) for $\alpha=\beta$ . We note that similar results are found for $\hat{n}_{00}$ and $\hat{n}_{11}$ .

Nevertheless, we have found that the EEV fluctuations decrease if we consider only eigenstates that are restricted to each symmetry sector. For example, in Fig. 6.5, each region encircled by dotted curves shows the restricted eigenstates that belong to $\mathcal{H}_{\mathbf{q}_{1}}$ . In this sector, the EEV fluctuations seem to decrease with increasing the system size. To be more precise, we define the EEV fluctuation $\Delta\mathcal{O}_{\gamma}^{(\mathbf{q})}$ in sector $\mathcal{H}_{\mathbf{q}}$ by

[TABLE]

where $\ket{E_{\gamma}^{(\mathbf{q})}}$ is an energy eigenstate in $\mathcal{H}_{\mathbf{q}}$ with an energy $E_{\gamma}^{(\mathbf{q})}$ , and $\gamma\>(1\leq\gamma\leq\mathrm{dim}[\mathcal{H}_{\mathbf{q}}])$ is a label of the eigenstate. We have also defined the spectral average in the sector $\mathcal{H}_{\mathbf{q}}$ within a small energy shell (cf. Eq. (3.28)):

[TABLE]

where $\mathcal{N}^{(\mathbf{q})}_{E,\omega_{s}}$ is the number of the energy eigenstates in $\mathcal{H}_{\mathbf{q}}$ within the energy shell $[E-\omega_{s},E+\omega_{s}]$ . We also define the average of (the generalized version of) the microcanonical ensemble $\braket{\hat{\mathcal{O}}}_{\mathrm{mic}}^{(\mathbf{q})}(E)$ in the sector $\mathcal{H}_{\mathbf{q}}$ by replacing $\omega_{s}$ in Eq. (6.22) with the microcanonical energy width $\Delta E$ .¶¶¶We note that, in a marcroscopic system, $\Delta E$ may be subextensive with the system size, whereas $\omega_{s}(<\omega_{\mathrm{sh}})$ is expected to remain small in many cases [41] (also see Chapter 3).

Figure 6.6 illustrates the validity of the ETH for each symmetry sector. We evaluate the typical magnitude of $\Delta\mathcal{O}_{\gamma}^{(\mathbf{q})}$ with $\sigma[{\Delta{\mathcal{O}}^{(\mathbf{q})}}]$ , where $\sigma[{\Delta{\mathcal{O}}^{(\mathbf{q})}}]$ is the standard deviation of $\braket{E_{\gamma}^{(\mathbf{q})}}{\hat{\mathcal{O}}}{E_{\gamma}^{(\mathbf{q})}}$ within the energy shell $[E-\omega_{s},E+\omega_{s}]$ for $\mathbf{q}_{1}$ and $\mathbf{q}_{2}:=(-1,+1,...,+1)$ .∥∥∥We note that $\sigma[{\Delta{\mathcal{O}}^{(\mathbf{q})}}]$ is a generalized version of the first equation in Eq. (4.4). As shown in the figure, both $\sigma[{\Delta{\mathcal{O}}^{(\mathbf{q_{1}})}}]$ and $\sigma[{\Delta{\mathcal{O}}^{(\mathbf{q_{2}})}}]$ rapidly decrease with increasing the system size $L$ .******Strictly speaking, the decay of the standard deviations is directly related to the weak ETH. However, (exponentially) fast decrease of them is observed in systems where the strong ETH is also expected to hold [31]. We also note that the EEV fluctuations are evaluated by the deviations from linear fittings within the small energy shell (see the first footnote of Subsection 4.3.2).

Assuming the ETH for each sector, the diagonal ensemble is approximately written as a statistical mixture of the microcanonical ensembles in all sectors:

[TABLE]

which is obtained by applying the derivation in Eq. (2.3.2) for each sector. Here,

[TABLE]

represents the occupation ratio of the sector $\mathcal{H}_{\mathbf{q}}$ , $\mathcal{\hat{P}}_{\mathbf{q}}$ is the projection operator onto the sector $\mathcal{H}_{\mathbf{q}}$ , and $c_{\gamma}^{(\mathbf{q})}:=\braket{E_{\gamma}^{(\mathbf{q})}}{\psi_{0}}$ . Moreover, we define

[TABLE]

as the average energy in sector $\mathcal{H}_{\mathbf{q}}$ . To derive Eq. (6.23), we have assumed that $|c_{\gamma}^{(\mathbf{q})}|$ ’s have a sharp peak around $E_{\mathbf{q}}$ (cf. Eq. (2.3.2)). Note that Eq. (6.23) depends on $2|G|=2^{L+1}$ parameters $p_{\mathbf{q}}$ and $E_{\mathbf{q}}$ , whereas the diagonal ensemble depends on $\mathrm{dim}[\mathcal{H}]=\frac{(3L)!}{L!(2L)!}(\gg 2|G|)$ parameters.

Now, we define what we call the “restricted GGE (rGGE)” with $2^{L+1}$ conserved quantities from which $p_{\mathbf{q}}$ and $E_{\mathbf{q}}$ are determined. By taking $\hat{Q}_{0}:=\hat{H},\hat{Q_{l}}:=\hat{P}_{l}\>(1\leq l\leq L)$ and their higher-order correlations as such conserved quantities, we construct the rGGE as

[TABLE]

where $Z_{\mathrm{rGGE}}:=\mathrm{Tr}[\exp(\cdots)]$ (see Refs. [47, 175, 176] for similar concepts). Parameters $\{\kappa_{lm\cdot\cdot\cdot}\}$ are uniquely determined from the initial condition as

[TABLE]

From Eq. (6.26) we obtain $\mathrm{Tr}[\hat{\rho}_{\mathrm{rGGE}}\mathcal{\hat{P}}_{\mathbf{q}}]=p_{\mathbf{q}}$ and $\frac{1}{p_{\mathbf{q}}}\mathrm{Tr}[\hat{\rho}_{\mathrm{rGGE}}\mathcal{\hat{P}}_{\mathbf{q}}\hat{H}\mathcal{\hat{P}}_{\mathbf{q}}]=E_{\mathbf{q}}$ , which justifies the rGGE as the ensemble that describes a stationary state.

We conjecture that the GGE defined in Eq. (6.11) can approximate the rGGE if we consider observables that can be written as the sums (or averages) of operators whose supports lie in each layer. As we have seen in Sec. 5.3, a related conjecture (i.e., the conjecture of the truncated GGE) was made in Ref. [51], which states that we can remove those conserved quantities that are less local than the observables in constructing the GGE. In our model, the multiple products of $\hat{Q}_{l}$ ’s in Eq. (6.26) have supports over the multiple layers. They are thus expected to be excluded from the rGGE for $\hat{n}_{00},\hat{n}_{01}$ , and $\hat{n}_{11}$ , which can be written as the averages of the local operators that have supports in each layer.

Before closing this section, we briefly explain why $\overline{\delta n_{\mathrm{GGE}}}$ is less sensitive to the change of $L$ for Case A than for Case B. In usual nonintegrable systems, the EEV fluctuations $\Delta\mathcal{O}_{\alpha}$ rapidly decrease with increasing $\mathrm{dim}[\mathcal{H}]$ as we have seen in Chapter 3. We expect that the restricted EEV fluctuations $\Delta\mathcal{O}_{\gamma}^{(\mathbf{q})}$ also decrease with increasing $\mathrm{dim}[\mathcal{H}_{\mathbf{q}}]$ . When more negative $\mathbb{Z}_{2}$ parities ( $q_{l}=-1$ ) exist in the sectors, they have smaller Hilbert dimensions, which results in a larger $\Delta\mathcal{O}_{\gamma}^{(\mathbf{q})}$ . Then the EEV fluctuations remain larger for Case A because of the sectors with negative $\mathbb{Z}_{2}$ parities when $L$ increases. Thus, $\overline{\delta n_{\mathrm{GGE}}}$ is less sensitive to $L$ in Case A than in Case B.

6.6 Models with fewer local symmetries

In this section, we show that the canonical ensemble well describes our macroscopic observables and that the GGE is not necessary when the number of the local symmetries does not increase with increasing $L$ . To demonstrate this, we first introduce two models with fewer local $\mathbb{Z}_{2}$ symmetries.

In Figure 6.7 (ii), we show model (b), which has only one global $\mathbb{Z}_{2}$ symmetry. The difference from model (a) is that bosons can hop vertically between the L (or R) sites of the neighboring layers. We assume that $t_{\mathrm{L}l,\mathrm{L}(l+1)}=t_{\mathrm{R}l,\mathrm{R}(l+1)}\neq 0$ , which leads to a global conserved quantity $\prod_{l=1}^{L}\hat{P}_{l}$ . This operator swaps the sites R and L on each layer simultaneously.

In Fig. 6.7 (iii), we show model (c), which has an $L$ -independent number $F\>(F=0,1,2,3)$ of local $\mathbb{Z}_{2}$ symmetries. In this model $t_{\mathrm{M}l,\mathrm{L}l}=t_{\mathrm{M}l,\mathrm{R}l}$ is satisfied only for $l\leq F$ due to the additional randomness introduced in the other layers. Then, it has local $\mathbb{Z}_{2}$ symmetries only at the layers with $1\leq l\leq F$ for $F>0$ . In addition, model (c) with $F=0$ is a usual nonintegrable system that conserves only energy.

Figure 6.8 shows the validity of the canonical ensemble in the models (b) and (c) by showing the $L$ -dependence of $\overline{\delta n_{01,\mathrm{can}}}$ . In the models (b) and (c) with $F=0$ , $\overline{\delta n_{01,\mathrm{can}}}$ rapidly decreases with increasing the system size down to about one tenth compared with (a) at $L=6$ . These results justify use of the canonical ensemble in these models. In the models (c), the $L$ -dependence is much less evident for $F\geq 1$ than $F=0$ . Nevertheless, $\overline{\delta n_{01,\mathrm{can}}}$ decreases even for $F=3$ , which again implies the validity of the canonical ensemble. Similar results are obtained for other macroscopic observables such as $\overline{\delta n_{00,\mathrm{can}}}$ and $\overline{\delta n_{11,\mathrm{can}}}$ . We attribute these results to the usual ETH, which holds weakly for $F\geq 1$ (see Appendix C.3).

Figure 6.9 illustrates the $F$ -dependence of $\overline{\delta n_{\mathrm{can}}}$ with $L=6$ . The figure shows that the canonical ensemble works better when the value of $F$ (or equivalently, $F/L$ ) is smaller. This result implies that the expectation values of macroscopic observables in the stationary state can be predicted by the canonical ensemble if the number of symmetries are much less than the system size.

6.7 Conclusions and discussions

Let us summarize this chapter and make some discussions. We have shown that stationary states for the nonintegrable model with an extensive number of local $\mathbb{Z}_{2}$ symmetries (Fig. 6.1) can be described by the GGE and not by the canonical ensemble. We find that the ETH breaks down for the entire spectrum, but it holds true for each symmetry sector. We have discussed that this restricted ETH leads to the GGE if we neglect multiple correlations among local conserved quantities. Next, by studying the models with only one global $\mathbb{Z}_{2}$ symmetry or the $L$ -independent number of local $\mathbb{Z}_{2}$ symmetries, we find that the canonical ensemble works well for predicting the expectation values of the macroscopic observables in these models. Our results have clarified that the GGE is necessary to describe stationary states if the system has an extensive number of local symmetries, even if they do not label every energy eigenstate.

We have several open problems about the relation between our GGE and stationary states. First, the initial states that we have used are almost homogeneous over different layers, which makes $\lambda_{l}$ ’s in Eq. (6.11) close to one another. It is of interest to investigate whether Eq. (6.11) is valid even for inhomogeneous initial states. Second, model (a) has an extensive number of the most local conserved quantities $\hat{P}_{l}$ , from which we can construct the GGE that describes the observables defined in each layer. On the other hand, in total, this model has more than extensive number of the local conserved quantities such as $\hat{P}_{1}\hat{P}_{2}$ , which may affect the expectation values of less local observables. Therefore, it is an open problem how far we can truncate the rGGE to predict the expectation values of given observables in stationary states. The third problem is to clarify how many symmetries are enough to prevent macroscopic observables from relaxing to the stationary states that can be described by the canonical ensemble. In other words, what will stationary state be when the system has local symmetries which increase with increasing $L$ in a non-extensive manner? Since $L$ increases much faster than the number of local symmetries in this case, this question cannot be answered by our exact diagonalization analysis. These questions are left for the future investigation.

Chapter 7 Conclusions and Future prospects

7.1 Conclusions

In this thesis, we have investigated how and when isolated quantum systems approach thermal equilibrium with an emphasis on the nonintegrability of systems. Previous studies have strongly indicated that nonintegrable systems that conserve only energy approach stationary states that are described by the (micro)canonical ensemble. The eigenstate thermalization hypothesis (ETH) is one possible candidate that justifies thermalization in isolated quantum systems. However, the rigorous proofs and definite criteria for the validity of the ETH are far from trivial. Thus, for understanding the mechanisms of the ETH, it is important to provide clues by possible analytical explanations and numerical simulations.

In the first part of our work, we have shown that random matrix theory can predict the ETH and its finite-size corrections (within some energy shell) in nonintegrable systems and for a wide class of observables. We have first refined and generalized the RMT predictions to investigate finite-size corrections of the ETH. We have especially focused on two types of quantities of matrix elements: one is the ratios between standard deviations of diagonal and off-diagonal matrix elements, and the other is the probability densities of the off-diagonal matrix elements. The RMT predicts that these quantities have universal features that depend on the anti-unitary symmetries of Hamiltonians, the anti-unitary symmetries of observables, and the “singularities” of observables.***Here singular observables are those that do not satisfy Eq. (4.40). Next, we have numerically investigated matrix-element statistics of various observables in nonintegrable systems that only conserve energy. We have demonstrated that the finite-size corrections of the ETH are in excellent agreement with the predictions of RMT for a wide class of observables with various symmetries, including many-body correlations and singular operators. We have also remarked, however, that counterexamples can always be constructed even among simple observables.

Nonintegrable systems with additional conserved quantities have been investigated much less. It is expected that stationary states are effectively described by the GGE in integrable systems, which have conserved quantities that determine every energy eigenstate. Then, it is of interest whether nonintegrable systems relax to non-thermal stationary states (possibly described by the GGE) if they have additional conserved quantities due to symmetries.

In the second part of our work, we have shown that stationary states for a nonintegrable model with an extensive number of local $\mathbb{Z}_{2}$ symmetries can effectively be described by the GGE and not by the canonical ensemble. For this model, the ETH holds true only for each symmetry sector instead of the entire spectrum. We have discussed that this restricted ETH leads to the GGE if we neglect multiple correlations among local conserved quantities. We have also studied the models with only one global $\mathbb{Z}_{2}$ symmetry or the $L$ -independent number of local $\mathbb{Z}_{2}$ symmetries. We find that the canonical ensemble works well for predicting the expectation values of the macroscopic observables in these models. Our results have clarified that the GGE is necessary to describe stationary states in the presence of an extensive number of local symmetries, even if they do not label each energy eigenstate.

7.2 Future prospects

Before concluding this thesis, we would like to discuss several future prospects.

First, we believe that the success of the RMT model in nonintegrable systems for various operators will enable us to investigate finite-size corrections of quantum statistical mechanics. Although we have considered only matrix elements of observables in this thesis, we expect that our method can similarly be applied to more directly observable quantities in small nonintegrable quantum systems such as temporal fluctuations after a quench which have in fact experimentally been observed [89]. The foundations of statistical mechanics in such systems may be important for the basics of quantum thermodynamics in small systems [177]. We also expect that properly applying RMT might reveal nontrivial aspects of nonequilibrium dynamics, which has attracted a lot of attention from high-energy physics to condensed matter physics [178, 179, 180, 181, 182].

Second, we wonder how the notion of the GGE can be developed and applied to characterize nonequilibrium dynamics in macroscopic systems. It is known that if prethermalization occurs due to approximate conserved quantities, the prethermalized plateau is well described by the GGE constructed from these quantities [94]. Recent studies have also shown that the GGE can be applied to describe nonequilibrium stationary states [161, 162, 163] and “generalized hydrodynamics” [183] in integrable systems. Our work has suggested that these works may be generalized to systems with sufficiently many conserved quantities irrespective of integrability. We expect that by properly choosing approximately conserved quantities, nonequilibrium dynamics can be captured even in nonintegrable systems.

Appendix A Review of random matrix theory (RMT)

In this appendix we give a brief review of random matrix theory (RMT). We first briefly review the history of RMT. Next, we explain the general definitions and classifications of the Gaussian random matrices. Then we explain the statistics that universally emerges in RMT, as a supplement to the main text. Finally, we review the notion of the ergodicity of random matrices. More detailed calculations and miscellaneous topics are given in Refs. [148, 184, 146, 185, 144, 186, 151].

A.1 History

Random matrix theory (RMT) was first applied to physics in 1951 [187] by Eugene P. Wigner, who investigated the excitation spectrum of nuclei. He conjectured that statistical properties of the eigenvalues of complex nuclei can be described by the eigenvalues of randomly generated matrices. After Wigner’s seminal work, Dyson published a series of papers [188, 189, 190, 191, 192] on the mathematical formulations and generalizations of random matrices in 1962. Of particular importance is what is called the “threefold way,” the classification of random matrices in terms of an anti-unitary symmetry operator that commutes with the Hamiltonian. He also formulated other important concepts, such as the circular ensembles or the Brownian motion of energy levels [148].

After its basics were established by Dyson, RMT was actively applied in the field of nuclear physics in the 1960’s and the 1970’s. On one hand, experimentalists verified that the level fluctuations in the spectrum of nuclei coincide with RMT prediction. On the other hand, theorists made efforts to develop RMT for a better description of the experiments. For example, they introduced the notion of the random S matrices and the embedded random ensembles. Although the applications of RMT were mainly made in nuclear physics in these decades, we remark that RMT was also applied to the field of ecological systems [193].

RMT greatly developed both in foundations and applications in the 1980’s and the 1990’s. One important finding was the connection between RMT and mesoscopic physics in quantum transport phenomena (including the theory of the universal conductance fluctuations) or disordered systems (including the Anderson localization transition). Another notable application was made in the field of quantum chaos, where it was conjectured that the spectrum of a quantum Sinai billiard and the spectra of more general quantum chaotic systems are described by those of random matrices (i.e., the Bohigas-Giannoni-Schmit conjecture). As a mathematical development, the technique of the supersymmetry was introduced, which enabled one to easily calculate the propagator of a random Hamiltonian. We also remark that the classification of random matrices in terms of symmetries was enlarged by Altland and Zirnbauer in 1997 [194] (known as the “tenfold way”).

RMT has further been utilized in describing various fields of physics recently. For example, distributions of height fluctuations that appear in the Kardar-Parisi-Zhang equations [195] have turned out to be described by the Tracy-Widom distributions***The Tracy-Widom distribution is a distribution of the maximum eigenvalue of the random matrix. of RMT [196]. Topological insulators and superconductors are classified and investigated using the Altland-Zirnbauer tenfold way [197]. Last but not least, RMT is closely related to the eigenstate thermalization hypothesis (ETH) in isolated quantum systems [41] (see Chapter 3). It is expected that the complexity of the energy eigenstates of nonintegrable systems are modeled by RMT. RMT is also expected to be related to the transition between the delocalized phase (where the ETH holds true) and the many-body localized phase (where the ETH does not hold) [198].

A.2 Definitions and classifications

A.2.1 Gaussian ensembles

Let $\hat{H}$ be a $D\times D$ random matrix which is chosen from the probability $P(\hat{H})[d\hat{H}]$ , where $[d\hat{H}]$ is the volume element of $d\hat{H}$ . In the Gaussian ensembles, each matrix element $H_{ij}$ is independent and identically follows a Gaussian distribution. Moreover, $P(\hat{H})$ is invariant under certain symmetry transformations, which we will explain below.

Threefold way by Dyson

Dyson classified Gaussian random matrices in terms of an anti-unitary operator $\hat{T}$ that commutes with $\hat{H}$ . If there exists no such $\hat{T}$ , the ensemble is called the Gaussian unitary ensemble (GUE) because it is invariant under any $D\times D$ unitary matrix. If there exists $\hat{T}$ that satisfies $\hat{T}^{2}=1$ , the ensemble is called the Gaussian orthogonal ensemble (GOE), since it is invariant under any $D\times D$ orthogonal matrix. Finally, if $\hat{T}^{2}=-1$ , the ensemble is called the Gaussian symplectic ensemble (GSE), which is invariant under any symplectic transformation.

Firstly, matrices in the GUE have $D^{2}$ independent degrees of freedom. The probability measure for the GUE can be written as follows:

[TABLE]

where $d^{2}H_{ij}=d\mathrm{Re}[H_{ij}]d\mathrm{Im}[H_{ij}]$ and $c_{2}$ is a normalization constant.

Secondly, matrices in the GOE have $D(D+1)/2$ independent degrees of freedom because each element can be taken as real variables (i.e., $H_{ij}=H_{ij}^{*}=H_{ji}$ ). The probability measure for the GOE can be written as follows:

[TABLE]

where $c_{1}$ is a normalization constant.

Finally, matrices in the GSE are written in terms of the quaternion notation as

[TABLE]

where $\hat{\sigma}^{(\gamma)}$ ’s are the Pauli matrices and $\hat{h}^{(\mu)}$ ’s $\>(\mu=0,1,2,3)$ are $(D/2)\times(D/2)$ matrices (we assume that $D$ is even in this case). Let us assume that $\hat{T}$ is the time-reversal operator (similar discussions can be applied to other anti-unitary symmetries). Then the $\hat{T}$ -invariance and the Hermiticity lead to the conditions $h_{nm}^{(\mu)}=h_{nm}^{(\mu)*}\>(\mu=0,1,2,3)$ , $h_{mn}^{(0)}=h_{nm}^{(0)}$ and $h_{mn}^{(\gamma)}=-h_{nm}^{(\gamma)}\>(\gamma=1,2,3)$ ; we thus have $D(D-1)/2$ independent variables. The probability measure for the GSE can be written as follows:

[TABLE]

where $c_{4}$ is a normalization constant. Note that we have used the notation that enables us to write down

[TABLE]

for all symmetry classes ( $\beta=1,2,$ and 4 for the GOE, the GUE, and the GSE, respectively).

Next, we consider the distributions of eigenvalues and eigenvectors of random matrices. Let us first consider the case with the GUE. In this case the Hamiltonian can be diagonalized by a unitary matrix $U$ as

[TABLE]

where $E=\mathrm{diag}(x_{1},\cdots,x_{D})$ represents a set of the eigenvalues. By taking the derivative, we obtain

[TABLE]

and then

[TABLE]

where we have used $dU^{\dagger}U=-U^{\dagger}dU$ . Each matrix element on the right-hand side of this equation can be written down as

[TABLE]

Since $[d\hat{H}]$ is the invariant measure of the unitary transformation, namely $[U^{\dagger}d\hat{H}U]=[d\hat{H}]$ (for a proof, see Ref. [199]), we obtain

[TABLE]

Here we note that the complex nature of the matrix elements leads to the factor $|x_{i}-x_{j}|^{2}$ , which represents the quadratic level repulsion. In fact, by integrating out the variables for eigenvectors and taking the Gaussian weight, we obtain the eigenvalue distributions for the GUE as follows:

[TABLE]

The eigenvector part, $\prod_{i>j}\mathrm{Re}(U^{\dagger}dU)_{ij}\mathrm{Im}(U^{\dagger}dU)_{ij}$ , is invariant under an arbitrary unitary transformation, so is the Haar measure on the unitary group. Similar considerations hold true for the case with the GOE and the GSE. For the GOE, we find

[TABLE]

where $\prod_{i>j}(O^{T}dO)_{ij}$ is the Haar measure on the orthogonal group. We can obtain the result for the GSE, too, and finally obtain the unified formula for the distributions of eigenvalues as

[TABLE]

Tenfold way by Altland and Zirnbauer

Do we have universality classes other than the GUE, the GOE, and the GSE? In fact, in the context of the QCD, the so-called chiral random matrix ensembles (the chGUE, the chGOE, and the chGSE) were found. Matrices in these ensembles have chiral symmetries: the corresponding operator is unitary and anti-commutes with the Hamiltonians. Altland and Zirnbauer then added four more classes and completed the ten classes focusing on the role of symmetries. These classes are distinguished by an anti-unitary symmetry operator $\hat{T}$ that commutes with the Hamiltonian (which we call a time-reversal symmetry in this subsection), an anti-unitary symmetry operator $\hat{\Pi}$ that anti-commutes with the Hamiltonian (a particle-hole symmetry), and a unitary symmetry operator $\hat{C}$ that anti-commutes with the Hamiltonian (a chiral/sublattice symmetry).

In Fig. A.1, we show the ten classifications with respect to these symmetries. Here, we assume that the values of $\hat{T}^{2}$ and $\hat{\Pi}^{2}$ will be either $+1$ or $-1$ (times the identity ooperator). We note that we do not have to consider the case where two symmetries of the same type exist. For example, if two anti-unitary symmetry operators $\hat{T}_{1}$ and $\hat{T}_{2}$ exist, $\hat{T}_{1}\hat{T}_{2}$ also becomes a symmetry operator that commutes with the Hamiltonian. Since $\hat{T}_{1}\hat{T}_{2}$ is unitary, we can decompose the Hamiltonian into irreducible blocks by this symmetry and treat these individual blocks again. This discussion can be used for $\hat{\Pi}$ and $\hat{C}$ as well. Similarly, if $\hat{\Pi}$ and $\hat{T}$ exist, $\hat{T}\hat{\Pi}$ becomes a unitary symmetry operator that anti-commutes with the Hamiltonian, which plays the role of $\hat{C}$ . Thus, the presence of the time-reversal and particle-hole symmetry necessarily leads to the chiral symmetry. We also note that these classes are called Class A, AI, …, CI, which are adapted from the mathematical terminology due to Élie Cartan.

A.3 Statistics in Gaussian random matrices

In this section, we explain some details of the statistics of RMT to complement the main text.

A.3.1 Level-spacing statistics

As we have seen in Chapter 3, the level-spacing distributions of the random matrix are approximately described by Eq. (3.12). In fact, this is the result obtained from Eq. (A.13) with $D=2$ . We can write the level-spacing distribution as

[TABLE]

where $x_{1}^{\prime}$ and $x_{2}^{\prime}$ are renormalized levels. From the normalization conditions $\int dsP(s)=\int dssP(s)=1$ , we can determine $C_{\beta}$ and $A_{\beta}$ , which results in Eq. (3.12). This is the result for $N=2$ , but it is known that it can approximate the level-spacing distribution for $N\gg 1$ as well [148, 184].

A.3.2 Distributions of eigenstates

Let us consider a single eigenstate $\ket{E_{\alpha}}$ and its components with respect to a fixed basis set $\{\ket{a_{i}}\}\>(1\leq i\leq d)$ . In the case of the GUE, the joint probability of finding $z_{1}=\mathrm{Re}[\braket{a_{1}}{E_{\alpha}}],z_{2}=\mathrm{Im}[\braket{a_{1}}{E_{\alpha}}],\cdots,z_{2d-1}=\mathrm{Re}[\braket{a_{d}}{E_{\alpha}}],z_{2d}=\mathrm{Im}[\braket{a_{d}}{E_{\alpha}}]$ is

[TABLE]

By integrating out $z_{2l+1},\cdots,z_{2d}$ , we obtain the marginal distribution of $z_{1},\cdots,z_{2l}$ as

[TABLE]

In particular, we can take $l=1$ and find the distribution of $y=z_{1}^{2}+z_{2}^{2}=|\braket{a_{1}}{E_{\alpha}}|^{2}$ as

[TABLE]

in the large- $d$ limit. This distribution is called the Porter-Thomas distribution.

Similarly, for the case with GOE, we can consider the joint probability of finding $z_{1}=\braket{a_{1}}{E_{\alpha}},\cdots,z_{d}=\braket{a_{d}}{E_{\alpha}}$ as†††We take the basis set such that each $z_{l}$ becomes real.

[TABLE]

Consequently, we have

[TABLE]

and the Porter-Thomas distribution (the probability of finding $y=z_{1}^{2}=|\braket{a_{1}}{E_{\alpha}}|^{2}$ ) for a large $d$ as

[TABLE]

We briefly consider the eigenvector statistics for the GSE in Appendix B.3.

A.4 Ergodicity of Gaussian random matrices

In this section, we review the so-called “ergodicity” of random matrices, which connects the spectral average and the ensemble average. We first consider the general framework and then prove the ergodicity for the second moments of off-diagonal matrix elements.

The ergodicity of random matrices means that, for most of the fixed Hamiltonians that are sampled from certain random ensemble, the spectral average is approximated by the ensemble average.‡‡‡As we have mentioned in Chapter 3, the ergodicity of random matrices is different from the usual terminology of the ergodicity of dynamical systems, which states that the phase-space average is equal to the long-time average. Let us begin with a function $g_{\alpha}$ , which depends on a single energy eigenstate $\ket{E_{\alpha}}$ (or its eigenvalue $E_{\alpha}$ ). We can consider the spectral average of $g_{\alpha}$

[TABLE]

where $\mathcal{T}$ denotes a set of the labels of the samplings and $d_{s}$ is the number of the samplings. We can also consider the ensemble average as

[TABLE]

We note that the ensemble average is often easy to calculate analytically by RMT.

To prove the ergodicity, we first have to show

[TABLE]

which is valid if $\overline{g_{\alpha}}$ is constant in $\alpha\in\mathcal{T}$ . Next, we need to show

[TABLE]

for $d,d_{s}\rightarrow\infty$ . If Eqs. (A.23) and (A.24) are satisfied in this limit, we have

[TABLE]

for most of the Hamiltonians in the random ensemble.

We prove the ergodicity for the second moments of the off-diagonal matrix elements. For simplicity, we consider a nonsingular observable $\hat{O}$ and $d\times d$ random matrices in the GUE. Since we have seen that $|O_{\alpha\beta}|^{2}\sim d^{-1}$ in the main text, we define $g_{\alpha\beta}=d|O_{\alpha\beta}|^{2}\>(\alpha\neq\beta)$ to get a nontrivial result. In this case, we define the spectral average

[TABLE]

where we assume that there is no degeneracy in the spectrum.

First, Eq. (A.23) is valid because

[TABLE]

where

[TABLE]

for a large $d$ .

Next, for Eq. (A.24), we have

[TABLE]

where $\alpha\neq\alpha^{\prime}$ and $\beta\neq\beta^{\prime}$ . Since $\left(\overline{g_{\alpha\beta}g_{\gamma\delta}}-\overline{g_{\alpha\beta}}\>\overline{g_{\gamma\delta}}\right)$ is at most of order one, the first and second terms in the final expression of Eq. (A.4) vanish when $d_{s}$ is large. Moreover, when $d$ is large, the correlation between $g_{\alpha\beta}$ and $g_{\alpha^{\prime}\beta^{\prime}}$ is expected to vanish: $\overline{g_{\alpha\beta}g_{\alpha^{\prime}\beta^{\prime}}}=\overline{g_{\alpha\beta}}\>\overline{g_{\alpha^{\prime}\beta^{\prime}}}+\mathrm{o}(1)$ [146].§§§To show this, we apply the method in Subsection 4.2.2: we first move $\ket{E_{\alpha}}$ in the $(d-3)$ -dimensional Hilbert space that is orthogonal to $\ket{E_{\beta}},\ket{E_{\alpha^{\prime}}}$ , and $\ket{E_{\beta^{\prime}}}$ . Then, by moving $\ket{E_{\beta}}$ in the $(d-2)$ -dimensional Hilbert space that is orthogonal to $\ket{E_{\alpha^{\prime}}}$ and $\ket{E_{\beta^{\prime}}}$ , we obtain the result.

Then Eq. (A.24) holds true in the limit $d,d_{s}\rightarrow\infty$ . We expect that the ergodicity holds true similarly for higher moments and distributions of diagonal and off-diagonal matrix elements in other classes of random matrices. Some other aspects of the ergodicity (e.g., level densities and level-spacing distributions) are reviewed in Ref. [146].

Appendix B Detailed derivations in the main text

B.1 Derivation of Eq. (2.26)

We follow the proof of Refs. [7, 200] for finite-dimensional lattice systems. Let us denote the basis set of operators in the subsystem $S$ by $\{\hat{A}_{l}\}_{l=1}^{d_{S}^{2}}$ , where $d_{S}:=\dim[\mathcal{H}_{S}]$ . We can assume the orthonormality condition

[TABLE]

A given operator $\hat{\rho}$ on $S$ can be expanded as

[TABLE]

Then we have

[TABLE]

where $||\hat{\rho}||_{F}:=\sqrt{\mathrm{Tr}[\hat{\rho}^{2}]}$ is the Frobenius norm and we have used the relation $||\hat{\rho}||_{\mathrm{op}}\leq||\hat{\rho}||_{F}$ . Therefore,

[TABLE]

Using Markov’s inequality, we obtain Eq. (2.26).

B.2 Derivation of Eq. (3.2.1)

We follow the supplement of Ref. [40]. First notice an inequality

[TABLE]

Next, for a fixed pair $\rho\neq\sigma$ , we define the following four vectors:

[TABLE]

Then we find

[TABLE]

and hence

[TABLE]

This leads to $\left|\braket{\sigma}{\hat{O}}{\rho}\right|\leq\sum_{\phi=1}^{4}|\delta_{\phi}|/2$ and thus

[TABLE]

where we have used Eq. (B.2).

B.3 Derivation of Eqs. (4.18-4.29)

We first consider the statistics of the eigenstates for the Gaussian symplectic ensemble (GSE). We expand an eigenstate $\ket{E_{\alpha}}$ with respect to the symplectic basis set $\ket{a_{1}},\ket{\tilde{a_{1}}},\ket{a_{2}},\ket{\tilde{a_{2}}},\cdots\ket{a_{d/2}},\ket{\tilde{a_{d/2}}}$ , where $\ket{\tilde{a_{i^{\prime}}}}=\hat{T}\ket{a_{i^{\prime}}}\>(i^{\prime}=1,\cdots,d/2)$ . In this case, the joint (marginal) probability distribution for finding $z_{1}=\mathrm{Re}[\braket{a_{1}}{E_{\alpha}}],z_{2}=\mathrm{Im}[\braket{a_{1}}{E_{\alpha}}],z_{3}=\mathrm{Re}[\braket{\tilde{a_{1}}}{E_{\alpha}}],z_{4}=\mathrm{Im}[\braket{\tilde{a}_{1}}{E_{\alpha}}],\cdots,z_{2l-1}=\mathrm{Re}[\braket{\tilde{a_{l/2}}}{E_{\alpha}}],z_{2l}=\mathrm{Im}[\braket{\tilde{a_{l/2}}}{E_{\alpha}}]$ is***Precisely speaking, in the case of the GSE, we have room to choose the Kramers pair after we sample a Hamiltonian. In other words, we have the freedom to rotate two degenerate eigenstates as $\ket{a_{i^{\prime}}}\rightarrow s\ket{a_{i^{\prime}}}+t\ket{\tilde{a_{i^{\prime}}}},\ket{\tilde{a_{i^{\prime}}}}\rightarrow-t^{*}\ket{a_{i^{\prime}}}+s^{*}\ket{\tilde{a_{i^{\prime}}}}\>\>(|s|^{2}+|t|^{2}=1)$ . We assume that the random average is invariant under this rotation in the degenerate space.

[TABLE]

This distribution is equivalent to the case with the GUE, from which we obtain moments such as

[TABLE]

and

[TABLE]

Next, consider the case where two eigenstates ( $\ket{E_{\alpha}},\ket{E_{\beta}}$ or $\ket{E_{\alpha}},\ket{\tilde{E_{\alpha}}}$ ) are involved. By switching the roles of $\ket{a_{i^{\prime}}}$ and $\ket{E_{\alpha}}$ in the previous results, we get

[TABLE]

Moreover, noting that $\braket{E_{\alpha}}{\tilde{a_{i^{\prime}}}}=-\braket{a_{i^{\prime}}}{\tilde{E_{\alpha}}}$ , we get

[TABLE]

We also consider the equality

[TABLE]

and its random average

[TABLE]

From these equations, we obtain

[TABLE]

because we assume that the random average is invariant under the rotation in the degenerate space (see the footnote). Similarly, we get

[TABLE]

Finally, we consider the random averages that are related to four different inner products. Since $\ket{E_{\beta}}$ and $\ket{\tilde{E_{\beta}}}$ are not statistically distinct with respect to $\ket{E_{\alpha}}$ , we have

[TABLE]

We note that the right-hand side of this equation is

[TABLE]

From this, we obtain

[TABLE]

Similarly, we obtain

[TABLE]

for $i^{\prime}\neq j^{\prime}$ . Next, we consider

[TABLE]

where $i=1,2,\cdots,d$ and $\ket{\tilde{a_{i^{\prime}}}}=\ket{a_{d-i^{\prime}+1}}$ . The three sums consist of $d,d/2\times 2$ , and $d(d-2)$ terms, respectively. Taking the average of this equation, we get

[TABLE]

and then

[TABLE]

Using these formula, we calculate the ensemble average and the variance of the matrix elements. For even observables $\hat{O}\>([\hat{O},\hat{T}]=0)$ , the diagonal term can be similarly calculated as in the case for the GUE, and the matrix elements with respect to the Kramers pairs vanish because of the symmetry (see the main text). For the variance of the off-diagonal matrix elements, we have

[TABLE]

Next, consider odd observables $\hat{O}\>(\{\hat{O},\hat{T}\}=0)$ . The average for diagonal matrix elements is

[TABLE]

For the variance, we have

[TABLE]

For the matrix elements $\braket{E_{\alpha}}{\hat{{O}}}{\tilde{E_{\alpha}}}$ , we have the average $\overline{\braket{E_{\alpha}}{\hat{O}}{\tilde{E_{\alpha}}}}=0$ . The variance can be obtained as

[TABLE]

Finally, for the off-diagonal matrix elements with respect to the eigenstates with different energy, we can find that the average is zero and that the variance is

[TABLE]

B.4 Justification of $\sigma^{2}\simeq\mathcal{V}/d$ in the RMT model (Subsection 4.2.2)

From Eq. (4.38), we have

[TABLE]

Then, if we assume the ergodicity of random matrices (see Appendix A.4), we can replace $\overline{|O_{\alpha\beta}|^{2}}$ with the spectral average. Thus, we have $\frac{\mathcal{V}}{d}\simeq\sigma^{2}$ .

B.5 Occupation ratios of each symmetry sector in Sec. 6.3

In this section, we calculate the occupation ratios $p_{\mathbf{q}}$ in Eq. (6.24) for the 1/3-filling. We first note that the relation

[TABLE]

holds true. We then note that the state in the curly brackets on the right-hand side is an eigenstate of the symmetry operators $(\hat{P}_{1},...,\hat{P}_{L})$ with the eigenvalues $(q_{1},...,q_{L})$ . Thus the normalized projection operator onto $\mathbf{q}$ is written as $\hat{\mathcal{P}}_{\mathbf{q}}=\frac{1}{2^{L}}\prod_{l=1}^{L}(1+q_{l}\hat{P}_{l})$ . Since $\hat{P_{l}}$ is an operator that swaps two sites on the $l$ -th layer, we obtain $\braket{\psi_{0}^{A}}{\hat{{P}}_{l_{1}}\hat{{P}}_{l_{2}}\dots}{\psi_{0}^{A}}=0$ in Case A and $\braket{\psi_{0}^{B}}{\hat{{P}}_{l_{1}}\hat{{P}}_{l_{2}}\dots}{\psi_{0}^{B}}=1$ in Case B $(l_{1}<l_{2}<\dots)$ . Expanding $\hat{\mathcal{P}}_{\mathbf{q}}$ and using the above results, we obtain $p_{\mathbf{q}}=\frac{1}{2^{L}}\prod_{l=1}^{L}(1+0)$ for Case A and $p_{\mathbf{q}}=\frac{1}{2^{L}}\prod_{l=1}^{L}(1+q_{l})$ for Case B. Therefore, we obtain

[TABLE]

We note that the result for Case B is the same as in Eq. (B.35) even for the case of the 1/6-filling.

Appendix C Miscellaneous topics

C.1 Tasaki’s MATE

It is often difficult in practice to construct the set of mutually commuting observables $\left\{\hat{M}_{1},\cdots,\hat{M}_{K}\right\}$ from $\left\{\hat{M}^{\prime}_{1},\cdots,\hat{M}^{\prime}_{K}\right\}$ . Tasaki [14] avoids the step of making the observables commute and defines the equilibrium subspace in a bit different manner (his definition is called TMATE [13]). First, the microcanonical equilibrium values of the original observables $\left\{\hat{M}^{\prime}_{1},\cdots,\hat{M}^{\prime}_{K}\right\}$ are defined as

[TABLE]

We define the following projection operator for each observable

[TABLE]

which is the projection associated with the eigenvalues $\mu^{\prime}_{j}$ (the corresponding eigenvector is denoted as $\ket{\mu^{\prime}_{j}}$ ) of $\hat{M}^{\prime}_{j}$ that lie within some resolutions $\Delta\mu^{\prime}_{j}$ . Then the equilibrium subspace can be defined as

[TABLE]

MATE and TMATE are similar to each other.

C.2 The numerical verification of the ETH for many-body correlations in Chapter 4

In this section, we numerically show that the ETH seems to hold true even for many-body correlations in Eq. (4.58). For simplicity, we show the result of the ETH for diagonal matrix elements (we have also checked the ETH for off-diagonal matrix elements).

In Fig. C.1, we show the eigenstate expectation values (EEVs) $\braket{E_{\alpha}}{\hat{\mathcal{O}}_{N}}{E_{\alpha}}$ for the many-body correlations $\hat{\mathcal{O}}_{N}$ (i.e., we take $l=N$ in Eq. (4.58)) in integrable and nonintegrable systems. For an integrable system, we take a disorder-free transverse-field Ising model with the open boundary condition whose Hamiltonian can be written as

[TABLE]

where we take $J=1$ and $h^{\prime}=-1.05$ . For a nonintegrable system, we take model (b) defined in Chapter 4. Figure C.1 shows that the fluctuations of the EEVs rapidly decrease with $N$ for nonintegrable systems, whereas they remain large in integrable systems.***We note that for the integrable model, the EEVs have certain symmetric structures. Namely, we obtain $\braket{E_{\alpha}}{\hat{\mathcal{O}}_{N}}{E_{\alpha}}=\braket{-E_{\alpha}}{\hat{\mathcal{O}}_{N}}{-E_{\alpha}}$ for $N=8,12$ and $\braket{E_{\alpha}}{\hat{\mathcal{O}}_{N}}{E_{\alpha}}=-\braket{-E_{\alpha}}{\hat{\mathcal{O}}_{N}}{-E_{\alpha}}$ for $N=10$ . This symmetry is due to the chiral symmetry operator $\hat{C}$ that transforms the Pauli operators as $\hat{\sigma}_{i}^{x}\rightarrow-\hat{\sigma}_{i}^{x},\hat{\sigma}_{i}^{y}\rightarrow\hat{\sigma}_{i}^{y}$ and $\hat{\sigma}_{i}^{z}\rightarrow(-1)^{i}\hat{\sigma}_{i}^{z}$ . Since $\{\hat{H},\hat{C}\}=0$ , we have a pair of eigenstates $\ket{E_{\alpha}}$ and $\ket{-E_{\alpha}}=\hat{C}\ket{E_{\alpha}}$ , where $\hat{H}\ket{-E_{\alpha}}=-E_{\alpha}\ket{-E_{\alpha}}$ is satisfied. This symmetry leads to the condition

$\displaystyle\braket{E_{\alpha}}{\hat{\mathcal{O}}_{N}}{E_{\alpha}}=(-1)^{N/2}\braket{-E_{\alpha}}{\hat{\mathcal{O}}_{N}}{-E_{\alpha}},$

(C.5)

which explains the numerical results.

This result implies that the ETH does and does not hold true in nonintegrable and integrable systems, respectively, even for many-body correlations. We note that we obtain similar results for other values of $l$ .†††We have shown the scaling for $\hat{\mathcal{O}}_{N}$ with respect to $N$ . We can make another scaling, where only the system size changes with a fixed observable $\hat{\mathcal{O}}_{l}$ . In any case, we find that the EEV fluctuations decrease with increasing the system size $N$ for nonintegrable systems.

C.3 The ETH for the models (b) and (c) in Sec. 6.6

In Figs. C.2 (i) and (ii), we show the EEVs for $\hat{n}_{01}$ in the models (b) and (c), respectively. In Fig. C.3, we also show $\sigma[\Delta\mathcal{O}]$ , a typical magnitude of the EEV fluctuations $\Delta\mathcal{O}_{\alpha}$ in the middle of the spectrum. We define $\sigma[\Delta\mathcal{O}]$ as the standard deviation of $\braket{E_{\alpha}}{\hat{\mathcal{O}}}{E_{\alpha}}$ within a small energy shell $[E-\omega_{s},E+\omega_{s}]$ .‡‡‡We note that this is equivalent to $\Delta\mathcal{O}_{\mathrm{d}}$ in Chapter 4. In Fig. C.2 (i), while the splittings of the EEVs are seen due to the global $\mathbb{Z}_{2}$ symmetry, this splitting seems to move to the edge of the spectrum with increasing the system size. Therefore, the ETH is expected to hold true in the thermodynamic limit especially in the middle of the spectrum (see Fig. C.3). Next, Figs. C.2 (ii) and C.3 show that even though $\Delta\mathcal{O}_{\alpha}$ and $\sigma[\Delta\mathcal{O}]$ decrease with increasing the system size, their $L$ -dependences are weaker for $F\geq 1$ than for $F=0$ . This result is consistent with the behavior of the relative difference in model (c): the $L$ -dependence is much less sensitive for $F\geq 1$ than for $F=0$ .

Acknowledgements

First of all, I would like to express my deepest gratitude to my supervisor, Professor Masahito Ueda. He is both an extraordinary physicist and an exceptional teacher. Every discussion with him was full of thoughtful criticism and encouragement: for leading me to seek for the theme that has an impact on the development of physics, he has patiently tried to clarify my clumsy ideas without ever denying them and given me insightful advices. He has also spared a lot of time for reading the manuscript of this thesis and giving me enormous comments. I also thank him for recommending me the main subject of this thesis: thermalization in isolated quantum systems.

I am also deeply grateful to my collaborator, Assistant Prof. Tatsuhiko N. Ikeda. Even though he had been writing his own dissertation, he was willing to discuss the GGE in nonintegrable systems reviewed in Chapter 6 with me. He taught me a lot of things without which this thesis would be far from completed: the basics and advancement of thermalization in isolated quantum systems, technical methods for calculations, the computer programming, and how to write a paper and give a presentation. Most of all, his attitude toward research has impressed me a lot.

I have been fortunate to ask questions and have discussions with many others. Assistant Prof. Shunsuke Furukawa in our group has taught me many things from the computer programming to condensed matter physics. Whenever I ask him (often stupid) questions, he kindly taught me in a clear manner. We would also like to thank great researchers that tackle the problem of dynamics in quantum many-body systems, especially Assistant Prof. Takashi Mori, Dr. Sho Sugiura, Dr. Kazuya Fujimoto, Mr. Yuto Ashida, Ms. Mamiko Tatsuta, and Mr. Zongping Gong. Discussions with them have always been helpful and stimulating. Moreover, their intriguing works have motivated me to become a better physicist. We also thank all the members in Masahito Ueda Group for providing me an exciting and enjoyable environment for doing my research.

Finally, I acknowledge the financial support and lectures through the Program for Leading Graduate Schools (ALPS).

Bibliography200

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Lev Davidovich Landau and EM Lifshitz. Statistical physics, part i, 1980.
2[2] Herbert B Callen. Thermodynamics & an Intro. to Thermostatistics . John wiley & sons, 2006.
3[3] Akira Shimizu. http://as 2.c.u-tokyo.ac.jp/lecture_note/statmech.pdf .
4[4] D. Ter Haar. Foundations of statistical mechanics. Rev. Mod. Phys. , 27:289–338, Jul 1955.
5[5] 田崎晴明. 統計力学 . 培風館, 2008.
6[6] J v Neumann. Beweis des ergodensatzes und des h-theorems in der neuen mechanik. Zeitschrift für Physik , 57(1-2):30–70, 1929.
7[7] 杉田歩. 量子統計力学の基礎付けについて. 数理解析研究所講究録 , 1507:147–159, 2006.
8[8] Sheldon Goldstein, Joel L. Lebowitz, Roderich Tumulka, and Nino Zanghì. Canonical typicality. Phys. Rev. Lett. , 96:050403, Feb 2006.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Abstract

Contents

Chapter 1 Statistical physics of isolated quantum systems: an overview

1.1 Foundations of quantum statistical mechanics and the notion of typicality

1.2 Approach to thermal equilibrium

1.2.1 Equilibration and thermalization

1.2.2 Systems that approach non-thermal stationary states

1.3 Experiments of isolated quantum systems

1.3.1 Ultracold atoms

1.3.2 Trapped ions and other systems

1.4 Organization of this thesis

Chapter 2 Equilibration and thermalization by unitary time evolutions

2.1 Definitions of thermal equilibrium and typicality

2.1.1 General framework

2.1.2 Microscopic thermal equilibrium (MITE)

Proof of typicality for MITE

2.1.3 Macroscopic thermal equilibrium (MATE)

2.1.4 A looser way to consider thermal equilibrium

2.2 Conditions for equilibration and thermalization

2.2.1 Equilibration

Timescales

2.2.2 Thermalization

2.3 Approach to thermal equilibrium from any initial state: the eigenstate thermalization hypothesis

2.3.1 Off-diagonal matrix elements and equilibration

2.3.2 Diagonal matrix elements and thermalization

2.4 Roles of initial states for equilibration and thermalization

Chapter 3 Review of the eigenstate thermalization hypothesis (ETH)

3.1 Histories

3.2 Possible explanations of the ETH

3.2.1 Arguments by von Neumann and Reimann

The setup and the statement

Proof

3.2.2 Some predictions from random matrix theory in nonintegrable systems

Level-spacing statistics of random matrices and nonintegrable systems

Matrix elements from the viewpoint of RMT

Relations to nointegrable systems

The ETH and its finite-size corrections

3.2.3 Argument by Deutsch

3.3 Numerical simulations of the ETH

3.3.1 Level-spacing statistics of hardcore-particle systems

3.3.2 Diagonal matrix elements

3.3.3 Off-diagonal matrix elements

3.4 Weak ETH

3.5 Summary and remarks

Chapter 4 Observable-dependence of how random matrix theory can predict deviations from the ETH

4.1 Motivations

4.2 Statistics of the finite-size corrections of the ETH for the random matrix model

4.2.1 Universal ratios between diagonal and off-diagonal matrix elements

The GOE

The GSE

4.2.2 Observable-dependent probability densities of the off-diagonal matrix elements

Nonsingular operators

Singular operators

4.2.3 Conjectures from the random matrix model

4.3 Numerical verifications of the random matrix predictions

4.3.1 Models

4.3.2 Few-body observables

The universal ratios

Probability densities of the off-diagonal elements

4.3.3 Many-body correlations

The universal ratios

Probability densities of the off-diagonal elements

4.3.4 Density matrices corresponding to pure states

4.3.5 A simple counterexample

4.4 Conclusions and Discussions

Chapter 5 Generalized Gibbs ensemble (GGE) in integrable systems

5.1 Non-thermal stationary states due to conserved quantities

5.2 The GGE in essentially free systems

5.3 Importance of the locality of conserved quantities and the truncated GGE

5.4 The GGE in interacting systems solved by the Bethe ansatz

5.5 Conclustion

Chapter 6 Generalized Gibbs ensemble in a nonintegrable system with an extensive number of local symmetries

6.1 Motivation

B.4 Justification of $\sigma^{2}\simeq\mathcal{V}/d$ in the RMT model (Subsection 4.2.2)