TASI lectures on Matrix Theory from a modern viewpoint

Henry W. Lin

arXiv:2508.20970·hep-th·September 30, 2025

TASI lectures on Matrix Theory from a modern viewpoint

Henry W. Lin

PDF

Open Access

TL;DR

This paper reviews the BFSS matrix quantum mechanics from a modern, post-AdS/CFT perspective, discussing gravity duals, strong coupling extrapolation, and the matrix bootstrap method.

Contribution

It provides a comprehensive modern viewpoint on D0-brane matrix theory, including gravity duals and bootstrap techniques, which are less explored in traditional literature.

Findings

01

Clarifies the gravity dual in the 't Hooft regime

02

Extrapolates matrix theory to strong coupling

03

Applies matrix bootstrap to D0-brane quantum mechanics

Abstract

These notes review the D0-brane or Banks-Fischler-Shenker-Susskind (BFSS) matrix quantum mechanics from a post-AdS/CFT perspective. We start from the decoupling argument for D0-branes and discuss the gravity dual in the 't Hooft regime, before extrapolating to strong coupling. In the second part of these notes, we review the matrix bootstrap method and its application to the D0-brane quantum mechanics.

Tables1

Table 1. Table 1 : Relation between gauge theory, Type IIA and M-theory parameters

gauge theory	Type IIA strings	M-theory
$N = rank of matrices$	$N = # of D0 branes$	$N = KK momentum number$
$g_{YM}^{2}$	$g_{s} / (4 π^{2} ℓ_{s}^{3})$	$R^{3} / (4 π^{2} ℓ_{p}^{6})$

Equations240

I = \frac{1}{( 2 π ) ^{7} ℓ _{s}^{8}} \int d^{10} x g [e^{- 2 ϕ} (R + 4 (\nabla ϕ)^{2}) - \frac{1}{4} F_{μν}^{2}]

I = \frac{1}{( 2 π ) ^{7} ℓ _{s}^{8}} \int d^{10} x g [e^{- 2 ϕ} (R + 4 (\nabla ϕ)^{2}) - \frac{1}{4} F_{μν}^{2}]

d s^{2}

d s^{2}

e^{- 2 ϕ}

H

M

M

\frac{M}{N}

T = \frac{7}{4 π r _{0} cosh α}

T = \frac{7}{4 π r _{0} cosh α}

(\frac{d s ^{2}}{ℓ _{p}^{2}})_{black hole}

(\frac{d s ^{2}}{ℓ _{p}^{2}})_{black hole}

(\frac{d s ^{2}}{ℓ _{p}^{2}})_{black string}

(\frac{d s ^{2}}{ℓ _{p}^{2}})_{black string}

\displaystyle\binom{t}{x_{11}}\to\binom{t^{\prime}}{x_{11}^{\prime}}=\left(\begin{array}[]{cc}\cosh\alpha&\sinh\alpha\\ \sinh\alpha&\cosh\alpha\end{array}\right)\binom{t}{x_{11}}.

\displaystyle\binom{t}{x_{11}}\to\binom{t^{\prime}}{x_{11}^{\prime}}=\left(\begin{array}[]{cc}\cosh\alpha&\sinh\alpha\\ \sinh\alpha&\cosh\alpha\end{array}\right)\binom{t}{x_{11}}.

d s_{11}^{2}

d s_{11}^{2}

g_{s} ≪ 1 holding fixed: ℓ_{s} Δ E \sim g_{s}^{1/3}, N .

g_{s} ≪ 1 holding fixed: ℓ_{s} Δ E \sim g_{s}^{1/3}, N .

S \sim \frac{1}{g _{YM}^{2}} \int d t [\frac{1}{2} \Tr \dot{X}^{2} + \frac{1}{4} \Tr [X_{I}, X_{J}]^{2} + fermions + higher derivatives]

S \sim \frac{1}{g _{YM}^{2}} \int d t [\frac{1}{2} \Tr \dot{X}^{2} + \frac{1}{4} \Tr [X_{I}, X_{J}]^{2} + fermions + higher derivatives]

[g_{YM}^{2}] = energy^{3 - p}, g_{YM}^{2} = \frac{g _{s}}{( 2 π ) ^{2 - p} ℓ _{s}^{3 - p}} .

[g_{YM}^{2}] = energy^{3 - p}, g_{YM}^{2} = \frac{g _{s}}{( 2 π ) ^{2 - p} ℓ _{s}^{3 - p}} .

r_{0} / ℓ_{s} \sim g_{s}^{1/3} .

r_{0} / ℓ_{s} \sim g_{s}^{1/3} .

ρ = \frac{r}{α ^{'}} \frac{1}{( d _{0} g _{YM}^{2} N ) ^{1/3}} .

ρ = \frac{r}{α ^{'}} \frac{1}{( d _{0} g _{YM}^{2} N ) ^{1/3}} .

E = H^{- 1/4} E_{p} \sim (g_{s} N)^{1/3} ρ^{7/4} E_{p}

E = H^{- 1/4} E_{p} \sim (g_{s} N)^{1/3} ρ^{7/4} E_{p}

\frac{d s ^{2}}{α ^{'}}

\frac{d s ^{2}}{α ^{'}}

e^{- ϕ}

H_{IIA string theory} \approx H_{flat space} \otimes H_{near horizon region}

H_{IIA string theory} \approx H_{flat space} \otimes H_{near horizon region}

H_{IIA string theory} \approx H_{flat space} \otimes H_{SYM quantum mechanics}

H_{IIA string theory} \approx H_{flat space} \otimes H_{SYM quantum mechanics}

\frac{d s ^{2}}{α ^{'}}

\frac{d s ^{2}}{α ^{'}}

e^{ϕ}

d s_{F 1}^{2}

d s_{F 1}^{2}

B_{t 1}

\tilde{g}_{θ θ} = \frac{1}{g _{θ θ}}, \tilde{g}_{θ μ} = \frac{B _{θ μ}}{g _{θ θ}}, \tilde{g}_{μν} = g_{μν} - \frac{g _{θ μ} g _{θ ν} - B _{θ μ} B _{θ ν}}{g _{θ θ}},

\tilde{g}_{θ θ} = \frac{1}{g _{θ θ}}, \tilde{g}_{θ μ} = \frac{B _{θ μ}}{g _{θ θ}}, \tilde{g}_{μν} = g_{μν} - \frac{g _{θ μ} g _{θ ν} - B _{θ μ} B _{θ ν}}{g _{θ θ}},

\tilde{B}_{θ μ} = \frac{g _{θ μ}}{g _{θ θ}}, \tilde{B}_{μν} = B_{μν} - \frac{g _{θ μ} B _{θ ν} - g _{θ ν} B _{θ μ}}{g _{θ θ}}, e^{\tilde{ϕ}} = e^{ϕ} g_{θ θ}^{- 1/2} .

\frac{d s ^{2}}{α ^{'}}

\frac{d s ^{2}}{α ^{'}}

+ (2 π) d_{1} (g_{YM}^{2})^{2} \frac{N}{U ^{6}} (d \tilde{x}_{1} - \frac{U ^{6}}{α ^{'} d _{1} g _{YM}^{2} N} d t)^{2},

\tilde{B}

\frac{r}{ℓ _{p}} \sim N^{1/9}

\frac{r}{ℓ _{p}} \sim N^{1/9}

S_{SYM} \sim \int d t \Tr (e^{- ϕ (t, X^{I} (t))} - g (X^{I} (t)) - A_{0} (t, X^{I} (t)))

S_{SYM} \sim \int d t \Tr (e^{- ϕ (t, X^{I} (t))} - g (X^{I} (t)) - A_{0} (t, X^{I} (t)))

S_{SYM} \to S_{SYM} + N k \sum \frac{1}{k !} \int d t \partial_{I_{1}} \dots \partial_{I_{k}} φ \Tr (F^{2} X^{(I_{1}} \dots X^{I_{k})}) .

S_{SYM} \to S_{SYM} + N k \sum \frac{1}{k !} \int d t \partial_{I_{1}} \dots \partial_{I_{k}} φ \Tr (F^{2} X^{(I_{1}} \dots X^{I_{k})}) .

O_{k + 2} = \Tr X^{(I_{1}} \dots X^{I_{k + 2})},

O_{k + 2} = \Tr X^{(I_{1}} \dots X^{I_{k + 2})},

\frac{d s ^{2}}{α ^{'}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMatrix Theory and Algorithms

Full text

TASI lectures on Matrix Theory from a modern viewpoint

Henry W. Lina,b

a Leinweber Institute for Theoretical Physics, Stanford University, Stanford, CA 94305, USA

b Jadwin Hall, Princeton University, Princeton, NJ 08540, USA

Abstract

These notes review the D0-brane or Banks-Fischler-Shenker-Susskind (BFSS) matrix quantum mechanics from a post-AdS/CFT perspective. We start from the decoupling argument for D0-branes and discuss the gravity dual in the ’t Hooft regime, before extrapolating to strong coupling. In the second part of these notes, we review the matrix bootstrap method and its application to the D0-brane quantum mechanics.

1 Introduction

The Banks-Fischler-Shenker-Susskind (BFSS) conjecture [1] was proposed in 1996. It is probably older than the median TASI audience member. Why should we revisit it now? On the one hand, the motivation for studying the model is as strong as ever:

It provides a non-perturbative definition of the M-theory scattering matrix. When BFSS made their conjecture, there was no other candidate for the non-perturbative S-matrix. However, post-AdS/CFT we may also define the M-theory S-matrix by taking the flat space limit; see 4.2 for more on this. So a sharper version of the conjecture is that all of these different definitions in fact compute the same S-matrix. 2. 2.

It contains black holes, including ones that are well described by Einstein gravity. 3. 3.

It is “just” a quantum mechanical model (as opposed to a field theory), and therefore should be somewhat easier to simulate via classical or quantum [2] algorithms. Conceptually, it is a distillation111Perhaps the IKKT model [3] is an even more potent distillation, where all of spacetime emerges from just an integral. of the holographic Mystery, since all of space emerges from just several matrices sitting in a “quantum dot.” Furthermore, in the scattering setup relevant for the BFSS conjecture, spacetime emerges in a way that seems qualitatively different than, say, in the flat space limit of AdS/CFT.

We have two main advantages over physicists in the 90s:

(1)

We understand other examples of gauge/gravity duality in much more detail; presumably this collective experience can inspire progress222Besides the tremendous amount of progress in AdS/CFT, I would like to highlight some recent progress [4, 5, 6] in a lower dimensional cousin of BFSS, the IKKT model. Recently a massive deformation of this model has been studied that is a bit analogous to the massive deformation of BFSS, the so-called pp wave or Berenstein-Maldacena-Nastase (BMN) model [7].. 2. (2)

We have better numerics, both algorithms and machines. We are perhaps a little less intimidated by strong coupling and large $N$ .

In part one of these lectures, I will review our understanding from the gauge/gravity point of view, taking advantage of (1). I will not review the original arguments for BFSS which involve thinking about the infinite momentum frame, see [8, 9, 10, 11] for reviews which cover this. Instead I will try to spell out the logic from the point of view of someone who pretty much takes AdS/CFT for granted.

In part two, we will review some recent numerical approaches to matrix theories (2). This will be hugely biased towards the bootstrap approach, although lattice Monte Carlo is a very important method [12, 13, 14, 15, 16, 17, 18, 19], see also [20, 21, 22, 23, 19] for other recent numerical approaches to related models. Some simple exercises are also included; the reader is encouraged to do them!

2 Review of D0-brane holography

Let us review the gravity dual of SYM in $0+1$ spacetime dimensions, following the logic of [24, 25], (see also [9]).

2.1 D0 black hole

We start by recalling the D0 black hole solution. Since D0 branes carry Ramond-Ramond charge, this is a charged black hole. We use conventions where the relevant part of the Type II supergravity action is

[TABLE]

From the spacetime point of view, the D0 branes source the (string-frame) metric, dilaton, and Ramond-Ramond fields [26, 27, 28, 29]:

[TABLE]

We can exchange the parameters $r_{0},\alpha$ for the mass and charge $(M,Q)$ of the black hole:

[TABLE]

The formula (6) is the gravitational manifestation of the BPS bound. The temperature of the black hole is:

[TABLE]

We see that in the extremal limit $T\to 0$ at fixed $N$ , we recover the familiar fact that a D0 brane has mass $1/g_{s}$ in string units.

How do we get this solution? We will take a somewhat scenic route. Recall that Type IIA can be viewed as M-theory in 11d compactified on a small circle of radius $R=g_{s}\ell_{s}$ . The dilaton is interpreted as the size of the circle, and the RR 1-form comes from the off-diagonal components of the metric. This immediately suggests that we should search for an 11d solution which is pure metric. The most naive guess is simply the Schwarzschild black hole with $d=11$ :

[TABLE]

This is not a bad guess, but there are two problems. First, we want a solution that is uniform in the 11th dimension, so that we can do dimensional reduction. Second, we want the solution to carry a large amount of RR charge. There is a simple fix to both issues. First, we consider the black string in 11d. This means we take the Schwarzschild solution in $d=10$ and add a dimension:

[TABLE]

This fixes objection (1). To address objection (2), let’s perform a Lorentz boost:

[TABLE]

For large boost parameter, we now have a candidate solution which is uniform in the 11th dimension, and contains lots of Kaluza Klein momentum:

[TABLE]

where $H(r)$ is defined as before (4).

**Exercise 1: Reduction of 11d black string **

Perform the change of coordinates (12) and check that we arrive at (13).

Check that the solution (13) agrees with the type IIA reduction (2), via $\mathrm{d}s_{11}^{2}=e^{4\phi/3}\left(\mathrm{d}z^{2}+A^{\mu}\mathrm{d}x_{\mu}\right)^{2}+e^{-2\phi/3}\mathrm{d}s_{10}^{2}.$

**Exercise 2: Gregory Laflamme Instability **

Give a poor man’s argument for the dynamical instability [30] of the black string: in $d+1$ spacetime dimensions, compute the entropy of a black string of length $L$ and compare it to the entropy of a black hole with the same mass $M$ . At what scale do the entropies become comparable?

2.2 Decoupling limit

Now, following [24, 25] we take the decoupling limit:

[TABLE]

This means we consider type IIA string theory in asymptotically flat space with $N$ D0 branes and take the $g_{s}\to 0$ limit. We focus on the low-energy states in the Hilbert space with energy above extremality333The BPS bound ensures there are no states below extremality. $\Delta E\sim g_{s}^{1/3}/\ell_{s}$ .

The obvious states are massless modes (e.g. supergravitons) with very low energy. But there are also excitations of the D0 branes. The D0 brane theory is described by super Yang Mills with SU( $N$ ) gauge group $+$ higher derivative interactions:

[TABLE]

Since we are only considering very low energy states (much lower than the string scale), we can ignore the higher derivative interactions. Then the only dimensionful scale in the SYM theory is given by the coupling444The SYM theory is conventionally defined so that the Hamiltonian measures the energy above extremality. E.g., we subtract off the rest mass of the D0 branes $N/(g_{s}\ell_{s})$ .. For SYM theory in $p+1$ dimensions, the Yang-Mills coupling has units of

[TABLE]

So from the SYM description, the natural energy scale is $\Delta E\sim(g^{2}_{\mathrm{YM}})^{1/3}\sim g_{s}^{1/3}/\ell_{s}$ which is being held fixed in this limit.

Now let us analyze the same set of states from the bulk spacetime perspective. We are instructed to consider a black hole with fixed charge $N$ . Demanding that the energy above extremality (5) scales like $E-N\sim g_{s}^{1/3}/\ell_{s}\sim\frac{r_{0}^{7}}{g_{s}^{2}\ell_{s}^{8}}$ , we find

[TABLE]

So the parameter $r_{0}$ is small in string units, which tells us that the black hole is near extremality from the flat space point of view. Alternatively, we can demand that $\beta/\ell_{s}\sim g_{s}^{-1/3}$ . Then the scaling of $r_{0}$ follows from (7). We sketch the Penrose diagram in the near-extremal limit in Figure 1. The black hole singularity formally becomes null in the extremal limit555In the decoupled geometry, it is a causal feature of the Penrose diagram that the singularity bends down. But the precise shape of the singularity depends on the conformal transformation we use to draw the diagram. In the decoupled geometry (with flat space asymptotics) the shape of the singularity is arbitrary but in the strict extremal limit the singularity becomes null. I thank Douglas Stanford for some discussion about this point, see also [31]..

So far we have argued that the black hole geometry (near extremality) survives the decoupling limit. But we would like to know what kinds of perturbations to this background also survive. Consider a particle that is located in the near horizon region, e.g., at $r/\ell_{s}\sim g_{s}^{1/3}$ . More precisely, let us introduce a new radial coordinate:

[TABLE]

We consider a particle at a fixed $\rho$ in the decoupling limit. Now let us calculate the energy of the particle relative to the flat space region. We must account for the redshift factor $\sqrt{g_{tt}}=H^{-1/4}$ . This means that the energy as measured by the boundary system (or as seen from asymptotically flat space) is quite a bit lower:

[TABLE]

We see that even if the particle has energy $E_{p}\sim 1/\ell_{s}$ (e.g. an excited string state), it will survive the decoupling limit. This means that a generic object in string theory will survive the decoupling limit if it resides in the near horizon region666One could also ask about D-branes which have parametrically larger masses than $1/\ell_{s}$ . But note that the effective string coupling in the throat is finite, so these objects are not parameterically heavier in terms of $1/g_{s}$ where $g_{s}$ is the string coupling in the flat space region. Hence they too survive decoupling..

Based on these considerations, it is convenient to define a new dimensionless time coordinate $\tau=\left(d_{0}g^{2}_{\mathrm{YM}}N\right)^{1/3}t$ . The energy $\mathcal{E}$ conjugate to $\tau$ is the dimensionless energy measured in units of the ’t Hooft coupling (which survives the decoupling limit), e.g., $\mathcal{E}=H/(d_{0}g^{2}_{\mathrm{YM}}N)^{1/3}$ . In these coordinates, the near horizon region is

[TABLE]

Note that even though we sent $g_{s}\to 0$ the effective string coupling is finite in the decoupling limit (e.g., independent of the flat space $g_{s}$ ). This has two implications. It means that excitations that are heavier than massive strings, e.g., D-branes, also survive the decoupling limit777For some work on D-branes in the decoupled D $p$ -brane backgrounds, see [32, 33]. These works discuss operators in the boundary theory that create giant gravitons, mimicking to some extent the discussion of giant graviton operators in AdS/CFT.. Second, to make the effective string coupling $\sim 1/N$ small (so that we can trust the semi-classical IIA approximation), we must take $N\to\infty$ and work in the ’t Hooft limit. Let us emphasize that whether or not the supergravity description is reliable is secondary, it is not crucial to the decoupling argument.

To summarize, according to the gravity solution, we expect that there are two kinds of excitations in the $g_{s}\to 0$ , low energy limit. There are the low energy massless excitations of weakly coupled IIA string theory in flat space. And there are generic excitations (massless or massive excitations) of the bulk theory in the near-horizon region. So the Hilbert space approximately factorizes:

[TABLE]

On the other hand, from the worldvolume (worldline in the D0 case) perspective:

[TABLE]

We have illustrated this in Figure (2). Cancelling the Hilbert space $\mathcal{H}_{\text{flat space}}$ in both equations (22) (23) leads us to identify $\mathcal{H}_{\text{near horizon region}}=\mathcal{H}_{\text{SYM quantum mechanics}}$ .

**Exercise 3: Black hole in a box **

It is often said that AdS is a convenient way to put black holes in a box. Show that the D0 brane geometry is also a “box” in the sense that particles cannot escape to infinity. Show that the only exception to this rule is a D0 brane itself; the D0 brane experiences a gravitational force which is essentially equal and opposite of the electric force. (The BPS bound forbids an object with a greater charge to mass ratio.) This means that the black hole can only Hawking evaporate into D0 branes. Estimate the lifetime of the black hole. (For a detailed estimate, see [34].)

**Exercise 4: D0 brane thermodynamics **

Work out the black hole thermodynamics of the above metric. Start by computing the temperature of the metric (20). Then compute $S=\frac{A}{4G_{N}}$ , where $A$ is the area of the horizon in Einstein frame, and express it in terms of the temperature. Finally use this to derive the energy of the black hole as a function of temperature.

Consider a higher derivative curvature correction $\sim\alpha^{\prime 3}\int\sqrt{g}\,e^{-2\phi}R^{4}$ . How does this affect the thermodynamics? Use this to give a parametric upper bound on the temperature for which the supergravity analysis is valid. Even at low temperatures, argue that the Type IIA solution is valid for $r/\ell_{p}\ll N^{1/3}$ .

When $e^{\phi}\sim 1$ , the 10d solution should be replaced by the 11d black string. Work out the temperature scale where this occurs. Argue that even at these low temperatures, we can still use the 10d solution as long as $r/\ell_{p}\gg N^{1/7}$ . At even lower temperatures, the results of exercise 2.1 imply that the 11d black string will transition into an 11d black hole. Use these results to give a more complete picture of the thermodynamics at very low temperatures [35, 25]. For recent work, see [36].

How big is the “throat” in the decoupling limit? At low temperatures, the proper length from the horizon to the “boundary” or flat space region goes like $\Delta s\sim\ell_{s}\tilde{\beta}^{-3/10}$ where $\tilde{\beta}=\beta(d_{0}g^{2}_{\mathrm{YM}}N)$ is the dimensionless inverse temperature in ’t Hooft units. We see that for order 1 temperatures, the entire region is string scale. This suggests that the metric is unreliable at these temperatures due to $\alpha^{\prime}$ corrections. Indeed, in exercise 2.2, you will check that the geometry is only trustworthy at low temperatures.

Further reading 1: Other Dp brane geometries

Study the other Dp brane geometries for $p\leq 4$ in the decoupling limit [25], and understand their IR behavior.

For $p=1$ , the effective string coupling grows in the radial direction. Since we have a IIB solution, we should use S-duality to find a more convenient description of the IR. Then $\phi\to-\phi$ and the coupling starts to decrease towards the IR. (In exercise 2.2 you are asked to study the holographic dual in more detail.) On the boundary, the IR behavior is “matrix string theory” [37], whose effective description is a free symmetric orbifold CFT perturbed by irrelevant operators. There is an interesting conjecture [37, 38] that directly connects conformal perturbation theory in the leading irrelevant operator (a supersymmetric twist operator) with the genus expansion in the worldsheet approach to Type IIA scattering amplitudes. In the matrix string conjecture the string coupling is identified with $1/(g_{\mathrm{YM}}R)$ , where the 1+1D SYM theory is compactified on a spatial circle of radius $R$ .

For $p=2$ , there is an M-theory description of the IR in terms of the M2 brane solution. The boundary description is the ABJM CFT.

$p=3$ is the familiar case of $\mathcal{N}=4$ SYM. Note in the decoupling limit we set $g_{s}=$ constant.

$p=4$ the SYM interaction is irrelevant. We view this theory as the effective field theory of M5 branes wrapped on a circle. Note that in the decoupling limit, we send $g_{s}\to\infty$ . The UV is strongly coupled, and the bulk geometry becomes $\mathrm{AdS}_{7}\times S_{4}$ .

$p=-1$ , this is known as IKKT [39]. A mass deformation of this model has been the subject of some interesting new developments, see [4, 5, 6].

**Exercise 5: The holographic dual of 1+1D SYM **

In this exercise, you are asked to work out the holographic dual of the maximally supersymmetric Yang Mills. Start with the Type IIB extremal black D1 brane solution:

$\displaystyle\frac{\mathrm{d}s^{2}}{\alpha^{\prime}}$ $\displaystyle=\frac{U^{3}}{g_{\mathrm{YM}}\sqrt{2^{6}\pi^{3}N}}dx_{\|}^{2}+\frac{g_{YM}\sqrt{2^{6}\pi^{3}N}}{U^{3}}dU^{2}+g_{\mathrm{YM}}\frac{\sqrt{2^{6}\pi^{3}N}}{U}d\Omega_{8}^{2}$

(24)

$\displaystyle e^{\phi}$ $\displaystyle=\left(\frac{g_{\mathrm{YM}}^{6}2^{8}\pi^{5}N}{U^{6}}\right)^{1/2},\quad A_{0}=-\frac{1}{2}\left(\frac{g_{\mathrm{YM}}^{6}2^{8}\pi^{5}N}{U^{6}}\right)^{1/2}$

(25)

In the IR, the dilaton grows and we should use S-duality. Show that this gives the extremal F1 solution:

$\displaystyle\mathrm{d}s^{2}_{F1}$ $\displaystyle=\frac{\alpha^{\prime}}{(2\pi)g_{\mathrm{YM}}^{2}}\big[\,h(U)\,(-\mathrm{d}t^{2}+\mathrm{d}x_{1}^{2})+\mathrm{d}U^{2}+U^{2}\mathrm{d}\Omega_{7}^{2}\,\big],$

(26)

$\displaystyle B_{t1}$ $\displaystyle=h(U),\qquad e^{\phi}=\frac{1}{(2\pi)\,g_{\mathrm{YM}}^{2}}\,\frac{U^{3}}{\sqrt{d_{1}\,g_{\mathrm{YM}}^{2}\,N}}\,.$

(27)

Now we compactify the SYM theory on a spatial circle $x_{1}\sim x_{1}+2\pi R$ . We may introduce $\theta$ -coordinates, so that $g_{\theta\theta}=R^{2},g_{11}=\dfrac{\alpha^{\prime}}{(2\pi)g_{\mathrm{YM}}^{2}}\,R^{2}h(U)$ and $B_{t\theta}=R\,h(U)$ . We see that the proper siznowe of the circle is small in the IR. For a translation-invariant background in $\theta$ , T-duality relates a Type IIb solution to a Type IIA solution via [40]:

$\displaystyle\tilde{g}_{\theta\theta}=\frac{1}{g_{\theta\theta}},\quad\tilde{g}_{\theta\mu}=\frac{B_{\theta\mu}}{g_{\theta\theta}},\quad\tilde{g}_{\mu\nu}=g_{\mu\nu}-\frac{g_{\theta\mu}\,g_{\theta\nu}-B_{\theta\mu}\,B_{\theta\nu}}{g_{\theta\theta}},$

$\displaystyle\tilde{B}_{\theta\mu}=\frac{g_{\theta\mu}}{g_{\theta\theta}},\quad\tilde{B}_{\mu\nu}=B_{\mu\nu}-\frac{g_{\theta\mu}\,B_{\theta\nu}-g_{\theta\nu}\,B_{\theta\mu}}{g_{\theta\theta}},\quad e^{\tilde{\phi}}=e^{\phi}\,g_{\theta\theta}^{-1/2}.$

(28)

Apply (28) to (26)–(27), and introduce the dual coordinate $\tilde{x}_{1}=\tilde{R}\,\tilde{\theta}$ with $\tilde{R}=\alpha^{\prime}/R$ . Then the type IIA metric becomes

$\displaystyle\frac{\mathrm{d}s^{2}}{\alpha^{\prime}}$ $\displaystyle=\frac{1}{(2\pi)g_{\mathrm{YM}}^{2}}\Big(\mathrm{d}U^{2}+U^{2}\mathrm{d}\Omega_{7}^{2}\Big)-\frac{1}{(2\pi)g_{\mathrm{YM}}^{4}}\;\frac{U^{6}}{d_{1}\,N}\;\mathrm{d}t^{2}$

(29)

$\displaystyle\quad+\;(2\pi)\,d_{1}\,(g_{\mathrm{YM}}^{2})^{2}\,\frac{N}{U^{6}}\,\Bigg(\mathrm{d}\tilde{x}_{1}-\frac{U^{6}}{\alpha^{\prime}\,d_{1}\,g_{\mathrm{YM}}^{2}\,N}\,\mathrm{d}t\Bigg)^{\!2}\!,$

(30)

$\displaystyle\tilde{B}$ $\displaystyle=0,\qquad e^{\tilde{\phi}}=\frac{1}{\sqrt{2\pi}\,g_{\mathrm{YM}}\,R}\,.$

(31)

The dual radius is $\tilde{R}=\alpha^{\prime}/R$ . Somewhat surprisingly, we have found a pure-metric solution which carries momentum in the $\tilde{x}_{1}$ direction.

2.3 Differentiate dictionary

We now review a schematic derivation of the “differentiate” dictionary that is familiar in the AdS/CFT context (for a review in the AdS context, see section 4 of [41]).

Before taking the decoupling limit, we consider a background which instead of being asymptotically flat instead contains some supergravity waves. For simplicity, we could consider some (weak-field) dilaton waves incident on the D0-branes. Then we repeat the decoupling logic. From the gravity point of view, we are left with a perturbation of flat space $\otimes$ the throat-like region given by (35) but now with a non-trivial dilaton profile. The profile in the throat can be thought of as arising from non-trivial boundary conditions of the dilaton, which is an imprint from the joining of the throat to flat space.

On the boundary side, the (schematic) non-Abelian generalization of the DBI action contains a coupling $e^{-\phi({t,X})}F^{2}$ where $F^{2}$ is the Yang-Mills field strength and $X^{I}$ are the transverse coordinates of the D[math]-brane888This is schematic because we are ignoring the fermions, and we are also being a bit sloppy about the precise non-Abelian generalization of the DBI action.:

[TABLE]

These scalars transform under the global $R$ -symmetry of the SYM as SO( $9$ ) fundamentals. The coupling to the dilaton wave induces a perturbation which leads to a deformation of the worldvolume action[42, 43] by terms schematically of the form:

[TABLE]

Here the indices $I_{1}\cdots I_{k}$ are symmetrized with traces removed999The indices should be symmetrized since the order of the derivatives acting on $\phi$ does not matter.. The precise form of the operator (including the fermionic terms) can be obtained by acting with 4 supercharges on the operator

[TABLE]

where the parentheses on the indices indicates that the operator is in the traceless symmetric representation of SO(9).

Reasoning in this way, we learn that deforming the SYM theory by adding various terms to the Lagrangian is equivalent to changing the boundary conditions of various supergravity modes in the throat-like region. By differentiating with respect to a source term for these operators, we can compute correlation functions in the boundary theory using gravity. a priori, the gravity computation in this background seems complicated, but in fact we will see in the next section that the gravity problem is closely related to AdS. This implies for instance that 2-pt functions of the operator (34) will have simple power law correlation functions in the low temperature/large Euclidean-time Type IIA regime.

2.4 Relation to AdS

By introducing the metric $z=R_{\mathrm{AdS}}\rho^{-5/2}$ (do not confuse this $z$ with the previous M-theory $z$ ), we may write the solution in an instructive form:

[TABLE]

It is instructive to consider the action associated with small fluctuations of the dilaton, e.g., $\phi=\phi_{\text{s}}+\varphi$ , where the classical solution $\phi_{\text{s}}$ is given by (37). (The treatment below is somewhat heuristic101010Among other concerns, we have not argued that $\varphi$ does not mix with other modes.; for a detailed treatment, see [44] and also [45].) Then expanding to quadratic order in $\varphi$ , (1) gives

[TABLE]

Decomposing $\varphi$ into eigenfunctions of the scalar Laplacian on the sphere, and using that the eigenvalues are $\nabla_{S^{n}}=-k(k+n-1)$ , we have:

[TABLE]

We see that these modes behave like massive fields in the AdSd+1 black brane [46, 45], with an effective mass $m_{k}^{2}=k(k+7)$ . Note that we have introduced an additional $d-1$ spatial dimensions to account for the factors of $z$ . (We implicitly assume that these extra dimensions are toroidal $T^{d-1}$ and that all fields are homogeneous on $T^{d-1}$ , see [45].) One can then compute the scaling dimensions of these fields using the usual relation

[TABLE]

Note here the scaling dimension is defined so that (in the zero-temperature limit)

[TABLE]

Here the additional factor of $d-1$ arises since $\mathcal{O}$ is a zero mode in the “extra” AdS dimensions.

Further reading 2: Scaling symmetry

The solution (35)-(37) has an important scaling symmetry [46, 45]:

$\displaystyle\tau\rightarrow\gamma\tau,\quad z\rightarrow\gamma z,$

(43)

We keep $N$ and $g^{2}_{\mathrm{YM}}$ fixed under the scaling. Under this transformation, $g,\phi$ and $A_{0}$ change in such a way that the Type IIA action is not invariant. However, the action changes in a simple way:

$\displaystyle I\rightarrow\gamma^{-9/5}I.$

(44)

This scaling “symmetry” is responsible for the simple dependence of the free energy on temperature. We can also use this scaling symmetry to characterize some perturbations to the solution. In particular, consider more general boundary conditions on some supergravity field $\chi\sim\chi_{r}(\tau)z^{\Delta}$ as $z\to 0$ . Then $\chi_{r}\to\chi_{r}\gamma^{\Delta}$ . On the other hand, the change in the gravity action (in the free field approximation) is

$\displaystyle I\sim\int\mathrm{d}\tau\,\mathrm{d}\tau^{\prime}\chi_{r}(\tau)\chi_{r}(\tau^{\prime})G(\tau-\tau^{\prime}).$

(45)

According to the differentiate dictionary, $G$ is the boundary 2-pt function. Then demanding $I\to\gamma^{-9/5}I=\gamma^{-(d-1)}I$ we conclude that $G\to G\gamma^{-2\Delta-(d-1)}$ or

$\displaystyle G\sim|\tau|^{-2\Delta-(d-1)}.$

(46)

Please note that this argument does not work for massive fields, e.g., ones corresponding to massive stringy states, see [45] for more. For recent work using this scaling symmetry, see [33, 47, 48].

This gravity analysis shows there should be boundary operators that have conformal 2-pt functions [44, 49, 46]. These operators should be BPS operators. Let’s recall the supersymmetric properties of the operator $\mathcal{O}_{k}$ that we obtained from the DBI analysis (34). In the above $\mathcal{O}_{k}$ denotes any one of a number of operators that spans the symmetric traceless irrep of SO( $9$ ). To analyze the action of the supercharge on this operator, it is convenient to pick one particular element of the irrep. We define the complex matrix

[TABLE]

Clearly $\mathcal{Z}^{\otimes k}$ transforms under SO( $9-p$ ) as a symmetric tensor of rank $k$ ; it is also traceless since $\mathcal{Z}=\vec{y}\cdot\vec{X}$ and $\vec{y}$ is a null vector $\vec{y}\cdot\vec{y}=0$ . Note that $[Q_{\alpha},\mathcal{Z}]=(\gamma^{1}+\mathrm{i}\gamma^{2})_{\alpha\beta}\psi^{\beta}$ . The matrix $(\gamma^{1}+\mathrm{i}\gamma^{2})$ has a $\mathcal{N}/2$ dimensional null space. (Here $\mathcal{N}$ is the dimension of the spinor irrep of SO( $9-p$ ), e.g., $\mathcal{N}=16$ for the D0 brane theory and famously $\mathcal{N}=4$ for $p=3$ .) So any operator made out of $\mathcal{Z}$ is 1/2-BPS. Since acting with a supercharge changes the dimension of an operator by $1/2$ , together with (41), this implies that $\Tr\mathcal{Z}^{k}$ or any traceless symmetric operator has dimension

[TABLE]

Here we have used that $R_{\text{AdS}}=2/5.$ This is the analog of the famous fact that in $\mathcal{N}=4$ SYM the scaling dimensions of similar operators $\Tr\mathcal{Z}^{J}$ are given by $\Delta=J$ .

This relation to AdS is useful beyond just computing the correlation functions at zero temperature111111More precisely, by zero temperature we mean a regime where the temperature is arbitrarily low in ’t Hooft units but are not suppressed by powers of $N$ so that we are within the Type IIA regime. Similarly, Euclidean time separations are large in ’t Hooft units but do not scale with $N$ .; in particular the relation to the AdS black brane is useful for computing the thermal 2-pt function (and in particular the quasi-normal modes, see [45]).

2.5 Black hole thermodynamics

Let us review what is known about the corrections to the black hole thermodynamics beyond pure supergravity. We expect that the relevant corrections to Type IIA effective action to have the form

[TABLE]

Here $R^{4}+\cdots$ is meant to indicate a non-linear correction that involves 8 derivatives. Note that it is insufficient to consider just the 8-derivative terms involving the metric; for this background we in principle need to know the full non-linear 8-derivative correction that involves the RR 1-form $A_{\mu}$ , e.g., terms $\sim F^{8}$ which are at present unknown.

On the other hand, the 1-loop term $\sim\tilde{\gamma}$ (which also involves the Ramond-Ramond field, etc.) arises from dimensional reduction of the 8-derivative pure metric term in M-theory [50]:

[TABLE]

This is a tidy way to specify the 1-loop term in the Type IIA effective action because from the 11d point of view, the solution is pure metric.

Now let us discuss the implications of this for the black hole thermodynamics. To first order, we simply evaluate the effective action on the solution. To evaluate the 1-loop term, one can simply use the 11d black string solution (9), see [51, 52] and the Appendix of [45]. Although we do not know the full tree-level correction, knowing that the leading correction $\sim(\alpha^{\prime})^{3}$ together with dimensional analysis allows us to estimate the temperature dependence of the correction.

To summarize the results, it is convenient to measure energy and temperature in ’t Hooft units $\tilde{E}=E/(g_{\mathrm{YM}}^{2}N)^{1/3},\tilde{T}=T/(g_{\mathrm{YM}}^{2}N)^{1/3}$ . Then we have a double expansion in low temperature and large $N$ :

[TABLE]

Only $a_{0}$ and $b_{0}$ are known analytically. In exercise 2.2, you were asked to show that $a_{0}=\frac{9}{14}4^{13/5}15^{2/5}(\pi/7)^{14/5}$ . To understand the exponent $23/5$ associated with $a_{1}$ , note that the curvature at the horizon scales $\alpha^{\prime}R\propto\tilde{T}^{3/5}$ , so a term like $\sim R^{4}$ will give an additional factor of $T^{9/5}$ relative to the Einstein-Hilbert term $\sim R$ . Furthermore, the sub-leading $a_{2}$ comes from a 12-derivative term. So the subleading exponent associated with $a_{2}$ is given by

[TABLE]

The $b_{0}$ term can be obtained by integrating the $R^{4}$ correction to M-theory on the background (9), see [51, 52] and the Appendix of [45]. The $b_{1}$ term comes from $\sim D^{6}R^{4}$ , which again leads to a $T^{9/5}$ suppression relative to the $R^{4}$ M-theory term.

2.6 Monte Carlo

An extremely important tool for studying this model is lattice Monte Carlo [12, 13, 14, 15, 16, 17, 18, 19]. Most of these studies attempt to approach the strongly coupled ’t Hooft regime by putting the theory on a Euclidean circle and working in the path integral formalism. To do so, one discretizes Euclidean time, and then integrates out the fermions to derive a measure for the bosonic matrices and the gauge field.

There are two important subtleties when doing these simulations. First, after integrating out the fermions there is no guarantee that the resulting measure is positive, e.g., can be interpreted as a probability measure that can then be efficiently sampled from. In practice, one simply samples from the absolute value of the measure121212This is sometimes referred to as the “quenched” approximation in the literature.; a priori this seems like a dramatic modification to the problem; however, various authors [14, 15, 16] give evidence that the phase of the measure is sharply peaked at some values of temperature and $N$ , although it is unclear whether fluctuations in the phase are parametrically suppressed in the strongly coupled ’t Hooft regime. Second, since the black hole phase is only metastable at finite $N$ , one must regulate the problem by effectively putting the system in a box. In the modern approach, one does this by turning on the BMN mass deformation term which lifts all the flat directions [7].

There is also a Monte Carlo prediction [18] for $a_{1}=-9.90\pm 0.31$ and even $a_{2}=5.78\pm 0.38$ , see also [53]. As we have already discussed, just predicting $a_{1}$ using string theory is an interesting challenge; in principle, it could be extracted from a tree-level 8-pt amplitude, but one could also hope for a less cumbersome method. Impressively, [18] also claim to reproduce the 1-loop value of $b_{1}$ , which is the first hint of 11d M-theory physics. See box (4.2) for some discussion on other approaches to computing these M-theory corrections.

3 The matrix model

The D0 brane matrix quantum mechanics consists of 9 bosonic matrices $X_{I}$ and 16 fermionic matrices $\psi_{\alpha}$ , which transform under an SO(9) $R$ -symmetry in the fundamental and spinor representations. The matrix model was introduced and studied before the BFSS conjecture [54, 55, 56]. All matrices are taken to be Hermitian and traceless; they satisfy the canonical commutation relations:

[TABLE]

In this review, there is no distinction between upper and lower indices. The Hamiltonian is

[TABLE]

In the above expression, there is an implicit sum over $I,J$ . With these conventions, $X$ has units of energy and $g^{2}_{\mathrm{YM}}$ has units of $E^{3}$ . We can take the SO(9) gamma matrices $\gamma^{I}$ to be real, traceless, and symmetric. To compare with the BFSS literature, it is convenient to perform the canonical transformation $X=\tilde{X}/(2\pi\ell_{s}^{2}),P=(2\pi\ell_{s}^{2})\tilde{P}$ so that $\tilde{X}$ has units of length and

[TABLE]

Here we take the matrices to be traceless and Hermitian so that they transform under the gauge group SU( $N$ ). We could also include the zero mode, which would just be a free non-relativistic particle131313In these units the U(1) mode is governed by $H=\frac{R}{2N}p^{2}$ . If we fix $p^{2}\ell_{p}^{2}$ , then $H\ell_{p}=\frac{R}{2\ell_{p}N}(p^{2}\ell_{p}^{2})$ . This model has 16 supercharges which transform as spinors under the SO(9) global symmetry. The 16 Hermitian supercharges are

[TABLE]

They satisfy the supersymmetry algebra

[TABLE]

In the above equation, $C_{ij}$ is a generator of SU $(N)$ symmetry, where each matrix transforms in the adjoint representation. By choosing the matrices to transform in SU $(N)$ as opposed to U( $N$ ), we have removed the center of mass degree of freedom. We have the option to gauge or ungauge the model. For a quantum mechanical system this just means whether we take the Hilbert space to be the SU $(N)$ invariant sector or whether we include arbitrary states that transform non-trivially under SU( $N$ ). Note that $C\neq 0$ violates SUSY. For more, see [57].

Exercise 6: Dimensional reduction

Dimensionally reduce flat space 10D SYM, $\mathcal{N}=1$ SYM or 4D $\mathcal{N}=4$ SYM and check that it gives BFSS. Consider $\mathcal{N}=4$ SYM on $S^{3}\times\mathbb{R}$ . The theory on $S^{3}$ has an SO(4) $\simeq\mathrm{SU}(2)_{L}\times\mathrm{SU}(2)_{R}$ symmetry. Show that at the classical level, truncating to the modes that preserve $\mathrm{SU}(2)_{R}$ leads to the massive Berenstein-Maldacena-Nastase (BMN) matrix model [7]; see [58].

3.1 The spectrum at finite $N$

Here is what is believed to be true about the model at finite $N$ :

There exists a unique, normalizable zero energy ground state (preserves $Q_{\alpha}$ and is rotationally invariant). The evidence for this is a rather subtle index computation, see [59, 60] for $N=2$ and [61, 62] for results for more general $N$ . Furthermore, [63] argued that all ground states must be SO(9) singlets. This implies that all ground states are bosonic $(-1)^{F}=1$ and combined with the index result this implies that there is a unique normalizable zero energy state for $N=2$ . Naively, one would think that the Witten index $\Tr(-1)^{F}e^{-\beta H}$ as $\beta\to 0$ can be computed using the IKKT matrix integral [61], but this is not quite right due to the flat directions. 2. 2.

There are power law tails of the wave function; these can be studied in a $1/r$ expansion. These tails are associated with splitting the $N\times N$ matrix into approximately block diagonal configurations, with $N_{1}\times N_{1}$ and $N_{2}\times N_{2}$ sub-matrices in the ground state [64, 65, 66, 67, 34]. Splitting into two sub-matrices is associated with a $1/r^{9}$ power law tail, where $r=|x_{1}-x_{2}|$ is the relative separation between the blocks. The ground state wavefunction enjoys a further “factorization” property where each block hierarchically splits into smaller sub-blocks [67, 34]. 3. 3.

All other states are scattering states, with the continuum starting at $E>0$ . There are no bound states with energy $E>0$ (although at large $N$ it is believed that there are metastable states, see Exercise 2.2). The in/out scattering states have the following block diagonal form:

[TABLE]

Here $\psi_{0}$ is the ground state wavefunction of a smaller BFSS matrix model with $N=N_{i}$ . $X_{i}$ is an $N_{i}\times N_{i}$ traceless Hermitian matrix, and $x_{i}$ is the trace of the block, e.g., the center of mass. The zeros indicate that the wavefunction is peaked around zero (in some gauge), e.g. that the overall wavefunction is peaked around matrices that are simultaneously block-diagonalizable. We have depicted a situation where there are 3 blocks, but in general there could be any number of blocks $2\leq n_{\text{blocks}}\leq N$ .

We have also suppressed the fermions in the above notation; the ground state wavefunction should really be written $\psi_{0}(X^{I}_{i},\psi^{\alpha}_{i})$ where $\psi^{\alpha}_{i}$ are all traceless matrices. The bosonic U(1) mode comes with 16 superpartners $\psi_{\alpha}$ where $\psi$ is a Majorana fermion. We may think of $x_{i}$ as Goldstone bosons where we have spontaneously broken the SU( $N$ ) symmetry by separating the branes; $\psi_{\alpha}$ may then be viewed as Goldstinos. This defines a Hilbert space of dimension

[TABLE]

To fully specify the asymptotic state, we should also specify the state in this Hilbert space. We have written the decomposition of 256 in anticipation of the BFSS conjecture (81).

Since each SU( $N$ ) block is in the ground state, the total energy of the scattering state comes just from the U(1) factors. It is given by

[TABLE]

and is independent of the state of the Goldstino fermions.

4 Scattering and the BFSS conjecture

We would like to understand the gravity dual of the scattering setup described in the previous section.

4.1 Scattering in the ’t Hooft limit

A generalization of the extremal solution (2) is to multi-center solutions:

[TABLE]

The BPS condition $Q=M$ implies that the gravitational force cancels the Coulomb force between the black holes and therefore one can place these black holes anywhere with respect to each other.

By repeating the decoupling arguments, we conclude that this configuration is represented by a state in the matrix quantum mechanics where the $X_{i}$ have vevs given by $r_{i}$ . The charge of each black hole is represented by $N_{i}$ , the size of each block diagonal matrix.

Since the multi-center solutions exist for any choice of $r_{i}$ , it is natural to consider a generalization of these solutions where the black holes are moving slowly with respect to each other. In particular, by the decoupling logic, these configurations would be relevant for the scattering problem in the matrix quantum mechanics. There is a general procedure for generalizing these multi-center solutions to slowly moving solutions [68]. We promote $\vec{r}_{i}\to\vec{r}_{i}(t)$ . Then in the small velocity expansion,

[TABLE]

Here $A_{i}$ and $N^{i}$ are determined by solving the constraints. Actually this calculation was already done by Shiraishi [69]; in the notation of [69] the values relevant141414The paper claims that $a=1$ for string theory, but this is a typo. In string frame RR fields should not be coupled to the dilaton. for the D0 brane computation are $a^{2}=N=9$ . In principle,

[TABLE]

We have seen that the cancellation of electric and gravitational forces implies $V=0$ . Actually SUSY implies that the $g=1$ , consistent with the results of [69]. It would be interesting to compute $F(r)$ by going to higher orders in the low velocity expansion. (This is similar to a post-Newtonian approximation, except that we do not assume that the metric is close to flat.)

A slightly different regime is to consider scattering with a large cluster of D0’s and a small number of “probe” D0’s. In the ’t Hooft limit, we can view this process as being governed by the probe D0 brane action in the extremal D0 black hole background [70]. The DBI action for a probe D0 brane in this background is

[TABLE]

We see that we recover the above facts, namely that $V=0,g=1$ and that $F\propto 1/\rho^{7}$ . A trivial generalization of this computation is to replace $N\to N_{1}N_{2}$ if we have multiple D0 branes in the gravity background with $N_{1}\gg 1$ .

Exercise 7: Probe D0 brane scattering

Study the solutions151515I thank Gauri Batra for discussions about these trajectories. of (71). Show that there are “scattering” trajectories in Lorentzian signature where the probe D0 brane approaches $\rho\to\pm\infty$ when $\tau\to\pm\infty$ .

The result (72) can be reproduced from the BFSS matrix model. To compare with the literature, we should convert back to $r,t$ coordinates using $\rho=(r/\ell_{s})/(60\pi^{3}g_{s}N_{1})^{1/3}$ and $\tau=(60\pi^{3}g_{s}N_{1})^{1/3}t/\ell_{s}$

[TABLE]

The $v^{4}$ and $v^{6}$ terms have been matched (including the precise numerical coefficient161616To compare with [1, 70] one should shift $\ell_{p}^{3}\to\ell_{p}^{3}/(2\pi)$ .) to a 1-loop and 2-loop computation in the matrix theory [1, 71, 70]. The idea is that we separate the degrees of freedom into slow and fast modes and use a Born-Oppenheimer approximation. The fast modes are harmonic oscillators with $\omega=|X_{1}-X_{2}|/\alpha^{\prime}$ , which can be interpreted as arising from open strings connecting the separated D0 branes. For this calculation, it is easy to work in the Hamiltonian formalism (see [72] for a recent discussion).

Let us comment on the loop counting parameter in this approach. We focus on two matrices to illustrate the approach. For small $w_{x}$ we have

[TABLE]

If the separation between matrices is large $X^{I}X^{I}$ is large, then we see that the $w$ mode is very heavy. The commutator square interaction gives us an oscillator term

[TABLE]

This oscillator has a characteristic frequency given by the energy of a stretched string:

[TABLE]

So the loop counting parameter in the scattering theory is

[TABLE]

For example, the 1-loop matrix theory computation which reproduces the $\dot{r}^{4}/r^{7}$ term in (74) a priori should be corrected by a function $f_{4}(\lambda_{\text{eff}})=1+c_{1}\lambda_{\text{eff}}+\cdots$ . The perturbative computation only guarantees that $f_{4}=1$ at small $\lambda_{\text{eff}}$ but to compare with Type IIA gravity, we need to know $f_{4}$ as $\lambda_{\text{eff}}\to\infty$ , e.g., $N^{1/7}\ll r/\ell_{p}\ll N^{1/3}$ , see exercise 2.2. This corresponds to strong ’t Hooft coupling $1\ll\lambda_{\text{eff}}\ll N^{4/7}$ . How are we supposed to extrapolate from weak coupling to strong coupling? Here one appeals to extended supersymmetry [73, 74], which forbids certain corrections to the effective potential. This is enough to show that $V=0$ and $g=1$ in (69) and that $f_{4}$ is constant and therefore compare with the strong coupling (gravity) prediction. Similarly, the $\dot{r}^{6}$ term is protected and in fact the coefficient of the $\dot{r}^{6}$ term is determined171717The $v^{6}$ terms are determined in the SU(2) effective theory which is relevant for $2\to 2$ scattering. For higher-pt scattering amplitudes, one needs to consider larger gauge groups, where there are unfixed couplings [75] for the $v^{6}$ terms but the $v^{4}$ terms remain protected [76, 75]. I thank Savdeep Sethi for discussions. by lower derivative terms in the effective action [74].

4.2 Scattering beyond the ’t Hooft limit

In the previous section, we studied scattering at strong ’t Hooft coupling. Perhaps even more interesting is to consider the ultra-strongly coupled limit

[TABLE]

Here we are thinking of $r$ as the impact parameter $b$ in the scattering process. This is far beyond the conventional ’t Hooft regime; it is more analogous to the limit $N\to\infty$ , $g^{2}_{\mathrm{YM}}=\text{fixed}$ in $\mathcal{N}=4$ SYM [77, 78]. Based on the gravity picture, one should expect this to probe 11-d scattering. This is indeed what BFSS conjectured. From the modern point of view, their conjecture is actually even bolder, as they assume that the amplitude is not contaminated by any effects that occur in the weak coupling/stringy region.

Let us now state precisely the BFSS conjecture [1]:

[TABLE]

Here the RHS is the M-theory amplitude in asymptotically flat 11d spacetime. For this conjecture to be true, it is necessary that the only stable particles in M-theory are massless. The state of the goldstino fermions encodes the particular polarization states of the particles. In particular, we interpret the RHS of (62) as follows: 44 is the dimension of rank 2 traceless symmetric tensors of SO(9) (the little group in 11d) and 84 is the dimension of rank 3 fully anti-symmetric tensors. 128 is a fermionic irrep of spin(9). These are precisely the states associated with the graviton, the 3-form gauge field $A_{\mu\nu\rho}$ , and the gravitinos in M-theory.

The conjecture is usually stated as $R\to\infty$ , but we can state it in terms of dimensionless quantities. First, if all momenta are fixed in Planck units, the typical impact parameter (say in the $2\to 2$ scattering) will be of order the Planck scale, so we will satisfy (80). This implies that the dimensionless ’t Hooft coupling $\lambda_{\text{eff}}\sim N$ according to (79). For another way of understanding the scaling $\ell_{p}/R\sim 1/N$ , let us consider the Hamiltonian (which is identified with $p_{+}$ ) in units of the Yang Mills coupling. The BFSS scaling implies

[TABLE]

If we go back to the holographic analysis, when $E/(g^{2}_{\mathrm{YM}})^{1/3}\sim 1/N$ the dual is an 11d Schwarzschild black hole whose Schwarzschild radius is fixed in Planck units [25]. In other words, an intermediate state in the graviton scattering could be an 11d M-theory black hole. Once again $\lambda_{\text{eff}}\sim g^{2}_{\mathrm{YM}}N/E^{3}\sim$ We may compare the energy in the ’t Hooft regime where the dimensionless velocity $v^{2}$ appearing in (72) is fixed, $H/(g^{2}_{\mathrm{YM}}N_{1})^{1/3}\sim N_{1}N_{2}v^{2}$ . As expected, the BFSS conjecture energies that are much smaller than the scattering energies in the ’t Hooft regime.

Exercise 8: Gravitational scattering

Recall from Alexander Zhiboedov’s lectures that in $d>4$ , Einstein gravity in the limit $s=\text{fixed},t\to 0$ gives the amplitude

$\displaystyle\mathcal{A}_{\text{tree}}(s,t)\sim-\frac{8\pi G_{N}s^{2}}{t}$

(85)

Here $16\pi G_{N}=(2\pi)^{8}\ell_{p}^{9}$ . Work out the 11d kinematics and show that $s=-N_{1}N_{2}\dot{r}^{2},t=-\vec{k}^{2}$ . Then

$\displaystyle V$ $\displaystyle=\frac{1}{2\pi R}\prod_{i=1}^{4}\frac{1}{\sqrt{2E_{i}}}\int\frac{\mathrm{d}^{9}k}{(2\pi)^{9}}e^{\mathrm{i}\vec{k}\cdot\vec{r}}\mathcal{A}_{\text{tree}}$

(86)

$\displaystyle=\frac{15N_{1}N_{2}(2\pi)^{3}\ell_{p}^{9}\dot{r}^{4}}{16R^{3}r^{7}}$

(87)

where you may use $\int\frac{\mathrm{d}^{9}\vec{k}}{(2\pi)^{9}}e^{\mathrm{i}\vec{k}\cdot\vec{r}}|\vec{k}|^{-2}=\frac{15}{2(2\pi)^{4}}r^{-7}$ . This agrees with the second term in (74). For help and/or the generalization to the $v^{4}/r^{16}$ term see [79].

**Further reading 3: Other approaches to M-theory. **

We can also take the flat space limit of AdS/CFT [78, 77, 80] and extract flat space amplitudes. Note that this also involves a large $N$ , ultra-strongly coupled limit.

M2 branes: the low energy limit of M2 branes is given by a 3D superconformal field theory, the ABJM theory [81]. This theory is dual to AdS ${}_{4}\times S_{7}/\mathbb{Z}_{k}$ . Using numerical conformal bootstrap (with the help of inputs from localization), one can compute stress-energy tensor correlation functions and study the flat space limit [82, 83, 84]. The numerical CFT bootstrap seems promising for computing even non-protected terms to the M-theory effective action.

M5 branes: the low energy limit of M5 branes is governed by the 6D (2,0) theory. The holographic dual is AdS ${}_{7}\times S_{4}$ [24]. The conformal anomaly181818The $A_{N-1}$ theory has central charge $c(A_{N-1})=4N^{3}-3N-1$ . The $-3N$ term should be reproducible from the bulk $R^{4}$ term, according to [85]. of the 6D CFT at finite $N$ has been computed [86, 87, 88, 89, 90] and the sub-leading $N$ dependence should be reproduced by the $R^{4}$ term in M-theory (50), see [85]. The $R^{4}$ term was carefully reproduced in [91] by considering 4-pt functions of 1/2-BPS operators. See also [92, 93] for bootstrapping this theory.

Although this has never been worked out in detail, one could in principle extract the M-theory S-matrix from correlation functions of the supergravity modes in BFSS using a similar logic to the AdS/CFT flat space limit [78, 77, 80].

Finally, a tantalizing possibility is that the M-theory amplitude is an extremal amplitude in the S-matrix bootstrap [94]. If so, general principles of unitarity, crossing, etc., would be enough to determine the amplitude.

It would be nice to compare these different approaches, in particular to compute some non-protected quantity from two or more different approaches.

It is sometimes said that we do not have an independent, non-perturbative bulk definition of M-theory. Nevertheless, a variant of the BFSS conjecture that seems non-perturbatively well-defined is: the amplitudes of the matrix model in the BFSS limit agree with the flat space limit of the amplitudes obtained from the ABJM or the 6D (2,0) CFT. This conjecture is still shocking as the different quantum systems a priori have very little in common.

5 Other techniques for analyzing the BFSS model

Here we give a superficial survey of various techniques that have been used to analyze the BFSS model analytically.

5.1 High temperature/weak ’t Hooft coupling expansion

In the high temperature limit, the effective ’t Hooft coupling $\lambda\beta^{3}\ll 1$ and we can develop an expansion in $\lambda_{\text{eff}}$ . The $\beta\to 0$ limit is not quite a free theory because we are left with a non-trivial matrix integral over the Matsubara zero modes. Note that if we impose periodic boundary conditions for the fermions we obtain the IKKT matrix integral, whereas if we impose anti-periodic boundary conditions, there are no fermion zero modes. The latter case is relevant for the usual thermodynamics, see [95].

On a similar conceptual footing, one can consider the BMN matrix model at very large mass $\mu$ parameter. Then the effective ’t Hooft coupling $\lambda/\mu^{3}\ll 1$ and one can do ordinary Hamiltonian perturbation theory [96] to compute the energy levels, etc.

5.2 SUSY techniques

The computation of the index in BFSS is perhaps the oldest case where supersymmetric techniques were applicable. We have already mentioned this in section 3.1; it is a subtle computation due to the flat directions. In the BMN matrix model, the flat directions of the BFSS matrix model are lifted due to the presence of mass terms. The index of the BMN model was recently computed [97].

One can also compute other quantities in the BMN matrix model using supersymmetric localization [98]. This technique can be used to compute correlation functions like $\expectationvalue{\Tr Z^{k}}$ where $Z=X_{a}+\mathrm{i}X_{m}$ where $a$ is an SO(6) index and $m$ is an SO(3) index. (One might ask whether these results give something non-trivial in the massless limit, but such correlation functions vanish in the BFSS vacuum due to SO(9) symmetry.) Recently, a mass-deformed version of IKKT has also been studied [4, 5, 6]; localization can also be used to compute the partition function and some correlation functions in this model. Both the mass-deformed IKKT and the mass-deformed BFSS model (the BMN model) have a rich set of vacua which unfortunately is beyond the scope of this review.

Beyond correlation functions, one may ask about scattering. As we have mentioned, the effective action on the moduli space is heavily constrained by non-renormalization theorems that rely on supersymmetry [73, 74, 76, 75]. This constrains the 4-pt amplitude (and higher-pt amplitudes), especially in the limit of zero momentum transfer in the longitudinal direction. Another recent development is the computation of the 3-pt amplitude. The 3-pt amplitude was recently computed in [99]. The 3-pt amplitude in M-theory is fixed (up to a constant) by kinematics191919The 3-pt kinematics involving massless particles cannot be satisfied with real momentum. However, it can be satisfied by going to (2,9) signature, or equivalently considering the usual (1,10) signature and taking one of the spatial momentum to be imaginary. . However, the symmetries of 11-d M-theory are not manifest in the BFSS model. Hence it is a non-trivial computation (and a test of the BFSS conjecture) that the 3-pt amplitude is reproduced by the matrix model. The idea in [99] is to uplift to a problem in 1+1D SYM compactified on a circle of circumference $2\pi\tilde{R}_{9}$ . When $\tilde{R}_{9}$ is small, we expect this problem to reduce to the BFSS problem. The boundary conditions on the cylinder are chosen in such a way to be related to the initial and finite states in the 3-pt amplitude.

Now converting the scattering problem in the quantum mechanics to a field theory problem seems like making the problem harder, but actually the field theory computation has an advantage. The trick is to note that the cylinder can be interpreted as a trace in the “open string” channel, we are computing a supersymmetric index $\Tr((-1)^{F}e^{-2\pi\tilde{R}_{9}H}\cdots)$ :

[TABLE]

Then one can compute the index semi-classically. On the other hand, in the closed string channel, one views the same quantity as an overlap between supersymmetric boundary states that $\bra{B_{L}}$ and $\ket{B_{R}}$ which are chosen to be D1 cousins of the block diagonal states described around (61). The result of the computation agrees with the BFSS prediction.

6 Matrix Bootstrap

6.1 Bootstrapping equations of motion

We will now turn to a particular non-perturbative approach to solving large $N$ matrix systems, including the BFSS matrix model. This approach is sometimes called the “matrix bootstrap”; its main features are

It is non-perturbative in the coupling; it does not rely on any weak-coupling expansion. 2. 2.

It works directly in the ’t Hooft large $N$ limit. 3. 3.

It can be used to solve quantum systems, including those that have a sign problem in Euclidean signature. 4. 4.

It works even for systems that are metastable; e.g., systems which are strictly speaking only well-defined at infinite $N$ .

Some potential downsides of the method are that it only produces inequalities on interesting observables. In some simple contexts it has been argued that these inequalities will be enough to give islands that converge to the exact solution [100]; but for most situations there is no guarantee of convergence. Of course, according to the bootstrap philosophy, rigorous inequalities might still be interesting, especially if there are “kinks” in the allowed region. Another potential downside is that for a generic system with $D$ matrices, the number of single trace operators of length $L$ grows $\sim D^{L}$ , which can quickly become computationally expensive.

The simplest context is the single matrix integral [101, 100]. The goal is to compute moments

[TABLE]

Here $V(X)$ is a polynomial in $X$ . We have normalized $\tr 1=1$ , e.g., $\tr=\frac{1}{N}\Tr$ . At infinite $N$ (in the ’t Hooft limit), we can use large $N$ factorization to reduce multi-trace observables to single traces. To bootstrap, we need a set of consistency conditions and a set of positivity constraints. For consistency conditions, we will use the Schwinger-Dyson or loop equations:

[TABLE]

In writing the double trace as a product of two single traces, we have assumed large $N$ factorization, and hence we have input infinite $N$ into the bootstrap. For a polynomial potential $V$ , we may view this equation as a recursion relation between lower moments and higher moments. In (89) we have depicted the Schwinger-Dyson equations for $V=\tfrac{1}{2}X^{2}+\frac{1}{3}gX^{3}$ .

Then we combine this with positivity of the Hankel matrix

[TABLE]

The notation $\mathcal{M}\succeq 0$ indicates that $\mathcal{M}$ is a positive semi-definite matrix, e.g., that all of its eigenvalues are non-negative. An equivalent characterization is that $p^{\dagger}\mathcal{M}p\geq 0$ for all vectors $p$ . This positivity follows from the fact that we may consider a general polynomial in the matrices

[TABLE]

where we have used the fact that $X$ is a Hermitian matrix.

Exercise 9: 1-matrix model bootstrap

Consider $V=\tfrac{1}{2}X^{2}+\frac{1}{3}gX^{3}$ . Construct the Hankel matrix $\mathcal{M}_{ij}$ up to length $i+j\leq L$ . Use the loop equations (89) to express all elements as a function of $\tr X$ and $g$ . Then use SemidefiniteOptimization[] in Mathematica to generate a plot of the allowed region for the $\tr X$ as a function of $g$ . At large values of $L$ , one should be able to estimate the range of $g$ where the problem is well-defined in the $N\to\infty$ limit.

Exercise 10: Single variable integral bootstrap

For an even easier exercise, derive the Schwinger-Dyson equation for the measure over a single bosonic variable:

$\displaystyle\langle x^{n}\rangle={Z}^{-1}\int\mathrm{d}x\,x^{n}e^{-V(x)}$

(93)

where $V$ can be your favorite bounded polynomial, e.g., $V(x)=\tfrac{1}{2}x^{2}+\frac{1}{4}gx^{4}$ . Define again the Hankel matrix $\mathcal{M}_{i,j}=\langle{x^{i+j}}\rangle$ and use SemidefiniteOptimization[] to bound the correlation functions. Can you get a high precision estimate of $\langle x^{2}\rangle$ , say at $g=1$ ?

In principle, large $N$ factorization means that different correlators are related by a set of quadratic equations202020For finite $N$ , one can instead impose trace relations [102].. This means that the semi-definite program is non-linear. For some simple problems like the 1-matrix model, one can simply scan over a small number of variables. However, more generally, one needs to use the method of non-linear relaxation [103]. The idea is to replace $q=x^{2}$ with $q\geq x^{2}$ . This can be encoded in a positive semi-definite matrix:

[TABLE]

More generally, let $\vec{x}$ be a set of single trace variables, e.g., $x_{k}=\expectationvalue{\tr X^{k}}$ . Then we define the matrix of variables $Q_{ij}$ . To perform non-linear relaxation, we replace all occurrences of $x_{i}x_{j}$ in the loop equations with $x_{i}x_{j}\to Q_{ij}$ . Then we enforce

[TABLE]

One can also impose a slightly more refined constraint [104] $\mathcal{M}\succeq Q,$ which follows from positivity of the covariance matrix $\langle\operatorname{tr}\left(\mathcal{O}_{i}-\left\langle\mathcal{O}_{i}\right\rangle\right)^{\dagger}\left(\mathcal{O}_{j}-\left\langle\mathcal{O}_{j}\right\rangle\right)\rangle=\mathcal{M}_{ij}-Q_{ij}\succeq 0.$

One can generalize the bootstrap approach to consider multi-matrix integrals, see [100] and [105]. One can consider all observables that are single trace words made up of $L$ letters. The number of such observables grows rapidly, $\sim D^{L}$ . However, the number of independent loop equations also grows exponentially. Furthermore, one should generalize $\mathcal{M}$ to be the inner product matrix of arbitrary words. This leads to a positivity matrix that is also exponentially large. Despite these challenges, [105] achieved high precision estimates for simple correlation functions in a certain 2-matrix model with an interaction of the form $V=V(X)+V(Y)-g^{2}\,\tr[X,Y]^{2}$ that is not believed to be solvable by analytic means. To achieve these impressive high precision estimates, it is important to use the non-linear relaxation method. Unlike the 1-matrix model, as one considers larger and larger values of $L$ , the number of unknown variables grows and it is impractical to scan over all such variables.

6.2 Quantum mechanical bootstrap

Now we turn to the generalization from the matrix integral to matrix quantum mechanics. The key insight [106] is to work in the Hilbert space/Hamiltonian formalism. One replaces the Schwinger-Dyson equations with the equations of motion:

[TABLE]

where $\mathcal{O}$ is any operator and the expectation value is with respect to an energy eigenstate (or more generally a density matrix which commutes with the Hamiltonian.).

The positivity constraints are again of the form $\mathcal{M}_{ij}=\langle O^{\dagger}_{i}O_{j}\rangle,\;\mathcal{M}\succeq 0$ . This positivity constraint only relies on the positivity of the Hilbert space inner product; it holds for any quantum-mechanical system. For Euclidean problems, we are using reflection positivity and not positivity of the Euclidean measure; fermionic degrees of freedom are not a problem for this quantum mechanical bootstrap.

One can also scan over $\expectationvalue{H}=E$ . For a simple quantum system like a non-relativistic particle in a potential $V(x)$ , one can scan over $E$ while minimizing/maximizing over some simple 1-pt function like $\expectationvalue{x^{2}}$ . If one plots $E$ vs $\expectationvalue{x^{2}}$ , typically one finds an “archipelago” where there are a few islands corresponding lowest eigenvalues of the Hamiltonian; at larger values of energy there is a peninsula where individual eigenstates cannot be resolved.

Exercise 11: Bootstrapping simple quantum systems

One can apply the quantum mechanical bootstrap to simple quantum systems, like a particle in a potential (see e.g. [107, 108]). Compute the ground state energy of the quantum anharmonic oscillator using these ideas.

A more interesting problem is to consider the “toy supermembrane.” This is a non-relativistic particle in 2 dimensions with potential $V=x^{2}y^{2}$ . Classically this model has some flat directions and was considered a toy model for D0 brane quantum mechanics [56, 72].

For the application to large $N$ matrix quantum mechanics, we should consider the equations of motion obtained from single trace operators $\expectationvalue{[H,\Tr\mathcal{O}]}=0$ and the corresponding positivity matrix of adjoints $\expectationvalue{\Tr O^{\dagger}_{i}O_{j}}\geq 0$ . Note that the commutator of two single traces gives a single trace.

In addition we use cyclicity of the trace $+$ canonical commutation relations212121Here $\tr$ is over the SU( $N$ ) indices and should not be confused with the trace over the quantum Hilbert space. This is why we cannot simply write $\tr(AB)=\tr(BA)$ ., e.g.,

[TABLE]

In the Hamiltonian approach, these cyclicity relations are the only ingredient which generate double trace relations, and therefore the only place where we input $N=\infty$ .

6.3 Finite temperature

The above constraints allow us to bootstrap systems in an energy eigenstate, or in the large $N$ limit, the microcanonical ensemble. However, we would also like to bootstrap systems in the canonical ensemble. To do so, one leverages an inequality that is sometimes called the energy-entropy balance (EEB) inequality (Araki and Sewell [109], see also [110]). The EEB inequality states that for any operator $O$ ,

[TABLE]

where expectation values are taken with respect to the thermal state $\rho=\frac{1}{Z}e^{-\beta H}$ . Together with time translation $\expectationvalue{[O,H]}=0$ and positivity $\expectationvalue{O^{\dagger}O}\geq 0$ and normalization of the trace $\expectationvalue{1}=1$ . Imposing this inequality for all operators $O$ is equivalent to KMS. Recently this has been used to bootstrap the thermodynamics of the ungauged 1-matrix model and the 2-matrix model with Yang-Mills like interaction, see [111]. For this application, one chooses $O$ to be an operator that transforms in the adjoint of SU( $N$ ) or U( $N$ ). A derivation of this inequality using log-convexity of the thermal 2-pt function may be found in [112], see also Appendix A of [113] and Theorem 5.3.15 of [114].

A simple case of the EEB inequality is to take $\beta\to\infty$ ; then we derive the ground state positivity condition:

[TABLE]

This inequality says that perturbing the ground state by acting with an operator $O$ may only increase the energy of system. A slight generalization of this statement is that the matrix

[TABLE]

This follows from the requirement that a generic superposition of perturbations $c_{i}O_{i}$ acting on the ground state $\ket{\Omega}$ raises the energy. This inequality can be used to obtain very precise bounds on properties of the ground state [115]. It is a bit simpler to work with in practice, since we do not have to deal with the non-linear nature of (102), although this can be dealt with using convex relaxation [110, 111].

Exercise 12: Energy-entropy inequality

Consider the harmonic oscillator at finite temperature222222I thank everyone at an SITP lunch for suggesting/discussing this example.. Consider the inequality (102) with $O=a^{n}$ . Then separately consider $O=(a^{\dagger})^{n}$ . Argue that for both sets of inequalities to be true, we must have saturation

$\displaystyle\log\frac{\expectationvalue{O^{\dagger}O}}{{\expectationvalue{OO^{\dagger}}}}=\beta\frac{\expectationvalue{O^{\dagger}[H,O]}}{\expectationvalue{O^{\dagger}O}}$

(105)

Use this to argue that the density matrix $\rho\propto\exp(-\beta H)$ .

An interesting recent development is that the bootstrap can be generalized to time-dependent problems:

Further reading 4: Time dependent bootstrap

Here we summarize the LMN bootstrap method for time-dependent 1-pt functions [116]. The primal problem is:

$\displaystyle\operatorname{minimize}$ $\displaystyle\operatorname{Tr}\mathcal{O}\mathcal{M}(T)$

(106)

subject to $\displaystyle\mathcal{M}(t)\succeq 0$

(107)

$\displaystyle\operatorname{Tr}A^{(i)}\mathcal{M}(t)=a^{(i)}$

(108)

$\displaystyle\operatorname{Tr}\left(D^{(k)}-C^{(k)}\frac{\mathrm{d}}{\mathrm{d}t}\right)\mathcal{M}(t)=0$

(109)

$\displaystyle\mathcal{M}(T=0)=\mathcal{M}_{0}$

(110)

Here $\mathcal{M}(t)$ is a matrix that encodes a set of Lorentzian 1-pt functions. The constraint (109) encodes the Heisenberg equations of motion and (110) is the initial condition (where information about the initial state is input). (107) inputs Hilbert space positivity, and (108) enforces relations between 1-pt functions due to the canonical commutation relations.

The dual problem is

maximize $\displaystyle\lambda_{D}^{(k)}(0)\operatorname{Tr}C^{(k)}\mathcal{M}_{0}$

(111)

subject to $\displaystyle\lambda_{D}^{(k)}(T)C^{(k)}=\mathcal{O}$

(112)

$\displaystyle\lambda_{A}^{(i)}(t)A^{(i)}+\left(D^{(k)}+C^{(k)}\frac{\mathrm{d}}{\mathrm{d}t}\right)\lambda_{D}^{(k)}\succeq 0.$

(113)

To derive the dual problem, one introduces an action for the primal problem, with Lagrange multipliers $\lambda_{A}(t),\lambda_{D}(t)$ which enforce (108) and (109). One also introduces a Lagrange multiplier $\Lambda(t)\succeq 0$ that enforces (107). Then one can integrate out (e.g. enforce the equations of motion) for the $\mathcal{M}(t)$ variable, which leads to (112) and (113). The resulting action (111) will then just involve the Lagrange multiplier evaluate at $t=0$ .

The main point is that in the dual problem, we do not need to solve the Heisenberg equations of motion, which is generically impossible in a strongly coupled system. We have managed to convert the equations into a set of inequalities on the dual variables $\lambda(t)$ . Roughly speaking, one can expand $\lambda(t)=\sum_{i}\lambda_{i}e_{i}(t)$ over sum finite basis of functions $e_{i}(t)$ and try to optimize over the choice of coefficients $\lambda_{i}$ subject to the constraints. As long as we find a solution to the dual problem (a feasible function that satisfies (113)), we immediately derive a bound on the primal problem. In particular, we do not need to search over an infinite-dimensional basis of functions $e_{i}(t)$ to derive a bound!

One can generalize this method to solve a variety of time-dependent problems, including the 2-pt function in Euclidean signature [112]. This allows one to also study thermal properties by imposing the KMS condition on 2-pt functions without using the EEB inequalities .

6.4 Application to BFSS

We now discuss the application of these bootstrap ideas to BFSS. Optimistically, using the above ideas, one should be able to compute a wide variety of quantities that probe the rich physics of this model that we have reviewed. For example, one could hope to compute the thermodynamics of the model, which would test the Bekenstein-Hawking area formula and go beyond into the stringy black hole regime, as we discussed in (2.5). For now, however, we will focus on a basic but surprisingly non-trivial question: what is the size of the bound state wavefunction? This is non-trivial since this is a low-energy (and therefore strong coupling) property of the BFSS system that cannot be computed using any known weak-coupling method. In the original BFSS paper [1], two estimates were given. One estimate was $r/\ell_{p}\sim N^{1/9}$ , which is smaller than the size of the M-theory bubble $r/\ell_{p}\lesssim N^{1/7}$ . Another estimate, later confirmed by Polchinski [9], is the much larger value $r/\ell_{p}\gtrsim N^{1/3}$ , which is the answer expected from ’t Hooft scaling232323To see this, note that the engineering dimension $[X]=\text{mass}$ , so $1/N\expectationvalue{\Tr X^{2}}\lambda^{-2/3}\sim r^{2}/N^{-2/3}\sim O(1)$ . . From a modern perspective [117], we can recognize [9]’s arguments as a bootstrap proof. (For an intuitive account of Polchinski’s argument, see [77].) It is based on exactly the same constraints and positivity ideas that were listed in [106] and outlined above. These are:

Constraints:

$\displaystyle\expectationvalue{[H,\Tr X^{2}]}=0\Rightarrow\expectationvalue{\Tr X^{I}P_{I}}+\expectationvalue{\Tr P^{I}X_{I}}=0.$

(114)

$\displaystyle\expectationvalue{\Tr[X^{I},P^{I}]}=9\mathrm{i}N^{2}.$

(115)

$\displaystyle\expectationvalue{[H,\Tr XP]}=0\Rightarrow-2\expectationvalue{K}+4\expectationvalue{V}+\expectationvalue{F}=0.$

(116)

$\displaystyle\expectationvalue{H}=E\Rightarrow\expectationvalue{K}+\expectationvalue{V}+\expectationvalue{F}=E.$

(117)

Combining (114) and (115) gives $\expectationvalue{\Tr X^{I}P^{I}}=9\mathrm{i}N^{2}/2$ . Combining (116) and (117), we may eliminate $\expectationvalue{F}$ :

[TABLE]

Now we list some relevant positivity constraints:

Positivity

$\displaystyle\left(\begin{array}[]{cc}\expectationvalue{\Tr X^{2}}&\expectationvalue{\Tr XP}\\ \expectationvalue{\Tr PX}&\expectationvalue{\Tr P^{2}}\\ \end{array}\right)\succeq 0,\quad\Rightarrow\sum_{I}\expectationvalue{\operatorname{Tr}X^{2}}\expectationvalue{\operatorname{Tr}\left(P^{I}P^{I}\right)}\geq\frac{9}{4}N^{4}.$

(121)

$\displaystyle\left(\begin{array}[]{cc}\Tr X^{4}&\Tr X^{2}Y^{2}\\ \Tr X^{2}Y^{2}&\Tr X^{4}\\ \end{array}\right)\succeq 0,\quad\left(\begin{array}[]{cc}\Tr X^{2}Y^{2}&\Tr XYXY\\ \Tr XYXY&\Tr X^{2}Y^{2}\\ \end{array}\right)\succeq 0,$

(127)

$\displaystyle\left(\begin{array}[]{cc}\langle\operatorname{Tr}1\rangle&\left\langle\operatorname{Tr}X^{2}\right\rangle\\ \left\langle\operatorname{Tr}X^{2}\right\rangle&\left\langle\operatorname{Tr}X^{4}\right\rangle\end{array}\right)\succeq 0,$

(130)

We start with the RHS of (121), which is essentially the uncertainty principle for matrices. We use (118) to replace $\expectationvalue{\Tr P^{I}P^{I}}$ (kinetic energy) with potential energy:

[TABLE]

We have used SO(9) symmetry to focus on just 2 of the matrices, which we call $X$ and $Y$ . We can then relate both terms to $\expectationvalue{\Tr X^{4}}$ using the positivity relations (LABEL:xxyy). Finally, we can replace $\expectationvalue{\Tr X^{2}}$ with $\expectationvalue{\Tr X^{4}}$ using (130). This allows us to write the inequality in terms of just $\expectationvalue{\Tr X^{4}}$ :

[TABLE]

Let us make a few comments. Setting $E=0$ recovers Polchinski point. Assuming parametric saturation of the bound implies that “typical eigenvalue” $r\sim\lambda^{1/3}$ , which is roughly the size of the Type IIA supergravity region. The scale at which the bound varies is $E/N^{2}\sim\lambda^{1/3}$ . Recall that the regime of validity of supergravity is $E/N^{2}\ll\lambda^{1/3}$ , see e.g. (52). It is interesting that these simple arguments give us non-trivial dynamical information even in the strong coupling regime.

Recognizing the Polchinski virial theorem as a bootstrap bound immediately allows us to improve it. In particular, we can derive a bound on $\expectationvalue{\tr X^{2}}$ which cannot be obtained from the above arguments alone. To do so, we should include information about the fermionic term $F$ . The idea is to consider a $3\times 3$ positivity matrix corresponding to the operators $O_{I},P_{I},X_{I}$ where we have written the fermionic term as $\Tr O_{I}X_{I}$ .

[TABLE]

In the next section, we show that 242424The bound in [117] was improved in [118] by treating the fermionic constraints more systematically. that $1/9\expectationvalue{\Tr O_{I}O_{I}}\leq 64N^{2}/3$ , see (154). Then demanding that $\det\mathcal{M}_{3}\geq 0$ gives

[TABLE]

Improving Polchinski’s bound

With a little more work [118], one can derive the slightly better level 6 analytic bound

$\displaystyle\frac{1}{N(g^{2}_{\mathrm{YM}}N)^{2/3}}\left\langle\operatorname{Tr}{X}^{2}\right\rangle=\langle\tr\tilde{X}^{2}\rangle\geq\frac{3}{4}\left(\frac{3}{50}\right)^{1/3}\approx 0.2936$

(139)

This should be compared to the Monte Carlo value of 0.378 [18]. It is remarkable that these simple analytical bounds can recover $\geq 0.8$ the Monte Carlo result.

This result implies that $r/\ell_{p}\gtrsim N^{1/3}$ , which is the size of the supergravity region, see exercise (2.2).

Further reading 5: Intuition and anti-intuition about the bound state size

There is an intuitive, non-rigorous argument [77] that the wavefunction has a size $r/\ell_{p}\sim N^{1/3}$ that is closely related to the above ideas. Consider some matrix $X_{1}$ , it should have a typical size $\tr X_{1}^{2}\sim Nr^{2}$ , where $r$ is a typical eigenvalue of $X_{1}$ . Then consider a different matrix $X_{2}$ . The idea is to view the off-diagonal elements of the matrix $X_{2}$ as harmonic oscillators with frequency given by our previous estimate (78) $\omega\sim Rr/\ell_{p}^{3}$ . If we assume that each matrix element is in the oscillator ground state, the typical size of an off-diagonal matrix element $(\Delta X_{ij})^{2}\sim\ell_{p}^{3}/r$ . Hence $\langle{\Tr(X_{2})^{2}\rangle}\sim N^{2}\ell_{p}^{3}/r$ . But rotational invariance implies $\langle{\Tr(X_{1})^{2}\rangle}=\langle{\Tr(X_{2})^{2}\rangle}\Rightarrow r^{3}\sim N\ell_{p}^{3}$ . This gives the desired estimate. Of course, the ground state wavefunction of the BFSS theory is the wavefunction of a strongly coupled many body system; to replace $\sim$ with precise inequalities, we must use the bootstrap arguments.

While the above argument provides some intuition for why the wavefunction is big, the large size of the bound state is counterintuitive from the point of view of the BFSS scattering conjecture. Here we quote252525The quote has been lightly edited to correct typos and to adopt our notation. from Susskind [77] who puts its colorfully:

[T]he history of the scattering process has two very different but equivalent descriptions. In the usual spacetime supergravity description two small particles come in from infinity and remain essentially non-interacting until they come within a distance of order $\ell_{p}$ . They interact for a short time and then separate into final particles which cease to interact as soon as they are separated by $\ell_{p}$ . In light cone units the interaction lasts for a time $\ell_{p}N/(|\vec{p}|R)$ . The holographic matrix description also begins with asymptotically distant non-interacting objects. In this description the constituents begin to merge and interact when their separation is of order $N^{1/3}\ell_{p}$ . As they approach, the many body wave function begins to more and more resemble the ground state. The system remains in this entangled state for a light cone time of order $\ell_{p}N^{4/3}/(|\vec{p}|R)$ and then separate into non-interacting final clusters. The situation is particularly perplexing if the energy is not very large and the impact parameter is much larger than $\ell_{p}$ . [Then in the] gravity description the particles miss each other and just continue without significant deflection. Exactly how this miracle happens from the SYM description is still a mystery.

A related question is: to what extent can one isolate the low energy degrees of freedom in the matrix model that encodes the $M$ -theory region?

Before moving on, let us note that for any supersymmetric quantum mechanical system $\{Q_{\alpha},Q_{\beta}\}\propto\delta_{\alpha\beta}H$ one can directly bootstrap a SUSY invariant state $Q_{\alpha}\ket{\Omega}=0$ . This is done by writing $\expectationvalue{Q_{\alpha}O}=\expectationvalue{OQ_{\alpha}}=0$ . Since a SUSY invariant state is automatically a ground state of $H$ , we may further assume that it preserves any global symmetries, including the $R$ -symmetry under which $Q$ transforms. At large $N$ , the useful condition becomes:

SUSY ground state bootstrap

For a supersymmetric quantum system with at least one ground state that preserves SUSY, one can instead use the supercharge equations of motion:

$\displaystyle\bra{\Omega}\{Q_{\alpha},O_{\alpha}\}\ket{\Omega}=0.$

(140)

Here $O_{\alpha}$ is any fermionic operator. (Since the ground state preserves the $R$ -symmetry, we get useful equations by forming singlets under the $R$ -symmetry.) Together with the usual positivity matrices, it can be shown [118] that ground state positivity (104) is automatically implied by the supercharge constraints.

6.4.1 Decomposing into SO(9) blocks

SO(9) symmetry implies that the only ground state correlation functions are SO(9) invariants. However, to derive positivity constraints, we must consider operators that transform non-trivial under SO(9) in intermediate steps. For example, to prove that $\expectationvalue{\tr X^{I}X^{I}}\geq 0$ , we observe that it is the norm-squared of the operator $X^{I}$ , which is an SO(9) vector.

More generally, the positivity matrix $\mathcal{M}$ can be written as a direct sum of block-diagonal matrices, with each block associated to some irrep of SO(9). Given that the BFSS matrix model has 9 bosonic matrices and 16 fermionic matrices, we would like to avoid explicitly enumerating the operators. The situation is somewhat similar to how one performs the conformal block decomposition in the conformal bootstrap.

To illustrate the general procedure, let us consider the 4-fermion correlator

[TABLE]

We can view $\mathcal{M}$ as a giant $16^{2}\times 16^{2}$ positivity matrix indexed by $i=(\alpha,\beta),j=(\eta,\epsilon)$ . We would like to avoid explicitly constructing such a large matrix. To this end, note that we can fuse $\alpha,\beta$ into some irrep $R$ . Since the ground state is SO(9) invariant, we must also fuse $\eta,\epsilon$ into the irrep $\bar{R}=R$ to get a singlet. This means:

[TABLE]

We have written down the 5 possible irreps that can be obtained by fusing two spinors. Thus we have already reduced the $16^{2}\times 16^{2}$ matrix to just 5 unknown variables. Now consider cyclicity of the trace (together with the anti-commutation relations for the fermions) $\tr(\psi_{\alpha}\psi_{\beta}\psi_{\eta}\psi_{\epsilon})=-\tr(\psi_{\beta}\psi_{\eta}\psi_{\epsilon}\psi_{\alpha})+\frac{1}{2}\delta_{\alpha\beta}\delta_{\eta\epsilon}+\frac{1}{2}\delta_{\alpha\epsilon}\delta_{\eta\beta}$ . This gives the alternative expression

[TABLE]

To solve this equation, we need the crossing relations for the SO(9) blocks. We have introduced a graphical notation, where each vertex corresponds to a Clebsch-Gordan symbol, and internal lines represent sums over common indices. We want to expand an $s$ -channel block in a basis of $t$ -channel blocks:

[TABLE]

Here $\mathbb{F}_{R_{s},R_{t}}$ is the SO(9) crossing kernel. It is a 6j symbol, see [118] for more details. For the case above, all external operators in the 16 irrep, e.g., the spinor irrep. Concretely, since there are 5 irreps that appear in (143), $\mathbb{F}_{R_{s},R_{t}}$ is just a $5\times 5$ matrix with rational numbers as coefficients, see [118] equation (35) for the matrix. With this crossing kernel, we can solve (143). This reduces the set of 5 variables $\{a_{1},a_{9},\cdots,a_{126}\}$ to just two variables, which for convenience we take to be $\{a_{1},a_{9}\}$ . Furthermore, positivity262626Since some of the $O$ ’s are actually anti-Hermitian, $a_{84}$ and $a_{36}$ are negative-definite. of these 5 variables restricts the possible values of $\{a_{1},a_{9}\}$ to a subset of the $\mathbb{R}^{2}$ :

[TABLE]

In the matrix inequality (153), we used the fact that the identity is also an SO(9) singlet, and that $\expectationvalue{\tr\psi_{\alpha}\psi_{\alpha}}=8$ . Using these constraints, we can maximize/minimize the value of $a_{9}$ to find that

[TABLE]

This value was used in deriving (137). Similarly, we can derive bounds on $a_{1}$ which leads to $1\leq\expectationvalue{\tr OO}\leq 2$ .

The main point is that after using group theory, we do not need to work with the explicit matrix $\mathcal{M}$ with its many possible values of indices272727To be precise, we also considered constraints coming from considering the identity operator and the 2-fermion correlator. More generally, for each irrep $R$ , we will need to consider all operators up to some level that transform in that irrep. For example, if $R$ corresponds to rank-2 anti-symmetric tensors, we will need to consider $\{X^{[I}X^{J]},X^{[I}P^{J]},P^{[I}X^{J]},\psi\gamma^{IJ}\psi\}$ up to level 3, where level is defined by (155).. Group theory boils down all the constraints from this giant matrix to just some simple constraints on 2 unknowns which control the values of all singlet correlators.

6.4.2 Numerics and other related models

To go beyond simple analytic bounds, one can systematically automate the computation [118]. To organize the bootstrap, it is convenient to assign a level

[TABLE]

Then the level of a single trace operator is the sum of the level of each letter, e.g., $\Tr X^{I}X^{J}X^{I}X^{J}$ is a level 4 operator. This assignment has the nice property that anti-commuting an operator with the supercharge (58) (at most) increases its level by 1/2. It also plays nicely with cyclicity, which either preserves the level or generates an operator with level $\ell-3$ .

It was reported that up to level 9 the allowed region has the shape of a peninsula, see Figure 5. An obvious question is whether this peninsula will collapse into an island at higher levels. [104] studied a purely bosonic variant282828Without supersymmetry, the classically flat directions of the potential where $[X_{I},X_{J}]=0$ are lifted quantum mechanically and the resulting theories are gapped at finite $N$ . Therefore, there are no scattering states in these models. It would be interesting to study models with less than 16 SUSYs that would still have flat directions. of BFSS:

[TABLE]

For both $D=2$ and $D=9$ , a bootstrap island was found at level 10. The island then shrinks rapidly at higher levels. A plot for $D=9$ (sometimes called “bosonic BFSS”) have been reproduced here; one should compare the orange peninsula in Figure 6 with the peninsulas shown in Figure 5. Optimistically one will find precise estimates by going to level 10 and beyond in the BFSS case.

7 Conclusion

The BFSS conjecture and, more generally, D0 brane holography is a very rich subject; we have only managed to survey a handful of developments. We have emphasized the 10d and 11d black hole in this review, but the model should also shed light on other non-perturbative objects in string/M-theory such as M2 branes [1], M5 branes [121, 122, 123, 124] and Kaluza-Klein monopoles [33]. It is an important window into M-theory, and perhaps has lessons for flat space holography more generally [125, 126, 127]. In the future, it seems important to understand new techniques for doing strongly coupled computations in this model, especially quantities that are not protected by supersymmetry.

Acknowledgments

I thank Gauri Batra, Shai Chester, Yiming Chen, Victor Ivo, Silviu Pufu, Juan Maldacena, Savdeep Sethi, Stephen Shenker, Douglas Stanford, Lenny Susskind, Gustavo Joaquin Turiaci and Zechuan Zheng for useful discussions. I thank all the TASI participants for their questions and for correcting some typos in an earlier version of these notes. I am supported by a Bloch Fellowship and by NSF Grant PHY-2310429.

Appendix A M-theory reminder

M-theory in asymptotically flat space is an 11d theory of quantum gravity. It has only one (dimensionful) scale, the 11d Planck scale $\ell_{p}$ . At energies much lower than the Planck scale $E\ll 1/\ell_{p}$ the theory is well-approximated by supergravity in 11d. In addition to the metric, there is a 3-form gauge field $A_{\mu\nu\rho}$ . The electric sources of this gauge field are called M2 branes and the magnetic sources are M5 branes. (The 4-form field strength $F$ is dual to a 7-form $\tilde{F}=d\tilde{A}$ which couples to the 6-dimensional worldvolume.)

Let us now recall the relationship between M-theory and Type IIA. Compactifying M-theory on a small spatial circle of size $R$ gives Type IIA with

[TABLE]

This follows from the fact that D0 branes have mass $M=1/(\ell_{s}g_{s})$ in Type IIA. On the other hand, from the 11d viewpoint they are just gravitons with Kaluza-Klein momentum $M=1/R$ . We can derive another relation between the M-theory/Type IIA parameters by considering the relation between an M2 brane and a string. The Type IIA string has a tension that is (by definition) $1/(2\pi\alpha^{\prime})=1/(2\pi\ell_{s}^{2})$ . On the other hand, M-theory has membranes (M2 branes) with tension $1/(2\pi)^{2}\ell_{p}^{3}$ . (Here $\ell_{p}$ is the 11d Planck scale.) To get a string (a 1-brane), we wrap the M2 on the spatial circle of radius $R$ so the resulting string has tension:

[TABLE]

The relations (157) and (158) allow us to convert the two M-theory parameters $R,\ell_{p}$ to the Type IIA parameters $\ell_{s}$ and $g_{s}$ .

For our purposes, the low energy Lagrangian for M-theory is

[TABLE]

The relation between the M-theory 11d metric and the string frame fields in Type IIA is given by

[TABLE]

Plugging in this ansatz, and using the relations (157) and (158) we recover the Type IIA action (1).

Bibliography127

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] T. Banks, W. Fischler, S.H. Shenker and L. Susskind, M theory as a matrix model: A conjecture , Phys. Rev. D 55 (1997) 5112 [ hep-th/9610043 ]. · doi ↗
2[2] J. Maldacena, A simple quantum system that describes a black hole , 2303.11534 .
3[3] Y. Imamura, Born-Infeld action and Chern-Simons term from Kaluza-Klein monopole in M theory , Phys. Lett. B 414 (1997) 242 [ hep-th/9706144 ]. · doi ↗
4[4] S.A. Hartnoll and J. Liu, The Polarised IKKT Matrix Model , 2409.18706 .
5[5] S. Komatsu, A. Martina, J.a. Penedones, A. Vuignier and X. Zhao, Einstein gravity from a matrix integral – Part I , 2410.18173 .
6[6] S. Komatsu, A. Martina, J. Penedones, A. Vuignier and X. Zhao, Einstein gravity from a matrix integral – Part II , 2411.18678 .
7[7] D.E. Berenstein, J.M. Maldacena and H.S. Nastase, Strings in flat space and pp waves from N=4 super Yang-Mills , JHEP 04 (2002) 013 [ hep-th/0202021 ]. · doi ↗
8[8] D. Bigatti and L. Susskind, Review of matrix theory , 1997.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Abstract

Contents

1 Introduction

2 Review of D0-brane holography

2.1 D0 black hole

2.2 Decoupling limit

2.3 Differentiate dictionary

2.4 Relation to AdS

2.5 Black hole thermodynamics

2.6 Monte Carlo

3 The matrix model

3.1 The spectrum at finite NNN

4 Scattering and the BFSS conjecture

4.1 Scattering in the ’t Hooft limit

4.2 Scattering beyond the ’t Hooft limit

5 Other techniques for analyzing the BFSS model

5.1 High temperature/weak ’t Hooft coupling expansion

5.2 SUSY techniques

6 Matrix Bootstrap

6.1 Bootstrapping equations of motion

6.2 Quantum mechanical bootstrap

6.3 Finite temperature

6.4 Application to BFSS

6.4.1 Decomposing into SO(9) blocks

6.4.2 Numerics and other related models

7 Conclusion

Acknowledgments

Appendix A M-theory reminder

3.1 The spectrum at finite $N$