Moments of the Riemann zeta function on short intervals of the critical   line

Louis-Pierre Arguin; Fr\'ed\'eric Ouimet; Maksym Radziwi\l\l

arXiv:1901.04061·math.NT·May 25, 2022

Moments of the Riemann zeta function on short intervals of the critical line

Louis-Pierre Arguin, Fr\'ed\'eric Ouimet, Maksym Radziwi\l\l

PDF

TL;DR

This paper analyzes the moments of the Riemann zeta function on short intervals along the critical line, revealing phase transitions and differences between mesoscopic and macroscopic scales, with implications for understanding zeta correlations.

Contribution

It extends the understanding of zeta moments on short intervals, proving phase transitions and interval-dependent behaviors, and generalizes previous results with unconditional proofs.

Findings

01

Moments exhibit phase transition at critical exponent _ heta(eta)

02

Different behavior of moments between mesoscopic and macroscopic intervals

03

Maximal size of zeta on short intervals quantified as (\u2212 T)^{m( heta)+o(1)}

Abstract

We show that as $T \to \infty$ , for all $t \in [T, 2 T]$ outside of a set of measure $o (T)$ , $\int_{- (l o g T)^{θ}}^{(l o g T)^{θ}} ∣ ζ (\frac{1}{2} + i t + i h) ∣^{β} d h = (lo g T)^{f_{θ} (β) + o (1)},$ for some explicit exponent $f_{θ} (β)$ , where $θ > - 1$ and $β > 0$ . This proves an extended version of a conjecture of Fyodorov and Keating (2014). In particular, it shows that, for all $θ > - 1$ , the moments exhibit a phase transition at a critical exponent $β_{c} (θ)$ , below which $f_{θ} (β)$ is quadratic and above which $f_{θ} (β)$ is linear. The form of the exponent $f_{θ}$ also differs between mesoscopic intervals ( $- 1 < θ < 0$ ) and macroscopic intervals ( $θ > 0$ ), a phenomenon that stems from an approximate tree structure for the correlations of zeta. We also prove that,…

Equations437

\int_{- l o g^{θ} T}^{l o g^{θ} T} ∣ ζ (\frac{1}{2} + i t + i h) ∣^{β} d h = (lo g T)^{f_{θ} (β) + o (1)},

\int_{- l o g^{θ} T}^{l o g^{θ} T} ∣ ζ (\frac{1}{2} + i t + i h) ∣^{β} d h = (lo g T)^{f_{θ} (β) + o (1)},

∣ h ∣ \leq l o g^{θ} T max ∣ ζ (\frac{1}{2} + i t + i h) ∣ = (lo g T)^{m (θ) + o (1)},

∣ h ∣ \leq l o g^{θ} T max ∣ ζ (\frac{1}{2} + i t + i h) ∣ = (lo g T)^{m (θ) + o (1)},

|\zeta(\tfrac{1}{2}+\mathrm{i}t)|=\mathcal{O}\left(\exp\bigg{(}\Big{(}\frac{\log 2}{2}+\mathrm{o}(1)\Big{)}\frac{\log t}{\log\log t}\bigg{)}\right),\quad\text{as }t\to\infty,

|\zeta(\tfrac{1}{2}+\mathrm{i}t)|=\mathcal{O}\left(\exp\bigg{(}\Big{(}\frac{\log 2}{2}+\mathrm{o}(1)\Big{)}\frac{\log t}{\log\log t}\bigg{)}\right),\quad\text{as }t\to\infty,

\max_{t\in[0,T]}|\zeta(\tfrac{1}{2}+\mathrm{i}t)|\geq\exp\bigg{(}(\sqrt{2}+\mathrm{o}(1))\sqrt{\frac{\log T\log\log\log T}{\log\log T}}\bigg{)},\quad\text{as }T\to\infty,

\max_{t\in[0,T]}|\zeta(\tfrac{1}{2}+\mathrm{i}t)|\geq\exp\bigg{(}(\sqrt{2}+\mathrm{o}(1))\sqrt{\frac{\log T\log\log\log T}{\log\log T}}\bigg{)},\quad\text{as }T\to\infty,

\max_{t\in[0,T]}|\zeta(\tfrac{1}{2}+\mathrm{i}t)|=\exp\left(\Big{(}\frac{1}{\sqrt{2}}+\mathrm{o}(1)\Big{)}\sqrt{\log T\cdot\log\log T}\right),\quad\text{as }T\to\infty.

\max_{t\in[0,T]}|\zeta(\tfrac{1}{2}+\mathrm{i}t)|=\exp\left(\Big{(}\frac{1}{\sqrt{2}}+\mathrm{o}(1)\Big{)}\sqrt{\log T\cdot\log\log T}\right),\quad\text{as }T\to\infty.

\frac{1}{T} \int_{T}^{2 T} ∣ ζ (\frac{1}{2} + i t) ∣^{β} d t, β > 0.

\frac{1}{T} \int_{T}^{2 T} ∣ ζ (\frac{1}{2} + i t) ∣^{β} d t, β > 0.

\frac{1}{T} \int_{T}^{2 T} ∣ ζ (\frac{1}{2} + i t) ∣^{β} d t \sim C_{β} (lo g T)^{β^{2} /4}, as T \to \infty,

\frac{1}{T} \int_{T}^{2 T} ∣ ζ (\frac{1}{2} + i t) ∣^{β} d t \sim C_{β} (lo g T)^{β^{2} /4}, as T \to \infty,

\max_{h\in[-1,1]}\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|-\big{(}\log\log T-\frac{3}{4}\log\log\log T\big{)}\in[-C,C].

\max_{h\in[-1,1]}\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|-\big{(}\log\log T-\frac{3}{4}\log\log\log T\big{)}\in[-C,C].

\int_{[- 1, 1]} ∣ ζ (\frac{1}{2} + i τ + i h) ∣^{β} d h = {(lo g T)^{β^{2} /4 + o (1)}, (lo g T)^{β - 1 + o (1)}, if β \leq 2, if β > 2,

\int_{[- 1, 1]} ∣ ζ (\frac{1}{2} + i τ + i h) ∣^{β} d h = {(lo g T)^{β^{2} /4 + o (1)}, (lo g T)^{β - 1 + o (1)}, if β \leq 2, if β > 2,

θ \leq 0

θ \leq 0

θ > 0

\mathbb{P}\Big{(}\int_{-\log^{\theta}T}^{\log^{\theta}T}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|^{\beta}{\rm d}h<(\log T)^{f_{\theta}(\beta)-\varepsilon}\Big{)}=\mathrm{o}(1).

\mathbb{P}\Big{(}\int_{-\log^{\theta}T}^{\log^{\theta}T}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|^{\beta}{\rm d}h<(\log T)^{f_{\theta}(\beta)-\varepsilon}\Big{)}=\mathrm{o}(1).

\mathbb{P}\Big{(}\int_{-\log^{\theta}T}^{\log^{\theta}T}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|^{\beta}{\rm d}h>(\log T)^{f_{\theta}(\beta)+\varepsilon}\Big{)}=\mathrm{o}(1).

\mathbb{P}\Big{(}\int_{-\log^{\theta}T}^{\log^{\theta}T}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|^{\beta}{\rm d}h>(\log T)^{f_{\theta}(\beta)+\varepsilon}\Big{)}=\mathrm{o}(1).

\mathbb{P}\Big{(}\max_{|h|\leq\log^{\theta}T}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|<(\log T)^{m(\theta)-\varepsilon}\Big{)}=\mathrm{o}(1).

\mathbb{P}\Big{(}\max_{|h|\leq\log^{\theta}T}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|<(\log T)^{m(\theta)-\varepsilon}\Big{)}=\mathrm{o}(1).

\mathbb{P}\Big{(}\max_{|h|\leq\log^{\theta}T}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|>(\log T)^{m(\theta)+\varepsilon}\Big{)}=\mathrm{o}(1).

\mathbb{P}\Big{(}\max_{|h|\leq\log^{\theta}T}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|>(\log T)^{m(\theta)+\varepsilon}\Big{)}=\mathrm{o}(1).

\mathbb{P}\Bigg{(}\frac{\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau)|}{\sqrt{\frac{1}{2}\log\log T}}\in(a,b)\Bigg{)}\xrightarrow{T\to\infty}\int_{a}^{b}\frac{e^{-u^{2}/2}}{\sqrt{2\pi}}\,{\rm d}u.

\mathbb{P}\Bigg{(}\frac{\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau)|}{\sqrt{\frac{1}{2}\log\log T}}\in(a,b)\Bigg{)}\xrightarrow{T\to\infty}\int_{a}^{b}\frac{e^{-u^{2}/2}}{\sqrt{2\pi}}\,{\rm d}u.

\displaystyle\max_{|h|\leq\log^{\theta}T}\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|-\big{(}m(\theta)\log\log T-r(\theta)\log\log\log T\big{)}\in[-C,C],

\displaystyle\max_{|h|\leq\log^{\theta}T}\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|-\big{(}m(\theta)\log\log T-r(\theta)\log\log\log T\big{)}\in[-C,C],

r (θ) = \frac{3}{4}

r (θ) = \frac{3}{4}

∣ h ∣ \leq l o g^{θ} T max lo g ∣ ζ (\frac{1}{2} + i τ + i h) ∣

∣ h ∣ \leq l o g^{θ} T max lo g ∣ ζ (\frac{1}{2} + i τ + i h) ∣

= m (θ) lo g lo g T - \frac{3}{4} lo g lo g lo g T + \frac{∣ θ ∣}{2} lo g lo g T \cdot Z + O_{P} (1),

\mathbb{E}\bigg{[}\Big{(}\frac{1}{2\pi}\int_{0}^{2\pi}|\det(\mathbb{I}-e^{-\mathrm{i}h}M_{N})|^{2\beta}{\rm d}h\Big{)}^{k}\bigg{]},\quad k>0,~{}\beta>0.

\mathbb{E}\bigg{[}\Big{(}\frac{1}{2\pi}\int_{0}^{2\pi}|\det(\mathbb{I}-e^{-\mathrm{i}h}M_{N})|^{2\beta}{\rm d}h\Big{)}^{k}\bigg{]},\quad k>0,~{}\beta>0.

\frac{∣ det ( I - e ^{- i h} M _{N} ) ∣ ^{2 β}}{E [ ∣ det ( I - e ^{- i h} M _{N} ) ∣ ^{2 β} ]} \frac{d h}{2 π},

\frac{∣ det ( I - e ^{- i h} M _{N} ) ∣ ^{2 β}}{E [ ∣ det ( I - e ^{- i h} M _{N} ) ∣ ^{2 β} ]} \frac{d h}{2 π},

P (A_{T}) = \frac{1}{T} Leb (A_{T}) and E [X_{T}] = \frac{1}{T} \int_{T}^{2 T} X_{T} (t) d t .

P (A_{T}) = \frac{1}{T} Leb (A_{T}) and E [X_{T}] = \frac{1}{T} \int_{T}^{2 T} X_{T} (t) d t .

\mathbb{E}\Big{[}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau)|^{\beta}\Big{]}\ll(\log T)^{\beta^{2}/4+\varepsilon},

\mathbb{E}\Big{[}|\zeta(\tfrac{1}{2}+\mathrm{i}\tau)|^{\beta}\Big{]}\ll(\log T)^{\beta^{2}/4+\varepsilon},

\max_{|h|\leq\log^{\theta}T}|D(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|^{\beta}\ll\sum_{|k|\leq\log^{1+\theta}T}\big{|}D\big{(}\tfrac{1}{2}+\mathrm{i}\tau+\tfrac{2\pi\mathrm{i}k}{\log T}\big{)}\big{|}^{\beta}.

\max_{|h|\leq\log^{\theta}T}|D(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|^{\beta}\ll\sum_{|k|\leq\log^{1+\theta}T}\big{|}D\big{(}\tfrac{1}{2}+\mathrm{i}\tau+\tfrac{2\pi\mathrm{i}k}{\log T}\big{)}\big{|}^{\beta}.

(ζ \cdot e^{- P_{∣ θ ∣}}) (\frac{1}{2} + i τ), where P_{α} (s) = l o g p \leq l o g^{α} T \sum \frac{1}{p ^{s}} for α > 0,

(ζ \cdot e^{- P_{∣ θ ∣}}) (\frac{1}{2} + i τ), where P_{α} (s) = l o g p \leq l o g^{α} T \sum \frac{1}{p ^{s}} for α > 0,

\max_{|h|\leq\log^{\theta}T}\big{|}\mathcal{P}_{|\theta|}(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)\big{|}=\mathrm{o}(\log\log T).

\max_{|h|\leq\log^{\theta}T}\big{|}\mathcal{P}_{|\theta|}(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)\big{|}=\mathrm{o}(\log\log T).

\mathbb{E}\Big{[}\big{|}(\zeta\cdot e^{-\mathcal{P}_{|\theta|}})(\tfrac{1}{2}+\mathrm{i}\tau)\big{|}^{\beta}\Big{]}\ll(\log T)^{(\beta^{2}/4)\cdot(1+\theta)+\varepsilon},

\mathbb{E}\Big{[}\big{|}(\zeta\cdot e^{-\mathcal{P}_{|\theta|}})(\tfrac{1}{2}+\mathrm{i}\tau)\big{|}^{\beta}\Big{]}\ll(\log T)^{(\beta^{2}/4)\cdot(1+\theta)+\varepsilon},

\int_{- l o g^{θ} T}^{l o g^{θ} T} ∣ ζ (σ + i τ + i h) ∣^{β} d h ≪ \int_{- 3 l o g^{θ} T}^{3 l o g^{θ} T} ∣ ζ (\frac{1}{2} + i τ + i h) ∣^{β} d h + \frac{1}{( lo g T ) ^{96}} .

\int_{- l o g^{θ} T}^{l o g^{θ} T} ∣ ζ (σ + i τ + i h) ∣^{β} d h ≪ \int_{- 3 l o g^{θ} T}^{3 l o g^{θ} T} ∣ ζ (\frac{1}{2} + i τ + i h) ∣^{β} d h + \frac{1}{( lo g T ) ^{96}} .

\int_{- l o g^{θ} T}^{l o g^{θ} T} ∣ ζ (σ_{0} + i τ + i h) ∣^{β} d h, with σ_{0} = \frac{1}{2} + \frac{1}{( lo g T ) ^{1 - δ}},

\int_{- l o g^{θ} T}^{l o g^{θ} T} ∣ ζ (σ_{0} + i τ + i h) ∣^{β} d h, with σ_{0} = \frac{1}{2} + \frac{1}{( lo g T ) ^{1 - δ}},

\int_{-\log^{\theta}T}^{\log^{\theta}T}\exp\big{(}\beta\,\mathrm{Re}\hskip 2.56073pt\mathcal{P}_{1-\delta}(\sigma_{0}+\mathrm{i}\tau+\mathrm{i}h)\big{)}{\rm d}h.

\int_{-\log^{\theta}T}^{\log^{\theta}T}\exp\big{(}\beta\,\mathrm{Re}\hskip 2.56073pt\mathcal{P}_{1-\delta}(\sigma_{0}+\mathrm{i}\tau+\mathrm{i}h)\big{)}{\rm d}h.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Moments of the Riemann zeta function on short intervals of the critical line

Louis-Pierre Arguinlabel=e1][email protected] [

Frédéric Ouimetlabel=e2 [

mark][email protected]

Maksym Radziwiłłlabel=e3 [

mark][email protected]

Baruch College and Graduate Center (CUNY),

California Institute of Technology,

Abstract

We show that as $T\to\infty$ , for all $t\in[T,2T]$ outside of a set of measure $\mathrm{o}(T)$ ,

[TABLE]

for some explicit exponent $f_{\theta}(\beta)$ , where $\theta>-1$ and $\beta>0$ . This proves an extended version of a conjecture of Fyodorov and Keating (2014). In particular, it shows that, for all $\theta>-1$ , the moments exhibit a phase transition at a critical exponent $\beta_{c}(\theta)$ , below which $f_{\theta}(\beta)$ is quadratic and above which $f_{\theta}(\beta)$ is linear. The form of the exponent $f_{\theta}$ also differs between mesoscopic intervals ( $-1<\theta<0$ ) and macroscopic intervals ( $\theta>0$ ), a phenomenon that stems from an approximate tree structure for the correlations of zeta. We also prove that, for all $t\in[T,2T]$ outside a set of measure $\mathrm{o}(T)$ ,

[TABLE]

for some explicit $m(\theta)$ . This generalizes earlier results of Najnudel (2018) and Arguin et al. (2019) for $\theta=0$ . The proofs are unconditional, except for the upper bounds when $\theta>3$ , where the Riemann hypothesis is assumed.

60G70,

11M06 \sep60F10 \sep60G60,

extreme value theory,

Riemann zeta function,

moments,

keywords:

[class=MSC2020]

keywords:

\startlocaldefs\endlocaldefs

,

and t1L.-P. A. is supported in part by NSF Grant DMS-1513441 and by NSF CAREER DMS-1653602.t2F. O. is supported by postdoctoral fellowship from the NSERC (PDF) and the FRQNT (B3X).t3M. R. acknowledges support of a Sloan fellowship and NSF grant DMS-1902063.

1 Introduction
1.1 Maxima and moments over large intervals
1.2 Maxima and moments over short intervals
1.3 Relations to other models
1.4 Outline of the proof
2 Upper bounds
2.1 Moment estimates
2.2 Discretization
2.3 Proofs of the upper bounds
2.3.1 The case $\theta\geq 0$
2.3.2 The case $\theta<0$
3 Lower bounds
3.1 Reduction off-axis
3.2 Mollification
3.3 Approximation of the mollifier
3.4 Proofs of the lower bounds
A Useful estimates

1 Introduction

1.1 Maxima and moments over large intervals

Understanding the growth of the Riemann zeta function $\zeta(s)$ on the critical line $\mathrm{Re}\hskip 2.56073pts=\tfrac{1}{2}$ is a central problem in number theory due, among other things, to its relationship with the distribution of the zeros of $\zeta(s)$ , see e.g. Theorem 9.3 in Titchmarsh (1986), and the more general subconvexity problem, see e.g. Michel and Venkatesh (2010); Venkatesh (2010), and see Iwaniec and Sarnak (2000) for a general discussion.

The Lindelöf hypothesis predicts that, for any $\varepsilon>0$ and all $t\in\mathbb{R}$ , we have $|\zeta(\tfrac{1}{2}+\mathrm{i}t)|=\mathcal{O}((1+|t|)^{\varepsilon})$ , whereas it follows from the Riemann hypothesis that

[TABLE]

see Chandee and Soundararajan (2011).

Unfortunately, there is a large gap between these conditional results and the best unconditional upper bounds, such as Bourgain (2017), which shows that $|\zeta(\tfrac{1}{2}+\mathrm{i}t)|=\mathcal{O}((1+|t|)^{13/84+\varepsilon})$ for any given $\varepsilon>0$ and all $t\in\mathbb{R}$ . Currently, the best unconditional lower bound,

[TABLE]

is established in de la Bretèche and Tenenbaum (2019) building on a method from Bondarenko and Seip (2017).

The true order of the maximum of $|\zeta(\tfrac{1}{2}+\mathrm{i}t)|$ remains elusive to this day. A conjecture that we find plausible is stated in Farmer, Gonek and Hughes (2007), where it is conjectured based on probabilistic models that

[TABLE]

Another set of central objects in the theory of the Riemann zeta function are the moments

[TABLE]

Their importance comes from their relationship to the size and zero-distribution of $\zeta(s)$ . However, unlike the problem of understanding the size of the global maximum of $|\zeta(\tfrac{1}{2}+\mathrm{i}t)|$ , we are in possession of widely believed conjectures regarding the behavior of moments. Following the work Keating and Snaith (2000), it is expected that, for all $\beta>0$ ,

[TABLE]

and that the constant $C_{\beta}>0$ factors into a product of two constants: one is computed from the moments of the characteristic polynomial of random unitary matrices, and the other is an arithmetic factor coming from the small primes.

There are a few results supporting (1.5). First, the conjecture (1.5) is known for $\beta=2$ and $\beta=4$ following the classical work of Hardy-Littlewood and Ingham. Upper bounds of the correct order of magnitude are established in Heap, Radziwiłł and Soundararajan (2019) for $0<\beta\leq 4$ . Meanwhile, lower bounds of the correct order of magnitude have been established for all $\beta\geq 2$ in Radziwiłł and Soundararajan (2013). Conditionally on the Riemann hypothesis, the correct order of magnitude of (1.5) is known for all $\beta>0$ (see Soundararajan (2009); Harper (2013a) for the upper bounds and Heath-Brown (1981) for the lower bounds).

1.2 Maxima and moments over short intervals

Motivated by the problem of understanding the global maximum, Fyodorov, Hiary and Keating (2012); Fyodorov and Keating (2014) initiated the question of understanding the true size of the local maximum of $\zeta(\tfrac{1}{2}+\mathrm{i}t)$ by establishing a connection with log-correlated processes. If $\tau$ is sampled uniformly on $[T,2T]$ , they conjectured that for any $0<\delta<1$ , there exists $C=C(\delta)>0$ large enough and independent of $T$ , such that with probability $1-\delta$ ,

[TABLE]

They also conjectured weak convergence, with a limiting tail of the form $Cye^{-2y}$ . The leading order $\log\log T$ was proved in Najnudel (2018) (conditionally on the Riemann hypothesis for the lower bound) and in Arguin et al. (2019) unconditionally. The sharp upper bound was recently established in Arguin, Bourgade and Radziwiłł (2020).

It is also conjectured in Fyodorov, Hiary and Keating (2012); Fyodorov and Keating (2014) (see Equations (14) and (2.30), respectively) that the moments in a short interval undergo a freezing phase transition, that is, the event

[TABLE]

has probability $1-\mathrm{o}(1)$ as $T\to\infty$ . Fyodorov and Keating (2014) also state corresponding conjectures for mesoscopic intervals of length $\log^{\theta}T$ when $\theta\in(-1,0)$ , as well as finer asymptotics for the moments.

In view of Equations (1.5) and (1.7), an obvious question is to determine up to which interval size the freezing phase transition persists. In this paper, we establish that freezing transitions occur exactly for interval sizes of order $\log^{\theta}T$ with $\theta>-1$ . We also obtain the corresponding results for local maxima over such intervals. The following functions will be crucial to our analysis:

[TABLE]

Theorem 1.1 (Moments).

Let $\theta>-1$ , $\beta>0$ and $\varepsilon>0$ be given. Let $\tau$ be a random variable uniformly distributed on $[T,2T]$ . Then, as $T\to\infty$ , we have

[TABLE]

Moreover, if $\theta\leq 3$ or if the Riemann hypothesis holds, then as $T\rightarrow\infty$ ,

[TABLE]

Proof.

For the upper bound, see Section 2.3, and for the lower bound, see Proposition 3.2. ∎

When $\beta>\beta_{c}(\theta)$ , the moments exhibit freezing, i.e. they are dominated by a few large values at the level of the local maximum of $|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|$ , $|h|\leq\log^{\theta}T$ . Theorem 1.1 also suggests that freezing does not occur for intervals larger than any fixed power of $\log T$ , since $\beta_{c}(\theta)\to\infty$ as $\theta\to\infty$ . We note that recently a sharp upper bound in the case $(\theta=0,\beta=2)$ has been established in Harper (2019), thus refining the $(\log T)^{\varepsilon}$ factor appearing in (1.10) when $\theta=0$ and $\beta=2$ .

Theorem 1.2 (Local maximum).

Let $\theta>-1$ and $\varepsilon>0$ be given. Let $\tau$ be a random variable uniformly distributed on $[T,2T]$ . Then, as $T\to\infty$ , we have

[TABLE]

Moreover, if $\theta\leq 3$ or if the Riemann hypothesis holds, then as $T\rightarrow\infty$ ,

[TABLE]

Proof.

For the upper bound, see Section 2.3, and for the lower bound, see Proposition 3.1. ∎

It is instructive to put these results in the context of two well-known facts on $\zeta$ . First, Selberg’s central limit theorem, see for example Selberg (1946, 1992) or the simple proof in Radziwiłł and Soundararajan (2017), states that, for any given $a<b$ ,

[TABLE]

In other words, a typical value of $\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau)|$ is a Gaussian random variable of variance $\frac{1}{2}\log\log T$ . This is consistent with the moment conjecture (1.5) which gives a precise expression for the Laplace transform of $\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau)|$ . Second, since $\zeta(\tfrac{1}{2}+\mathrm{i}t)$ varies on the scale of $(\log T)^{-1}$ for $T\leq t\leq 2T$ , the analysis of large values should be reducible to a discrete set of $(\log T)^{1+\theta}$ points. Putting these two facts together, one expects that the statistics of extreme values of $\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|$ , $|h|\leq\log^{\theta}T$ , should be similar to the ones of $(\log T)^{1+\theta}$ Gaussian random variables of variance $\frac{1}{2}\log\log T$ . If the random variables were independent, this is the so-called Random Energy Model (REM) in statistical mechanics introduced in Derrida (1981). For $\theta\geq 0$ , it is not hard to check, using basic Gaussian tail estimates, that the expression (1.8) corresponds to the free energy of the model, and the results of Theorem 1.2, to the maximum of the REM. For more on this, we refer to Kistler (2015), where many techniques from REM were introduced to analyze log-correlated processes.

The REM heuristic is of course limited as the values of $\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|$ , $|h|\leq\log^{\theta}T$ , are correlated. In fact, they are log-correlated if $|h-h^{\prime}|\leq 1$ , as first noticed Bourgade (2010). A good probabilistic model for the extreme values in the case $\theta=0$ is therefore a branching random walk. This is explained in more details in Section 1.4 and illustrated in Figure 1. For $\theta>0$ , our results show that the correlations do not affect large values at leading order (though the proofs must take them into account). As argued in Section 1.4, we believe that the correct probabilistic model for large values in this case is $\log^{\theta}T$ independent branching random walks. One implication is that the REM heuristic should persist to subleading order (but fail at the level of fluctuations). In view of this, we believe that conjecture (1.6) needs to be expanded as follows to include large intervals:

Conjecture 1.3.

Let $\theta\geq 0$ be given and let $m(\theta)$ be as in (1.8). Let $\tau$ be a random variable uniformly distributed on $[T,2T]$ . For any $0<\delta<1$ , there exists $C=C(\delta)>0$ large enough and independent of $T$ , such that with probability $1-\delta$ ,

[TABLE]

where

[TABLE]

In particular, we expect a discontinuity of $r(\theta)$ as $\theta\downarrow 0$ . An analysis of a model of the Riemann zeta function shows that the discontinuity can be resolved by approaching [math] at a suitable rate. Namely if $\theta\sim(\log\log T)^{-\alpha}$ , it is expected that $r(\theta)=\frac{1+2\alpha}{4}$ , interpolating between $1/4$ and $3/4$ for $0<\alpha<1$ , see Arguin, Dubach and Hartung (2021). Such hybrid statistics have been studied in the context of branching random walks, see Kistler and Schmidt (2015) and Bovier and Hartung (2020).

For $\theta<0$ , our analysis suggests that the correct model consists of a single random walk up to time $|\theta|\log\log T$ followed by a branching random walk. The maximum on such intervals would then be consistent with the level proposed in Section 2 (c)(ii) of Fyodorov and Keating (2014),

[TABLE]

where $\mathcal{Z}$ is a standard Gaussian random variable. As explained in Section 1.4, the additional fluctuation would represent the contribution of the Dirichlet polynomial $\sum_{\log p\leq\log^{|\theta|}T}\mathrm{Re}\hskip 2.56073ptp^{-1/2-\mathrm{i}(\tau+h)}$ which is essentially the same random variable for all $h$ ’s in the interval $|h|\leq\log^{\theta}T$ .

1.3 Relations to other models

When $-1<\theta\leq 0$ , Conjecture 1.3 is based on modelling $\zeta$ by the characteristic polynomial of a random unitary matrix (CUE). More precisely, if $M_{N}$ is a random matrix sampled from the Haar measure on the unitary group $\mathcal{U}(N)$ , one can consider the moments

[TABLE]

These can be computed in the limit $N\to\infty$ , at least heuristically, using Selberg integrals and the Fisher-Hartwig formula, cf. Fyodorov and Keating (2014). Exact expressions were recently obtained in Bailey and Keating (2019) in the regime $k,\beta\in\mathbb{N}$ . The statistics of $\log\int_{0}^{2\pi}|\det(\mathbb{I}-e^{-\mathrm{i}h}M_{N})|^{2\beta}{\rm d}h$ and of $\max_{h\in[0,2\pi]}|\det(\mathbb{I}-e^{-\mathrm{i}h}M_{N})|$ in the limit $N\to\infty$ can be inferred from the asymptotics of the moments by comparison with log-correlated processes, cf. Fyodorov, Gnutzmann and Keating (2018) for a numerical study. In the CUE setting, the freezing analogue of (1.7) and the leading order as in (1.6) were proved in Arguin, Belius and Bourgade (2017). The subleading order of the maximum was proved in Paquette and Zeitouni (2018), and up to constant $C$ in Chhaibi, Madaule and Najnudel (2018).

From the analysis of a particular variant of the log-correlated REM model, Fyodorov and Bouchaud (2008) conjectured an exact formula for the density of the total mass of the sub-critical Gaussian multiplicative chaos (GMC) measure associated to the Gaussian free field (GFF) on the unit circle, cf. Rhodes and Vargas (2014). In the critical case, they conjectured that the fluctuations of the maximum can be captured by a sum of two Gumbel variables. Both results were proved in Remy (2020). Naturally, these results are expected to hold in the CUE setting, where the GMC measure is the limit of

[TABLE]

as proved by Webb (2015) when $-1/4<\beta<1/\sqrt{2}$ , and by Nikula, Saksman and Webb (2020) when $1/\sqrt{2}\leq\beta<1$ . Such a random measure can also be considered in the context of the Riemann zeta function for mesoscopic intervals of length $\log^{\theta}T$ , $-1<\theta\leq 0$ , with $|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|$ in place of $|\det(\mathbb{I}-e^{-\mathrm{i}h}M_{N})|$ . (There does not seem to be any obvious equivalent for macroscopic intervals, $\theta>0$ , in the CUE model.) A step in this direction was made in Saksman and Webb (2020) where $\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)$ , $h\in\mathbb{R}$ , was shown to converge, as $T\to\infty$ , when considered as a random variable on the space of tempered distributions.

Another model for the large values of $\log|\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h)|$ , $h\in[-1,1]$ , is to consider a random Dirichlet polynomial $X_{h}=\mathrm{Re}\hskip 2.56073pt\sum_{p\leq T}\ p^{-1/2-\mathrm{i}h}U_{p}$ , where $(U_{p},\,p\text{ primes})$ are i.i.d. uniform random variables on the unit circle, cf. Harper (2013b); Arguin, Belius and Harper (2017); Arguin and Ouimet (2019). The analogue of conjecture (1.6) for this model was proved up to second-order corrections in Arguin, Belius and Harper (2017), and large deviations and continuity estimates for the derivative were found in Arguin and Ouimet (2019). The limit of the corresponding multiplicative chaos measure was obtained in Saksman and Webb (2016, 2020). A proof of the freezing phase transition was given in Arguin and Tai (2019). In the latter, the limit of the Gibbs measure $\exp(\beta X_{h}){\rm d}h$ is also studied in the supercritical regime $\beta>2$ , showing that it is supported on $h$ ’s that are at a relative distance of order one or order $(\log T)^{-1}$ of each other. This result was used in Ouimet (2018) to prove that the normalized Gibbs weights converge to a Poisson-Dirichlet distribution.

Notation.

For the rest of the paper, $\tau$ denotes a uniform random variable on $[T,2T]$ . For any event $A_{T}\subseteq[T,2T]$ and a random variable $X_{T}:[T,2T]\rightarrow\mathbb{C}$ , we write

[TABLE]

We also use the standard $\mathrm{o}$ and $\mathcal{O}$ notations: thus, $f(T)=\mathrm{o}(g(T))$ if $|f(T)/g(T)|$ tends to [math] as $T\to\infty$ when the parameters $\theta$ , $\beta$ and $\varepsilon$ are fixed. Similarly, we write $f(T)=\mathcal{O}(g(T))$ if $\limsup|f(T)/g(T)|$ is bounded for $\theta$ , $\beta$ and $\varepsilon$ fixed. We sometimes write for conciseness $f(T)\ll g(T)$ if $f(T)=\mathcal{O}(g(T))$ , and also $f(T)\asymp g(T)$ if both $f(T)\ll g(T)$ and $g(T)\ll f(T)$ hold. In some statements, we write $f(T)\ll_{A}g(T)$ or $f(T)=\mathcal{O}_{A}(g(T))$ to highlight the dependence on a specific parameter $A$ in the implicit constant. In some of the proofs, we use the common convention that $\varepsilon$ denotes an arbitrarily small positive quantity that may vary from line to line. We will also encounter some arithmetical functions familiar in number theory. These include: $\omega(n)$ (which counts the number of distinct primes dividing $n$ ), $\Omega(n)$ (which counts with multiplicity the number of primes dividing $n$ ), and the Möbius function $\mu(n)$ (which equals [math] if $n$ is divisible by the square of a prime, and equals $(-1)^{\omega(n)}$ if $n$ is square-free). Throughout the paper, $x\vee y$ and $x\wedge y$ refer to $\max\{x,y\}$ and $\min\{x,y\}$ , respectively.

1.4 Outline of the proof

For $\theta>0$ , the upper bound part of Theorem 1.1 and Theorem 1.2 follows from the moment estimates

[TABLE]

and from a discretization result which roughly shows that for a Dirichlet polynomial $D$ that approximates zeta, and for $\beta\geq 1$ , we have

[TABLE]

Equation (1.19) tells us that the process $(\zeta(\tfrac{1}{2}+\mathrm{i}\tau+\mathrm{i}h),~{}|h|\leq\log^{\theta}T)$ varies on a $(\log T)^{-1}$ scale, so that the maximum and moments of $\log|\zeta|$ on an interval of length $\mathcal{O}(\log^{\theta}T)$ behave as those of $\mathcal{O}(\log^{1+\theta}T)$ i.i.d. Gaussian random variables of variance $\tfrac{1}{2}\log\log T$ .111As in the branching random walk setting, the log-correlations are important in the proof of the first-order asymptotics of the maximum, high points and moments, but they do not influence the results. When comparing Gaussian fields, Slepian’s lemma tells us that, at equal variance, the field with no correlations will have, on average, the highest maximum and the highest number of points above any fixed proportion of the maximum (the asymptotics of the moments are derived directly from these two quantities). Therefore, the asymptotics of the maximum and moments for i.i.d. Gaussians are always an upper bound for those of log-correlated Gaussian fields. It turns out that we get a matching lower bound by a coarse-graining of the scales following Kistler (2015). This is why our heuristic here is phrased in terms of i.i.d. Gaussians, because the correlations ultimately only matters for the proof, not the actual results. The limitation to $\theta\leq 3$ comes from the fact that the upper bounds (1.18) are not known unconditionally for $\beta>4$ .

When $\theta<0$ , the upper bounds in Theorem 1.1 and Theorem 1.2 are a bit more delicate. We follow essentially the same strategy, but we apply it to the function

[TABLE]

instead of $\zeta(\tfrac{1}{2}+\mathrm{i}\tau)$ . The reason is that, when $\theta<0$ , the contribution of the primes up to scale $|\theta|$ is negligible with high probability. Namely, with probability $1-\mathrm{o}(1)$ ,

[TABLE]

When $\tau$ is restricted to a specific event $\mathcal{A}(T)$ on which (1.20) can be discretized as in (1.19), we can show that

[TABLE]

for $\beta\leq 2$ . This explains the additional factor $(\beta^{2}/4)\theta$ in $f_{\theta}(\beta)$ when $-1<\theta<0$ and $\beta\leq 2$ .

We then turn to the lower bound part of Theorem 1.1 and Theorem 1.2. The lower bounds in Theorem 1.2 follow directly from Theorem 1.1 (see (3.74)), so it is enough to discuss Theorem 1.1.

The problem is first reduced to obtaining lower bounds for moments off the critical line. In particular, it is shown, uniformly in $\tfrac{1}{2}\leq\sigma\leq\tfrac{1}{2}+(\log T)^{\theta-\varepsilon}$ and for any given $\varepsilon>0$ , that with probability $1-\mathrm{o}(1)$ ,

[TABLE]

This is accomplished using a result of Gabriel (1927) for subharmonic functions, and the construction of an explicit entire function which is a good approximation to the indicator function of the rectangle $\mathcal{R}=\{\sigma+\mathrm{i}u:|u|\leq(\log T)^{\theta},\,\tfrac{1}{2}\leq\sigma\leq\tfrac{1}{2}+(\log T)^{\theta-\varepsilon}\}$ in the whole strip $\tfrac{1}{2}\leq\mathrm{Re}\hskip 2.56073pts$ . The fact that the interval can be very small when $\theta<0$ makes this part rather technical. We believe that this result might be useful in other applications as well.

The problem is therefore reduced to obtaining a good lower bound for

[TABLE]

for some sufficiently small $\delta>0$ . We adapt mollification results from Arguin et al. (2019) to show that, outside of an event of probability $\mathrm{o}(1)$ , the problem can be reduced to understanding

[TABLE]

The proof of the lower bound is now restricted to the problem of understanding the correlation structure of the process

[TABLE]

The remaining part of the argument is done in Section 3.4 by a multiscale second moment method introduced in Kistler (2015). The covariance of the process (1.26) can be computed using Lemma A.3 with $a(p)=p^{-\sigma_{0}}(p^{-\mathrm{i}h}+p^{-\mathrm{i}h^{\prime}})$ :

[TABLE]

The cosine factor implies that primes smaller than $\exp(|h-h^{\prime}|^{-1})$ are almost perfectly correlated, whereas primes greater than $\exp(|h-h^{\prime}|^{-1})$ decorrelate quickly. In fact, the covariance can be evaluated precisely using the prime number theorem and equals $\frac{1}{2}\log|h-h^{\prime}|^{-1}+\mathcal{O}(1)$ . This shows that the process is approximatively a log-correlated Gaussian process. (This is also true for $\log|\zeta|$ in the sense of finite-dimensional distributions as shown in Bourgade (2010).)

The identification with a log-correlated process is useful as it suggests that the Dirichlet polynomials have an underlying tree structure. To see this, consider the increments

[TABLE]

The range of primes is chosen so that each $P_{k}$ has variance $\tfrac{1}{2}+\mathrm{o}(1)$ . In this framework, the Dirichlet polynomial at $h$ can be seen as a random walk with independent and identically distributed increments. However, the random walks for different $h$ ’s are not independent by (1.27). In fact, the walks are almost perfectly correlated until they branch out around the prime $p\approx\exp(|h-h^{\prime}|^{-1})$ , corresponding to the increment $k(h,h^{\prime})=\log|h-h^{\prime}|^{-1}$ . Since $k$ goes to essentially $\log\log T$ , the analysis can be restricted to $h$ ’s on a grid with mesh $(\log T)^{-1}$ . Furthermore, the $h$ ’s in an interval of size $(\log T)^{-\alpha}$ , for $0<\alpha<1$ , will share the same increments up to $k\approx\alpha\log\log T$ .

The above observations have important consequences for the probabilistic analysis. For $\theta=0$ , this means that the process (1.26) on an interval of order one is well approximated by a Gaussian process indexed by a tree of average degree $e=2.718\dots$ , where the independent increments $P_{k}(h)$ are identified with the edges of the tree. Note that the number of leaves on the interval $[-1,1]$ is then $\approx e^{\log\log T}=\log T$ . Equivalently, the walks $\sum_{k}P_{k}(h)$ , $h\in[-1,1]$ , can be seen as a branching random walk on a Galton-Watson tree with an average number of offspring $e$ , cf. Figure 1.

When $\theta<0$ , the tree structure suggests that the primes up to $\exp(\log^{|\theta|}T)$ do not contribute to large values, since they should be essentially the same for all $h$ ’s in the interval . Therefore these primes can be cutoff at a low cost, cf. Corollary 2.12. This is equivalent to restricting to a subtree of the one on $[-1,1]$ with $(1+\theta)\log\log T$ increments and $\log^{1+\theta}T$ leaves, yielding a maximum at leading order of $(1+\theta)\log\log T$ by the REM heuristic.

The case $\theta>0$ stands out as the analogy with branching random walks fails. This is because the random walks for $h$ and $h^{\prime}$ are essentially independent when ${|h-h^{\prime}|>1}$ . Therefore the right probabilistic model seems to consist of $\asymp\log^{\theta}T$ independent branching random walks corresponding to different intervals of order one, see Figure 1. A large class of similar models (called CREM’s for Continuous Random Energy Models) have been studied in Bovier and Kurkova (2004), see Bovier (2006, 2017) for a review. It turns out that the large values at leading order correspond to the ones of a REM with $\log^{1+\theta}T$ variables of variance $\frac{1}{2}\log\log T$ . This yields a maximum of $\sqrt{1+\theta}\log\log T$ at leading order. In fact, in view of the extreme value statistics of CREM’s, we expect that the REM heuristic holds for subleading corrections. This is the motivation for Conjecture 1.3.

2 Upper bounds

2.1 Moment estimates

We will need a number of moment estimates which we state below.

Proposition 2.1.

Assume the Riemann hypothesis. Let $\beta>0$ and $\varepsilon>0$ be given. Then,

[TABLE]

Proof.

See Corollary A in Soundararajan (2009). ∎

Proposition 2.2.

Let $0<\beta\leq 4$ be given. Then,

[TABLE]

Proof.

See Theorem 1 in Heap, Radziwiłł and Soundararajan (2019). ∎

The proof of Proposition 2.1 is based on the following deterministic upper bound for $\zeta$ : Suppose that $T$ is large. Let $T\leq t\leq 2T$ , and let $2\leq x\leq T^{2}$ . Then, as $T\to\infty$ , we have

[TABLE]

see Proposition and Lemma 2 in Soundararajan (2009). On the Riemann hypothesis, the upper bounds in Theorem 1.1 and Theorem 1.2 could be proved in a simpler way by using this deterministic bound, and by proving the corresponding results for the Dirichlet polynomials. For unconditional results, such a deterministic upper bound is not available. We need to work on average to discard the contribution of large primes. This is the purpose of Lemmas 2.3, 2.4, 2.5 and Proposition 2.6 below.

In order to compute the moments of $\zeta\cdot e^{-\mathcal{P}_{|\theta|}}$ , we will need to express $e^{-\mathcal{P}_{|\theta|}}$ as a finite Dirichlet polynomial. To this aim, notice that if $|z|\leq\nu/10$ for some $\nu\in\mathbb{N}$ , we have $\big{|}e^{z}-\sum_{j=0}^{\nu}\frac{z^{j}}{j!}\big{|}\leq e^{-\nu}.$ Consider more generally $e^{\lambda\mathcal{P}(s)}$ with $\lambda\in\mathbb{C}$ and $\mathcal{P}(s)=\sum_{p\leq X}a(p)p^{-s}$ for some completely multiplicative function $a$ . We have by the above, assuming $|\lambda\mathcal{P}(s)|\leq\nu/10$ for some $\nu\in\mathbb{N}$ , and by the multinomial formula, that

[TABLE]

where $\Omega(n)$ is the number of prime factors of $n$ with multiplicity. Here, $\mathfrak{g}$ is the multiplicative function defined by $\mathfrak{g}(p^{k})=1/k!$ for all integers $k$ and primes $p$ .

The relevant function $a$ for $e^{-\mathcal{P}_{|\theta|}}$ will be of the following form: Given $\alpha,\beta\in\mathbb{R}$ and $\theta>-1$ , let $\mathfrak{F}_{\alpha,\beta,\theta}(n)$ denote a completely multiplicative function such that

[TABLE]

In the next three lemmas, we control various terms with the aim of proving the moment estimate in Proposition 2.6, which we will need in the case of short intervals.

Lemma 2.3.

Let $-1<\theta<0$ , $\beta>0$ and $\varepsilon>0$ be given. Then,

[TABLE]

Proof.

Notice that the Dirichlet polynomial in (2.6) has length $\ll T^{\delta}$ for any fixed $\delta>0$ . In particular, by the mean-value formula (Lemma A.2),

[TABLE]

Dropping the restriction on $\Omega(n)$ and expressing the sum as an Euler product yield

[TABLE]

The logarithm of the right-hand side is easily evaluated using the prime number theorem (see Lemma A.1) and is $(\beta^{2}(1+\theta)/4)\log\log T+\mathcal{O}(1)$ . This proves the claimed bound. ∎

Lemma 2.4.

Let $-1<\theta<0$ , $0<\beta\leq 2$ and $\varepsilon>0$ be given. Then,

[TABLE]

Proof.

By Theorem 1 in Bettin, Chandee and Radziwiłł (2017), the left-hand side of (2.8) is

[TABLE]

where $\Phi$ is a smooth non-negative function such that $\Phi(x)\geq 1$ for all $1\leq x\leq 2$ , with support contained in say $[0,3]$ , and $(n,m)$ and $[n,m]$ stand for the greatest common divisor and the least common multiple, respectively.

We first note that if $n,m$ have the prime factorization $n=\prod_{i=1}^{r}p_{i}^{\alpha_{i}}$ and $m=\prod_{i=1}^{r}p_{i}^{\beta_{i}}$ , where the $\alpha_{i}$ ’s and $\beta_{i}$ ’s are possibly [math], then $[n,m]=\prod_{i=1}^{r}p_{i}^{\alpha_{i}\vee\beta_{i}}$ . This means that if $a(n)$ and $b(m)$ are two bounded multiplicative functions, we have

[TABLE]

Using Chernoff’s bound, we can get rid of the restriction $\Omega(n)\leq 100\lfloor\log\log T\rfloor$ in (2.9). It suffices to notice that the contribution of each sum over $n$ with $\Omega(n)>100\lfloor\log\log T\rfloor$ is

[TABLE]

where we used (2.10) with $a(n)=|\mathfrak{F}_{-1,\beta/2-1,\theta}(n)|\mathfrak{g}(n)e^{\Omega(n)}$ , $b(m)=|\mathfrak{F}_{-1,\beta/2-1,\theta}(m)|\mathfrak{g}(m)$ . The contribution of each sum over $m$ with $\Omega(m)>100\lfloor\log\log T\rfloor$ can be removed in the same manner.

Considering the sums in (2.9) without the restriction on $\Omega(n)$ and $\Omega(m)$ , we get by (2.10) and Lemma A.1,

[TABLE]

In particular, this means that the second integral in (2.9) is $\ll(\log T)^{\beta^{2}(1+\theta)/4+\varepsilon}$ .

To evaluate the first integral in (2.9) , write

[TABLE]

Then, we end up having to evaluate

[TABLE]

As above, the sum over $m$ and $n$ factors into an Euler product which is

[TABLE]

For $|z|=1/\log T$ , note that

[TABLE]

and a Taylor expansion yields

[TABLE]

Since the error term in (2.17) is $\mathrm{o}(1)$ by Lemma A.1, the Euler product in (2.15) is

[TABLE]

By putting this estimate back in the contour integral and using a trivial bound on $z^{-2}$ , Equation (2.14) is $\ll(\log T)^{\beta^{2}(1+\theta)/4+\varepsilon}$ as required. ∎

Lemma 2.5.

Let $\varepsilon>0$ be given. For $\ell=2\lfloor\log\log T\rfloor$ , we have

[TABLE]

and

[TABLE]

Proof.

First, we apply a moment estimate (Lemma A.4) followed by a prime number theorem estimate (Lemma A.1) to obtain

[TABLE]

The estimate (2.19) then follows by applying the Cauchy-Schwarz inequality, the fourth moment bound $\mathbb{E}[|\zeta(\tfrac{1}{2}+\mathrm{i}\tau)|^{4}]\ll(\log T)^{4}$ , see e.g. Ingham (1928), and (2.21). For (2.20), the same reasoning as in (2.21) yields the estimate $\ll e^{-2\ell}\ll(\log T)^{-4}$ . ∎

The last three lemmas show a moment bound of the right order for $\zeta\cdot e^{-\mathcal{P}_{|\theta|}}$ .

Proposition 2.6.

Let $-1<\theta<0$ , $0<\beta\leq 2$ and $\varepsilon>0$ be given. Then, as $T\rightarrow\infty$ ,

[TABLE]

with the event

[TABLE]

Proof.

Let $0<\beta<2$ . By Young’s inequality with $p=2/\beta$ and $q=2/(2-\beta)$ ,

[TABLE]

Note that (2.24) holds trivially for $\beta=2$ . Hence, for $0<\beta\leq 2$ ,

[TABLE]

On the event $\mathcal{A}(T)\cap\{|\mathcal{P}_{1-\varepsilon}(\tfrac{1}{2}+\mathrm{i}\tau)|\leq 5\log\log T\}$ , we get, by the truncation (2.4) with $\nu=100\lfloor\log\log T\rfloor$ and the identity $|z+w|^{2}\leq 2(|z|^{2}+|w|^{2})$ , that

[TABLE]

where $\mathfrak{F}_{\alpha,\beta,\theta}(n)$ is the completely multiplicative function defined in (2.5). Likewise, on the same event, we have

[TABLE]

Finally, on the event $\mathcal{A}(T)\cap\{|\mathcal{P}_{1-\varepsilon}(\tfrac{1}{2}+\mathrm{i}\tau)|>5\log\log T\}$ , we get, for any $\ell\geq 1$ ,

[TABLE]

since for $\beta\leq 2$ , $|\zeta|^{\beta}$ is bounded by $(1+|\zeta|^{2})$ and $|e^{-\mathcal{P}_{|\theta|}}|^{\beta}$ is bounded by $(\log T)^{4}$ on $\mathcal{A}(T)$ . We choose $\ell=2\lfloor\log\log T\rfloor$ . Now, take the expectation with $\tau$ restricted to $\mathcal{A}(T)$ in (2.25), then split the terms on the right-hand side over the associated events in (2.26), (2.27) and (2.28). We use Lemmas 2.3, 2.4 and 2.5 to bound the expectations. ∎

2.2 Discretization

The analysis of the maximum of zeta on an interval can often be restricted to $h$ ’s on a grid with mesh of order $(\log T)^{-1}$ . This can be proved for the maximum using the functional equation for zeta, see for example Lemma 2.2 in Farmer, Gonek and Hughes (2007). We will need a more elaborate variant for general Dirichlet polynomials.

Proposition 2.7.

Let $\theta>-1$ , $\beta\geq 1$ and $\varepsilon>0$ be given. Let $D(s)=\sum_{n\leq T^{1+\varepsilon}}a(n)n^{-s}$ be a Dirichlet polynomial of length $T^{1+\varepsilon}$ where $\sup_{n\leq T^{1+\varepsilon}}|a(n)|\leq B$ for some $B>0$ possibly depending on $\varepsilon$ and $T$ . Then, for all $A>10(1+\varepsilon)\beta$ , $T\leq t\leq 2T$ , and $\sigma\geq 1/2$ ,

[TABLE]

Proof.

Let $V$ be a smooth function with $V(x)=1$ for $x\in[-(1+\varepsilon),0]$ and compactly supported in $[-(1+2\varepsilon),\varepsilon]$ . We show

[TABLE]

By taking the complex norm and applying Hölder’s inequality with $\beta\geq 1$ , this yields

[TABLE]

This proves (2.29) after taking the supremum over $h$ , using the rapid decay of $\widehat{V}$ , and noticing that $\sup_{n\leq T^{1+\varepsilon}}|a(n)|\leq B$ and our assumption on $A$ imply that

[TABLE]

Since $D(s)$ is of the form $\sum_{n\leq T^{1+\varepsilon}}a(n)n^{-s}$ , it suffices by linearity to establish (2.30) for a single $n\leq T^{1+\varepsilon}$ , i.e.,

[TABLE]

Using the Poisson summation formula, the right-hand side can be rewritten as

[TABLE]

where we made the change of variable $y=(1+2\varepsilon)(u+\tfrac{h\log T}{2\pi})$ . The term $\ell=0$ is equal to $n^{-\mathrm{i}h}$ since $V(-\tfrac{\log n}{\log T})=1$ for $1\leq n\leq T^{1+\varepsilon}$ by the choice of $V$ . The other terms ( $\ell\neq 0$ ) are all equal to [math] since $-\ell(1+2\varepsilon)-\tfrac{\log n}{\log T}$ falls outside the support of $V$ for $1\leq n\leq T^{1+\varepsilon}$ . This proves (2.33) and the proposition. ∎

Proposition 2.7 implies five important corollaries to tackle the maximum of $\zeta$ and of Dirichlet polynomials. We first observe that the discretization applies to $\zeta$ in Corollary 2.9. This is a consequence of the following approximation.

Lemma 2.8 (Approximation of $\zeta$ ).

Let $\varepsilon>0$ and $\sigma\geq 1/2$ be given, and let $k>\max\{5,10/\varepsilon\}$ be an integer. Then, as $T\to\infty$ and for $t\asymp T$ , we have

[TABLE]

where the smoothing $w_{k}$ is defined by setting

[TABLE]

where $(y)_{+}\vcentcolon=\max\{y,0\}$ . Examples of graphs for $w_{k}(x)$ are provided in Figure 2.

Proof.

The case $\sigma>2$ is a trivial consequence of the fact that $|\sum_{n>T}n^{-\sigma-\mathrm{i}t}|\leq|\sum_{n>T}n^{-2}|\leq T^{-1}$ . Therefore, assume $1/2\leq\sigma\leq 2$ . We claim that,

[TABLE]

First it is easy to check that this formula holds for $x<0$ and $x>k$ : if $x<0$ then we shift the contour towards $\mathrm{Re}\hskip 2.56073ptz=-\infty$ and collect a single pole with residue $1$ at $z=0$ , while if $x>k$ then we shift the contour towards $\mathrm{Re}\hskip 2.56073ptz=\infty$ and we see that the integral is zero. In the remaining intermediate range $0\leq x<k$ we expand

[TABLE]

and we use the fact that,

[TABLE]

Therefore,

[TABLE]

We now shift the contour to the line $\mathrm{Re}\hskip 2.56073ptz=-(k-2)$ . We collect a pole at $z=0$ with residue $\zeta(\sigma+\mathrm{i}t)$ . On the line $\mathrm{Re}\hskip 2.56073ptz=-(k-2)$ , we bound the integral using the estimate $|\zeta(r+\mathrm{i}t)|\ll(1+|t|)^{1/2-r}$ , which is valid for any fixed $r<-\tfrac{1}{100}$ and all $t\in\mathbb{R}$ .222This estimate follows from applying the functional equation for $\zeta(r+\mathrm{i}t)$ , bounding the ratio of Gamma factors using Stirling’s formula and bounding $\zeta(1-r-\mathrm{i}t)$ trivially by $O(1)$ . Specifically, the contribution of the line $\mathrm{Re}\hskip 2.56073ptz=-(k-2)$ is bounded by

[TABLE]

This proves that

[TABLE]

The conclusion follows by a simple rescaling. ∎

From Lemma 2.8, we derive the following discretization result.

Corollary 2.9.

Let $\theta>-1$ , $\beta\geq 1$ and $\varepsilon>0$ be given. For any $A>10(\varepsilon^{-1}+1)\beta$ and all $T\leq t\leq 2T$ ,

[TABLE]

Proof.

This is a consequence of the $\zeta$ approximation in Lemma 2.8 with $\sigma=1/2$ and $k=\lfloor(A+1)/(\beta\varepsilon)\rfloor$ , and the discretization in Proposition 2.7 with $B=1$ . ∎

As a consequence, we get a suboptimal upper bound for $\theta>-1$ using the second moment. Note that this bound also works for $\theta$ dependent on $T$ .

Corollary 2.10.

Let $0<\varepsilon\leq 1$ be given and let $k>10/\varepsilon$ be an integer. Then, for any $\theta>-1$ , possibly dependent on $T$ , we have

[TABLE]

Proof.

The Dirichlet polynomial in (2.44) is $\ll\sum_{n\leq T^{1+\varepsilon}}n^{-1/2}\ll T^{(1+\varepsilon)/2}\ll T$ , so the probability is just zero when $\theta>\log T/\log\log T$ . Therefore, we assume that $\theta\leq\log T/\log\log T$ . By the $\zeta$ approximation in Lemma 2.8, it suffices to prove

[TABLE]

By applying Markov’s inequality and Corollary 2.9 with $A=100(\varepsilon^{-1}+1)$ , the probability in (2.45) is

[TABLE]

Using a standard second moment bound, see e.g. (Titchmarsh, 1986, p.141), the last two expectations are $\ll\log T$ . We conclude that the right-hand side of (2.46) is

[TABLE]

since $\theta>-1$ . ∎

A similar reasoning using Markov’s inequality can be applied to get an upper bound for the maximum of $\mathcal{P}_{\alpha}$ , $0<\alpha<1$ . The bound below is suboptimal for $\theta<0$ and optimal for $\theta\geq 0$ .

Corollary 2.11.

Let $\theta>-1$ , $\varepsilon>0$ and $\sigma\geq 1/2$ be given. Then,

[TABLE]

Proof.

We apply Markov’s inequality with exponent $2\ell$ , and discretize as in (2.46) using Proposition 2.7 with $B=1$ . We then use moment estimates from Lemma A.4, with $\ell=\lfloor(1+\theta)\log\log T\rfloor$ , to bound the expectations. ∎

When $\theta<0$ and $\alpha>|\theta|$ , the bound (2.48) (and its analogue for $\zeta$ ) needs to be refined by discarding the contribution of small primes. The result below directly implies that for $\theta<0$ and $\alpha>|\theta|$ , the sharp upper bound for $\mathrm{Re}\hskip 2.56073pt\mathcal{P}_{\alpha}$ is $\sqrt{(\alpha+\theta)(1+\theta)}\log\log T$ since the effective variance is $\tfrac{(\alpha+\theta)}{2}\log\log T$ .

Corollary 2.12.

Let $-1<\theta<0$ and $\sigma\geq 1/2$ be given. Then, for any $0<\varepsilon<C$ and $V=V(T)$ that satisfies $\varepsilon\log\log T\leq V\leq C\log\log T$ , we have

[TABLE]

for some constant $c=c(\varepsilon,C)>0$ .

Proof.

For a lighter notation, write $S(h)=\mathcal{P}_{|\theta|}(\sigma+\mathrm{i}\tau+\mathrm{i}h)$ . (We keep the dependence on $\tau$ implicit, consistent with the probabilistic notation for random variables.) We have

[TABLE]

Let $\ell$ denote a generic natural integer. By Markov’s inequality, a moment estimate (Lemma A.4) and a prime number theorem estimate (Lemma A.1), we have

[TABLE]

With the choice $\ell=\lfloor\frac{\varepsilon^{2}}{8}\log\log T\rfloor$ , this probability is $\ll\exp(-aV)$ for some constant $a=a(\varepsilon,C)>0$ .

It remains to control the first probability on the right-hand side of (2.50). Let $\ell$ denote another natural integer to be chosen later. By applying Proposition 2.7, we get

[TABLE]

A short calculation, using moment estimates (Lemma A.4) followed by prime number theorem estimates (Lemma A.1), yields

[TABLE]

for some constant $d>0$ (to obtain the last inequality, note that $|h|\cdot\log^{|\theta|}T\leq 1$ ).

Then, by Markov’s inequality and the choice $\ell=\lfloor\frac{\varepsilon^{2}}{8d}\log\log T\rfloor$ , we deduce

[TABLE]

for some constant $b=b(\varepsilon,C)>0$ . ∎

As before, the maximum of $\zeta\cdot e^{-\mathcal{P}_{|\theta|}}$ can be discretized by truncating the exponential.

Corollary 2.13.

Let $-1<\theta\leq 0$ and $\varepsilon>0$ be given. Then, there exists a constant $C=C(\theta,\varepsilon)>0$ such that the event

[TABLE]

has probability $1-\mathrm{o}(1)$ .

Proof.

Define the event

[TABLE]

By Corollary 2.12, we have $\mathbb{P}(\widetilde{\mathcal{A}}(T))=1-\mathrm{o}(1)$ . By (2.4), for all $\tau\in\widetilde{\mathcal{A}}(T)$ , we also have

[TABLE]

Combining this with the $\zeta$ approximation in Lemma 2.8 with $\sigma=1/2$ and $k=102/\varepsilon$ , we conclude that, for all $\tau\in\widetilde{\mathcal{A}}(T)$ and uniformly for $y\asymp T$ ,

[TABLE]

where $D$ is a Dirichlet polynomial of length $T^{1+2\varepsilon}$ . Proposition 2.7 implies

[TABLE]

Together with (2.58), this concludes the proof. ∎

2.3 Proofs of the upper bounds

2.3.1 The case $\theta\geq 0$

Proof of Theorem 1.2 for $\theta\geq 0$ .

By Markov’s inequality with exponent $\beta>0$ , we have

[TABLE]

If we choose $\beta=2m(\theta)\geq 2$ , we get, by picking $A$ large enough in Corollary 2.9, that the right-hand side of the above equation is

[TABLE]

By applying Proposition 2.2 if $\beta\leq 4$ (i.e., if $\theta\leq 3$ ) and Proposition 2.1 if $\beta>4$ (i.e., if $\theta>3$ ), the expectation is bounded by $(\log T)^{m(\theta)^{2}+\varepsilon}$ . Therefore, the claim follows. ∎

Proof of Theorem 1.1 for $\theta\geq 0$ .

For all $\beta>0$ , Markov’s inequality yields

[TABLE]

When $\beta\leq 2\sqrt{1+\theta}$ , we have $f_{\theta}(\beta)=\beta^{2}/4+\theta$ , so the right-hand side of (2.62) is $\ll(\log T)^{-\varepsilon/2}$ by Proposition 2.2 for $\theta\leq 3$ and by Proposition 2.1 for $\theta>3$ .

It remains to sharpen the bound in the case $\beta>2\sqrt{1+\theta}$ . We use the Lebesgue measure of high points. Let $a,b>0$ . Two successive applications of Markov’s inequality yield

[TABLE]

Again, the optimal bound is at $b=2a$ . Using Proposition 2.2 for $\theta\leq 3$ and Proposition 2.1 for $\theta>3$ and choosing $b=2a$ , we conclude that this is $\ll(\log T)^{-\varepsilon/2}$ for $0<a\leq m(\theta)$ .

We now partition the integral according to the value of the integrand. Let $M\geq 1$ be an integer and $0\leq j\leq M$ . Theorem 1.2 (for $\theta\geq 0$ ) and the above imply that, with probability $1-\mathrm{o}(1)$ ,

[TABLE]

For $\beta>2\sqrt{1+\theta}\geq 2m(\theta)$ , the last term $j=M$ dominates and, in particular, the above is bounded by

[TABLE]

provided that $M$ is chosen sufficiently large with respect to $\theta$ , $\beta$ and $\varepsilon$ . ∎

Remark.

In the above proof, we could have handled all $\beta$ ’s using the Lebesgue measure of high points in the spirit of a Gibbs variational principle. We chose to prove the case $\beta\leq 2\sqrt{1+\theta}$ directly as the proof is straightforward.

2.3.2 The case $\theta<0$

Proof of Theorem 1.2 for $\theta<0$ .

We notice that

[TABLE]

By Corollary 2.12, the last term is $\mathrm{o}(1)$ as $T\rightarrow\infty$ . As in (2.56), let

[TABLE]

By Corollary 2.12 again, the probability of $\widetilde{\mathcal{A}}(T)$ is $1-\mathrm{o}(1)$ . We let $\mathcal{A}_{0}(T)$ denote the subset of $\widetilde{\mathcal{A}}(T)$ for which the conclusion of Corollary 2.13 holds. The probability of $\mathcal{A}_{0}(T)$ is $1-\mathrm{o}(1)$ . Then, by Markov’s inequality, we have

[TABLE]

By Corollary 2.13, and since $m(\theta)=1+\theta$ , this is

[TABLE]

By Proposition 2.6, this is

[TABLE]

as needed. ∎

Proof of Theorem 1.1 for $\theta<0$ .

Similarly to (2.66), we can restrict the integrand to $\zeta\cdot e^{-\mathcal{P}_{|\theta|}}$ as follows

[TABLE]

As in (2.67), the probability is $\mathbb{P}(\widetilde{\mathcal{A}}(T))=1-\mathrm{o}(1)$ , and by Markov’s inequality, we have

[TABLE]

By Proposition 2.6, the above is

[TABLE]

This bound proves the claim for $\beta\leq 2$ .

It remains to refine the bound for the case $\beta>2$ . This proceeds in the same way as in the proof of Theorem 1.1 in the case $\theta\geq 0$ , with $\zeta$ replaced by $\zeta\cdot e^{-\mathcal{P}_{|\theta|}}$ restricted on the event $\widetilde{\mathcal{A}}(T)$ . Namely, we have, for $0<a\leq m(\theta)$ ,

[TABLE]

This is $\mathrm{o}(1)$ by Proposition 2.6 with the optimal choice $b=2a/(1+\theta)\leq 2$ . The remainder is done exactly as in the proof of Theorem 1.1 in the case $\theta\geq 0$ , by partitioning the integral over values of the integrand in the range $[0,m(\theta)+\varepsilon]$ . ∎

3 Lower bounds

In this section, we prove:

Proposition 3.1.

Let $\theta>-1$ and $\varepsilon>0$ be given. Then,

[TABLE]

Proposition 3.2.

Let $\theta>-1$ , $\beta>0$ and $\varepsilon>0$ be given. Then,

[TABLE]

The lower bound for the maximum will be an easy consequence of the lower bound for the moments. The idea is to approximate zeta by an appropriate Dirichlet polynomial. This can be done with good precision off-axis, cf. Section 3.1. The approximation to a Dirichlet polynomial is then shown in Section 3.2. The lower bound for the moments of the Dirichlet polynomials is proved in Section 3.3 using Kistler’s multiscale second moment method. Finally, the two propositions above are proved in Section 3.4.

3.1 Reduction off-axis

In Arguin et al. (2019), the maximum on a short interval of the critical line was compared to the one on a short interval away from the critical line by exploiting the analyticity of $\zeta$ away from its pole. More precisely, a value off-axis can be seen as an average of zeta over the critical line weighed by the corresponding Poisson kernel. This approach could also be used in the case of the moments by using the subharmonicity of the function $z\mapsto|z|^{\beta}$ . We choose to apply a different method based on the following convexity theorem of Gabriel, which handles error terms more efficiently.

Proposition 3.3 (Theorem 2 of Gabriel (1927) in the special case $a=b=1$ ).

Let $F$ be a complex valued function which is analytic in the strip $\alpha\leq\mathrm{Re}\hskip 2.56073ptz\leq\beta$ . Suppose that $|F(z)|$ tends to zero as $|\mathrm{Im}\hskip 2.56073ptz|\rightarrow\infty$ , uniformly for $\alpha\leq\mathrm{Re}\hskip 2.56073ptz\leq\beta$ . Then, for any $\gamma\in[\alpha,\beta]$ and any $p>0$ ,

[TABLE]

where

[TABLE]

This theorem has the following useful consequence.

Corollary 3.4.

Let $F$ be a complex valued function which is analytic in the strip $\tfrac{1}{2}\leq\mathrm{Re}\hskip 2.56073ptz$ . Suppose that $|F(z)|$ tends to zero as $|\mathrm{Im}\hskip 2.56073ptz|\rightarrow\infty$ , uniformly for $\tfrac{1}{2}\leq\mathrm{Re}\hskip 2.56073ptz$ . Suppose also that $I(\sigma)\rightarrow 0$ as $\sigma\rightarrow\infty$ . Then, for any $\sigma>\tfrac{1}{2}$ and any $p>0$ ,

[TABLE]

Proof.

Let $\sigma^{\star}$ be such that

[TABLE]

Note that because of the assumption that $I(\sigma)\rightarrow 0$ as $\sigma\rightarrow\infty$ , the above $\sigma^{\star}$ has a finite value. Let $\varepsilon>0$ be given. If $\sigma^{\star}=\tfrac{1}{2}$ , then we are done. If $\sigma^{\star}\neq\tfrac{1}{2}$ , then by Proposition 3.3 applied with $\gamma=\sigma^{\star}$ , $\alpha=\frac{1}{2}$ and $\beta=\sigma^{\star}+\varepsilon$ , we get

[TABLE]

for some appropriate $\lambda,\mu>0$ that satisfy $\lambda+\mu=1$ .

Therefore, by definition of $\sigma^{\star}$ in (3.6),

[TABLE]

and hence $I(\sigma^{\star})^{\lambda}\leq I(\tfrac{1}{2})^{\lambda}$ . Since $\lambda>0$ , we get $I(\sigma^{\star})\leq I(\tfrac{1}{2})$ . The claim follows from (3.6). ∎

We now construct a special analytic approximation for the indicator function of the rectangle $\mathcal{R}=\{\sigma+\mathrm{i}v:\tfrac{1}{2}\leq\sigma\leq\tfrac{1}{2}+K,|v|\leq L\}$ for $K,L>0$ . The effective width of the indicator function will be $K\approx L/\Delta$ in the statement below.

Lemma 3.5.

Let $b_{1}\in(0,1)$ and $\Delta,L,A,b_{2}>0$ be given. There exists an entire function $\Phi_{\Delta,L}(z)$ such that, for $z=\sigma+\mathrm{i}v$ with $\sigma\geq\tfrac{1}{2}$ and $v\in\mathbb{R}$ ,

(i)

For $|v|\geq(1+b_{2})L$ , uniformly in $\sigma\geq\frac{1}{2}$ , $\Phi_{\Delta,L}(z)\ll_{A}b_{2}^{-A}\Delta^{1-A}.$ 2. (ii)

For any $|v|\leq(1-b_{1})L$ , $|\Phi_{\Delta,L}(z)|=1+\mathcal{O}_{b_{1},A}(\Delta^{-A})+\mathcal{O}((\sigma-\tfrac{1}{2})\tfrac{\Delta^{2}}{L}).$ 3. (iii)

For any $|v|\leq(1+b_{2})L$ , $|\Phi_{\Delta,L}(z)|\ll 1+(\sigma-\tfrac{1}{2})\tfrac{\Delta^{2}}{L}.$ 4. (iv)

$\Phi_{\Delta,L}(z)\rightarrow 0$ * uniformly in $v$ as $\sigma\rightarrow\infty$ .*

Proof.

Let $V$ be a smooth function, compactly supported in $(0,\infty)$ and such that $V(1)=1$ . Given a parameter $\eta>0$ and given $z\in\mathbb{C}$ with $\mathrm{Re}\hskip 2.56073ptz\geq\tfrac{1}{2}$ and $u\in\mathbb{R}$ , consider the following function:

[TABLE]

Then $\delta_{\eta}(z)$ defines an entire function of exponential type. By integration by parts, we see that

[TABLE]

for any $A>0$ and uniformly in $\mathrm{Re}\hskip 2.56073ptz\geq\tfrac{1}{2}$ . Therefore, we may think of $\delta_{\eta}(z)$ as localizing to $z=\tfrac{1}{2}+\mathcal{O}(\eta)$ . Furthermore, notice that if $z=\tfrac{1}{2}+\mathrm{i}v$ and $u\in\mathbb{R}$ , then

[TABLE]

and for $z=\sigma+\mathrm{i}v$ , we have by a Taylor expansion of the exponential,

[TABLE]

Finally, for $z=\sigma+\mathrm{i}v$ with $\sigma\geq\tfrac{1}{2}$ , we have from (3.10) that

[TABLE]

The candidate function is for $\eta=L/\Delta$ ,

[TABLE]

We will now describe some of the features of this function. Write $z=\sigma+\mathrm{i}v$ with $\sigma\geq\tfrac{1}{2}$ . Using the bound (3.13), we see that, if $|v|>(1+b_{2})L$ with $b_{2}>0$ , then

[TABLE]

This gives the first claim.

If $|v|\leq(1-b_{1})L$ , then by (3.14) and (3.12), we have

[TABLE]

It follows that if $\tfrac{1}{2}\leq\sigma$ and $|v|\leq(1-b_{1})L$ , then due to the rapid decay of $\widehat{V}$ , we have

[TABLE]

by Fourier inversion and the assumption that $V(1)=1$ . This proves the second claim. If $\tfrac{1}{2}\leq\sigma\ll 1$ and $|v|\leq(1+b_{2})L$ , then we have the bound

[TABLE]

which proves the third claim.

Finally, notice that $\delta_{L/\Delta}(z-\mathrm{i}u)\rightarrow 0$ uniformly as $\sigma\rightarrow\infty$ by (3.10), which implies the last claim that $\Phi_{\Delta,L}(z)\rightarrow 0$ uniformly in $v\in\mathbb{R}$ as $\sigma\rightarrow\infty$ . ∎

The following proposition relates the moments off and on axis.

Proposition 3.6.

Let $\theta>-1$ , $\beta>0$ , $0<\varepsilon\leq 1$ and $T\geq 10^{9}$ be given. Then, for all $\tfrac{1}{2}\leq\sigma\leq\tfrac{1}{2}+(\log T)^{\theta-3\varepsilon}$ , the event

[TABLE]

has probability $1-\mathrm{o}(1)$ .

Proof.

Let

[TABLE]

with $0<\varepsilon\leq 1$ and $k>10/\varepsilon$ a fixed integer. Using the $\zeta$ approximation in Lemma 2.8, we have, for $T\leq\tau\leq 2T$ and $\tfrac{1}{2}\leq\sigma\leq\tfrac{1}{2}+(\log T)^{\theta-3\varepsilon}$ ,

[TABLE]

Therefore, it suffices to establish (3.19) for $\zeta$ replaced by $D$ :

[TABLE]

Consider

[TABLE]

with $\Delta=\log^{\varepsilon}T$ and $L=1.5\log^{\theta}T$ . Then, by Lemma 3.5 (i) and (iv), Corollary 3.4 can be applied and yields

[TABLE]

Now, it remains to un-smooth both sides of this expression. Lemma 3.5 (ii) (with $b_{1}=1/3$ ) implies that $\Phi_{\Delta,L}(\sigma+\mathrm{i}u)\gg 1$ for $|u|\leq\log^{\theta}T$ . We thus have

[TABLE]

settling the left-hand side of (3.22). For the right-hand side, note that the choice $\Delta=\log^{\varepsilon}T$ and $L=1.5\log^{\theta}T$ ensures that the error term $(\sigma-\tfrac{1}{2})\tfrac{\Delta^{2}}{L}$ in Lemma 3.5 is $(\log T)^{-\varepsilon}$ for $\sigma-\tfrac{1}{2}\leq(\log T)^{\theta-3\varepsilon}$ . Lemma 3.5 (iii) (with $b_{2}=1$ ) shows that the right-hand side of (3.24) is

[TABLE]

where $\mathcal{U}_{\ell}=\{3(\log T)^{\theta+\ell}\leq|u|\leq 3(\log T)^{\theta+\ell+1}\}$ . By Corollary 2.10 and a union bound, the event

[TABLE]

has probability $1-\mathrm{o}(1)$ . Moreover, by Lemma 3.5 (i) with $A=1+\frac{100}{\varepsilon}(\lceil\theta\rceil+1)(1+1/\beta)$ and $b_{2}=2(\log T)^{\ell}-1$ , we have, for all $3(\log T)^{\theta+\ell}\leq|u|$ ,

[TABLE]

Therefore, on the event $\mathcal{S}(T)$ , and for every integer $\ell\geq 0$ , the following holds

[TABLE]

Thus, on $\mathcal{S}(T)$ , the contribution of the sum on the right-hand side of (3.26) is negligible. The claim follows by combining Equations (3.24), (3.25) and (3.26). ∎

3.2 Mollification

This step is an adaptation of Section 4.2 of Arguin et al. (2019), which is itself based on the work of Radziwiłł and Soundararajan (2017). The treatment is slightly different as the width of the interval needs to be taken into account. Also, we choose to use the discretization in Proposition 2.7 to obtain a uniform control on the interval as opposed to a Sobolev inequality.

The main idea is to define a mollifier for the zeta function

[TABLE]

where

[TABLE]

Here $\mu$ denotes the Möbius function $\mu(n)=(-1)^{\omega(n)}$ if $n$ is square-free, where $\omega(n)$ is the number of distinct prime factors, and $\mu(n)=0$ if $n$ is non-square-free. The estimate will be done slightly off-axis:

[TABLE]

The parameter $K$ will eventually be assumed to be large enough depending on $\theta$ , $\beta$ and $\varepsilon$ .

The goal of this section is to prove that $M$ is an approximate inverse of $\zeta$ :

Lemma 3.7.

Let $\theta>-1$ and $\varepsilon>0$ be given. Then,

[TABLE]

This was proved in the case $\theta=0$ in Lemma 4.2 of Arguin et al. (2019). In particular, it also holds verbatim for $-1<\theta<0$ since the interval is just smaller. The proof of Lemma 3.7 also holds in the case $\theta>0$ with slight modifications that we highlight. The key idea is the following $L^{2}$ -control:

Lemma 3.8.

Let $\theta>0$ be given. Then,

[TABLE]

Proof.

The proof follows Arguin et al. (2019) with a new error term due to the choice of $\nu_{\theta}$ . (The manipulations are very similar to the ones in Lemma 2.4.) The error appears after Equation (4.10) in Arguin et al. (2019) and is given by

[TABLE]

The Euler product is $\ll(\log T)^{7}$ using Lemma A.1. Using this and the definition of $\nu_{\theta}$ in (3.31) yields

[TABLE]

Since $K\geq 2$ , this gives the correct estimate. Note that the expression $\sum_{p>X}\log(1-p^{-2\sigma_{0}})^{-1}$ entering in the remainder of the proof of Lemma 4.2 in Arguin et al. (2019) is

[TABLE]

This ends the proof. ∎

Proof of Lemma 3.7 for $\theta>0$ .

By Lemma 2.8, $\zeta$ is well approximated by a Dirichlet polynomial of length $T^{1+\varepsilon}$ for any given $\varepsilon>0$ . Moreover, $M$ is a Dirichlet polynomial of length less than $T^{\varepsilon}$ for any given $\varepsilon>0$ . Therefore, an application of Markov’s inequality and Proposition 2.7 yield that the probability in (3.33) is

[TABLE]

The conclusion follows from Lemma 3.8. ∎

3.3 Approximation of the mollifier

We now approximate the mollifier $M$ by the exponential of a Dirichlet polynomial. If we let

[TABLE]

then the following relation between $\exp(-\widetilde{\mathcal{P}}_{1-K^{-1}}(s))$ and $M(s)$ holds for all $\mathrm{Re}\hskip 2.56073pts\geq 1/2$ :

[TABLE]

In particular, we see that $\exp(-\widetilde{\mathcal{P}}_{1-K^{-1}}(s))$ and $M(s)$ only differ for integers $n$ with more than $\nu_{\theta}$ prime factors ( $\Omega(n)>\nu_{\theta}$ ) and all their prime factors $\leq X$ . The following lemma make use of this fact to estimate how close they are when $s=\sigma_{0}+\mathrm{i}\tau+\mathrm{i}h$ .

Lemma 3.9.

Let $\theta>-1$ be given. Then, for any $K\geq 2$ , we have

[TABLE]

Proof.

The discretization in Proposition 2.7 together with the mean value theorem in Lemma A.2 yield

[TABLE]

The right-hand side is $\ll(\log T)^{-100}$ by Rankin’s trick and Lemma A.1:

[TABLE]

The result follows by Markov’s inequality. ∎

3.4 Proofs of the lower bounds

Consider, for $0\leq j\leq K-2$ , the Dirichlet polynomials

[TABLE]

We choose a probabilistic notation for the increments $P_{j}$ ’s seen as random variable, omitting the dependence on the random $\tau$ . We first prove a lower bound for the moments of Dirichlet polynomials.

Proposition 3.10.

Let $\theta>-1$ and $\varepsilon>0$ be given. Then,

[TABLE]

The polynomial $P_{K-2}$ is not included in the sum to ensure that the variances of the $P_{j}$ ’s are almost equal. Indeed, for all $|h|\leq\log^{\theta}T$ and $j\leq K-3$ , an application of (A.6) yields

[TABLE]

since $\sigma_{0}-\tfrac{1}{2}=(\log T)^{-1+3/(2K)}$ . The polynomial $P_{0}$ is ignored to ensure that the polynomials $\sum_{j=1}^{K-3}P_{j}(h)$ are almost independent for $h$ ’s that are far apart, which will be crucial for the second-moment method to go through; see below (3.63) in the proof of Proposition 3.10.

Proof of Proposition 3.10.

This is similar to the upper bound proof of Theorem 1.1. We first relate the moments to the measure of high points. Let $\varepsilon>0$ and $M\in\mathbb{N}$ , and set

[TABLE]

Consider $\gamma_{j}=\frac{j}{M}m(\theta)+\varepsilon$ for $1\leq j\leq M$ , and the good event

[TABLE]

We will show below that $\mathbb{P}(E)$ is $1-\mathrm{o}(1)$ . Before, we prove the lower bound on the moments on the event $E$ . We have

[TABLE]

By the continuity of the function $\gamma\mapsto\beta\gamma+\mathcal{E}_{\theta}(\gamma)$ , Equation (3.49) implies that, on the event $E$ and for $M$ large enough with respect to $\varepsilon$ and $\beta$ ,

[TABLE]

When $0<\beta\leq 2m(\theta)/(1+(\theta\wedge 0))$ , take $\varepsilon>0$ small enough so that $\beta>2\varepsilon/(1+(\theta\wedge 0))$ . The maximum is attained at $\gamma=\tfrac{\beta}{2}(1+(\theta\wedge 0))$ , in which case the right-hand side of (3.50) is equal to $\tfrac{\beta^{2}}{4}(1+(\theta\wedge 0))+\theta-\varepsilon$ . When $\beta>2m(\theta)/(1+(\theta\wedge 0))$ , the maximum is attained at $\gamma=m(\theta)$ , in which case the right-hand side of (3.50) is equal to $(\beta m(\theta)-1)-\varepsilon$ . Thus, on the event $E$ and for $M$ large enough, the lower bound in (3.45) is satisfied.

To conclude the proof of the proposition, it remains to show that $\mathbb{P}(E)\to 1$ as $T\to\infty$ . By the upper bound on the maximum of $\sum_{j=1}^{K-3}P_{j}(h)$ in (2.48) (and the remark below it for $\theta<0$ ), it is sufficient to prove that, for all $\eta>0$ and all $0<\gamma<m(\theta)$ , the event

[TABLE]

has probability $1-\mathrm{o}(1)$ .

Consider

[TABLE]

For $\theta<0$ , Corollary 2.12 ensures that the primes up to $\exp(\log^{|\theta|}T)$ only make a very small contribution, namely the event

[TABLE]

has probability $1-\mathrm{o}(1)$ . We consider the random variable

[TABLE]

where

[TABLE]

By summing the $x_{j}$ ’s, it is not hard to check that the intersection of the events $\{\mathcal{N}\geq(\log T)^{\mathcal{E}_{\theta}(\gamma)-\eta}\}$ and the one in (3.53) is included in the event in (3.51). Therefore, the proof of the proposition is reduced to show

[TABLE]

This is established by the Paley-Zygmund inequality.

To this aim, we shall need one-point and two-point large deviation estimates for the event

[TABLE]

The next two propositions are stated as Propositions 5.4 and 5.5 in Arguin et al. (2019). They are consequences of the Gaussian moments in Lemma A.3.

Proposition 3.11 (One-point large deviation estimates).

Consider the event $A(h)$ in (3.57). For any choices of $\sqrt{\log\log T}\ll_{K}x_{j}\leq\log\log T$ where $1\leq j\leq K-3$ , and uniformly for $h,h^{\prime}\in[-\log^{\theta}T,\log^{\theta}T]$ , we have

[TABLE]

In the case of two points $h,h^{\prime}$ , the primes are essentially correlated up to $\exp(|h-h^{\prime}|^{-1})$ and quickly decorrelate afterwards. For $\theta\geq 0$ , this means that the $P_{j}$ ’s are essentially independent whenever $|h-h^{\prime}|>(\log T)^{-\frac{1}{2K}}$ , since $j=0$ is excluded. For $\theta<0$ , we must exclude the $j$ ’s up to $\mathcal{J}(\theta)-1$ . Therefore, the $P_{j}$ ’s are essentially independent whenever $|h-h^{\prime}|>(\log T)^{\theta-\frac{1}{2K}}$ . We get:

Proposition 3.12 (Two-point large deviation estimates).

Consider the event $A(h)$ in (3.57). For any choices of $0<x_{j}\leq\log\log T$ , and uniformly for $h,h^{\prime}\in[-\log^{\theta}T,\log^{\theta}T]$ such that $|h-h^{\prime}|>(\log T)^{\scriptscriptstyle-\frac{\mathcal{J}(\theta)}{K}+\frac{1}{2K}}$ , we have

[TABLE]

Furthermore, let $0\leq\ell\leq K-3$ . Then, uniformly for $h,h^{\prime}\in[-\log^{\theta}T,\log^{\theta}T]$ such that $|h-h^{\prime}|\leq(\log T)^{-\ell/K}$ , we have

[TABLE]

Now, in order to prove (3.56), we start by finding a lower bound on $\mathbb{E}[\mathcal{N}]$ . By (3.58), the $x_{j}$ ’s in (3.55) and the $s_{j}$ ’s in (3.46), we have

[TABLE]

assuming that $K$ is large enough with respect to $\theta$ , $\gamma$ and $\eta$ . By the Paley-Zygmund inequality, this implies

[TABLE]

It remains to show $\mathbb{E}[\mathcal{N}^{2}]=(1+\mathrm{o}(1))(\mathbb{E}[\mathcal{N}])^{2}$ . With $I=[-\log^{\theta}T,\log^{\theta}T]$ , Fubini’s theorem yields

[TABLE]

The integral can be divided into $(K-\mathcal{J}(\theta)+1)$ parts:

[TABLE]

The dominant term will be the one on $B$ . Note that $\mathrm{Leb}(B)=\mathrm{Leb}(I)^{2}(1+\mathrm{o}(1))$ . Hence, by (3.59), we have

[TABLE]

By (3.60) and the estimate (3.61), the integral on $B_{0}$ is

[TABLE]

assuming that $K$ is large enough with respect to $\theta$ and $\gamma$ . For $\ell=\mathcal{J}(\theta),\dots,K-3$ , the integral on $B_{\ell}$ is, by (3.60) and the estimate (3.61),

[TABLE]

assuming again that $K$ is large enough with respect to $\theta$ , $\gamma$ and $\eta$ . Since $\gamma^{2}<m(\theta)^{2}=(1+\theta)(1+(\theta\wedge 0))$ , the right-hand side of (3.67) is $\mathrm{o}\big{(}(\mathbb{E}[\mathcal{N}])^{2}\big{)}$ if we fix $\eta>0$ small enough with respect to $\theta$ and $\gamma$ . Similarly, by (3.58) and the estimate (3.61), the integral on $B_{K-2}$ is

[TABLE]

provided that $\eta$ is small enough with respect to $\theta$ and $\gamma$ , and $K$ is large enough with respect to $\theta$ , $\gamma$ and $\eta$ . This concludes the proof of Proposition 3.10. ∎

Putting all the work of Section 3 together, we can prove the lower bound in Theorem 1.1.

Proof of Proposition 3.2.

By Proposition 3.6, the probability in (3.2) is

[TABLE]

By Lemma 3.7 and Lemma 3.9, the above is

[TABLE]

Now, notice that the (double) sum for $k\geq 3$ in $\widetilde{\mathcal{P}}_{1-K^{-1}}(\sigma_{0}+\mathrm{i}\tau+\mathrm{i}h)$ is of order one (uniformly for $|h|\leq\log^{\theta}T$ ), and that the sum for $k=2$ is of negligible order:

[TABLE]

where we use the discretization from Proposition 2.7 and the moment estimates from Lemma A.4. Indeed, the right-hand side of (3.71) is $\mathrm{o}(1)$ with the choice $A=\sqrt{\nu_{\theta}}$ and $\ell=\lfloor(1+\theta)\log\log T\rfloor$ . Hence, $\widetilde{\mathcal{P}}_{1-K^{-1}}$ can be replaced by $\mathcal{P}_{1-K^{-1}}$ with an error less than $\log^{\varepsilon}T$ with probability $1-\mathrm{o}(1)$ , meaning that the right-hand side of (3.70) is

[TABLE]

By (2.48), we may discard the terms with $j=0$ and $j=K-2$ with a similar error. For $K$ large enough with respect to $\varepsilon$ , $\beta$ and $\theta$ , the probability in (3.72) is therefore

[TABLE]

Finally, the probability in (3.73) tends to $1$ as $T\to\infty$ by Proposition 3.10. ∎

We now prove the lower bound in Theorem 1.2.

Proof of Proposition 3.1.

From (1.8), we have that $f_{\theta}(\beta)=\beta m(\theta)-1$ when $\beta>\beta_{c}(\theta)=2\sqrt{1+(\theta\wedge 0)}$ . Thus, on the event in the statement of Proposition 3.2 (which has probability $1-\mathrm{o}(1)$ ), and for $\beta$ large enough with respect to $\varepsilon$ and $\theta$ , we have

[TABLE]

This ends the proof. ∎

Appendix A Useful estimates

The prime number theorem yields estimates on the sum of primes with a good error.

Lemma A.1.

Let $1\leq P\leq Q$ , then

[TABLE]

Also, for $|\eta\log Q|\leq 1$ ,

[TABLE]

Proof.

For (A.1), see Lemma A.1 in Arguin and Ouimet (2019) and Lemma 2.1 in Arguin, Belius and Harper (2017). For (A.2), see p.20 in Harper (2013b). ∎

The next three results yield moment estimates for Dirichlet polynomials. The first one is an elementary bound. The second ensures that moments of Dirichlet polynomials that are not too high are approximately Gaussian.

Lemma A.2 (Lemma 3.3 in Arguin et al. (2019)).

For any complex numbers $a(n)$ and $b(n)$ , and for $N\leq T$ , we have

[TABLE]

Lemma A.3 (Lemma 3.4 in Arguin et al. (2019)).

Let $x\geq 2$ be a real number, and suppose that for primes $p\leq x$ , $a(p)$ is a complex number with $|a(p)|\leq 1$ . Then, for any $k\in\mathbb{N}$ ,

[TABLE]

where $I_{0}(z)=\sum_{n\geq 0}z^{2n}/(2^{2n}(n!)^{2})$ denotes the modified Bessel function of the first kind of order [math]. In particular, the expression is $\mathcal{O}\left(x^{2k}/T\right)$ for odd $k$ .

The relation with Gaussian moments in the case where $a(p)=p^{-\sigma-\mathrm{i}h}$ is obtained by expanding the product to get

[TABLE]

where $F(z)$ is analytic in a neighborhood of [math] with $F(0)=1$ and any derivative of a fixed order is bounded by $\sum_{p\leq x}p^{-4\sigma}$ uniformly in $z$ . In particular, this implies that, for $\sigma\geq 1/2$ and $k$ small enough so that $x^{2k}/T=\mathrm{o}(1)$ ,

[TABLE]

The above also holds if $a(p)=0$ for $p\leq y$ (say) with the sum over primes restricted to $y<p\leq x$ . In particular, the error $\sum_{y<p\leq x}p^{-4\sigma}$ can be made $\mathrm{o}(1)$ by taking $y$ large. We note that the moments yield a Gaussian tail

[TABLE]

by picking the moment $k=\lfloor V^{2}/2v^{2}\rfloor$ with $v^{2}=\frac{1}{2}\sum_{p\leq x}p^{-2\sigma}$ , for $V$ not too large.

Finally the third estimate is a cruder version of the Gaussian moment estimates that yields quick upper bounds on moments.

Lemma A.4 (Lemma 3 in Soundararajan (2009)).

Let $T$ be large, and let $2\leq x\leq T$ . Let $\ell$ be a natural number such that $x^{\ell}\ll T/\log T$ . For any complex numbers $a(p)$ , we have

[TABLE]

Bibliography53

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Arguin, Belius and Bourgade (2017) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Belius, \bfnm D. \binits D. and \bauthor \bsnm Bourgade, \bfnm P. \binits P. ( \byear 2017). \btitle Maximum of the characteristic polynomial of random unitary matrices. \bjournal Comm. Math. Phys. \bvolume 349 \bpages 703–751. \bmrnumber 3594368 \endbibitem
2Arguin, Belius and Harper (2017) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Belius, \bfnm D. \binits D. and \bauthor \bsnm Harper, \bfnm A. J. \binits A. J. ( \byear 2017). \btitle Maxima of a randomized Riemann zeta function, and branching random walks. \bjournal Ann. Appl. Probab. \bvolume 27 \bpages 178–215. \bmrnumber 3619786 \endbibitem
3Arguin, Bourgade and Radziwiłł (2020) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Bourgade, \bfnm P. \binits P. and \bauthor \bsnm Radziwiłł, \bfnm M. \binits M. ( \byear 2020). \btitle The Fyodorov-Hiary-Keating conjecture I. \bjournal Preprint \bpages 1–49. \bnote ar Xiv:2007.00988 . \endbibitem
4Arguin, Dubach and Hartung (2021) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Dubach, \bfnm G. \binits G. and \bauthor \bsnm Hartung, \bfnm L. \binits L. ( \byear 2021). \btitle Maxima of a random model of the Riemann zeta function over intervals of varying length. \bjournal Preprint \bpages 1–26. \bnote ar Xiv:2103.04817 . \endbibitem
5Arguin and Ouimet (2019) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P. and \bauthor \bsnm Ouimet, \bfnm F. \binits F. ( \byear 2019). \btitle Large deviations and continuity estimates for the derivative of a random model of log ⁡ | ζ | 𝜁 \log|\zeta| on the critical line. \bjournal J. Math. Anal. Appl. \bvolume 472 \bpages 687–695. \bmrnumber 3906393 \endbibitem
6Arguin and Tai (2019) {bincollection} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P. and \bauthor \bsnm Tai, \bfnm W. \binits W. ( \byear 2019). \btitle Is the Riemann zeta function in a short interval a 1-RSB spin glass ? In \bbooktitle Sojourns in Probability Theory and Statistical Physics - I. \bseries Springer Proceedings in Mathematics & Statistics \bpages 63–88. \bpublisher Springer Singapore. \bnote doi:10.1007/978-981-15-0294-1 . \endbibitem · doi ↗
7Arguin et al. (2019) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Belius, \bfnm D. \binits D., \bauthor \bsnm Bourgade, \bfnm P. \binits P., \bauthor \bsnm Radziwiłł, \bfnm M. \binits M. and \bauthor \bsnm Soundararajan, \bfnm K. \binits K. ( \byear 2019). \btitle Maximum of the Riemann zeta function on a short interval of the critical line. \bjournal Comm. Pure Appl. Math. \bvolume 72 \bpages 500–535. \bmrnumber 3911893 \endbibitem
8Bailey and Keating (2019) {barticle} [author] \bauthor \bsnm Bailey, \bfnm E. C. \binits E. C. and \bauthor \bsnm Keating, \bfnm J. P. \binits J. P. ( \byear 2019). \btitle On the moments of the moments of the characteristic polynomials of random unitary matrices. \bjournal Comm. Math. Phys. \bvolume 371 \bpages 689–726. \bmrnumber 4019917 \endbibitem

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Moments of the Riemann zeta function on short intervals of the critical line

Abstract

keywords:

keywords:

Contents

1 Introduction

1.1 Maxima and moments over large intervals

1.2 Maxima and moments over short intervals

Theorem 1.1** (Moments).**

Proof.

Theorem 1.2** (Local maximum).**

Proof.

Conjecture 1.3**.**

1.3 Relations to other models

Notation**.**

1.4 Outline of the proof

2 Upper bounds

2.1 Moment estimates

Proposition 2.1**.**

Proof.

Proposition 2.2**.**

Proof.

Lemma 2.3**.**

Proof.

Lemma 2.4**.**

Proof.

Lemma 2.5**.**

Proof.

Proposition 2.6**.**

Proof.

2.2 Discretization

Proposition 2.7**.**

Proof.

Lemma 2.8** (Approximation of ζ\zetaζ).**

Proof.

Corollary 2.9**.**

Proof.

Corollary 2.10**.**

Proof.

Corollary 2.11**.**

Proof.

Corollary 2.12**.**

Proof.

Corollary 2.13**.**

Proof.

2.3 Proofs of the upper bounds

2.3.1 The case θ≥0\theta\geq 0θ≥0

Proof of Theorem 1.2 for θ≥0\theta\geq 0θ≥0.

Proof of Theorem 1.1 for θ≥0\theta\geq 0θ≥0.

Remark**.**

2.3.2 The case θ<0\theta<0θ<0

Proof of Theorem 1.2 for θ<0\theta<0θ<0.

Proof of Theorem 1.1 for θ<0\theta<0θ<0.

3 Lower bounds

Proposition 3.1**.**

Proposition 3.2**.**

3.1 Reduction off-axis

Proposition 3.3** (Theorem 2 of Gabriel (1927) in the special case a=b=1a=b=1a=b=1).**

Corollary 3.4**.**

Proof.

Lemma 3.5**.**

Proof.

Proposition 3.6**.**

Proof.

3.2 Mollification

Lemma 3.7**.**

Lemma 3.8**.**

Proof.

Proof of Lemma 3.7 for θ>0\theta>0θ>0.

3.3 Approximation of the mollifier

Lemma 3.9**.**

Proof.

3.4 Proofs of the lower bounds

Proposition 3.10**.**

Theorem 1.1 (Moments).

Theorem 1.2 (Local maximum).

Conjecture 1.3.

Notation.

Proposition 2.1.

Proposition 2.2.

Lemma 2.3.

Lemma 2.4.

Lemma 2.5.

Proposition 2.6.

Proposition 2.7.

Lemma 2.8 (Approximation of $\zeta$ ).

Corollary 2.9.

Corollary 2.10.

Corollary 2.11.

Corollary 2.12.

Corollary 2.13.

2.3.1 The case $\theta\geq 0$

Proof of Theorem 1.2 for $\theta\geq 0$ .

Proof of Theorem 1.1 for $\theta\geq 0$ .

Remark.

2.3.2 The case $\theta<0$

Proof of Theorem 1.2 for $\theta<0$ .

Proof of Theorem 1.1 for $\theta<0$ .

Proposition 3.1.

Proposition 3.2.

Proposition 3.3 (Theorem 2 of Gabriel (1927) in the special case $a=b=1$ ).

Corollary 3.4.

Lemma 3.5.

Proposition 3.6.

Lemma 3.7.

Lemma 3.8.

Proof of Lemma 3.7 for $\theta>0$ .

Lemma 3.9.

Proposition 3.10.

Proposition 3.11 (One-point large deviation estimates).

Proposition 3.12 (Two-point large deviation estimates).

Lemma A.1.

Lemma A.2 (Lemma 3.3 in Arguin et al. (2019)).

Lemma A.3 (Lemma 3.4 in Arguin et al. (2019)).

Lemma A.4 (Lemma 3 in Soundararajan (2009)).