Moments of the Riemann zeta function on short intervals of the critical line
Louis-Pierre Arguin, Fr\'ed\'eric Ouimet, Maksym Radziwi\l\l

TL;DR
This paper analyzes the moments of the Riemann zeta function on short intervals along the critical line, revealing phase transitions and differences between mesoscopic and macroscopic scales, with implications for understanding zeta correlations.
Contribution
It extends the understanding of zeta moments on short intervals, proving phase transitions and interval-dependent behaviors, and generalizes previous results with unconditional proofs.
Findings
Moments exhibit phase transition at critical exponent _ heta(eta)
Different behavior of moments between mesoscopic and macroscopic intervals
Maximal size of zeta on short intervals quantified as (\u2212 T)^{m( heta)+o(1)}
Abstract
We show that as , for all outside of a set of measure , for some explicit exponent , where and . This proves an extended version of a conjecture of Fyodorov and Keating (2014). In particular, it shows that, for all , the moments exhibit a phase transition at a critical exponent , below which is quadratic and above which is linear. The form of the exponent also differs between mesoscopic intervals () and macroscopic intervals (), a phenomenon that stems from an approximate tree structure for the correlations of zeta. We also prove that,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Moments of the Riemann zeta function on short intervals of the critical line
Louis-Pierre Arguinlabel=e1][email protected] [
Frédéric Ouimetlabel=e2 [
mark][email protected]
Maksym Radziwiłłlabel=e3 [
mark][email protected]
Baruch College and Graduate Center (CUNY),
California Institute of Technology,
Abstract
We show that as , for all outside of a set of measure ,
[TABLE]
for some explicit exponent , where and . This proves an extended version of a conjecture of Fyodorov and Keating (2014). In particular, it shows that, for all , the moments exhibit a phase transition at a critical exponent , below which is quadratic and above which is linear. The form of the exponent also differs between mesoscopic intervals () and macroscopic intervals (), a phenomenon that stems from an approximate tree structure for the correlations of zeta. We also prove that, for all outside a set of measure ,
[TABLE]
for some explicit . This generalizes earlier results of Najnudel (2018) and Arguin et al. (2019) for . The proofs are unconditional, except for the upper bounds when , where the Riemann hypothesis is assumed.
60G70,
11M06 \sep60F10 \sep60G60,
extreme value theory,
Riemann zeta function,
moments,
keywords:
[class=MSC2020]
keywords:
\startlocaldefs\endlocaldefs
,
and t1L.-P. A. is supported in part by NSF Grant DMS-1513441 and by NSF CAREER DMS-1653602.t2F. O. is supported by postdoctoral fellowship from the NSERC (PDF) and the FRQNT (B3X).t3M. R. acknowledges support of a Sloan fellowship and NSF grant DMS-1902063.
Contents
1 Introduction
1.1 Maxima and moments over large intervals
Understanding the growth of the Riemann zeta function on the critical line is a central problem in number theory due, among other things, to its relationship with the distribution of the zeros of , see e.g. Theorem 9.3 in Titchmarsh (1986), and the more general subconvexity problem, see e.g. Michel and Venkatesh (2010); Venkatesh (2010), and see Iwaniec and Sarnak (2000) for a general discussion.
The Lindelöf hypothesis predicts that, for any and all , we have , whereas it follows from the Riemann hypothesis that
[TABLE]
see Chandee and Soundararajan (2011).
Unfortunately, there is a large gap between these conditional results and the best unconditional upper bounds, such as Bourgain (2017), which shows that for any given and all . Currently, the best unconditional lower bound,
[TABLE]
is established in de la Bretèche and Tenenbaum (2019) building on a method from Bondarenko and Seip (2017).
The true order of the maximum of remains elusive to this day. A conjecture that we find plausible is stated in Farmer, Gonek and Hughes (2007), where it is conjectured based on probabilistic models that
[TABLE]
Another set of central objects in the theory of the Riemann zeta function are the moments
[TABLE]
Their importance comes from their relationship to the size and zero-distribution of . However, unlike the problem of understanding the size of the global maximum of , we are in possession of widely believed conjectures regarding the behavior of moments. Following the work Keating and Snaith (2000), it is expected that, for all ,
[TABLE]
and that the constant factors into a product of two constants: one is computed from the moments of the characteristic polynomial of random unitary matrices, and the other is an arithmetic factor coming from the small primes.
There are a few results supporting (1.5). First, the conjecture (1.5) is known for and following the classical work of Hardy-Littlewood and Ingham. Upper bounds of the correct order of magnitude are established in Heap, Radziwiłł and Soundararajan (2019) for . Meanwhile, lower bounds of the correct order of magnitude have been established for all in Radziwiłł and Soundararajan (2013). Conditionally on the Riemann hypothesis, the correct order of magnitude of (1.5) is known for all (see Soundararajan (2009); Harper (2013a) for the upper bounds and Heath-Brown (1981) for the lower bounds).
1.2 Maxima and moments over short intervals
Motivated by the problem of understanding the global maximum, Fyodorov, Hiary and Keating (2012); Fyodorov and Keating (2014) initiated the question of understanding the true size of the local maximum of by establishing a connection with log-correlated processes. If is sampled uniformly on , they conjectured that for any , there exists large enough and independent of , such that with probability ,
[TABLE]
They also conjectured weak convergence, with a limiting tail of the form . The leading order was proved in Najnudel (2018) (conditionally on the Riemann hypothesis for the lower bound) and in Arguin et al. (2019) unconditionally. The sharp upper bound was recently established in Arguin, Bourgade and Radziwiłł (2020).
It is also conjectured in Fyodorov, Hiary and Keating (2012); Fyodorov and Keating (2014) (see Equations (14) and (2.30), respectively) that the moments in a short interval undergo a freezing phase transition, that is, the event
[TABLE]
has probability as . Fyodorov and Keating (2014) also state corresponding conjectures for mesoscopic intervals of length when , as well as finer asymptotics for the moments.
In view of Equations (1.5) and (1.7), an obvious question is to determine up to which interval size the freezing phase transition persists. In this paper, we establish that freezing transitions occur exactly for interval sizes of order with . We also obtain the corresponding results for local maxima over such intervals. The following functions will be crucial to our analysis:
[TABLE]
Theorem 1.1** (Moments).**
Let , and be given. Let be a random variable uniformly distributed on . Then, as , we have
[TABLE]
Moreover, if or if the Riemann hypothesis holds, then as ,
[TABLE]
Proof.
For the upper bound, see Section 2.3, and for the lower bound, see Proposition 3.2. ∎
When , the moments exhibit freezing, i.e. they are dominated by a few large values at the level of the local maximum of , . Theorem 1.1 also suggests that freezing does not occur for intervals larger than any fixed power of , since as . We note that recently a sharp upper bound in the case has been established in Harper (2019), thus refining the factor appearing in (1.10) when and .
Theorem 1.2** (Local maximum).**
Let and be given. Let be a random variable uniformly distributed on . Then, as , we have
[TABLE]
Moreover, if or if the Riemann hypothesis holds, then as ,
[TABLE]
Proof.
For the upper bound, see Section 2.3, and for the lower bound, see Proposition 3.1. ∎
It is instructive to put these results in the context of two well-known facts on . First, Selberg’s central limit theorem, see for example Selberg (1946, 1992) or the simple proof in Radziwiłł and Soundararajan (2017), states that, for any given ,
[TABLE]
In other words, a typical value of is a Gaussian random variable of variance . This is consistent with the moment conjecture (1.5) which gives a precise expression for the Laplace transform of . Second, since varies on the scale of for , the analysis of large values should be reducible to a discrete set of points. Putting these two facts together, one expects that the statistics of extreme values of , , should be similar to the ones of Gaussian random variables of variance . If the random variables were independent, this is the so-called Random Energy Model (REM) in statistical mechanics introduced in Derrida (1981). For , it is not hard to check, using basic Gaussian tail estimates, that the expression (1.8) corresponds to the free energy of the model, and the results of Theorem 1.2, to the maximum of the REM. For more on this, we refer to Kistler (2015), where many techniques from REM were introduced to analyze log-correlated processes.
The REM heuristic is of course limited as the values of , , are correlated. In fact, they are log-correlated if , as first noticed Bourgade (2010). A good probabilistic model for the extreme values in the case is therefore a branching random walk. This is explained in more details in Section 1.4 and illustrated in Figure 1. For , our results show that the correlations do not affect large values at leading order (though the proofs must take them into account). As argued in Section 1.4, we believe that the correct probabilistic model for large values in this case is independent branching random walks. One implication is that the REM heuristic should persist to subleading order (but fail at the level of fluctuations). In view of this, we believe that conjecture (1.6) needs to be expanded as follows to include large intervals:
Conjecture 1.3**.**
Let be given and let be as in (1.8). Let be a random variable uniformly distributed on . For any , there exists large enough and independent of , such that with probability ,
[TABLE]
where
[TABLE]
In particular, we expect a discontinuity of as . An analysis of a model of the Riemann zeta function shows that the discontinuity can be resolved by approaching [math] at a suitable rate. Namely if , it is expected that , interpolating between and for , see Arguin, Dubach and Hartung (2021). Such hybrid statistics have been studied in the context of branching random walks, see Kistler and Schmidt (2015) and Bovier and Hartung (2020).
For , our analysis suggests that the correct model consists of a single random walk up to time followed by a branching random walk. The maximum on such intervals would then be consistent with the level proposed in Section 2 (c)(ii) of Fyodorov and Keating (2014),
[TABLE]
where is a standard Gaussian random variable. As explained in Section 1.4, the additional fluctuation would represent the contribution of the Dirichlet polynomial which is essentially the same random variable for all ’s in the interval .
1.3 Relations to other models
When , Conjecture 1.3 is based on modelling by the characteristic polynomial of a random unitary matrix (CUE). More precisely, if is a random matrix sampled from the Haar measure on the unitary group , one can consider the moments
[TABLE]
These can be computed in the limit , at least heuristically, using Selberg integrals and the Fisher-Hartwig formula, cf. Fyodorov and Keating (2014). Exact expressions were recently obtained in Bailey and Keating (2019) in the regime . The statistics of and of in the limit can be inferred from the asymptotics of the moments by comparison with log-correlated processes, cf. Fyodorov, Gnutzmann and Keating (2018) for a numerical study. In the CUE setting, the freezing analogue of (1.7) and the leading order as in (1.6) were proved in Arguin, Belius and Bourgade (2017). The subleading order of the maximum was proved in Paquette and Zeitouni (2018), and up to constant in Chhaibi, Madaule and Najnudel (2018).
From the analysis of a particular variant of the log-correlated REM model, Fyodorov and Bouchaud (2008) conjectured an exact formula for the density of the total mass of the sub-critical Gaussian multiplicative chaos (GMC) measure associated to the Gaussian free field (GFF) on the unit circle, cf. Rhodes and Vargas (2014). In the critical case, they conjectured that the fluctuations of the maximum can be captured by a sum of two Gumbel variables. Both results were proved in Remy (2020). Naturally, these results are expected to hold in the CUE setting, where the GMC measure is the limit of
[TABLE]
as proved by Webb (2015) when , and by Nikula, Saksman and Webb (2020) when . Such a random measure can also be considered in the context of the Riemann zeta function for mesoscopic intervals of length , , with in place of . (There does not seem to be any obvious equivalent for macroscopic intervals, , in the CUE model.) A step in this direction was made in Saksman and Webb (2020) where , , was shown to converge, as , when considered as a random variable on the space of tempered distributions.
Another model for the large values of , , is to consider a random Dirichlet polynomial , where are i.i.d. uniform random variables on the unit circle, cf. Harper (2013b); Arguin, Belius and Harper (2017); Arguin and Ouimet (2019). The analogue of conjecture (1.6) for this model was proved up to second-order corrections in Arguin, Belius and Harper (2017), and large deviations and continuity estimates for the derivative were found in Arguin and Ouimet (2019). The limit of the corresponding multiplicative chaos measure was obtained in Saksman and Webb (2016, 2020). A proof of the freezing phase transition was given in Arguin and Tai (2019). In the latter, the limit of the Gibbs measure is also studied in the supercritical regime , showing that it is supported on ’s that are at a relative distance of order one or order of each other. This result was used in Ouimet (2018) to prove that the normalized Gibbs weights converge to a Poisson-Dirichlet distribution.
Notation**.**
For the rest of the paper, denotes a uniform random variable on . For any event and a random variable , we write
[TABLE]
We also use the standard and notations: thus, if tends to [math] as when the parameters , and are fixed. Similarly, we write if is bounded for , and fixed. We sometimes write for conciseness if , and also if both and hold. In some statements, we write or to highlight the dependence on a specific parameter in the implicit constant. In some of the proofs, we use the common convention that denotes an arbitrarily small positive quantity that may vary from line to line. We will also encounter some arithmetical functions familiar in number theory. These include: (which counts the number of distinct primes dividing ), (which counts with multiplicity the number of primes dividing ), and the Möbius function (which equals [math] if is divisible by the square of a prime, and equals if is square-free). Throughout the paper, and refer to and , respectively.
1.4 Outline of the proof
For , the upper bound part of Theorem 1.1 and Theorem 1.2 follows from the moment estimates
[TABLE]
and from a discretization result which roughly shows that for a Dirichlet polynomial that approximates zeta, and for , we have
[TABLE]
Equation (1.19) tells us that the process varies on a scale, so that the maximum and moments of on an interval of length behave as those of i.i.d. Gaussian random variables of variance .111As in the branching random walk setting, the log-correlations are important in the proof of the first-order asymptotics of the maximum, high points and moments, but they do not influence the results. When comparing Gaussian fields, Slepian’s lemma tells us that, at equal variance, the field with no correlations will have, on average, the highest maximum and the highest number of points above any fixed proportion of the maximum (the asymptotics of the moments are derived directly from these two quantities). Therefore, the asymptotics of the maximum and moments for i.i.d. Gaussians are always an upper bound for those of log-correlated Gaussian fields. It turns out that we get a matching lower bound by a coarse-graining of the scales following Kistler (2015). This is why our heuristic here is phrased in terms of i.i.d. Gaussians, because the correlations ultimately only matters for the proof, not the actual results. The limitation to comes from the fact that the upper bounds (1.18) are not known unconditionally for .
When , the upper bounds in Theorem 1.1 and Theorem 1.2 are a bit more delicate. We follow essentially the same strategy, but we apply it to the function
[TABLE]
instead of . The reason is that, when , the contribution of the primes up to scale is negligible with high probability. Namely, with probability ,
[TABLE]
When is restricted to a specific event on which (1.20) can be discretized as in (1.19), we can show that
[TABLE]
for . This explains the additional factor in when and .
We then turn to the lower bound part of Theorem 1.1 and Theorem 1.2. The lower bounds in Theorem 1.2 follow directly from Theorem 1.1 (see (3.74)), so it is enough to discuss Theorem 1.1.
The problem is first reduced to obtaining lower bounds for moments off the critical line. In particular, it is shown, uniformly in and for any given , that with probability ,
[TABLE]
This is accomplished using a result of Gabriel (1927) for subharmonic functions, and the construction of an explicit entire function which is a good approximation to the indicator function of the rectangle in the whole strip . The fact that the interval can be very small when makes this part rather technical. We believe that this result might be useful in other applications as well.
The problem is therefore reduced to obtaining a good lower bound for
[TABLE]
for some sufficiently small . We adapt mollification results from Arguin et al. (2019) to show that, outside of an event of probability , the problem can be reduced to understanding
[TABLE]
The proof of the lower bound is now restricted to the problem of understanding the correlation structure of the process
[TABLE]
The remaining part of the argument is done in Section 3.4 by a multiscale second moment method introduced in Kistler (2015). The covariance of the process (1.26) can be computed using Lemma A.3 with :
[TABLE]
The cosine factor implies that primes smaller than are almost perfectly correlated, whereas primes greater than decorrelate quickly. In fact, the covariance can be evaluated precisely using the prime number theorem and equals . This shows that the process is approximatively a log-correlated Gaussian process. (This is also true for in the sense of finite-dimensional distributions as shown in Bourgade (2010).)
The identification with a log-correlated process is useful as it suggests that the Dirichlet polynomials have an underlying tree structure. To see this, consider the increments
[TABLE]
The range of primes is chosen so that each has variance . In this framework, the Dirichlet polynomial at can be seen as a random walk with independent and identically distributed increments. However, the random walks for different ’s are not independent by (1.27). In fact, the walks are almost perfectly correlated until they branch out around the prime , corresponding to the increment . Since goes to essentially , the analysis can be restricted to ’s on a grid with mesh . Furthermore, the ’s in an interval of size , for , will share the same increments up to .
The above observations have important consequences for the probabilistic analysis. For , this means that the process (1.26) on an interval of order one is well approximated by a Gaussian process indexed by a tree of average degree , where the independent increments are identified with the edges of the tree. Note that the number of leaves on the interval is then . Equivalently, the walks , , can be seen as a branching random walk on a Galton-Watson tree with an average number of offspring , cf. Figure 1.
When , the tree structure suggests that the primes up to do not contribute to large values, since they should be essentially the same for all ’s in the interval . Therefore these primes can be cutoff at a low cost, cf. Corollary 2.12. This is equivalent to restricting to a subtree of the one on with increments and leaves, yielding a maximum at leading order of by the REM heuristic.
The case stands out as the analogy with branching random walks fails. This is because the random walks for and are essentially independent when . Therefore the right probabilistic model seems to consist of independent branching random walks corresponding to different intervals of order one, see Figure 1. A large class of similar models (called CREM’s for Continuous Random Energy Models) have been studied in Bovier and Kurkova (2004), see Bovier (2006, 2017) for a review. It turns out that the large values at leading order correspond to the ones of a REM with variables of variance . This yields a maximum of at leading order. In fact, in view of the extreme value statistics of CREM’s, we expect that the REM heuristic holds for subleading corrections. This is the motivation for Conjecture 1.3.
2 Upper bounds
2.1 Moment estimates
We will need a number of moment estimates which we state below.
Proposition 2.1**.**
Assume the Riemann hypothesis. Let and be given. Then,
[TABLE]
Proof.
See Corollary A in Soundararajan (2009). ∎
Proposition 2.2**.**
Let be given. Then,
[TABLE]
Proof.
See Theorem 1 in Heap, Radziwiłł and Soundararajan (2019). ∎
The proof of Proposition 2.1 is based on the following deterministic upper bound for : Suppose that is large. Let , and let . Then, as , we have
[TABLE]
see Proposition and Lemma 2 in Soundararajan (2009). On the Riemann hypothesis, the upper bounds in Theorem 1.1 and Theorem 1.2 could be proved in a simpler way by using this deterministic bound, and by proving the corresponding results for the Dirichlet polynomials. For unconditional results, such a deterministic upper bound is not available. We need to work on average to discard the contribution of large primes. This is the purpose of Lemmas 2.3, 2.4, 2.5 and Proposition 2.6 below.
In order to compute the moments of , we will need to express as a finite Dirichlet polynomial. To this aim, notice that if for some , we have \big{|}e^{z}-\sum_{j=0}^{\nu}\frac{z^{j}}{j!}\big{|}\leq e^{-\nu}. Consider more generally with and for some completely multiplicative function . We have by the above, assuming for some , and by the multinomial formula, that
[TABLE]
where is the number of prime factors of with multiplicity. Here, is the multiplicative function defined by for all integers and primes .
The relevant function for will be of the following form: Given and , let denote a completely multiplicative function such that
[TABLE]
In the next three lemmas, we control various terms with the aim of proving the moment estimate in Proposition 2.6, which we will need in the case of short intervals.
Lemma 2.3**.**
Let , and be given. Then,
[TABLE]
Proof.
Notice that the Dirichlet polynomial in (2.6) has length for any fixed . In particular, by the mean-value formula (Lemma A.2),
[TABLE]
Dropping the restriction on and expressing the sum as an Euler product yield
[TABLE]
The logarithm of the right-hand side is easily evaluated using the prime number theorem (see Lemma A.1) and is . This proves the claimed bound. ∎
Lemma 2.4**.**
Let , and be given. Then,
[TABLE]
Proof.
By Theorem 1 in Bettin, Chandee and Radziwiłł (2017), the left-hand side of (2.8) is
[TABLE]
where is a smooth non-negative function such that for all , with support contained in say , and and stand for the greatest common divisor and the least common multiple, respectively.
We first note that if have the prime factorization and , where the ’s and ’s are possibly [math], then . This means that if and are two bounded multiplicative functions, we have
[TABLE]
Using Chernoff’s bound, we can get rid of the restriction in (2.9). It suffices to notice that the contribution of each sum over with is
[TABLE]
where we used (2.10) with , . The contribution of each sum over with can be removed in the same manner.
Considering the sums in (2.9) without the restriction on and , we get by (2.10) and Lemma A.1,
[TABLE]
In particular, this means that the second integral in (2.9) is .
To evaluate the first integral in (2.9) , write
[TABLE]
Then, we end up having to evaluate
[TABLE]
As above, the sum over and factors into an Euler product which is
[TABLE]
For , note that
[TABLE]
and a Taylor expansion yields
[TABLE]
Since the error term in (2.17) is by Lemma A.1, the Euler product in (2.15) is
[TABLE]
By putting this estimate back in the contour integral and using a trivial bound on , Equation (2.14) is as required. ∎
Lemma 2.5**.**
Let be given. For , we have
[TABLE]
and
[TABLE]
Proof.
First, we apply a moment estimate (Lemma A.4) followed by a prime number theorem estimate (Lemma A.1) to obtain
[TABLE]
The estimate (2.19) then follows by applying the Cauchy-Schwarz inequality, the fourth moment bound , see e.g. Ingham (1928), and (2.21). For (2.20), the same reasoning as in (2.21) yields the estimate . ∎
The last three lemmas show a moment bound of the right order for .
Proposition 2.6**.**
Let , and be given. Then, as ,
[TABLE]
with the event
[TABLE]
Proof.
Let . By Young’s inequality with and ,
[TABLE]
Note that (2.24) holds trivially for . Hence, for ,
[TABLE]
On the event , we get, by the truncation (2.4) with and the identity , that
[TABLE]
where is the completely multiplicative function defined in (2.5). Likewise, on the same event, we have
[TABLE]
Finally, on the event , we get, for any ,
[TABLE]
since for , is bounded by and is bounded by on . We choose . Now, take the expectation with restricted to in (2.25), then split the terms on the right-hand side over the associated events in (2.26), (2.27) and (2.28). We use Lemmas 2.3, 2.4 and 2.5 to bound the expectations. ∎
2.2 Discretization
The analysis of the maximum of zeta on an interval can often be restricted to ’s on a grid with mesh of order . This can be proved for the maximum using the functional equation for zeta, see for example Lemma 2.2 in Farmer, Gonek and Hughes (2007). We will need a more elaborate variant for general Dirichlet polynomials.
Proposition 2.7**.**
Let , and be given. Let be a Dirichlet polynomial of length where for some possibly depending on and . Then, for all , , and ,
[TABLE]
Proof.
Let be a smooth function with for and compactly supported in . We show
[TABLE]
By taking the complex norm and applying Hölder’s inequality with , this yields
[TABLE]
This proves (2.29) after taking the supremum over , using the rapid decay of , and noticing that and our assumption on imply that
[TABLE]
Since is of the form , it suffices by linearity to establish (2.30) for a single , i.e.,
[TABLE]
Using the Poisson summation formula, the right-hand side can be rewritten as
[TABLE]
where we made the change of variable . The term is equal to since for by the choice of . The other terms () are all equal to [math] since falls outside the support of for . This proves (2.33) and the proposition. ∎
Proposition 2.7 implies five important corollaries to tackle the maximum of and of Dirichlet polynomials. We first observe that the discretization applies to in Corollary 2.9. This is a consequence of the following approximation.
Lemma 2.8** (Approximation of ).**
Let and be given, and let be an integer. Then, as and for , we have
[TABLE]
where the smoothing is defined by setting
[TABLE]
where . Examples of graphs for are provided in Figure 2.
Proof.
The case is a trivial consequence of the fact that . Therefore, assume . We claim that,
[TABLE]
First it is easy to check that this formula holds for and : if then we shift the contour towards and collect a single pole with residue at , while if then we shift the contour towards and we see that the integral is zero. In the remaining intermediate range we expand
[TABLE]
and we use the fact that,
[TABLE]
Therefore,
[TABLE]
We now shift the contour to the line . We collect a pole at with residue . On the line , we bound the integral using the estimate , which is valid for any fixed and all .222This estimate follows from applying the functional equation for , bounding the ratio of Gamma factors using Stirling’s formula and bounding trivially by . Specifically, the contribution of the line is bounded by
[TABLE]
This proves that
[TABLE]
The conclusion follows by a simple rescaling. ∎
From Lemma 2.8, we derive the following discretization result.
Corollary 2.9**.**
Let , and be given. For any and all ,
[TABLE]
Proof.
This is a consequence of the approximation in Lemma 2.8 with and , and the discretization in Proposition 2.7 with . ∎
As a consequence, we get a suboptimal upper bound for using the second moment. Note that this bound also works for dependent on .
Corollary 2.10**.**
Let be given and let be an integer. Then, for any , possibly dependent on , we have
[TABLE]
Proof.
The Dirichlet polynomial in (2.44) is , so the probability is just zero when . Therefore, we assume that . By the approximation in Lemma 2.8, it suffices to prove
[TABLE]
By applying Markov’s inequality and Corollary 2.9 with , the probability in (2.45) is
[TABLE]
Using a standard second moment bound, see e.g. (Titchmarsh, 1986, p.141), the last two expectations are . We conclude that the right-hand side of (2.46) is
[TABLE]
since . ∎
A similar reasoning using Markov’s inequality can be applied to get an upper bound for the maximum of , . The bound below is suboptimal for and optimal for .
Corollary 2.11**.**
Let , and be given. Then,
[TABLE]
Proof.
We apply Markov’s inequality with exponent , and discretize as in (2.46) using Proposition 2.7 with . We then use moment estimates from Lemma A.4, with , to bound the expectations. ∎
When and , the bound (2.48) (and its analogue for ) needs to be refined by discarding the contribution of small primes. The result below directly implies that for and , the sharp upper bound for is since the effective variance is .
Corollary 2.12**.**
Let and be given. Then, for any and that satisfies , we have
[TABLE]
for some constant .
Proof.
For a lighter notation, write . (We keep the dependence on implicit, consistent with the probabilistic notation for random variables.) We have
[TABLE]
Let denote a generic natural integer. By Markov’s inequality, a moment estimate (Lemma A.4) and a prime number theorem estimate (Lemma A.1), we have
[TABLE]
With the choice , this probability is for some constant .
It remains to control the first probability on the right-hand side of (2.50). Let denote another natural integer to be chosen later. By applying Proposition 2.7, we get
[TABLE]
A short calculation, using moment estimates (Lemma A.4) followed by prime number theorem estimates (Lemma A.1), yields
[TABLE]
for some constant (to obtain the last inequality, note that ).
Then, by Markov’s inequality and the choice , we deduce
[TABLE]
for some constant . ∎
As before, the maximum of can be discretized by truncating the exponential.
Corollary 2.13**.**
Let and be given. Then, there exists a constant such that the event
[TABLE]
has probability .
Proof.
Define the event
[TABLE]
By Corollary 2.12, we have . By (2.4), for all , we also have
[TABLE]
Combining this with the approximation in Lemma 2.8 with and , we conclude that, for all and uniformly for ,
[TABLE]
where is a Dirichlet polynomial of length . Proposition 2.7 implies
[TABLE]
Together with (2.58), this concludes the proof. ∎
2.3 Proofs of the upper bounds
2.3.1 The case
Proof of Theorem 1.2 for .
By Markov’s inequality with exponent , we have
[TABLE]
If we choose , we get, by picking large enough in Corollary 2.9, that the right-hand side of the above equation is
[TABLE]
By applying Proposition 2.2 if (i.e., if ) and Proposition 2.1 if (i.e., if ), the expectation is bounded by . Therefore, the claim follows. ∎
Proof of Theorem 1.1 for .
For all , Markov’s inequality yields
[TABLE]
When , we have , so the right-hand side of (2.62) is by Proposition 2.2 for and by Proposition 2.1 for .
It remains to sharpen the bound in the case . We use the Lebesgue measure of high points. Let . Two successive applications of Markov’s inequality yield
[TABLE]
Again, the optimal bound is at . Using Proposition 2.2 for and Proposition 2.1 for and choosing , we conclude that this is for .
We now partition the integral according to the value of the integrand. Let be an integer and . Theorem 1.2 (for ) and the above imply that, with probability ,
[TABLE]
For , the last term dominates and, in particular, the above is bounded by
[TABLE]
provided that is chosen sufficiently large with respect to , and . ∎
Remark**.**
In the above proof, we could have handled all ’s using the Lebesgue measure of high points in the spirit of a Gibbs variational principle. We chose to prove the case directly as the proof is straightforward.
2.3.2 The case
Proof of Theorem 1.2 for .
We notice that
[TABLE]
By Corollary 2.12, the last term is as . As in (2.56), let
[TABLE]
By Corollary 2.12 again, the probability of is . We let denote the subset of for which the conclusion of Corollary 2.13 holds. The probability of is . Then, by Markov’s inequality, we have
[TABLE]
By Corollary 2.13, and since , this is
[TABLE]
By Proposition 2.6, this is
[TABLE]
as needed. ∎
Proof of Theorem 1.1 for .
Similarly to (2.66), we can restrict the integrand to as follows
[TABLE]
As in (2.67), the probability is , and by Markov’s inequality, we have
[TABLE]
By Proposition 2.6, the above is
[TABLE]
This bound proves the claim for .
It remains to refine the bound for the case . This proceeds in the same way as in the proof of Theorem 1.1 in the case , with replaced by restricted on the event . Namely, we have, for ,
[TABLE]
This is by Proposition 2.6 with the optimal choice . The remainder is done exactly as in the proof of Theorem 1.1 in the case , by partitioning the integral over values of the integrand in the range . ∎
3 Lower bounds
In this section, we prove:
Proposition 3.1**.**
Let and be given. Then,
[TABLE]
Proposition 3.2**.**
Let , and be given. Then,
[TABLE]
The lower bound for the maximum will be an easy consequence of the lower bound for the moments. The idea is to approximate zeta by an appropriate Dirichlet polynomial. This can be done with good precision off-axis, cf. Section 3.1. The approximation to a Dirichlet polynomial is then shown in Section 3.2. The lower bound for the moments of the Dirichlet polynomials is proved in Section 3.3 using Kistler’s multiscale second moment method. Finally, the two propositions above are proved in Section 3.4.
3.1 Reduction off-axis
In Arguin et al. (2019), the maximum on a short interval of the critical line was compared to the one on a short interval away from the critical line by exploiting the analyticity of away from its pole. More precisely, a value off-axis can be seen as an average of zeta over the critical line weighed by the corresponding Poisson kernel. This approach could also be used in the case of the moments by using the subharmonicity of the function . We choose to apply a different method based on the following convexity theorem of Gabriel, which handles error terms more efficiently.
Proposition 3.3** (Theorem 2 of Gabriel (1927) in the special case ).**
Let be a complex valued function which is analytic in the strip . Suppose that tends to zero as , uniformly for . Then, for any and any ,
[TABLE]
where
[TABLE]
This theorem has the following useful consequence.
Corollary 3.4**.**
Let be a complex valued function which is analytic in the strip . Suppose that tends to zero as , uniformly for . Suppose also that as . Then, for any and any ,
[TABLE]
Proof.
Let be such that
[TABLE]
Note that because of the assumption that as , the above has a finite value. Let be given. If , then we are done. If , then by Proposition 3.3 applied with , and , we get
[TABLE]
for some appropriate that satisfy .
Therefore, by definition of in (3.6),
[TABLE]
and hence . Since , we get . The claim follows from (3.6). ∎
We now construct a special analytic approximation for the indicator function of the rectangle for . The effective width of the indicator function will be in the statement below.
Lemma 3.5**.**
Let and be given. There exists an entire function such that, for with and ,
- (i)
For , uniformly in , 2. (ii)
For any , 3. (iii)
For any , 4. (iv)
* uniformly in as .*
Proof.
Let be a smooth function, compactly supported in and such that . Given a parameter and given with and , consider the following function:
[TABLE]
Then defines an entire function of exponential type. By integration by parts, we see that
[TABLE]
for any and uniformly in . Therefore, we may think of as localizing to . Furthermore, notice that if and , then
[TABLE]
and for , we have by a Taylor expansion of the exponential,
[TABLE]
Finally, for with , we have from (3.10) that
[TABLE]
The candidate function is for ,
[TABLE]
We will now describe some of the features of this function. Write with . Using the bound (3.13), we see that, if with , then
[TABLE]
This gives the first claim.
If , then by (3.14) and (3.12), we have
[TABLE]
It follows that if and , then due to the rapid decay of , we have
[TABLE]
by Fourier inversion and the assumption that . This proves the second claim. If and , then we have the bound
[TABLE]
which proves the third claim.
Finally, notice that uniformly as by (3.10), which implies the last claim that uniformly in as . ∎
The following proposition relates the moments off and on axis.
Proposition 3.6**.**
Let , , and be given. Then, for all , the event
[TABLE]
has probability .
Proof.
Let
[TABLE]
with and a fixed integer. Using the approximation in Lemma 2.8, we have, for and ,
[TABLE]
Therefore, it suffices to establish (3.19) for replaced by :
[TABLE]
Consider
[TABLE]
with and . Then, by Lemma 3.5 (i) and (iv), Corollary 3.4 can be applied and yields
[TABLE]
Now, it remains to un-smooth both sides of this expression. Lemma 3.5 (ii) (with ) implies that for . We thus have
[TABLE]
settling the left-hand side of (3.22). For the right-hand side, note that the choice and ensures that the error term in Lemma 3.5 is for . Lemma 3.5 (iii) (with ) shows that the right-hand side of (3.24) is
[TABLE]
where . By Corollary 2.10 and a union bound, the event
[TABLE]
has probability . Moreover, by Lemma 3.5 (i) with and , we have, for all ,
[TABLE]
Therefore, on the event , and for every integer , the following holds
[TABLE]
Thus, on , the contribution of the sum on the right-hand side of (3.26) is negligible. The claim follows by combining Equations (3.24), (3.25) and (3.26). ∎
3.2 Mollification
This step is an adaptation of Section 4.2 of Arguin et al. (2019), which is itself based on the work of Radziwiłł and Soundararajan (2017). The treatment is slightly different as the width of the interval needs to be taken into account. Also, we choose to use the discretization in Proposition 2.7 to obtain a uniform control on the interval as opposed to a Sobolev inequality.
The main idea is to define a mollifier for the zeta function
[TABLE]
where
[TABLE]
Here denotes the Möbius function if is square-free, where is the number of distinct prime factors, and if is non-square-free. The estimate will be done slightly off-axis:
[TABLE]
The parameter will eventually be assumed to be large enough depending on , and .
The goal of this section is to prove that is an approximate inverse of :
Lemma 3.7**.**
Let and be given. Then,
[TABLE]
This was proved in the case in Lemma 4.2 of Arguin et al. (2019). In particular, it also holds verbatim for since the interval is just smaller. The proof of Lemma 3.7 also holds in the case with slight modifications that we highlight. The key idea is the following -control:
Lemma 3.8**.**
Let be given. Then,
[TABLE]
Proof.
The proof follows Arguin et al. (2019) with a new error term due to the choice of . (The manipulations are very similar to the ones in Lemma 2.4.) The error appears after Equation (4.10) in Arguin et al. (2019) and is given by
[TABLE]
The Euler product is using Lemma A.1. Using this and the definition of in (3.31) yields
[TABLE]
Since , this gives the correct estimate. Note that the expression entering in the remainder of the proof of Lemma 4.2 in Arguin et al. (2019) is
[TABLE]
This ends the proof. ∎
Proof of Lemma 3.7 for .
By Lemma 2.8, is well approximated by a Dirichlet polynomial of length for any given . Moreover, is a Dirichlet polynomial of length less than for any given . Therefore, an application of Markov’s inequality and Proposition 2.7 yield that the probability in (3.33) is
[TABLE]
The conclusion follows from Lemma 3.8. ∎
3.3 Approximation of the mollifier
We now approximate the mollifier by the exponential of a Dirichlet polynomial. If we let
[TABLE]
then the following relation between and holds for all :
[TABLE]
In particular, we see that and only differ for integers with more than prime factors () and all their prime factors . The following lemma make use of this fact to estimate how close they are when .
Lemma 3.9**.**
Let be given. Then, for any , we have
[TABLE]
Proof.
The discretization in Proposition 2.7 together with the mean value theorem in Lemma A.2 yield
[TABLE]
The right-hand side is by Rankin’s trick and Lemma A.1:
[TABLE]
The result follows by Markov’s inequality. ∎
3.4 Proofs of the lower bounds
Consider, for , the Dirichlet polynomials
[TABLE]
We choose a probabilistic notation for the increments ’s seen as random variable, omitting the dependence on the random . We first prove a lower bound for the moments of Dirichlet polynomials.
Proposition 3.10**.**
Let and be given. Then,
[TABLE]
The polynomial is not included in the sum to ensure that the variances of the ’s are almost equal. Indeed, for all and , an application of (A.6) yields
[TABLE]
since . The polynomial is ignored to ensure that the polynomials are almost independent for ’s that are far apart, which will be crucial for the second-moment method to go through; see below (3.63) in the proof of Proposition 3.10.
Proof of Proposition 3.10.
This is similar to the upper bound proof of Theorem 1.1. We first relate the moments to the measure of high points. Let and , and set
[TABLE]
Consider for , and the good event
[TABLE]
We will show below that is . Before, we prove the lower bound on the moments on the event . We have
[TABLE]
By the continuity of the function , Equation (3.49) implies that, on the event and for large enough with respect to and ,
[TABLE]
When , take small enough so that . The maximum is attained at , in which case the right-hand side of (3.50) is equal to . When , the maximum is attained at , in which case the right-hand side of (3.50) is equal to . Thus, on the event and for large enough, the lower bound in (3.45) is satisfied.
To conclude the proof of the proposition, it remains to show that as . By the upper bound on the maximum of in (2.48) (and the remark below it for ), it is sufficient to prove that, for all and all , the event
[TABLE]
has probability .
Consider
[TABLE]
For , Corollary 2.12 ensures that the primes up to only make a very small contribution, namely the event
[TABLE]
has probability . We consider the random variable
[TABLE]
where
[TABLE]
By summing the ’s, it is not hard to check that the intersection of the events and the one in (3.53) is included in the event in (3.51). Therefore, the proof of the proposition is reduced to show
[TABLE]
This is established by the Paley-Zygmund inequality.
To this aim, we shall need one-point and two-point large deviation estimates for the event
[TABLE]
The next two propositions are stated as Propositions 5.4 and 5.5 in Arguin et al. (2019). They are consequences of the Gaussian moments in Lemma A.3.
Proposition 3.11** (One-point large deviation estimates).**
Consider the event in (3.57). For any choices of where , and uniformly for , we have
[TABLE]
In the case of two points , the primes are essentially correlated up to and quickly decorrelate afterwards. For , this means that the ’s are essentially independent whenever , since is excluded. For , we must exclude the ’s up to . Therefore, the ’s are essentially independent whenever . We get:
Proposition 3.12** (Two-point large deviation estimates).**
Consider the event in (3.57). For any choices of , and uniformly for such that , we have
[TABLE]
Furthermore, let . Then, uniformly for such that , we have
[TABLE]
Now, in order to prove (3.56), we start by finding a lower bound on . By (3.58), the ’s in (3.55) and the ’s in (3.46), we have
[TABLE]
assuming that is large enough with respect to , and . By the Paley-Zygmund inequality, this implies
[TABLE]
It remains to show . With , Fubini’s theorem yields
[TABLE]
The integral can be divided into parts:
[TABLE]
The dominant term will be the one on . Note that . Hence, by (3.59), we have
[TABLE]
By (3.60) and the estimate (3.61), the integral on is
[TABLE]
assuming that is large enough with respect to and . For , the integral on is, by (3.60) and the estimate (3.61),
[TABLE]
assuming again that is large enough with respect to , and . Since , the right-hand side of (3.67) is \mathrm{o}\big{(}(\mathbb{E}[\mathcal{N}])^{2}\big{)} if we fix small enough with respect to and . Similarly, by (3.58) and the estimate (3.61), the integral on is
[TABLE]
provided that is small enough with respect to and , and is large enough with respect to , and . This concludes the proof of Proposition 3.10. ∎
Putting all the work of Section 3 together, we can prove the lower bound in Theorem 1.1.
Proof of Proposition 3.2.
By Proposition 3.6, the probability in (3.2) is
[TABLE]
By Lemma 3.7 and Lemma 3.9, the above is
[TABLE]
Now, notice that the (double) sum for in is of order one (uniformly for ), and that the sum for is of negligible order:
[TABLE]
where we use the discretization from Proposition 2.7 and the moment estimates from Lemma A.4. Indeed, the right-hand side of (3.71) is with the choice and . Hence, can be replaced by with an error less than with probability , meaning that the right-hand side of (3.70) is
[TABLE]
By (2.48), we may discard the terms with and with a similar error. For large enough with respect to , and , the probability in (3.72) is therefore
[TABLE]
Finally, the probability in (3.73) tends to as by Proposition 3.10. ∎
We now prove the lower bound in Theorem 1.2.
Proof of Proposition 3.1.
From (1.8), we have that when . Thus, on the event in the statement of Proposition 3.2 (which has probability ), and for large enough with respect to and , we have
[TABLE]
This ends the proof. ∎
Appendix A Useful estimates
The prime number theorem yields estimates on the sum of primes with a good error.
Lemma A.1**.**
Let , then
[TABLE]
Also, for ,
[TABLE]
Proof.
For (A.1), see Lemma A.1 in Arguin and Ouimet (2019) and Lemma 2.1 in Arguin, Belius and Harper (2017). For (A.2), see p.20 in Harper (2013b). ∎
The next three results yield moment estimates for Dirichlet polynomials. The first one is an elementary bound. The second ensures that moments of Dirichlet polynomials that are not too high are approximately Gaussian.
Lemma A.2** (Lemma 3.3 in Arguin et al. (2019)).**
For any complex numbers and , and for , we have
[TABLE]
Lemma A.3** (Lemma 3.4 in Arguin et al. (2019)).**
Let be a real number, and suppose that for primes , is a complex number with . Then, for any ,
[TABLE]
where denotes the modified Bessel function of the first kind of order [math]. In particular, the expression is for odd .
The relation with Gaussian moments in the case where is obtained by expanding the product to get
[TABLE]
where is analytic in a neighborhood of [math] with and any derivative of a fixed order is bounded by uniformly in . In particular, this implies that, for and small enough so that ,
[TABLE]
The above also holds if for (say) with the sum over primes restricted to . In particular, the error can be made by taking large. We note that the moments yield a Gaussian tail
[TABLE]
by picking the moment with , for not too large.
Finally the third estimate is a cruder version of the Gaussian moment estimates that yields quick upper bounds on moments.
Lemma A.4** (Lemma 3 in Soundararajan (2009)).**
Let be large, and let . Let be a natural number such that . For any complex numbers , we have
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Arguin, Belius and Bourgade (2017) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Belius, \bfnm D. \binits D. and \bauthor \bsnm Bourgade, \bfnm P. \binits P. ( \byear 2017). \btitle Maximum of the characteristic polynomial of random unitary matrices. \bjournal Comm. Math. Phys. \bvolume 349 \bpages 703–751. \bmrnumber 3594368 \endbibitem
- 2Arguin, Belius and Harper (2017) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Belius, \bfnm D. \binits D. and \bauthor \bsnm Harper, \bfnm A. J. \binits A. J. ( \byear 2017). \btitle Maxima of a randomized Riemann zeta function, and branching random walks. \bjournal Ann. Appl. Probab. \bvolume 27 \bpages 178–215. \bmrnumber 3619786 \endbibitem
- 3Arguin, Bourgade and Radziwiłł (2020) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Bourgade, \bfnm P. \binits P. and \bauthor \bsnm Radziwiłł, \bfnm M. \binits M. ( \byear 2020). \btitle The Fyodorov-Hiary-Keating conjecture I. \bjournal Preprint \bpages 1–49. \bnote ar Xiv:2007.00988 . \endbibitem
- 4Arguin, Dubach and Hartung (2021) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Dubach, \bfnm G. \binits G. and \bauthor \bsnm Hartung, \bfnm L. \binits L. ( \byear 2021). \btitle Maxima of a random model of the Riemann zeta function over intervals of varying length. \bjournal Preprint \bpages 1–26. \bnote ar Xiv:2103.04817 . \endbibitem
- 5Arguin and Ouimet (2019) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P. and \bauthor \bsnm Ouimet, \bfnm F. \binits F. ( \byear 2019). \btitle Large deviations and continuity estimates for the derivative of a random model of log | ζ | 𝜁 \log|\zeta| on the critical line. \bjournal J. Math. Anal. Appl. \bvolume 472 \bpages 687–695. \bmrnumber 3906393 \endbibitem
- 6Arguin and Tai (2019) {bincollection} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P. and \bauthor \bsnm Tai, \bfnm W. \binits W. ( \byear 2019). \btitle Is the Riemann zeta function in a short interval a 1-RSB spin glass ? In \bbooktitle Sojourns in Probability Theory and Statistical Physics - I. \bseries Springer Proceedings in Mathematics & Statistics \bpages 63–88. \bpublisher Springer Singapore. \bnote doi:10.1007/978-981-15-0294-1 . \endbibitem · doi ↗
- 7Arguin et al. (2019) {barticle} [author] \bauthor \bsnm Arguin, \bfnm L. P. \binits L. P., \bauthor \bsnm Belius, \bfnm D. \binits D., \bauthor \bsnm Bourgade, \bfnm P. \binits P., \bauthor \bsnm Radziwiłł, \bfnm M. \binits M. and \bauthor \bsnm Soundararajan, \bfnm K. \binits K. ( \byear 2019). \btitle Maximum of the Riemann zeta function on a short interval of the critical line. \bjournal Comm. Pure Appl. Math. \bvolume 72 \bpages 500–535. \bmrnumber 3911893 \endbibitem
- 8Bailey and Keating (2019) {barticle} [author] \bauthor \bsnm Bailey, \bfnm E. C. \binits E. C. and \bauthor \bsnm Keating, \bfnm J. P. \binits J. P. ( \byear 2019). \btitle On the moments of the moments of the characteristic polynomials of random unitary matrices. \bjournal Comm. Math. Phys. \bvolume 371 \bpages 689–726. \bmrnumber 4019917 \endbibitem
