Moments of zeta and correlations of divisor-sums: V

Brian Conrey; Jonathan P. Keating

arXiv:1701.06651·math.NT·September 26, 2018

Moments of zeta and correlations of divisor-sums: V

Brian Conrey, Jonathan P. Keating

PDF

TL;DR

This paper advances the understanding of the moments of the Riemann zeta-function by analyzing Type II sums using circle method techniques to derive precise asymptotics for divisor sum correlations.

Contribution

It completes the analysis of Type II sums, providing a comprehensive framework for calculating lower order terms in zeta moments using divisor correlations.

Findings

01

Derived asymptotic formulas for divisor sum correlations

02

Completed the analysis of Type II sums in zeta moments

03

Enhanced methods for calculating moments of the zeta-function

Abstract

In this series of papers we examine the calculation of the $2 k$ th moment and shifted moments of the Riemann zeta-function on the critical line using long Dirichlet polynomials and divisor correlations. The present paper completes the general study of what we call Type II sums which utilize a circle method framework and a convolution of shifted convolution sums to obtain all of the lower order terms in the asymptotic formula for the mean square along $[T, 2 T]$ of a Dirichlet polynomial of arbitrary length with divisor functions as coefficients.

Equations308

D_{A} (s) := α \in A \prod ζ (s + α) = n = 1 \sum \infty \frac{τ _{A} ( n )}{n ^{s}};

D_{A} (s) := α \in A \prod ζ (s + α) = n = 1 \sum \infty \frac{τ _{A} ( n )}{n ^{s}};

I_{A, B}^{ψ} (T) = \int_{0}^{\infty} ψ (\frac{t}{T}) D_{A} (s) D_{B} (1 - s) d t

I_{A, B}^{ψ} (T) = \int_{0}^{\infty} ψ (\frac{t}{T}) D_{A} (s) D_{B} (1 - s) d t

D_{A} (s; X) = n \leq X \sum \frac{τ _{A} ( n )}{n ^{s}}

D_{A} (s; X) = n \leq X \sum \frac{τ _{A} ( n )}{n ^{s}}

D_{A} (s; X)

D_{A} (s; X)

R_{A} (s; X)

R_{A} (s; X)

\displaystyle I^{\psi}_{A,B}(T;X):=\int_{0}^{\infty}\psi\left(\frac{t}{T}\right)\big{(}D_{A}(s;X)-R_{A}(s;X)\big{)}\big{(}D_{B}(1-s;X)-R_{B}(1-s;X)\big{)}~{}dt

\displaystyle I^{\psi}_{A,B}(T;X):=\int_{0}^{\infty}\psi\left(\frac{t}{T}\right)\big{(}D_{A}(s;X)-R_{A}(s;X)\big{)}\big{(}D_{B}(1-s;X)-R_{B}(1-s;X)\big{)}~{}dt

D_{A} (s; X) - R_{A} (s; X) = \frac{1}{2 π i} \int_{(ϵ)} \frac{X ^{w}}{w} D_{A_{w}} (s) d w

D_{A} (s; X) - R_{A} (s; X) = \frac{1}{2 π i} \int_{(ϵ)} \frac{X ^{w}}{w} D_{A_{w}} (s) d w

I_{A, B}^{ψ} (T; X) = \frac{1}{( 2 π i ) ^{2}} \iint_{(ϵ), (ϵ)} \frac{X ^{z + w}}{z w} I_{A_{w}, B_{z}}^{ψ} (T) d w d z

I_{A, B}^{ψ} (T; X) = \frac{1}{( 2 π i ) ^{2}} \iint_{(ϵ), (ϵ)} \frac{X ^{z + w}}{z w} I_{A_{w}, B_{z}}^{ψ} (T) d w d z

A_{w} := {α + w : α \in A},

A_{w} := {α + w : α \in A},

I_{A, B}^{ψ} (T) = T \int_{0}^{\infty} ψ (t) ∣ U ∣ = ∣ V ∣ U \subset A , V \subset B \sum (\frac{tT}{2 π})^{- \sum_{β ^ \in V α ^ \in U} (\overset{α}{^} + \hat{β})} B (A - U + V^{-}, B - V + U^{-}, 1) d t + o (T)

I_{A, B}^{ψ} (T) = T \int_{0}^{\infty} ψ (t) ∣ U ∣ = ∣ V ∣ U \subset A , V \subset B \sum (\frac{tT}{2 π})^{- \sum_{β ^ \in V α ^ \in U} (\overset{α}{^} + \hat{β})} B (A - U + V^{-}, B - V + U^{-}, 1) d t + o (T)

B (A, B, s) := n = 1 \sum \infty \frac{τ _{A} ( n ) τ _{B} ( n )}{n ^{s}} .

B (A, B, s) := n = 1 \sum \infty \frac{τ _{A} ( n ) τ _{B} ( n )}{n ^{s}} .

I_{A, B}^{ψ} (T; X)

I_{A, B}^{ψ} (T; X)

T^{ℓ} \leq X < T^{ℓ + 1} .

T^{ℓ} \leq X < T^{ℓ + 1} .

I_{A, B}^{ψ} (T; X)

I_{A, B}^{ψ} (T; X)

\frac{1}{T} \int_{0}^{\infty} ψ (\frac{t}{T}) D_{A} (s; X) D_{B} (1 - s; X) d t

\frac{1}{T} \int_{0}^{\infty} ψ (\frac{t}{T}) D_{A} (s; X) D_{B} (1 - s; X) d t

O_{A_{1}, \dots, A_{ℓ}; B_{1}, \dots, B_{ℓ}} (T; X) := n _{1} n _{2} \dots n _{ℓ} \leq X m _{1} m _{2} \dots m _{ℓ} \leq X \sum \frac{τ _{A_{1}} ( m _{1} ) \dots τ _{A_{ℓ}} ( m _{ℓ} ) τ _{B_{1}} ( n _{1} ) \dots τ _{B_{ℓ}} ( n _{ℓ} )}{m _{1} \dots m _{ℓ} n _{1} \dots n _{ℓ}} \hat{ψ} (\frac{T}{2 π} lo g \frac{m _{1} \dots m _{ℓ}}{n _{1} \dots n _{ℓ}}) .

O_{A_{1}, \dots, A_{ℓ}; B_{1}, \dots, B_{ℓ}} (T; X) := n _{1} n _{2} \dots n _{ℓ} \leq X m _{1} m _{2} \dots m _{ℓ} \leq X \sum \frac{τ _{A_{1}} ( m _{1} ) \dots τ _{A_{ℓ}} ( m _{ℓ} ) τ _{B_{1}} ( n _{1} ) \dots τ _{B_{ℓ}} ( n _{ℓ} )}{m _{1} \dots m _{ℓ} n _{1} \dots n _{ℓ}} \hat{ψ} (\frac{T}{2 π} lo g \frac{m _{1} \dots m _{ℓ}}{n _{1} \dots n _{ℓ}}) .

M_{1} \dots M_{ℓ} = N_{1} \dots N_{ℓ} .

M_{1} \dots M_{ℓ} = N_{1} \dots N_{ℓ} .

h_{j} := m_{j} N_{j} - n_{j} M_{j} .

h_{j} := m_{j} N_{j} - n_{j} M_{j} .

n_{1} \dots n_{ℓ} M_{1} \dots M_{ℓ} = (m_{1} N_{1} - h_{1}) \dots (m_{ℓ} N_{ℓ} - h_{ℓ})

n_{1} \dots n_{ℓ} M_{1} \dots M_{ℓ} = (m_{1} N_{1} - h_{1}) \dots (m_{ℓ} N_{ℓ} - h_{ℓ})

\frac{n_{1}\dots n_{\ell}}{m_{1}\dots m_{\ell}}=\big{(}1-\frac{h_{1}}{m_{1}N_{1}}\big{)}\dots\big{(}1-\frac{h_{\ell}}{m_{\ell}N_{\ell}}\big{)}

\frac{n_{1}\dots n_{\ell}}{m_{1}\dots m_{\ell}}=\big{(}1-\frac{h_{1}}{m_{1}N_{1}}\big{)}\dots\big{(}1-\frac{h_{\ell}}{m_{\ell}N_{\ell}}\big{)}

\log\frac{n_{1}\dots n_{\ell}}{m_{1}\dots m_{\ell}}=-\frac{h_{1}}{m_{1}N_{1}}-\dots-\frac{h_{\ell}}{m_{\ell}N_{\ell}}+O\big{(}\frac{h_{i}h_{j}}{m_{1}\dots m_{\ell}N_{1}\dots N_{\ell}}\big{)}.

\log\frac{n_{1}\dots n_{\ell}}{m_{1}\dots m_{\ell}}=-\frac{h_{1}}{m_{1}N_{1}}-\dots-\frac{h_{\ell}}{m_{\ell}N_{\ell}}+O\big{(}\frac{h_{i}h_{j}}{m_{1}\dots m_{\ell}N_{1}\dots N_{\ell}}\big{)}.

M _{1} \dots M _{ℓ} = N _{1} \dots N _{ℓ} ( M _{j} , N _{j} ) = 1 \sum h_{1}, \dots h_{ℓ} \sum ( * _{1} ) , \dots ( * _{ℓ} ) m _{1} \dots m _{ℓ} \leq X \sum \frac{τ _{A_{1}} ( m _{1} ) \dots τ _{A_{ℓ}} ( m _{ℓ} ) τ _{B_{1}} ( n _{1} ) \dots τ _{B_{ℓ}} ( n _{ℓ} )}{m _{1} \dots m _{ℓ} n _{1} \dots n _{ℓ}} \hat{ψ} (\frac{T h _{1}}{2 π m _{1} N _{1}} + \dots + \frac{T h _{ℓ}}{2 π m _{ℓ} N _{ℓ}})

M _{1} \dots M _{ℓ} = N _{1} \dots N _{ℓ} ( M _{j} , N _{j} ) = 1 \sum h_{1}, \dots h_{ℓ} \sum ( * _{1} ) , \dots ( * _{ℓ} ) m _{1} \dots m _{ℓ} \leq X \sum \frac{τ _{A_{1}} ( m _{1} ) \dots τ _{A_{ℓ}} ( m _{ℓ} ) τ _{B_{1}} ( n _{1} ) \dots τ _{B_{ℓ}} ( n _{ℓ} )}{m _{1} \dots m _{ℓ} n _{1} \dots n _{ℓ}} \hat{ψ} (\frac{T h _{1}}{2 π m _{1} N _{1}} + \dots + \frac{T h _{ℓ}}{2 π m _{ℓ} N _{ℓ}})

(*_{j}) : m_{j} N_{j} - n_{j} M_{j} = h_{j} .

(*_{j}) : m_{j} N_{j} - n_{j} M_{j} = h_{j} .

O_{A_{1}, \dots, A_{ℓ}; B_{1}, \dots, B_{ℓ}}^{†} (T; X) :=

O_{A_{1}, \dots, A_{ℓ}; B_{1}, \dots, B_{ℓ}}^{†} (T; X) :=

O_{ℓ} (T; X) :=

O_{ℓ} (T; X) :=

w_{ℓ} = ℓ!^{2} ℓ^{2 k - 2 ℓ}

w_{ℓ} = ℓ!^{2} ℓ^{2 k - 2 ℓ}

I_{A, B}^{ψ} (T; X) = T O_{ℓ} (T; X) + O (T^{1 - δ}) .

I_{A, B}^{ψ} (T; X) = T O_{ℓ} (T; X) + O (T^{1 - δ}) .

⟨ τ_{A} (m) τ_{B} (n) ⟩_{m = u}^{(*)} \sim \frac{1}{M} q = 1 \sum \infty r_{q} (h) ⟨ τ_{A} (m) e (m N / q) ⟩_{m = u} ⟨ τ_{B} (n) e (n M / q) ⟩_{n = \frac{u N}{M}}

⟨ τ_{A} (m) τ_{B} (n) ⟩_{m = u}^{(*)} \sim \frac{1}{M} q = 1 \sum \infty r_{q} (h) ⟨ τ_{A} (m) e (m N / q) ⟩_{m = u} ⟨ τ_{B} (n) e (n M / q) ⟩_{n = \frac{u N}{M}}

\langle\tau_{A}(m)e(mN/q)\rangle_{m=u}=\frac{1}{2\pi i}\int_{|w-1|=\epsilon}D_{A}(w,e\big{(}\frac{N}{q}\big{)})u^{w-1}~{}dw

\langle\tau_{A}(m)e(mN/q)\rangle_{m=u}=\frac{1}{2\pi i}\int_{|w-1|=\epsilon}D_{A}(w,e\big{(}\frac{N}{q}\big{)})u^{w-1}~{}dw

D_{A}(w,e\big{(}\frac{N}{q}\big{)})=\sum_{n=1}^{\infty}\frac{\tau_{A}(n)e(nN/q)}{n^{s}}.

D_{A}(w,e\big{(}\frac{N}{q}\big{)})=\sum_{n=1}^{\infty}\frac{\tau_{A}(n)e(nN/q)}{n^{s}}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Moments of zeta and correlations of divisor-sums: V

Brian Conrey

American Institute of Mathematics, 360 Portage Ave, Palo Alto, CA 94306, USA and School of Mathematics, University of Bristol, Bristol BS8 1TW, UK

[email protected]

and

Jonathan P. Keating

School of Mathematics, University of Bristol, Bristol BS8 1TW, UK

[email protected]

Abstract.

In this series of papers we examine the calculation of the $2k$ th moment and shifted moments of the Riemann zeta-function on the critical line using long Dirichlet polynomials and divisor correlations. The present paper completes the general study of what we call Type II sums which utilize a circle method framework and a convolution of shifted convolution sums to obtain all of the lower order terms in the asymptotic formula for the mean square along $[T,2T]$ of a Dirichlet polynomial of arbitrary length with divisor functions as coefficients.

We gratefully acknowledge support under EPSRC Programme Grant EP/K034383/1 LMF: $L$ -Functions and Modular Forms. Research of the first author was also supported by the American Institute of Mathematics and by a grant from the National Science Foundation. JPK is grateful for the following additional support: a Royal Society Wolfson Research Merit Award, a Royal Society Leverhulme Senior Research Fellowship, a grant from the Air Force Office of Scientific Research, Air Force Material Command, USAF (number FA8655-10-1-3088), and ERC Advanced Grant 740900 (LogCorRM). He is also pleased to thank the American Institute of Mathematics for hospitality during two visits when work on this project was conducted. Finally, we are grateful to the referees for their careful reading of the manuscript and for extremely helpful comments and suggestions.

1. Introduction

This paper is part V of a sequence devoted to understanding how to conjecture all of the integral moments of the Riemann zeta-function from a number theoretic perspective. The method is to approximate $\zeta(s)^{k}$ by a long Dirichlet polynomial and then to compute the mean square of the Dirichlet polynomial (c.f. [GG]). There are many off-diagonal terms and it is the care of these that is the concern of these papers. In particular it is necessary to treat the off-diagonal terms by a method invented by Bogomolny and Keating [BK1, BK2]. Our perspective on this method is that it is most properly viewed as a multi-dimensional Hardy-Littlewood circle method.

In previous papers [CK1, CK3] we have developed a general method to calculate what were called type-I off-diagonal contributions in [BK1, BK2]; these are the off-diagonal terms usually considered in number-theoretic computations, e.g. in [GG]. In parts II [CK2] and IV [CK4] we considered the simplest of the type-II off-diagonal terms (in the terminology of [BK1, BK2]). These, somewhat unexpectedly, give a significant contribution in certain cases. They have not previously been analysed systematically. Our purpose here is to develop a general method for computing all type-II off-diagonal terms.

The formula we obtain is in complete agreement with all of the main terms predicted by the recipe of [CFKRS] (and in particular, with the leading order term conjectured in [KS]).

2. Shifted moments

Let $A$ and $B$ be two sets of cardinality $k$ containing “shifts” which may be thought of as parameters of size $\ll\frac{1}{\log T}$ where $T$ is the basic large parameter near where we want to know the average of a product of $\zeta$ -functions. Let

[TABLE]

this implicitly defines the arithmetic functions $\tau_{A}(n)$ . A basic question (the moment problem) is to evaluate

[TABLE]

where $\psi$ is a smooth function with compact support, say $\psi\in C^{\infty}[1,2]$ , and $s=1/2+it$ . The technique we are developing in this sequence of papers is to approach the moment problem through long Dirichlet polynomials. To this end we let

[TABLE]

be the truncated Dirichlet series for $D_{A}(s)$ . By Perron’s formula for $\Re s>1/4$ we have

[TABLE]

where

[TABLE]

In this sequence of papers we consider

[TABLE]

for various ranges of $X$ . The expectation is that if $X>T^{k}$ then this will be asymptotically equal to $I_{A,B}^{\psi}(T)$ .111In the earlier papers we did not include the terms $R_{A}$ and $R_{B}$ because their contributions were negligible since $X$ was not large. In general these terms do not create extra difficulties and so they will be ignored here as well.

We calculate this average in two different (conjectural) ways: the first is via the recipe and the second is via the delta method applied to the correlations of shifted divisor functions. We show that these two methods produce identical detailed main terms.

To use the recipe of [CFKRS] to conjecture a formula for $I_{A,B}^{\psi}(T;X)$ , we start with

[TABLE]

so that

[TABLE]

where

[TABLE]

i.e. the set $A$ but with all its elements shifted by $w$ . From the recipe [CFKRS] we expect that

[TABLE]

where

[TABLE]

We have used an unconventional notation here; by $A-U+V^{-}$ we mean the following: start with the set $A$ and remove the elements of $U$ and then include the negatives of the elements of $V$ . We think of the process as “swapping” equal numbers of elements between $A$ and $B$ ; when elements are removed from $A$ and put into $B$ they first get multiplied by $-1$ . We keep track of these swaps with our equal-sized subsets $U$ and $V$ of $A$ and $B$ ; and when we refer to the “number of swaps” in a term we mean the cardinality $|U|$ of $U$ (or, since they are of equal size, of $V$ ). We insert this conjecture and expect that

[TABLE]

We have done a little simplification here: instead of writing $U\subset A_{w}$ we have written $U\subset A$ and changed the exponent of $(tT/2\pi)$ accordingly.

Notice that there is a factor $(X/T^{|U|})^{w+z}$ in the previous equation. As mentioned above we refer to $|U|$ as the number of “swaps” in the recipe, and now we see more clearly the role it plays; in the terms above for which $X<T^{|U|}$ we move the path of integration in $w$ or $z$ to $+\infty$ so that the factor $(X/T^{|U|})^{w+z}\to 0$ and the contribution of such a term is 0. Thus, the size of $X$ determines how many “swaps” we must keep track of. To account for this we introduce a parameter $\ell$ defined by

[TABLE]

Then the above may be rewritten as

[TABLE]

where we have restricted the sum to at most $\ell$ swaps.

Now we turn to the second approach via divisor correlations with the goal of obtaining this formula in a completely different way. In [CK1] and [CK3] we accomplished this in the situations where there were 0 or 1 swaps (i.e. when $X<T^{2}$ ). In [CK2] we considered two swaps but in a special case. In [CK4] we looked at the general case of two swaps. In this paper, which is the final paper of the sequence, we look at the general case with any number of swaps.

As an extension of the ideas in these papers, we have also begun to explore the analogous calculations for averages of ratios of the zeta function, specifically in the context of zero correlations [CK5, CK6].

The second method to obtain a conjecture for $I^{\psi}_{A,B}(T)$ will involve an intricate study of convolutions of shifted divisor problems and will occupy the rest of this paper. We begin that calculation by integrating term-by-term to obtain

[TABLE]

where $\hat{\psi}(x)=\int_{-\infty}^{\infty}\psi(t)e(-xt)~{}dt~{}(=\int_{0}^{\infty}\psi(t)e(-xt)~{}dt$ because of the support of $\psi$ ). Now let us assume that $\ell\leq k$ where $\ell$ is defined above. We partition $A$ and $B$ into $\ell$ non-empty sets $A=A_{1}\cup A_{2}\dots\cup A_{\ell}$ and $B=B_{1}\cup B_{2}\dots\cup B_{\ell}$ . Then $\tau_{A}$ and $\tau_{B}$ are convolutions: $\tau_{A}=\tau_{A_{1}}*\tau_{A_{2}}*\dots*\tau_{A_{\ell}}$ and $\tau_{B}=\tau_{B_{1}}*\tau_{B_{2}}*\dots*\tau_{B_{\ell}}$ . For any such partition, the right hand side of (2) is equal to

[TABLE]

In other words $\mathcal{O}_{A;B}(T;X)=\mathcal{O}_{A_{1},\dots,A_{\ell};B_{1},\dots,B_{\ell}}(T;X)$ as long as $A$ and $B$ are the disjoint unions of the $A_{i}$ and $B_{j}$ . Now we want to define a refinement of this sum. We impose a pairing $A_{j}$ with $B_{j}$ and analyze this sum according to rational approximations to $m_{j}/n_{j}$ . In this way, the ordering of the sets $A_{i},B_{j}$ now matters. The eventual evaluation of $\mathcal{O}_{A,B}(T;X)$ will involve a sum of these pairings, which we describe in detail in the next section.

3. Type II convolution sums

There are various ways to decompose $A$ and $B$ and various ways to “pair” divisor functions $\tau_{A_{i}}$ and $\tau_{B_{j}}$ in preparation for the delta method.

More importantly, however, it turns out that there are various stratifications that also present themselves; basically one for each rational “direction.” If we ignore these then a simple application of the expected main terms from the delta-method analysis will lead us to the wrong main terms.

At first sight it seems that when we do this we are counting the same terms repeatedly. However, we believe that our situation is an example of Manin’s stratified subvarieties wherein counting solutions to high dimensional diophantine equations often involves identifying a collection of subvarieties on each of which the solutions are counted separately (by the delta method for example). The point is that the main terms of the delta method do not always count all of the solutions. This phenomenon was first identified in [FMT]; see, for example, [B] and [LT] for reviews of the subject.

Given $A_{1},\dots,A_{\ell}$ and $B_{1},\dots,B_{\ell}$ , the number of ways to pair each $A_{n}$ with a $B_{m}$ so that all are paired off is $\ell!$ . Let us consider the pairing of $A_{j}$ with $B_{j}$ . Now we think of $m_{j}/n_{j}$ as being approximated by a rational number $M_{j}/N_{j}$ with a small denominator for each of $j=1,2,\dots,\ell$ where $(M_{j},N_{j})=1$ . In this way we get subvarieties indexed by the rational directions $M_{j}/N_{j}$ with $1\leq j\leq\ell$ . We will use all directions $M_{j}/N_{j}$ subject to the natural conditions $(M_{j},N_{j})=1$ and

[TABLE]

We sum over all of the terms with $m_{j}/n_{j}$ close to $M_{j}/N_{j}$ . We introduce variables $h_{j}$ where, for a given $m_{j},n_{j},M_{j}$ and $N_{j}$ , we define

[TABLE]

The rapid decay of $\hat{\psi}$ governs the ranges of all of the variables; see below.

We have

[TABLE]

so that for $M_{1}\dots M_{\ell}=N_{1}\dots N_{\ell}$ we have

[TABLE]

and

[TABLE]

The error term is negligible so we can arrange the sum as

[TABLE]

where

[TABLE]

We can replace $n_{j}$ in the denominator by $\frac{m_{j}N_{j}}{M_{j}}$ . Thus we are led to define

[TABLE]

Also, we define

[TABLE]

where the weight factor

[TABLE]

in $\mathcal{O}_{\ell}(T;X)$ will be explained in a later section. Note that $\ell$ is defined in terms of $T$ and $X$ , so its inclusion in the notation is redundant. Now we can state

Conjecture 1.

Suppose that $k\leq\ell+1$ and $T^{\ell}\leq X<T^{\ell+1}$ . Then for some $\delta>0$ ,

[TABLE]

One way to view this paper is that it gives evidence for this conjecture. In particular, in the next few sections we will conjecturally understand $\mathcal{O}_{A_{1},\dots,A_{\ell};B_{1},\dots,B_{\ell}}^{\dagger}(T;X)$ by replacing the shifted divisor sums by what the delta-method leads us to expect for them. Then we evaluate the result and prove the rigorous theorem that our evaluation is precisely the quantity on the right-hand side of (2).

4. The case where $h_{1}\dots h_{\ell}\neq 0$

Let us first look at the situation where none of the $h_{j}$ is 0. The idea is to evaluate $\mathcal{O}_{A_{1},\dots,A_{\ell};B_{1},\dots,B_{\ell}}^{\dagger}(T;X)$ by replacing the $m_{j}$ by real variables $u_{j}$ while the $M_{j},N_{j}$ and $h_{j}$ remain integer-valued variables (and the $n_{j}$ are determined by the equations $*_{j}$ ).

To do this we will replace the convolution sums by their averages, i.e.

[TABLE]

where $r_{q}(h)$ is the Ramanujan sum (usually denoted $c_{q}(h)$ ) and

[TABLE]

where

[TABLE]

(In the above few lines we have replaced a sum $\sum_{n\leq x}a_{n}f(n)$ by an integral $\int_{1}^{x}f(u)\langle a_{n}\rangle_{n=u}~{}du$ where (in the handy physics notation) $\langle a_{n}\rangle_{n=u}$ denotes the average of $a_{n}$ when $n=u$ (the instantaneous rate of change of a good approximation to $\sum_{n\leq u}a_{n}$ with respect to $u$ ). In our context this may be expressed using $A(s)=\sum_{n=1}^{\infty}a_{n}n^{-s}$ and defining $\langle a_{n}\rangle_{n=u}=\operatornamewithlimits{Res}_{|s-1|<\epsilon}u^{s-1}A(s)$ where we sum the residues at all of the poles of $A(s)$ near $s=1$ . )

Thus, we believe that

[TABLE]

is, up to a power savings, equal to

[TABLE]

To further analyze this quantity, we make the changes of variable $v_{j}=\frac{T|h_{j}|}{2\pi u_{j}N_{j}}$ and bring the sums over the $h_{j}$ to the inside; $u_{1}\dots u_{\ell}<X$ implies that

[TABLE]

We detect this condition using Perron’s formula in an integral over $s$ . Then the above is

[TABLE]

where $\epsilon_{j}=\mbox{sgn}(h_{j})$ . We simplify this a bit. We combine the middle two lines into a single product over $j$ and gather together all of the like variables (note that the sums over $h_{j}$ below are now restricted to the positive integers) :

[TABLE]

At this point we can rigorously identify $\mbox{LHS}_{\ell}$ with the terms on the right of (2), through our key identity:

Theorem 1.

[TABLE]

where $U(\ell)$ denotes a set of cardinality $\ell$ with precisely one element from each of $A_{1},\dots,A_{\ell}$ and similarly $V(\ell)$ denotes a set of cardinality $\ell$ with precisely one element from each of $B_{1},\dots,B_{\ell}$ .

5. Preliminary reductions

Lemma 1.

[TABLE]

Proof.

The case $\ell=1$ of this identity may be found in [CK1]. We may prove the general case by working our way from the inside out and using the technique of that proof. For example, with fixed $v_{1},\dots,v_{\ell-1},\epsilon_{1},\dots,\epsilon_{\ell-1}$ we have that the integral over $v_{\ell}$ is

[TABLE]

We split this into two double integrals, one with $e(tv_{\ell})$ and the other with $e(-tv_{\ell})$ . The first we rotate the $v_{\ell}$ -path onto the positive imaginary axis, and the second we rotate the $v_{\ell}$ path onto the negative imaginary axis. By absolute convergence, we may now interchange the order of integration to arrive at a sum of two $v_{\ell}$ -integrals inside a $t$ -integral. We evaluate the $v_{\ell}$ integrals using the definition of the gamma-function. Then we repeat the process to evaluate the sum over $\epsilon_{\ell-1}$ of the integral over $v_{\ell-1}$ for a fixed $v_{1},\dots,v_{\ell-2},\epsilon_{1},\dots,\epsilon_{\ell-2}$ . And so on. ∎

6. Poles

We have

[TABLE]

where $G$ is a multiplicative function for which

[TABLE]

with $A^{\prime}=A-\{\alpha\}$ . With $*:mN-nM=h$ , this leads to

[TABLE]

where

[TABLE]

Inserting this into $\mbox{LHS}_{\ell}$ we have

[TABLE]

where $U(\ell)=\{\alpha_{1},\dots,\alpha_{\ell}\}$ with $\alpha_{j}\in A_{j}$ and $V(\ell)=\{\beta_{1},\dots,\beta_{\ell}\}$ with $\beta_{j}\in B_{j}$ . Now we sum over the $h_{j}$ to get factors $\zeta(s+\alpha_{j}+\beta_{j})$ . Thus,

[TABLE]

If we move the path of integration in $s$ to the line with $\Re s=\epsilon$ , then we cross the poles of the $\zeta(s+\alpha_{j}+\beta_{j})$ at $s=1-\alpha_{j}-\beta_{j}$ . These contribute an amount that cancels the contribution of the $R_{A;B}(T;X)$ .

Next, we apply the lemma of Section 5 to evaluate the integral over the $v_{j}$ and obtain a factor of $\chi(1-\alpha_{j}-\beta_{j}-s)$ . Then using the functional equation for $\zeta$ we have $\zeta(1-s-\alpha_{j}-\beta_{j})$ . Thus, the $s$ -integrand without the $\frac{X^{s}}{s}$ in LHS becomes

[TABLE]

our goal is to prove that this is equal to

[TABLE]

This further reduces to proving for each $U(\ell),V(\ell)$ that

[TABLE]

7. Local considerations

We shall find it convenient to state our main theorem as an identity of the Euler factor at a prime $p$ . We begin by introducing a set-theoretic notation. First of all, since $p$ is fixed for this discussion we will often suppress it. In fact we write $X$ for $1/p$ and mostly consider power series in $X$ . We take the unusual step of suppressing not only the prime $p$ but the divisor function and so we write $A(n)$ in place of $\tau_{A}(p^{n})$ . Also, for a set $A$ we let

[TABLE]

A further piece of notation: $A^{+}=A\cup\{0\}$ . We have two important identities. The first is

[TABLE]

This is a special case of

[TABLE]

The other identity is

[TABLE]

which follows by repeated application of the first identity.

For arbitrary sets $A$ , $B$ we let

[TABLE]

Also, we let

[TABLE]

We begin with sets $A_{j},B_{j}$ and numbers $\alpha_{j},\beta_{j}$ for $j=1,2,\dots,\ell$ . We consider

[TABLE]

where

[TABLE]

Our identity is

Theorem 2.

[TABLE]

By the results of the previous section, Theorem 1 follows from Theorem 2 with $(A_{j}\setminus\{\alpha_{j}\})_{s}$ in place of $A_{j}$ , $B_{j}\setminus\{\beta_{j}\}$ in place of $B_{j}$ , and $\alpha_{j}+s$ in place of $\alpha_{j}$ .

7.1. Some lemmas

Because of the condition $\min(M_{j},N_{j})=0$ we consider $\Sigma_{A,B,\alpha,\beta}(M,0)$ and $\Sigma_{A,B,\alpha,\beta}(0,N)$ . We have

Lemma 2.

[TABLE]

and

[TABLE]

We defer the proof to later.

The result of the lemma leads us to consider

[TABLE]

We will prove

Lemma 3.

We have

[TABLE]

The right-hand side of Theorem 2 may be expanded. This leads to

Lemma 4.

For $J\subset\{1,\dots,\ell\}$ let

[TABLE]

We have

[TABLE]

where $A=A_{1}\cup\dots\cup A_{\ell}$ and $B=B_{1}\cup\dots\cup B_{\ell}$ .

The combination of these three lemmas easily leads to a proof of Theorem 2.

7.2. Proof of Theorem 2

Proof.

By Lemma 1 the left side of the identity in Theorem 2 may be written as

[TABLE]

By Lemma 2 this is

[TABLE]

and by Lemma 3 this is

[TABLE]

which is the right side of the identity in Theorem 2.

∎

7.3. Proof of first lemma

Proof.

Expanding the $q$ -sum, we have

[TABLE]

We split this into the terms with $d<M$ and those with $d\geq M$ . We have

[TABLE]

The sum over $j$ telescopes so that this is

[TABLE]

Next we consider

[TABLE]

We replace $d$ by $d+M$ and have

[TABLE]

Now the sum over $j$ and $k$ telescopes and we have

[TABLE]

We recognize a convolution in the first term and rewrite this as

[TABLE]

The middle term here may be written as

[TABLE]

The second term of this cancels with $\Sigma^{-}(M,0)$ and so we have

[TABLE]

This may be rewritten as

[TABLE]

∎

By symmetry

[TABLE]

7.4. Proof of second lemma

Proof.

We prove more generally that

[TABLE]

where the $A_{i}$ , and $B_{i}$ are any functions on the natural numbers (i.e. sequences) and $*$ just means the usual Cauchy convolution one encounters when multiplying power series together. It suffices to prove

[TABLE]

as then our desired result follows upon integrating $\theta$ from 0 to 1 upon taking $X=Y^{2}$ . But now the left hand side is a product

[TABLE]

and the right hand side is a product

[TABLE]

Therefore, it suffices to prove that

[TABLE]

To do this, we consider the right hand side and order the double sum according to the minimum, call it $K$ , of $R$ and $S$ . The right hand side may be rewritten as

[TABLE]

Replacing $R$ by $M+K$ and $S$ by $N+K$ , we see that this is exactly the left hand side. ∎

7.5. Proof of third lemma

Proof.

Recall that

[TABLE]

Using this we see that

[TABLE]

In the last line we can replace $m+1$ and $n+1$ by $m$ and $n$ since $(A\cup\{-\beta\})(0)=A(0)=1$ and similarly for $B$ . Multiplying out the last line and combining it with the line above we have

[TABLE]

Now the idea is to apply this to each $A_{j},B_{j},\alpha_{j},\beta_{j}$ . We have

[TABLE]

We end up with

[TABLE]

which is equal to

[TABLE]

∎

8. Terms with some $h_{j}=0$

Suppose that we are in the situation where

[TABLE]

Then for each $j>\ell^{\prime}$ we have

[TABLE]

Since $(M_{j},N_{j})=1$ this implies that

[TABLE]

for some $\kappa_{j}$ . Then, our sum is

[TABLE]

where

[TABLE]

Now, as before, we replace the convolution sums (*) by their averages, i.e.

[TABLE]

where $X^{\prime}$ is defined by

[TABLE]

here $*:mN-nM=h$ . We expect by the delta-method [DFI] that

[TABLE]

with

[TABLE]

So, we are led to

[TABLE]

We make the changes of variable $v_{j}=\frac{T|h_{j}|}{2\pi u_{j}N_{j}}$ for $1\leq j\leq\ell^{\prime}$ and bring the sums over the $h_{j}$ to the inside; $u_{1}\dots u_{\ell^{\prime}}<X^{\prime}$ implies that

[TABLE]

We detect this condition using Perron’s formula in an integral over $s$ . Then the above is

[TABLE]

where the $\epsilon_{j}=\mbox{sgn}(h_{j})$ arise from taking account of the signs of the $h_{j}$ . We now simplify this a bit. We combine the middle two lines into a single product over $j$ and we gather together all of the like variables:

[TABLE]

Now we have another key identity:

Theorem 3.

[TABLE]

where $U(\ell^{\prime})$ denotes a set of cardinality $\ell^{\prime}$ with precisely one element from each of $A_{1},\dots,A_{\ell^{\prime}}$ and similarly $V(\ell^{\prime})$ denotes a set of cardinality $\ell^{\prime}$ with precisely one element from each of $B_{1},\dots,B_{\ell^{\prime}}$ .

9. Preliminary reductions, again

The result of Section 5 implies that

[TABLE]

10. Poles, again

As before we use

[TABLE]

Inserting this into $\mbox{LHS}_{\ell^{\prime}}$ we have

[TABLE]

Now we sum over the $h_{j}$ to get factors $\zeta(s+\alpha_{j}+\beta_{j})$ ; these pair up with the factors $\chi(w_{j}+z_{j}-s-1)$ which turned into $\chi(1-\alpha_{j}-\beta_{j}-s)$ after collecting the residues $w_{j}=1-\alpha_{j}$ and $z_{j}=1-\beta_{j}$ that arose from the integral over $v_{j}$ . Then using the functional equation for $\zeta$ we have $\zeta(1-s-\alpha_{j}-\beta_{j})$ . Thus, the $s$ -integrand without the $\frac{X^{s}}{s}$ in $\mbox{LHS}_{\ell^{\prime}}$ becomes

[TABLE]

where $U(\ell^{\prime})=\{\alpha_{1},\dots,\alpha_{\ell^{\prime}}\}$ and $V(\ell^{\prime})=\{\beta_{1},\dots,\beta_{\ell^{\prime}}\}$ . Our goal is to prove that the residue of this at $s^{\prime}=0$ is equal to

[TABLE]

This further reduces to proving that

[TABLE]

11. Local considerations, again

Again we convert the above to an identity about the Euler factor of each side at a prime $p$ .

With arbitrary sets $A_{j},B_{j}$ and numbers $\alpha_{j},\beta_{j}$ for $j=1,2,\dots,\ell^{\prime}$ , we consider

[TABLE]

where

[TABLE]

as before.

Our identity is

Theorem 4.

[TABLE]

By the results of the previous section, Theorem 3 follows from Theorem 4 with $(A_{i})_{s}$ in place of $A_{i}$ for $i=\ell^{\prime}+1,\dots,\ell$ and with $(A_{j}\setminus\{\alpha_{j}\})_{s}$ , $B_{j}\setminus\{\beta_{j}\}$ and $\alpha_{j}+s$ in place of $A_{j}$ , $B_{j}$ , and $\alpha_{j}$ , respectively, for $j=1,\dots,\ell^{\prime}$ .

11.1. Recall lemmas

Our earlier lemma implies that if $\min(M,N)=0$ then

[TABLE]

So, we can replace the $\Sigma$ s in the formula for $\mathcal{Q}^{\prime}$ by this expression.

Thus, we have

[TABLE]

Now, the critical observations are that

[TABLE]

as before, and

[TABLE]

where $A=A_{1}\cup\dots\cup A_{\ell}$ and $B=B_{1}\cup\dots\cup B_{\ell}$ .

These together imply Theorem 4.

12. Multiplicities

12.1. How many times is a given $\ell$ swap repeated?

Now we need to give an accounting of what we have so far. Each time we split $A$ and $B$ up into subsets $A=A_{1}\cup\dots\cup A_{\ell}$ and $B=B_{1}\cup\dots\cup B_{\ell}$ we accumulate terms that correspond to all swaps of $\alpha_{j}\in A_{j}$ and $\beta_{j}\in B_{j}$ . For a fixed decomposition of $A$ into $\ell$ subsets we clearly do not get ALL swaps of $\ell$ -sized subsets of $A$ and $B$ . Our solution to this dilemma is that we consider all decompositions of $A$ into $\ell$ disjoint non-empty subsets and similarly for $B$ . Then every pair of $\ell$ sized subsets will indeed appear in the swaps. However, now two different decompositions will often lead to the same swap. So how do we account for the overcounting?

How many times will a given $\ell$ -sized swap $S$ for $T$ occur? This is equivalent to asking how many ways can $A$ be split into $\ell$ subsets where $A_{j}$ contains $\alpha_{j}$ ? If $A$ has $k$ elements then there are $k-\ell$ elements that can be distributed arbitrarily into $\ell$ sets. This can happen in $\ell^{(k-\ell)}$ ways. Similarly for $B$ . Taking into account permutations we end up with a multiplicity of $\ell!^{2}\ell^{2(k-\ell)}$ .

12.2. How many times does the same $(m,n)$ lead to a solution of a $(*)$ -system?

Our original problem is to evaluate

[TABLE]

Note that if $A=\{\alpha_{1},\dots,\alpha_{k}\}$ and $B=\{\beta_{1},\dots,\beta_{k}\}$ then

[TABLE]

We split $A$ into $A_{1}\cup\dots\cup A_{\ell}$ and $B$ into $B_{1}\cup\dots\cup B_{\ell}$ ; this is equivalent to splitting $\{1,2,\dots,k\}$ into $I_{1}\cup\dots\cup I_{\ell}$ where $A_{i}=\{\alpha_{i}:i\in I_{i}\}$ and also $\{1,2,\dots,k\}=J_{1}\cup\dots\cup J_{\ell}$ where $B_{j}=\{\beta_{j}:j\in J_{j}\}$ . Then $\tau_{A_{i}}(m_{i})=\prod_{i^{\prime}\in I_{i}}\mu_{i^{\prime}}^{-\alpha_{i^{\prime}}}$ and $\tau_{B_{j}}(n_{j})=\prod_{j^{\prime}\in J_{j}}\mu_{j^{\prime}}^{-\beta_{j^{\prime}}}$ . Now after this splitting we count the $m_{i}$ and $n_{j}$ according to our $(*)$ -system:

[TABLE]

where $M_{1}\dots M_{\ell}=N_{1}\dots N_{\ell}$ . Now let’s say we have a solution of the $(*)$ -system as above and let’s take a collection of divisors of the $m_{i}$ and $n_{j}$ . For simplicity, let’s suppose that $\mu_{i}\mid m_{i}$ and $\nu_{j}\mid n_{j}$ for $1\leq i,j\leq\ell$ . Let’s write

[TABLE]

The question is: How many ways are there to do this? If we multiply the $j$ th equation in our system by $\mu_{j}^{*}\nu_{j}^{*}$ , where $\prod_{j=1}^{\ell}\mu_{j}\mu_{j}^{*}=m$ and $\prod_{j=1}^{\ell}\nu_{j}\nu_{j}^{*}=n$ , then we have a new equation

[TABLE]

where

[TABLE]

If $(\tilde{M}_{j},\tilde{N}_{j})>1$ then the common factor can be divided out and out of $\tilde{h}_{j}$ . Note that

[TABLE]

Thus, we have a new $(*)$ system but it corresponds to exactly the same $m=\mu_{1}\dots\mu_{k}$ and $n=\nu_{1}\dots\nu_{k}$ as in the old one. The number of ways to construct these $(*)$ -systems is just the number of ways to compose the $\hat{\mu}_{j}$ as products of the available $\{\mu_{\ell+1},\dots,\mu_{k}\}$ and the $\hat{\nu}_{j}$ from the $\{\nu_{\ell+1},\dots\nu_{k}\}$ . But this is exactly $\ell^{k-\ell}$ for the $\hat{\mu}_{j}$ and the same for the $\hat{\nu}_{j}$ . Then we take into account the ordering of the $\mu_{1},\dots,\mu_{\ell}$ and of the $\nu_{1},\dots,\nu_{\ell}$ ; this gives a factor of $\ell!^{2}$ . In this way we arrive at a multiplicity $\ell!^{2}\ell^{2(k-\ell)}$ for each solution of our $(*)$ -system which is the same as the multiplicity counted in the swaps of $\ell$ -sets.

Note that the same argument applies whether any of the $h_{j}$ are 0 or not. We need to divide out this multiplicity.

This explains the weight factor $w_{\ell}$ in (6).

12.3. Conclusion

We have found that $I_{A;B}^{\psi}(T;X)$ can be conjecturally evaluated by two different methods which produce the same answer. One way is to use the recipe of [CFKRS]. The other way is to let $\ell$ be defined by $T^{\ell}\leq X<T^{\ell+1}$ . Then partition $A$ and $B$ into $\ell$ subsets and evaluate a convolution of $\ell$ shifted divisor sums

[TABLE]

by a conjectural approach that involves the delta-method of [DFI]. A rigorous theorem identifying two Euler products proves that the result of the above agrees with some of the terms arising from the recipe. The terms with all $h_{i}\neq 0$ correspond to $\ell$ -swap terms from the recipe. The terms with $\ell^{\prime}$ of the $h_{i}$ non-zero and $\ell-\ell^{\prime}$ of the $h_{i}$ equal to 0 give $\ell^{\prime}$ swap-terms. Finally, if we sum over all possible partitions of $A$ and $B$ into non-empty subsets and account for multiplicities we achieve the desired equality between the two approaches.

A natural direction for further research is to consider other families of L-functions, for example quadratic Dirichlet L-functions, and to determine an arithmetic basis for the relevant moment conjectures.

Bibliography15

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[BK 1] E.B. Bogomolny and J.P. Keating. Random matrix theory and the Riemann zeros I: three- and four-point correlations. Nonlinearity 8 (1995), 1115–1131.
2[BK 2] E.B. Bogomolny and J.P. Keating. Random matrix theory and the Riemann zeros II: n 𝑛 n -point correlations. Nonlinearity 9 (1996), 911–935.
3[B] T. D. Browning. Quantitative arithmetic of projective varieties. Volume 277 of Progress in Mathematics. Birkhäuser Verlag, Basel, 2009.
4[CFKRS] J.B. Conrey, D.W. Farmer, J.P. Keating, M.O. Rubinstein and N.C. Snaith. Integral moments of L 𝐿 L -functions. Proc. Lond. Math. Soc. 91 (2005) 33–104.
5[CK 1] J.B. Conrey and J.P. Keating. Moments of zeta and correlations of divisor-sums: I. Phil. Trans. R. Soc. A 373 (2015), 20140313; ar Xiv:1506.06842
6[CK 2] J.B. Conrey and J.P. Keating. Moments of zeta and correlations of divisor-sums: II. In Advances in the Theory of Numbers – Proceedings of the Thirteenth Conference of the Canadian Number Theory Association, Fields Institute Communications (Editors: A. Alaca, S. Alaca & K.S. Williams), 75–85 (2015, Springer); ar Xiv:1506.06843
7[CK 3] J.B. Conrey and J.P. Keating. Moments of zeta and correlations of divisor-sums: III. Indagationes Mathematicae 26 (2015), no. 5, 736–747; ar Xiv:1506.06844
8[CK 4] J.B. Conrey and J.P. Keating. Moments of zeta and correlations of divisor-sums: IV. Res. Number Theory (2016) 2:24; ar Xiv:1506.06844

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Moments of zeta and correlations of divisor-sums: V

Abstract.

1. Introduction

2. Shifted moments

3. Type II convolution sums

Conjecture 1**.**

4. The case where h1…hℓ≠0h_{1}\dots h_{\ell}\neq 0h1​…hℓ​=0

Theorem 1**.**

5. Preliminary reductions

Lemma 1**.**

Proof.

6. Poles

7. Local considerations

Theorem 2**.**

7.1. Some lemmas

Lemma 2**.**

Lemma 3**.**

Lemma 4**.**

7.2. Proof of Theorem 2

Proof.

7.3. Proof of first lemma

Proof.

7.4. Proof of second lemma

Proof.

7.5. Proof of third lemma

Proof.

8. Terms with some hj=0h_{j}=0hj​=0

Theorem 3**.**

9. Preliminary reductions, again

10. Poles, again

11. Local considerations, again

Theorem 4**.**

11.1. Recall lemmas

12. Multiplicities

12.1. How many times is a given ℓ\ellℓ swap repeated?

12.2. How many times does the same (m,n)(m,n)(m,n) lead to a solution of a (∗)(*)(∗)-system?

12.3. Conclusion

Conjecture 1.

4. The case where $h_{1}\dots h_{\ell}\neq 0$

Theorem 1.

Lemma 1.

Theorem 2.

Lemma 2.

Lemma 3.

Lemma 4.

8. Terms with some $h_{j}=0$

Theorem 3.

Theorem 4.

12.1. How many times is a given $\ell$ swap repeated?

12.2. How many times does the same $(m,n)$ lead to a solution of a $(*)$ -system?