Fourier uniformity of bounded multiplicative functions in short   intervals on average

Kaisa Matom\"aki; Maksym Radziwi{\l}{\l}; Terence Tao

arXiv:1812.01224·math.NT·December 5, 2018

Fourier uniformity of bounded multiplicative functions in short intervals on average

Kaisa Matom\"aki, Maksym Radziwi{\l}{\l}, Terence Tao

PDF

Open Access

TL;DR

This paper proves a new form of Fourier uniformity for the Liouville function and other multiplicative functions in short intervals on average, extending previous results to smaller interval lengths and demonstrating significant cancellations.

Contribution

It establishes the first non-trivial local Fourier uniformity results for the Liouville function in very short intervals, improving upon prior bounds and applying to non-pretentious multiplicative functions.

Findings

01

Proves Fourier uniformity for $ heta > 0$ arbitrarily small.

02

Shows cancellations in sums involving $ ext{Liouville}$ and von Mangoldt functions.

03

Extends previous results from $ heta > 5/8$ to smaller scales.

Abstract

Let $λ$ denote the Liouville function. We show that as $X \to \infty$ , $\int_{X}^{2 X} α sup x < n \leq x + H \sum λ (n) e (- α n) d x = o (X H)$ for all $H \geq X^{θ}$ with $θ > 0$ fixed but arbitrarily small. Previously, this was only known for $θ > 5/8$ . For smaller values of $θ$ this is the first `non-trivial' case of local Fourier uniformity on average at this scale. We also obtain the analogous statement for (non-pretentious) $1$ -bounded multiplicative functions. We illustrate the strength of the result by obtaining cancellations in the sum of $λ (n) Λ (n + h) Λ (n + 2 h)$ over the ranges $h < X^{θ}$ and $n < X$ , and where $Λ$ is the von Mangoldt function.

Equations499

\int_{X}^{2 X} α sup x < n \leq x + H \sum λ (n) e (- α n) d x = o (X H)

\int_{X}^{2 X} α sup x < n \leq x + H \sum λ (n) e (- α n) d x = o (X H)

n \leq x \sum λ (n) = o (x)

n \leq x \sum λ (n) = o (x)

n \leq x \sum λ (n) = O_{ε} (x^{1/2 + ε}) for all ε > 0.

n \leq x \sum λ (n) = O_{ε} (x^{1/2 + ε}) for all ε > 0.

n \leq x \sum λ (n + h_{1}) \dots λ (n + h_{k}) = o (x)

n \leq x \sum λ (n + h_{1}) \dots λ (n + h_{k}) = o (x)

\sum_{|h|\leq H}\Big{|}\sum_{n\leq x}\lambda(n)\lambda(n+h)\Big{|}=o(Hx)

\sum_{|h|\leq H}\Big{|}\sum_{n\leq x}\lambda(n)\lambda(n+h)\Big{|}=o(Hx)

\sup_{\alpha}\int_{X}^{2X}\Big{|}\sum_{x<n\leq x+H}\lambda(n)e(-\alpha n)\Big{|}dx=o(HX)

\sup_{\alpha}\int_{X}^{2X}\Big{|}\sum_{x<n\leq x+H}\lambda(n)e(-\alpha n)\Big{|}dx=o(HX)

n \leq x \sum \frac{λ ( n ) λ ( n + h )}{n} = o (lo g x)

n \leq x \sum \frac{λ ( n ) λ ( n + h )}{n} = o (lo g x)

n \leq x \sum \frac{λ ( n + h _{1} ) \dots λ ( n + h _{k} )}{n} = o (lo g x)

n \leq x \sum \frac{λ ( n + h _{1} ) \dots λ ( n + h _{k} )}{n} = o (lo g x)

\int_{X}^{2 X} g \in G sup x < n \leq x + H \sum λ (n) F (g^{n - ⌊ x ⌋} x_{0}) d x = o (H X)

\int_{X}^{2 X} g \in G sup x < n \leq x + H \sum λ (n) F (g^{n - ⌊ x ⌋} x_{0}) d x = o (H X)

\int_{X}^{2X}\sup_{\alpha}\Big{|}\sum_{x<n\leq x+H}\lambda(n)e(-\alpha n)\Big{|}dx=o(XH).

\int_{X}^{2X}\sup_{\alpha}\Big{|}\sum_{x<n\leq x+H}\lambda(n)e(-\alpha n)\Big{|}dx=o(XH).

∣ h ∣ \leq H \sum (1 - \frac{∣ h ∣}{H}) n \leq X \sum λ (n) Λ (n + h) Λ (n + 2 h) = o (H X)

∣ h ∣ \leq H \sum (1 - \frac{∣ h ∣}{H}) n \leq X \sum λ (n) Λ (n + h) Λ (n + 2 h) = o (H X)

∣ h ∣ \leq H \sum (1 - \frac{∣ h ∣}{H}) n \leq X \sum Λ (n + h) Λ (n + 2 h)

∣ h ∣ \leq H \sum (1 - \frac{∣ h ∣}{H}) n \leq X \sum Λ (n + h) Λ (n + 2 h)

\int_{X}^{2 X} α sup x < n \leq x + H \sum f (n) e (- α n) d x ≫ X H .

\int_{X}^{2 X} α sup x < n \leq x + H \sum f (n) e (- α n) d x ≫ X H .

\mathbb{D}(f;X;Q)\coloneqq\inf_{\begin{subarray}{c}\chi\mod{q}\\ q\leq Q\\ |t|\leq X\end{subarray}}\Big{(}\sum_{p\leq X}\frac{1-\mathrm{Re}(f(p)p^{it}\chi(p))}{p}\Big{)}^{1/2}.

\mathbb{D}(f;X;Q)\coloneqq\inf_{\begin{subarray}{c}\chi\mod{q}\\ q\leq Q\\ |t|\leq X\end{subarray}}\Big{(}\sum_{p\leq X}\frac{1-\mathrm{Re}(f(p)p^{it}\chi(p))}{p}\Big{)}^{1/2}.

\int_{X}^{2 X} α sup x < n \leq x + H \sum f (n) e (- α n) d x \geq η H X .

\int_{X}^{2 X} α sup x < n \leq x + H \sum f (n) e (- α n) d x \geq η H X .

D (f; X^{2} / H^{2 - ρ}; Q) ≪_{η, θ, ρ} 1

D (f; X^{2} / H^{2 - ρ}; Q) ≪_{η, θ, ρ} 1

∣ h ∣ \leq H \sum (1 - \frac{∣ h ∣}{H}) n \leq X \sum f (n) a (n + h) b (n + 2 h) > η X H

∣ h ∣ \leq H \sum (1 - \frac{∣ h ∣}{H}) n \leq X \sum f (n) a (n + h) b (n + 2 h) > η X H

D (f; X^{2} / H^{2 - ρ}; Q) ≪_{η, θ, ρ} 1

D (f; X^{2} / H^{2 - ρ}; Q) ≪_{η, θ, ρ} 1

∣ h ∣ \leq H \sum (1 - \frac{∣ h ∣}{H}) n \leq X \sum f_{1} (n) f_{2} (n + h) f_{3} (n + 2 h)

∣ h ∣ \leq H \sum (1 - \frac{∣ h ∣}{H}) n \leq X \sum f_{1} (n) f_{2} (n + h) f_{3} (n + 2 h)

n \in I^{'} \sum f (n) e (- α_{I} n) > η H

n \in I^{'} \sum f (n) e (- α_{I} n) > η H

α_{I, 1} = α_{I, 2} + O (\frac{1}{H}) (mod 1) .

α_{I, 1} = α_{I, 2} + O (\frac{1}{H}) (mod 1) .

\frac{1}{p} n \in I \sum f (n) e (- α_{I} n) \approx n \in I / p \sum f (n) e (- α_{I} n p);

\frac{1}{p} n \in I \sum f (n) e (- α_{I} n) \approx n \in I / p \sum f (n) e (- α_{I} n p);

\int_{X}^{2 X} x < n \leq x + H \sum f (n) e (- α_{(x, x + H]} n) d x \geq η X H,

\int_{X}^{2 X} x < n \leq x + H \sum f (n) e (- α_{(x, x + H]} n) d x \geq η X H,

n \in I \sum f (n) e (- α_{I} n) ≫ H and n \in J \sum f (n) e (- α_{J} n) ≫ H

n \in I \sum f (n) e (- α_{I} n) ≫ H and n \in J \sum f (n) e (- α_{J} n) ≫ H

p α_{I} \equiv q α_{J} + O (P / H) (mod 1)

p α_{I} \equiv q α_{J} + O (P / H) (mod 1)

p α_{I} \equiv q α_{J} + O (\frac{P}{H}) (mod p)

p α_{I} \equiv q α_{J} + O (\frac{P}{H}) (mod p)

α_{I} \equiv \frac{q}{p} \cdot α_{J} + O (\frac{1}{H}) (mod 1)

α_{I} \equiv \frac{q}{p} \cdot α_{J} + O (\frac{1}{H}) (mod 1)

\frac{q _{1}}{p} α_{J_{1}} \equiv \frac{q _{2}}{p} α_{J_{2}} + O (\frac{1}{H}) (mod 1)

\frac{q _{1}}{p} α_{J_{1}} \equiv \frac{q _{2}}{p} α_{J_{2}} + O (\frac{1}{H}) (mod 1)

(\frac{P}{lo g P})^{2 k - 2} \geq (\frac{X}{H})^{2} .

(\frac{P}{lo g P})^{2 k - 2} \geq (\frac{X}{H})^{2} .

≫ \frac{H ^{2}}{X ^{2}} (\frac{P}{lo g P})^{4 k} ≫ 1

≫ \frac{H ^{2}}{X ^{2}} (\frac{P}{lo g P})^{4 k} ≫ 1

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematical Approximation and Integration · Advanced Harmonic Analysis Research · Analytic Number Theory Research

Full text

Fourier uniformity of bounded multiplicative functions in short intervals on average

Kaisa Matomäki

Department of Mathematics and Statistics

University of Turku, 20014 Turku

Finland

[email protected]

,

Maksym Radziwiłł

Department of Mathematics, Caltech, 1200 E California Blvd, Pasadena, CA, 91125

[email protected]

and

Terence Tao

Department of Mathematics, UCLA

405 Hilgard Ave

Los Angeles CA 90095

USA

[email protected]

Abstract.

Let $\lambda$ denote the Liouville function. We show that as $X\rightarrow\infty$ ,

[TABLE]

for all $H\geq X^{\theta}$ with $\theta>0$ fixed but arbitrarily small. Previously, this was only known for $\theta>5/8$ . For smaller values of $\theta$ this is the first “non-trivial” case of local Fourier uniformity on average at this scale. We also obtain the analogous statement for (non-pretentious) $1$ -bounded multiplicative functions.

We illustrate the strength of the result by obtaining cancellations in the sum of $\lambda(n)\Lambda(n+h)\Lambda(n+2h)$ over the ranges $h<X^{\theta}$ and $n<X$ , and where $\Lambda$ is the von Mangoldt function.

1. Introduction

Let $\lambda$ denote111All the results for $\lambda$ discussed here are also applicable to the Möbius function $\mu$ with only minor changes to the arguments; we leave the details to the interested reader. the Liouville function, that is, a completely multiplicative function with $\lambda(p)=-1$ at all primes $p$ . Among bounded multiplicative functions, $\lambda$ plays a distinguished role since the prime number theorem is equivalent to222Our conventions for asymptotic notation are given at the end of this introduction.

[TABLE]

as $x\rightarrow\infty$ , and the Riemann Hypothesis is equivalent to

[TABLE]

A far reaching generalization of (1) is Chowla’s conjecture [4], according to which, for any sequence of distinct integers $h_{1},h_{2},\ldots,h_{k}$ , one has

[TABLE]

as $x\rightarrow\infty$ , where we adopt the convention that $\lambda(n)=0$ for $n\leq 0$ . Because of the equivalence of (1) and the prime number theorem, Chowla’s conjecture is frequently viewed as a “higher order” prime number theorem.

In recent years there has been a substantial amount of progress on Chowla’s conjecture. Following the work of the first two authors [23] the authors established in [24] an averaged form333By applying Hölder’s inequality to (3), it is also possible to obtain an averaged version of (2) over all shifts $h_{1},\ldots,h_{k}$ ; see [24] for details. of this conjecture in the case $k=2$ , namely,

[TABLE]

provided that $H\rightarrow\infty$ as $x\rightarrow\infty$ ; see also [1, 18, 19, 26, 12, 7, 25] for some other averaged forms of Chowla’s conjecture (as well as the closely related Elliott and Hardy-Littlewood conjectures). An equivalent form of (3) (for related discussion, see [32]) states that

[TABLE]

provided that $H\rightarrow\infty$ as $X\rightarrow\infty$ . The estimate (4) along with the entropy decrement argument was used by the third author [31] to establish a logarithmically averaged version of Chowla’s conjecture, that is,

[TABLE]

as $x\rightarrow\infty$ , for any fixed integer $h\neq 0$ . Subsequently for odd $k$ , the third author and Teräväinen [34] used the entropy decrement argument and the Gowers uniformity of the ( $W$ -tricked) von Mangoldt function (but avoiding the use of (4)), to show that

[TABLE]

as $x\rightarrow\infty$ , for any distinct integers $h_{1},\ldots,h_{k}$ and $k$ odd. Their argument only partially generalizes to arbitrary multiplicative functions (see [33]); in the case of the Liouville function, it relies crucially on the assumption that $k$ is odd.

In order to establish (5) for all $k$ it is necessary to establish (the logarithmically averaged version of) what we call the local (higher order) Fourier uniformity conjecture (see [32]).

Conjecture 1.1 (Local higher order Fourier Uniformity).

Let $s\geq 0$ . Let $G\backslash\Gamma$ be an $s$ -step nilmanifold. Let $F:G\backslash\Gamma\rightarrow\mathbb{C}$ be Lipschitz continuous and let $x_{0}\in G\backslash\Gamma$ . Then

[TABLE]

as soon as $H\rightarrow\infty$ with $X\rightarrow\infty$ .

We refer to [14] for the definition of the terms above, however we will not need these notions in this paper. Informally, the conjecture asserts that on most short intervals, $\lambda$ does not exhibit significant correlation with any $s$ -step nilsequence (of bounded complexity). The estimate (4) proven in [24] essentially corresponds to the case $s=0$ of Conjecture 1.1; this is currently the only case of the conjecture that is completely settled.

In this paper we make a first step in going beyond the case of $s=0$ and establish the case $s=1$ of Conjecture 1.1 when $H=X^{\theta}$ with $\theta>0$ fixed but otherwise arbitrarily small. Let us first re-state our main result for the Liouville function in a more elementary fashion.

Theorem 1.2 (Local Fourier Uniformity for $s=1$ at scale $X^{\theta}$ ).

Let $\theta\in(0,1)$ be given and set $H=X^{\theta}$ . Then

[TABLE]

as $X\rightarrow\infty$ .

We restrict attention here to the regime $\theta\in(0,1)$ , since the case $\theta\geq 1$ follows from the classical work of Davenport [5] (and see [11], [13] for the $s=2$ and $s>2$ cases respectively of Conjecture 1.1 for this range of $\theta$ ). Informally, Theorem 1.2 asserts that on most intervals of the form $[x,x+x^{\theta}]$ , the Liouville function $\lambda(n)$ does not exhibit singificant correlation with linear phases $e(\alpha n)$ ; it can easily be shown to imply the $s=1$ case of Conjecture 1.1 in the range $H\geq X^{\theta}$ by approximating the $1$ -step nilsequence $n\mapsto F(g^{n-\lfloor x\rfloor}x_{0})$ by a Fourier series.

Previously, Theorem 1.2 was known unconditionally only for $\theta>5/8$ from the work of Zhan [36], who showed that as $X\rightarrow\infty$ the bound $\sum_{x<n\leq x+H}\lambda(n)e(-\alpha n)=o(XH)$ holds pointwise in $x\in[X,2X]$ for $H>X^{5/8+\varepsilon}$ . It is likely that our method can be pushed to reach $H=\exp((\log x)^{1-\delta})$ for some $\delta>0$ , and conditionally on the Riemann Hypothesis one should in principle be able to reach $H=(\log X)^{\psi(X)}$ for any function $\psi(X)$ going to infinity arbitrarily slowly with $X$ , although this may require a more careful reworking of the arguments here. It may be possible to extend the methods to this paper to also cover the $s>1$ case (again with $H=X^{\theta}$ for any fixed $\theta>0$ ); we plan to investigate this direction in future work.

Theorem 1.2 allows us to obtain cancellations in rather general triple correlations such as those of the form $\lambda(n)a(n+h)b(n+2h)$ , for sequences $a(\cdot)$ and $b(\cdot)$ for which sharp sieve majorants can be constructed. We illustrate the flavor of these results in the corollary below.

Corollary 1.3.

Let $\theta\in(0,1)$ be given. Let $H=X^{\theta}$ . Then

[TABLE]

as $X\rightarrow\infty$ .

Interestingly we are unable to obtain an asymptotic for

[TABLE]

for this range of $H$ , since this latter problem is essentially equivalent to evaluating asymptotically $\sum_{x\leq n<x+H}\Lambda(n)$ for almost all $x\leq X$ . The best result in this direction allows one to take $H>X^{1/6-\varepsilon(X)}$ with $\varepsilon(X)$ tending to zero arbitrarily slowly as $X\rightarrow\infty$ . This is due to Zaccagnini [35], building on ideas of Heath-Brown [15] and Huxley [16]. Thus, Corollary 1.3 gives a rare example of a sum involving the Liouville function that becomes harder to control when the Liouville function is removed!

In a subsequent paper we will obtain variants of Theorem 1.2 and Corollary 1.3 for unbounded multiplicative functions such as the divisor function or coefficients of automorphic forms. This will improve (in the $H$ aspect) earlier results of Blomer [3] that allowed one to take $H=X^{1/3+\varepsilon}$ in the triple correlations of the divisor function; however, in contrast to the results of [3], we will not obtain power-savings in the error terms.

Theorem 1.2 can in fact be generalized to almost all multiplicative functions $f:\mathbb{N}\rightarrow\mathbb{C}$ with $|f|\leq 1$ (we call such multiplicative functions $1$ -bounded). There is however one obstruction: if $f(n)=n^{it}\chi(n)$ with $|t|\leq\varepsilon X^{2}/H^{2}$ for a small absolute constant $\varepsilon>0$ and $\chi$ a Dirichlet character of bounded conductor $q$ , then one can check (using a Taylor expansion) that

[TABLE]

In fact for each $x\in[X,2X]$ one can set $\alpha$ equal to $\frac{t}{x}+\frac{a}{q}$ for some integer $a$ coprime to $q$ , and then $f(n)e(-\alpha n)\approx\chi(n)e(-an/q)x^{it}$ will typically have a mean of magnitude $\asymp 1/\sqrt{q}$ if $\chi$ is primitive.

Therefore the proper analogue of Theorem 1.2 can only hold for multiplicative functions $f$ that “do not pretend” to be any multiplicative function of the form $n\mapsto n^{it}\chi(n)$ with $|t|\leq X^{2}/H^{2}$ and $\chi$ of bounded conductor. To quantify this notion of “pretentiousness”, we follow Granville and Soundararajan [9] and introduce the distance function

[TABLE]

In particular $\mathbb{D}(f;X;Q)$ is small whenever $f$ is close to $n\mapsto n^{it}\chi(n)$ with444The role of the parameter $X$ here is mostly to control the size of $t$ . It is not important that the sum over $p$ runs up to $X$ ; it could run up to $X^{B}$ for any $B>0$ , since primes in $(X^{\alpha},X^{\beta}]$ contribute only $O_{\alpha,\beta}(1)$ to the distance. $|t|\leq X$ and $\chi$ of conductor $\leq Q$ .

Our main theorem, stated below, confirms that $n\mapsto n^{it}\chi(n)$ with $|t|\leq X^{2}/H^{2-o(1)}$ and $\chi$ of bounded conductor are essentially the only examples of $1$ -bounded multiplicative functions for which (6) can happen.

Theorem 1.4 (Main theorem).

Let $\theta\in(0,1)$ and $\eta>0$ . Let $f:\mathbb{N}\rightarrow\mathbb{C}$ be a multiplicative function with $|f|\leq 1$ . Suppose that, for $H=X^{\theta}$ , we have

[TABLE]

Then, for any $\rho\in(0,\frac{1}{8})$ ,

[TABLE]

for some $Q\ll_{\eta,\theta,\rho}1$ .

Theorem 1.4 yields an analogous result to Corollary 1.3 for general multiplicative functions. Without going into full generality we highlight that the result holds for correlations $f(n)a(n+h)b(n+2h)$ and sequences $a(n)$ , $b(n)$ that admit sharp sieve majorants. We illustrate this principle in the corollary below.

Corollary 1.5.

Let $\theta\in(0,1)$ . Let $f:\mathbb{N}\rightarrow\mathbb{C}$ be a $1$ -bounded multiplicative function. Suppose that $a(n),b(n)$ are sequences such that $a(n),b(n)\ll 1+\Lambda(n)$ for all $n\geq 1$ .

If

[TABLE]

with $H=X^{\theta}$ , then for any $\rho\in(0,\frac{1}{8})$ ,

[TABLE]

for some $Q\ll_{\eta,\theta,\rho}1$ .

The claim holds also when $f(n)a(n+h)b(n+2h)$ is replaced by $a(n)f(n+h)b(n+2h)$ or by $a(n)b(n+h)f(n+2h)$ .

We give the short derivation of Corollary 1.5 from Theorem 1.4 in Section 6. It is possible to extend Corollary 1.5 to sequences $b(n)$ or $a(n)$ equal to a multiplicative function $h:\mathbb{N}\rightarrow\mathbb{C}$ such that $|h(n)|\leq d_{k}(n)$ for all $n\geq 1$ and $k\geq 1$ a fixed integer. Since we will obtain a stronger result along these lines in a follow-up paper we do not include the details here.

It is immediate from Corollary 1.5 that given $1$ -bounded multiplicative functions $f_{1},f_{2},f_{3}$ , the correlations

[TABLE]

vanish asymptotically whenever at least one of the $f_{i}$ is non-pretentious in the sense that $\mathbb{D}(f_{i};X,Q)\to\infty$ as $X\to\infty$ for each $Q$ . In the remaining case that all of the $f_{i}$ are pretentious, an asymptotic for the correlations, without an average over $h$ , can be obtained using the method of [20] (see also the references therein).

1.1. An overview of the proof

We now describe in some detail the main ideas behind the proof of Theorem 1.4. Our presentation here is somewhat oversimplified to avoid technical issues; the actual rigorous argument will not quite follow the outline given here, but uses essentially the same ideas, despite being arranged slightly differently to resolve these technicalities.

First we notice that, by the “analytic” large sieve inequality (or more precisely, a maximal version of this inequality due to Montgomery [28]), given an interval $I=(x,x+H]$ , there are at most $\ll\eta^{-2}$ values $\alpha_{I}$ (modulo $1$ and up to perturbations by $O(1/H)$ ) for which

[TABLE]

for some $I^{\prime}\subset I$ ; see Lemma 2.2. For sake of this informal presentation, one can pretend that in fact there is only one such value $\alpha_{I}$ (modulo $1$ and perturbations by $O(1/H)$ ). Thus, if there are two subintervals $I^{\prime}_{1},I^{\prime}_{2}$ of $I$ (or of a slight dilate of $I$ ) and two frequencies $\alpha_{I,1},\alpha_{I,2}$ obeying (7), one can pretend that

[TABLE]

Informally, the estimate (7) asserts that $f$ exhibits significant oscillation at frequency $\alpha_{I}$ on the interval $I$ (or a large subinterval of this interval). We depict this situation schematically in Figure 1. In the schematic depictions we are pretending that if two such intervals $I_{1},I_{2}$ overlap (or are very near to each other), then their associated frequencies $\alpha_{I_{1}},\alpha_{I_{2}}$ are close modulo $1$ in the sense of (8).

At this point we point out a key example: if $f(n)=n^{it}$ for some $t=o(X^{2}/H^{2})$ , some Taylor expansion of the phase $n\mapsto t\log n$ of $f$ in $I$ reveals that one has the above inequality for some $\eta\gg 1$ and $\alpha_{I}=\frac{t}{x_{I}}$ , where $x_{I}$ denotes the starting point of $I$ . Thus, under the hypotheses of Theorem 1.4, we expect $\alpha_{I}$ to vary in $I$ in a manner which is “inversely proportional” to the location of $I$ in some sense. The bulk of our argument is devoted to rigorously verifying some version of this expectation; the main obstacle to overcome arises from the fact that $\alpha_{I}$ is only determined up modulo $1$ and up to perturbations by $O(1/H)$ .

Next, we recall an observation of Elliott [6] that by an application of the arithmetic large sieve inequality for a big set of primes $\mathcal{P}=\mathcal{P}_{I}\subset[2,H^{1/2}]$ , we have, for all $p\in\mathcal{P}$ ,

[TABLE]

see Proposition 2.5. To make things simpler we proceed in this outline as if the approximation (9) held for all primes $p\asymp P$ with $P\coloneqq H^{\varepsilon}$ and some small absolute constant $\varepsilon>0$ . Informally, (9) asserts that if $f(n)$ behaves like a constant multiple of $e(\alpha_{I}n)$ for $n\in I$ , then $f(m)$ behaves like a constant multiple of $e(\alpha_{I}mp)$ for $m\in I/p$ . Heuristically, this follows from the relationship $f(mp)=f(p)f(m)$ (at least when $m$ is coprime to $p$ ). We describe the estimate (9) schematically by the diagram in Figure 2. Note that this is consistent with the previous heuristic that $\alpha_{I}$ should be inversely proportional to the location of $I$ .

By the hypotheses of Theorem 1.4, we have some frequencies $\alpha_{(x,x+H]}$ for which

[TABLE]

and hence by a pigeonhole principle argument, we can find a large ( $\asymp X/H$ ) set of disjoint intervals $I$ of length $H$ in $[X,2X]$ for which (7) holds (after modifying $\eta$ slightly). From this, (9), and the Cauchy-Schwarz inequality, we will be able to locate a large set of quadruples $(I,J,p,q)$ with $I$ and $J$ disjoint intervals of length $H=X^{\varepsilon}$ for which

[TABLE]

and $p,q\asymp P=H^{\varepsilon}$ are primes for which (9) holds and such that $I/p\cap J/q\neq\emptyset$ ; see Figure 3.

Since the intervals $I/p$ and $J/q$ are nearby and the frequencies $p\alpha_{I}$ , $q\alpha_{J}$ lead to very large values of the short trigonometric polynomial supported respectively on $I/p$ and $J/q$ , we conclude from (8) that these frequencies lie (modulo $1$ and up to perturbations by $O(P/H)$ ) in a bounded set of $\ll 1$ frequencies. In particular by the pigeonhole principle it follows that, for a positive proportion of disjoint intervals $I,J$ of length $H$ and primes $p,q$ of size $P=H^{\varepsilon}$ with $I/p\cap J/q\neq\emptyset$ , we have the fundamental approximate equation

[TABLE]

relating the frequencies $\alpha_{I},\alpha_{J}$ associated to these intervals. The number of such quadruples $(I,J,p,q)$ is $\asymp(X/H)\cdot(P/\log P)^{2}$ , since once $I,p,q$ are chosen, $J$ is essentially determined by $I/p\cap J/q\neq\emptyset$ .

It would be nice if the congruence (11) held $\pmod{p}$ rather than just $\pmod{1}$ , as one could then profitably divide by $p$ . Fortunately, by the Chinese remainder theorem there exists a (potentially very large!) integer $k$ depending on $J$ and $q$ such that if we redefine $\alpha_{J}$ by shifting it by $k$ , then we do indeed have

[TABLE]

or equivalently

[TABLE]

for all $p\asymp P$ , with $p\neq q$ . Importantly, shifting $\alpha_{J}$ by $k\in\mathbb{Z}$ maintains the property (10), no matter how large $k$ is. The dependence of the integer $k$ on $q$ is a bit problematic; however let us suppose for sake of discussion that $k$ is independent of $q$ (we essentially end up achieving this through a different argument that involves two consecutive applications of the arithmetic large sieve). Then applying Cauchy-Schwarz we conclude that, for a positive proportion of intervals $J_{1},J_{2}$ and primes $q_{1},q_{2}\asymp P$ with555More precisely, $\frac{J_{1}}{q_{1}}$ and $\frac{J_{2}}{q_{2}}$ will both intersect a third interval $\frac{I}{p}$ , but this is almost the same as requiring that these intervals intersect each other, as they are all of comparable size; see Figure 4. For sake of this discussion, we ignore this technical distinction. $\frac{J_{1}}{q_{1}}\cap\frac{J_{2}}{q_{2}}\neq\emptyset$ , we have

[TABLE]

for many primes $p\asymp P$ . This is essentially the outcome of Section 3, though the argument there proceeds using a somewhat different arrangement of the above ingredients, most notably in that the prime $p$ ends up being at a different scale to the primes $q_{1},q_{2}$ , and the intervals $J_{1},J_{2}$ have length a bit less than $H$ (and are located at spatial scales a bit less than $X$ ). For sake of this discussion we assume that for the data $J_{1},J_{2},q_{1},q_{2}$ as above, the relation (12) holds for all $p\asymp P$ , not just for many such primes. We depict this relationship in graph theoretic language by connecting $J_{1}$ to $J_{2}$ by an edge which we label by the ratio $\frac{q_{2}}{q_{1}}$ of the primes needed to get from $J_{1}$ to (the vicinity of) $J_{2}$ by multiplication; see the dashed line in Figure 4. The resulting graph ${\mathcal{G}}$ is essentially undirected (except that if one wanted to get from $J_{2}$ to $J_{1}$ one would use the label $\frac{q_{1}}{q_{2}}$ rather than $\frac{q_{2}}{q_{1}}$ ) and multiplicity-free (the ratios $\frac{q_{2}}{q_{1}}$ for $q_{1}\neq q_{2}$ are all well separated from each other, so each pair $J_{1},J_{2}$ of distinct intervals may be connected by at most one such ratio).

Notice that the number of intervals $J_{1},J_{2}$ and primes $q_{1},q_{2}\asymp P$ constructed above is $\asymp(X/H)\cdot(P/\log P)^{2}$ ; thus the graph ${\mathcal{G}}$ described above has $\asymp X/H$ vertices and average degree $\asymp(P/\log P)^{2}$ . We begin Section 4 by applying Hölder’s inequality on ${\mathcal{G}}$ in a way that is motivated by Sidorenko’s conjecture (see [30]). We choose $k$ to be the first even integer for which

[TABLE]

Because of our hypotheses $H=X^{\theta}$ and $P=H^{\varepsilon}$ , we can take $k$ to be independent of $X$ . Roughly speaking, $k$ is the first integer at which we expect to see a very large number of non-trivial cycles of length $2k$ in the graph ${\mathcal{G}}$ . After many applications of Hölder’s ineqality, we can conclude that, for a positive proportion of disjoint intervals $I_{1},J_{1}\subset[X,2X]$ of length $H$ and primes $p_{1},q_{1}\asymp P$ with $I_{1}/p_{1}\cap J_{1}/q_{1}\neq\emptyset$ , there exist

[TABLE]

“chains” of intervals $I_{2}\ldots,I_{k},J_{2}\ldots,J_{k}\subset[X,2X]$ of length $H$ and primes

[TABLE]

such that, for all $\ell=1,2,\ldots,k$ ,

[TABLE]

and furthermore the approximate identities

[TABLE]

hold for all $p\asymp P$ , where we adopt the cyclic conventions $I_{k+1}=I_{1},J_{k+1}=J_{1}$ . The above set of relationships corresponds to two cycles of length $k$ in ${\mathcal{G}}$ connected by a further edge in ${\mathcal{G}}$ ; see Figure 5. The choice of $k$ is just large enough to ensure that the configuration in this figure will usually be non-degenerate in the sense that the primes $p_{1,1},\dots,q_{k,2},p_{1},q_{1}$ that arise are all distinct for most of the configurations. Since the primes $p$ in our case are of size $P=H^{\varepsilon}=X^{\varepsilon\theta}$ , it suffices to take $k$ bounded in terms of $\varepsilon,\theta$ to guarantee the existence of a large number of such chains.

Notice that we can interpret each of the relationships in (14) as holding $\pmod{p}$ instead of $\pmod{1}$ by multiplying by $p$ , thus obtaining the system of equations

[TABLE]

for all $p\asymp P$ . We can then use the Chinese remainder theorem to replace the $\pmod{p}$ congruences in (15) with $\pmod{Q}$ where $Q\coloneqq\prod_{p\asymp P}p$ . A key point for later analysis is that $Q$ is going to be extremely large (of size about $\exp(P)=\exp(X^{\varepsilon\theta})$ ), so much so that we will eventually be able to drop the congruence $\pmod{Q}$ altogether, once we obtain some more control on the location of the $\alpha_{I}$ .

After applying some algebra to (15) to eliminate all frequencies except $\alpha_{I_{1}},\alpha_{J_{1}}$ , we eventually conclude the estimates

[TABLE]

where $q_{1}^{\prime}\coloneqq\left|\prod_{\ell=1}^{k}p_{\ell,1}-\prod_{\ell=1}^{k}p_{\ell,2}\right|$ and $q_{2}^{\prime}\coloneqq\left|\prod_{\ell=1}^{k}q_{\ell,1}-\prod_{\ell=1}^{k}q_{\ell,2}\right|$ . The integers $q^{\prime}_{1},q^{\prime}_{2}$ are small; in fact the condition (13) will give the bound $q^{\prime}_{1},q^{\prime}_{2}\ll H^{O(\varepsilon)}$ . We can also assume that these integers are non-zero, because the number of intervals $I_{\ell},J_{\ell}$ and primes $p_{i,j},q_{i,j}$ for which $q_{j}^{\prime}$ could be zero is negligible. It follows then from (16), (17) that

[TABLE]

for some $a_{1},a_{2}\in\mathbb{Z}$ , $0<q_{1}^{\prime},q_{2}^{\prime}\ll H^{O(\varepsilon)}$ , and $T_{I_{1}},T_{J_{1}}\ll X^{2}/H^{2-\rho}$ , where $x_{I_{1}},x_{J_{1}}$ the starting points of the intervals $I_{1}$ , $J_{1}$ , respectively.

Suppose now for simplicity that $q_{1}^{\prime}=q_{2}^{\prime}=1$ , so that

[TABLE]

Notice that since $I_{1}\cap\frac{p_{1}}{q_{1}}J_{1}\neq\emptyset$ we have $x_{I_{1}}\approx\frac{p_{1}}{q_{1}}x_{J_{1}}$ . Combining (19), (20) with (18) we obtain the key relationship

[TABLE]

since $T_{I_{1}},T_{J_{1}}$ are much smaller in magnitude than $Q$ , we may now drop the congruence and conclude in fact that

[TABLE]

informally speaking, this means that the map $I\mapsto T_{I}$ is approximately locally constant on the graph ${\mathcal{G}}$ . Obtaining these quadruples $(I_{1},I_{2},p_{1},p_{2})$ with all the described properties is essentially the content of Section 4.

A Taylor expansion shows that if $\alpha_{I_{1}}$ is as in (19), then $e(-\alpha_{I_{1}}n)\approx e^{i\theta_{I_{1}}}n^{2\pi iT_{I_{1}}}$ with $\theta_{I_{1}}\in\mathbb{R}$ depending only on $I_{1}$ . Similarly for (20). Thus there exists a positive proportion set of disjoint intervals $I,J$ connected by an edge in ${\mathcal{G}}$ such that

[TABLE]

for some $T_{I},T_{J}\ll X^{2}/H^{2}$ with $T_{I}=T_{J}+O(PX/H)$ . To proceed further, we claim that the graph ${\mathcal{G}}$ is essentially an “expander graph” and in particular that it has one very large and highly connected component. This is the content of Section 5.

To see this claim, notice that taking a $O(PX/H)$ -spaced set of values $V$ in the range $\{T:T=O(X^{2}/H^{2-\rho})\}$ , we can group the intervals $I$ into subsets $\mathcal{A}(V)$ of those intervals $I$ for which $T_{I}=V+O(PX/H)$ . Then, because many pairs of intervals $I,J$ connected by an edge in $\mathcal{G}$ belong to the same $\mathcal{A}(V)$ , we obtain a large lower bound of the form

[TABLE]

where $P\coloneqq H^{\varepsilon}$ . That is we obtain a lower bound that corresponds to a positive proportion of disjoint intervals $I,J\subset[X,2X]$ of length $H$ and primes $p,q\asymp P$ such that $\frac{I}{p}\cap\frac{J}{q}\neq\emptyset$ . Now, since the exponential sum $\sum_{p\asymp H^{\varepsilon}}p^{it}$ exhibits cancellations, we can (using a bit of harmonic analysis) essentially bound the above by

[TABLE]

Noticing that $\sum_{V}\#\mathcal{A}(\mathcal{V})\ll X/H$ , we see that the above expression is in turn

[TABLE]

and therefore, combining (21) and (22), there exists a value $V$ for which $\#\mathcal{A}(V)\gg X/H$ . That is, there exists a universal $T\ll X^{2}/H^{2}$ (up to non-essential perturbations by $O(PX/H)$ that we can ignore) such that for a positive proportion of disjoint intervals $I$ of length $H$ we have,

[TABLE]

Averaging over such intervals it follows that, there exists $T\in\mathbb{R}$ such that $|T|\ll X^{2}/H^{2}$ and

[TABLE]

By the main theorem of [23] (or rather more precisely its extension to complex valued functions as in [24, Theorem A.3]) this implies that $f$ has to behave essentially as $n^{-iT}\chi(n)$ with $\chi$ a Dirichlet character of bounded conductor and $|T|\ll X^{2}/H^{2}$ , thus finishing the proof.

1.2. Some final remarks

It is very likely that it is possible, at the expense of additional technical difficulties, to push our argument down to $H=\exp((\log X)^{1-\delta})$ for some $\delta>0$ . However we start running into difficulties when $H$ hits $\exp((\log X)^{2/3+\varepsilon})$ and our argument appears to hit a hard limit when $H$ enters the neighborhood of powers of $\log X$ .

The first obstruction occurs because we require the set of primes $\mathcal{P}\subset[1,H]$ to be sufficiently dense so that at the very least $\prod_{p\in\mathcal{P}}p>X^{2}$ . This implies that $H$ needs to be larger than $\log X$ .

The second obstruction which prevents $H$ from going below $\exp((\log X)^{2/3})$ occurs because we require the exponential sum $\sum_{p\asymp H^{\varepsilon}}p^{it}$ to exhibit cancellations for $t$ of size $X$ . This is only known for $H>\exp((\log X)^{2/3+\varepsilon})$ following the work of Vinogradov-Korobov. This obstruction can be circumvented (in the case of the Liouville function, at least) by assuming the Riemann Hypothesis. In that case the exponential sum $\sum_{p\asymp H^{\varepsilon}}p^{it}$ will be non-trivially small provided that $H$ is a large power of the logarithm (specifically $H>(\log X)^{3/\varepsilon}$ ). However, we have not verified that the remaining portions of the argument extend to this range (among other things, one would need to make more precise the dependence of various implied constants on the parameter $k$ , which now must grow with $X$ instead of being fixed).

Notational conventions.

As usual $f\ll g$ , $g\gg f$ or $f=O(g)$ means that there is an absolute constant $C>0$ such that $|f|\leq Cg$ . If $C$ needs to depend on some parameters then we indicate this by subscripts, for instance $f\ll_{\eta}g$ denotes the estimate $|f|\leq C_{\eta}g$ for some $C_{\eta}$ depending on $g$ . If we write $f=o(g)$ as $X\to\infty$ this means that $|f|\leq c(X)g$ where $c(X)$ is a quantity that goes to zero as $X$ tends to infinity (which may make other quantities dependent on $X$ , such as $H$ , go to infinity also). We also write $f\asymp g$ for $f\ll g\ll f$ .

We set $e(x)\coloneqq e^{2\pi ix}$ . The symbol $p$ always denotes a prime, and so do $p^{\prime},p^{\prime\prime}$ . Given an interval $I=[a,b]$ we define $I/p\coloneqq[a/p,b/p]$ . Whenever we write $\alpha\equiv\beta+O(\eta)\pmod{1}$ we mean that there exists an absolute constant $C$ such that, $\|\alpha-\beta\|\leq C|\eta|$ where $\|x\|$ denotes the distance of $x$ from the nearest integer. Similarly whenever we write $\alpha\equiv\beta+O(\eta)\pmod{q}$ we mean $\alpha/q\equiv\beta/q+O(\eta/q)\pmod{1}$ . Given two intervals $I=[a,b]$ and $J=[c,d]$ with $b<c$ , whenever we write $\text{dist}(I,J)\leq\eta$ , we mean that $|c-b|\leq\eta$ . If $I=[a,b]$ and $c>0$ , we write $cI\coloneqq[ca,cb]$ , thus for instance $I/p=[a/p,b/p]$ .

Acknowledgments.

KM was supported by Academy of Finland grant no. 285894. MR was supported by an NSERC DG grant, the CRC program and a Sloan Fellowship. TT was supported by a Simons Investigator grant, the James and Carol Collins Chair, the Mathematical Analysis & Application Research Fund Endowment, and by NSF grant DMS-1266164. Part of this paper was written while the authors were in residence at MSRI in Spring 2017, which is supported by NSF grant DMS-1440140.

2. Auxiliary results

We collect here some standard results that will be used (mostly) in section 3.

In order to use some tools from graph theory, it is convenient666It should also be possible to work in a purely continuous setting, replacing various summations in our arguments with appropriately normalized integrals, using Fubini’s theorem in place of double counting arguments, allowing the intervals under consideration to overlap each other, and with various graph-theoretic inequalities replaced by their continuous counterparts. We leave the details of this alternate arrangement of the argument to the interested reader. to replace the continuous integral $\int_{X}^{2X}\ dx$ in Theorem 1.4 by something more discrete. Given $X,H$ , define a $(X,H)$ -family of intervals to be a finite collection $\mathcal{I}$ of intervals $I=[x_{I},x_{I}+H]$ of length $H$ contained in $[X/10,10X]$ , such that any pair of intervals in $\mathcal{I}$ are separated by a distance at least $500H$ ; in particular, the intervals in $\mathcal{I}$ are disjoint, and thus the cardinality of $\mathcal{I}$ cannot exceed $X/H$ .

We then have

Lemma 2.1 (Discretizing).

Let $a(n)$ be a sequence of complex numbers with $|a(n)|\leq 1$ for all integers $n\geq 1$ . Let $\eta>0$ and $X\geq H\geq 1$ . Suppose that

[TABLE]

Then there exist an $(X,H)$ -family of intervals $\mathcal{I}$ of cardinality $\geq\frac{\eta X}{1000H}$ and real numbers $\alpha_{I}$ associated to each $I\in\mathcal{I}$ such that, for all $I\in\mathcal{I}$ ,

[TABLE]

Proof.

It follows from (23) and the pigeonhole principle that there exists $y\in[0,H)$ such that

[TABLE]

Given $0\leq v<500$ , let $\mathcal{I}_{v}$ be the sub-collection of intervals $I=((500\ell+v)H+y,(500\ell+v+1)H+y]$ with $\frac{X}{500H}\leq\ell\leq\frac{X}{250H}$ for which

[TABLE]

Let $\mathcal{I}=\bigcup_{0\leq v<500}\mathcal{I}_{v}$ . It follows from (25) and the trivial bound $|a(n)|\leq 1$ , that

[TABLE]

Thus there exists an $0\leq v<500$ for which $\mathcal{I}_{v}$ is an $(X,H)$ -family of intervals of cardinality $\geq\frac{\eta X}{1000H}$ . Setting $\mathcal{I}=\mathcal{I}_{v}$ , we obtain the claim. ∎

The frequency $\alpha_{I}$ in the above proposition is not unique: one can shift it by any integer, and one can also perturb it by up to a small multiple of $\eta/H$ without significantly affecting (24). However, it turns out that modulo these freedoms, there are only a bounded number of choices for $\alpha_{I}$ (if one views $\eta$ as being fixed). More precisely, one has

Lemma 2.2 (Maximal large sieve).

Let $H\geq 1$ and let $I$ be an interval of length $10H$ . Let $\eta>0$ be given. Let $|a(n)|\leq 1$ be a sequence of complex numbers. Suppose that there exist $J\geq 1$ , frequencies $\alpha_{1},\alpha_{2},\ldots,\alpha_{J}\in\mathbb{R}$ and sub-intervals $I_{1},I_{2},\ldots,I_{J}\subset I$ of length at most $H$ such that

[TABLE]

for all $j=1,\ldots,J$ . Assume $H$ sufficiently large depending on $\eta$ . Then there exist a natural number $K\leq C\eta^{-2}$ with $C$ an absolute constant and frequencies $\beta_{1},\ldots,\beta_{K}$ depending only on $\eta>0$ , the sequence $\{a(\cdot)\}$ and the interval $I$ , such that, for each $1\leq j\leq J$ , there exists $k\in\{1,\dotsc,K\}$ with

[TABLE]

where we recall that $\|x\|=\text{dist}(x,\mathbb{Z})$ .

Proof.

Let $\gamma_{1}$ be the frequency $\gamma$ that maximizes the quantity

[TABLE]

with the supremum taken over all sub-intervals $L$ of $I$ . For $i\geq 2$ we define $\gamma_{i}$ inductively as the frequency that maximizes (26) in the region $[0,1]\backslash\bigcup_{j=1}^{i-1}[\gamma_{j}-\frac{1}{H},\gamma_{j}+\frac{1}{H}]$ . We thus obtain frequencies $\gamma_{1},\ldots,\gamma_{R}$ with $R$ a parameter to be chosen later, and moreover $\|\gamma_{i}-\gamma_{j}\|>\frac{1}{H}$ for $i\neq j$ .

Using the Carleson-Hunt theorem, it was proven by Montgomery [28, Theorem 2] that one has the maximal large sieve inequality777At the cost of worsening the dependence on $\eta$ slightly, one could also use the standard large sieve inequality [27] here, combined with Lemma 2.4 below.

[TABLE]

with $C$ an absolute constant. The right-hand side is $O(H(R+H))$ . Choosing $R$ to be a large multiple of $\eta^{-2}$ , it follows that there are at most $K\ll\eta^{-2}$ frequencies $\gamma_{i}$ for which

[TABLE]

Therefore for any $\alpha$ lying outside of

[TABLE]

we have

[TABLE]

Our assumption is that for each $\alpha_{j}$ with $1\leq j\leq J$ there exists an interval $I_{j}$ with $I_{j}\subset I$ for which

[TABLE]

Therefore $\alpha_{1},\ldots,\alpha_{J}\in\bigcup_{i=1}^{K}[\gamma_{i}-\frac{1}{H},\gamma_{i}+\frac{1}{H}]$ and the claim follows. ∎

We record also the following variant of the large sieve that we will need in Section 5.

Lemma 2.3 (Variant of large sieve).

Let $1\leq H\leq X$ and $R\in\mathbb{N}$ . Let $x_{1},\dotsc,x_{R}\in[1,X]$ be $H$ -separated (thus $|x_{i}-x_{j}|\geq H$ for all $1\leq i<j\leq R$ ). Then

[TABLE]

Proof.

Let $\Phi(t)$ be a smooth function such that $\Phi(t)\geq 1$ for $|t|\leq 1$ and with $\text{supp }\widehat{\Phi}\subset(-1,1)$ . Then the left-hand side of (27) is

[TABLE]

as claimed. ∎

We will also need the following tool from harmonic analysis.

Lemma 2.4 (Completion of sums).

There exists an absolute constant $\eta_{0}>0$ such that the following holds. Let $J$ be an interval of length $H$ and $a(n)$ complex coefficients with $|a(n)|\leq 1$ for all integers $n\geq 1$ . Let $I$ be an interval with $I\subset J$ . Suppose that $\eta\in(0,\eta_{0})$ and $\alpha\in\mathbb{R}$ are such that

[TABLE]

Then there exists $\theta\in\mathbb{R}$ such that $|\theta|\leq\frac{1}{\eta^{2}H}$ and

[TABLE]

Proof.

Let $y,z\in\mathbb{R}$ be chosen so that $I=[y,z]$ . Let $f$ be a smooth function with $f(n)=1$ for $n\in I$ , $|f(n)|\leq 1$ for all integers $n$ , and compactly supported in $[y-\frac{\eta}{100}\cdot H,z+\frac{\eta}{100}\cdot H]$ . Moreover we can ensure that $f$ is a Schwartz function with $|f^{(j)}(x)|\ll_{j}(\eta H)^{-j}$ for all $x\in\mathbb{R}$ and therefore with $|\widehat{f}(x)|=|\int_{\mathbb{R}}f(u)e(-xu)du|\ll_{A}H(1+\eta H|x|)^{-A}$ for all $A\in\mathbb{N}$ . Let

[TABLE]

Applying Poisson summation to $g(\beta)$ and using the above bound on $\widehat{f}$ we see that

[TABLE]

Moreover by construction of $g$ ,

[TABLE]

We split the integral on the right-hand side into two parts, namely $|\beta|\leq\frac{1}{\eta^{2}H}$ and the complement. We estimate the part over $|\beta|<\frac{1}{\eta^{2}H}$ trivially only using the bound $|g(\beta)|<2H$ . On the second part we apply Cauchy-Schwarz, Plancherel and (28) to see that it is bounded by $\ll\eta^{2}H$ . Collecting these estimates we conclude that

[TABLE]

Therefore there exists $\beta\in\mathbb{R}$ such that $|\beta|<\frac{1}{\eta^{2}H}$ and

[TABLE]

as needed. ∎

In section 3 we will frequently relate the Fourier behavior of $f$ on an interval $I$ with the behavior on dilated intervals $I/p$ for various primes $p$ . The key tool here is

Proposition 2.5 (Mean scales down).

Let $x\geq H\geq 1$ , and let $f:(x,x+H]\to\mathbb{C}$ obey the bound

[TABLE]

(thus $f=O(1)$ on average on $(x,x+H]$ in an $L^{2}$ sense). Then

[TABLE]

In particular, by Markov’s inequality, for any $\delta>0$ we have

[TABLE]

for all primes $p\leq H$ outside of an exceptional set ${\mathcal{P}}$ of primes with $\sum_{p\in{\mathcal{P}}}\frac{1}{p}\ll\delta^{-2}$ .

Proof.

See [6, Lemma 4.7]. ∎

We will also need the following number-theoretic estimate, in particular to dispose of some degenerate cases.

Lemma 2.6 (Counting nearby products of primes).

Let $k\in\mathbb{N}$ and $P^{\prime},N\geq 3$ be such that $(P^{\prime})^{k-1}\gg N$ . Write $d=P^{\prime 2}/(\log P^{\prime})^{2}$ . Then the number of $2k$ -tuples $(p^{\prime}_{1,1},\dots,p^{\prime}_{1,k},p^{\prime}_{2,1},\dots,p^{\prime}_{2,k})$ of primes in $[P^{\prime},2P^{\prime}]$ obeying the condition

[TABLE]

with $C>0$ a constant, is at most $O_{k,C}(\frac{(P^{\prime})^{2k}}{N\log^{2k}P^{\prime}})=O_{k,C}(\frac{d^{k}}{N})$ .

If we also impose the additional condition

[TABLE]

for some modulus $q\in\mathbb{N}$ , then the number of tuples is bounded by

[TABLE]

Proof.

Since the first claim follows from the second by specializing to $q=1$ it is enough to prove the second claim.

First notice that without loss of generality we can assume that $q\leq(\log N)^{3k}$ since otherwise the claim is trivial by replacing products of primes by integers (i.e., using the crude bound that every integer has at most $O_{k}(1)$ representations as a product of $k$ primes) and counting trivially.

Let $w$ be a smooth function such that $w(x)=1$ for $|x|\leq 100C$ . Then, the number of primes $p^{\prime}_{1,j},p^{\prime}_{2,\ell}$ for which (30) and (31) hold is

[TABLE]

Since $q<P^{\prime}$ and all of the $p^{\prime}_{1,j},p^{\prime}_{2,\ell}$ are primes, we can express the congruence condition using Dirichlet characters, thus

[TABLE]

where the sum is over all Dirichlet characters of period $q$ . Using this identity and the Fourier inversion formula $w(x)=\int_{\mathbb{R}}\widehat{w}(t)e^{2\pi ixt}dt$ , we see that the expression (32) is equal to

[TABLE]

Since $q\leq(\log N)^{3k}\ll_{k}(\log P^{\prime})^{3k}$ , the zero-free region for $L(s,\chi)$ gives

[TABLE]

see for instance [21, Lemma 2.4]. Using this pointwise estimate it follows that

[TABLE]

To bound the part of the integral with large $|t|$ we notice that for arbitrary coefficients $a(n)$ , we have the $L^{2}$ mean value theorem

[TABLE]

(see e.g., [17, Theorem 9.1]), while from the pointwise estimate we have

[TABLE]

Since

[TABLE]

where

[TABLE]

we may thus bound the part of the integral with $|t|>\exp((\log P^{\prime})^{1/100})$ using (33) by

[TABLE]

as required. Combining the two bounds, the claim follows. ∎

3. Intervals and frequencies

Assume we have the hypotheses of Theorem 1.4, thus there exists an $\eta>0$ such that

[TABLE]

Informally speaking, the main purpose of this section is to produce a large set $\mathcal{I}^{\prime\prime}$ of disjoint intervals $I^{\prime\prime}$ , each of length comparable to some quantity $L$ (which will be slightly shorter than $H$ ), as well as associated frequencies $\alpha^{\prime\prime}_{I^{\prime\prime}}$ with

[TABLE]

and a scale $P^{\prime}$ with the following property: For a positive proportion of quadruples $(I^{\prime\prime},J^{\prime\prime},p^{\prime},q^{\prime})\in\mathcal{I}^{2}\times[P^{\prime},2P^{\prime}]^{2}$ with $p^{\prime},q^{\prime}$ prime such that $I^{\prime\prime}$ is close to $\frac{p^{\prime}}{q^{\prime}}J^{\prime\prime}$ we have

[TABLE]

for a positive proportion of primes $p^{\prime\prime}$ in some range $[P^{\prime\prime}/2,P^{\prime\prime}]$ (compare with (12)). Moreover the ranges $P^{\prime\prime},P^{\prime},L$ are all related by $\log P^{\prime\prime}\asymp\log P^{\prime}\asymp\log L$ and $L\asymp H/P^{\prime}P^{\prime\prime}$ . This is the content of Proposition 3.2 below. We first need a preliminary proposition.

Proposition 3.1 (Scaling down).

Let $1\leq P\leq Q\leq H\leq X$ and $\eta>0$ , and let $f\colon\mathbb{N}\to\mathbb{C}$ be a $1$ -bounded multiplicative function. Assume that $P$ and $\frac{\log Q}{\log P}$ are sufficiently large depending on $\eta$ . Suppose that there exist an $(X,H)$ -family $\mathcal{I}$ of intervals of cardinality $\gg_{\eta}X/H$ and a real number $\alpha_{I}$ associated to each $I\in\mathcal{I}$ such that

[TABLE]

for all $I\in{\mathcal{I}}$ . Then there exist $P^{\prime}\in[P,Q/2]$ , an $(\frac{X}{P^{\prime}},\frac{H}{P^{\prime}})$ -family $\mathcal{I}^{\prime}$ of intervals of cardinality $\gg_{\eta}X/H$ , and a real number $\alpha^{\prime}_{I^{\prime}}$ associated to each $I^{\prime}\in\mathcal{I}^{\prime}$ , such that

[TABLE]

for all $I^{\prime}\in\mathcal{I}^{\prime}$ . Furthermore, for each $I^{\prime}\in\mathcal{I}^{\prime}$ , one can find $\gg_{\eta}\frac{P^{\prime}}{\log P^{\prime}}$ pairs $(I,p^{\prime})$ , where $I$ is an interval in $\mathcal{I}$ and $p^{\prime}$ is a prime in $[P^{\prime},2P^{\prime}]$ , such that $I/p^{\prime}$ lies within $3\frac{H}{P^{\prime}}$ of $I^{\prime}$ , and such that

[TABLE]

The conclusions of Proposition 3.1 are depicted schematically in Figure 6.

Proof.

For each $I\in\mathcal{I}$ , we apply Proposition 2.5 to the function $n\mapsto f(n)e(-\alpha_{I}n)$ on $I$ , and with $\delta$ sufficiently small depending on $\eta$ , to conclude that

[TABLE]

for all primes $p^{\prime}\in[P,Q]$ outside of an exceptional set ${\mathcal{P}}_{I}$ with

[TABLE]

Summing over all $I\in{\mathcal{I}}$ (recalling that this collection of intervals has cardinality at most $X/H$ ), we conclude

[TABLE]

From Mertens’ theorem and the pigeonhole principle, we may thus find $P^{\prime}\in[P,Q/2]$ such that

[TABLE]

Fix this quantity $P^{\prime}$ . If $\frac{\log Q}{\log P}$ is large enough, we conclude from the prime number theorem that

[TABLE]

and thus we have (35) for $\gg_{\eta}\frac{X}{H}\frac{P^{\prime}}{\log P^{\prime}}$ pairs $(I,p^{\prime})$ with $I\in{\mathcal{I}}$ and $p^{\prime}\in[P^{\prime},2P^{\prime}]$ .

As $f$ is multiplicative, we have $f(np^{\prime})=f(n)f(p^{\prime})$ unless $n$ is a multiple of $p^{\prime}$ . The latter contributes at most $O(\frac{H}{p^{\prime}P})$ to the left-hand side of (35), which is negligible compared to the right-hand side as $P$ (and hence $p^{\prime}$ ) is large. Thus we may freely replace $f(np^{\prime})$ by $f(n)f(p^{\prime})$ , and conclude that

[TABLE]

for $\gg_{\eta}\frac{X}{H}\frac{P^{\prime}}{\log P^{\prime}}$ pairs $(I,p^{\prime})$ . (Compare with Figure 2.)

Let ${\mathcal{S}}$ denote the collection of these pairs $(I,p^{\prime})$ , and let ${\mathcal{I}}_{1}$ denote the collection of all intervals of the form $I/p^{\prime}$ where $(I,p^{\prime})\in{\mathcal{S}}$ . These are intervals in $[0,10X/P^{\prime}]$ of length between $H/2P^{\prime}$ and $H/P^{\prime}$ . By a simple greedy algorithm, we may find a subfamily ${\mathcal{I}}_{2}$ of these intervals which are separated by distance at least $2H/P^{\prime}$ , with the property that every interval in ${\mathcal{I}}_{1}$ lies within a distance $3H/P^{\prime}$ of one of the intervals in ${\mathcal{I}}_{2}$ .

By (36) and Lemma 2.2, we can associate to each interval $I^{\prime}\in{\mathcal{I}}_{2}$ some real numbers $\beta_{I^{\prime},1},\dots,\beta_{I^{\prime},K(I^{\prime})}$ for some $K(I^{\prime})\ll_{\eta}1$ , with the property that, for each pair $(I,p^{\prime})\in{\mathcal{S}}$ with $I/p^{\prime}$ within $3H/P^{\prime}$ of $I^{\prime}$ , one has

[TABLE]

for some $1\leq k\leq K(I^{\prime})$ . By adding dummy values of $\beta$ if necessary we may take $K=K(I^{\prime})$ independent of $I^{\prime}$ . By the pigeonhole principle, we may find $1\leq k_{0}\leq K$ such that one has

[TABLE]

for $\gg_{\eta}\frac{X}{H}\frac{P^{\prime}}{\log P^{\prime}}$ triples $(I,p^{\prime},I^{\prime})$ with $(I,p^{\prime})\in{\mathcal{S}}$ and $I^{\prime}\in{\mathcal{I}}_{2}$ with $\frac{1}{p^{\prime}}I$ within distance $3\frac{H}{P^{\prime}}$ of $I^{\prime}$ . If we let ${\mathcal{T}}$ be the collection of such triples, then one can find a subset ${\mathcal{I}}_{3}$ of ${\mathcal{I}}_{2}$ of cardinality $\gg_{\eta}\frac{X}{H}$ with the property that for each $I^{\prime}\in{\mathcal{I}}_{3}$ , there are $\gg_{\eta}\frac{P^{\prime}}{\log P^{\prime}}$ pairs $(I,p^{\prime})\in{\mathcal{S}}$ with $(I,p^{\prime},I^{\prime})\in{\mathcal{T}}$ .

For $I^{\prime}\in{\mathcal{I}}_{3}$ , pick one of the pairs $(I(I^{\prime}),p^{\prime}(I^{\prime}))\in{\mathcal{S}}$ with $(I(I^{\prime}),p^{\prime}(I^{\prime}),I^{\prime})\in{\mathcal{T}}$ , then from (36) we have

[TABLE]

while from (37) we have

[TABLE]

whenever $(I,p^{\prime})\in{\mathcal{S}}$ with $(I,p^{\prime},I^{\prime})\in{\mathcal{T}}$ .

The interval $I(I^{\prime})/p^{\prime}(I^{\prime})$ lies in $[0,10X/P^{\prime}]$ with length between $H/2P^{\prime}$ and $H/P^{\prime}$ . Let $J(I^{\prime})$ be an interval in $[0,10X/P^{\prime}]$ of length exactly $H/P^{\prime}$ containing $I(I^{\prime})/p^{\prime}(I^{\prime})$ . By Lemma 2.4 and (38), we have

[TABLE]

for some real number

[TABLE]

In particular

[TABLE]

whenever $(I,p^{\prime})\in{\mathcal{S}}$ with $(I,p^{\prime},I^{\prime})\in{\mathcal{T}}$ .

Setting ${\mathcal{I}}^{\prime}$ to be a $500H/P^{\prime}$ -separated collection of $\gg X/H$ intervals of the form $J(I^{\prime})$ with $I^{\prime}\in{\mathcal{I}}_{3}$ , we obtain the claim. ∎

We are now ready to prove the main result of this section.

Proposition 3.2.

Let $X\geq 2$ , $\theta\in(0,1)$ , $\eta>0$ , and $\rho\in(0,1/8)$ . Let $f:\mathbb{N}\rightarrow\mathbb{C}$ be a multiplicative function with $|f|\leq 1$ . Suppose that, for $H=X^{\theta}$ , we have

[TABLE]

Let $\varepsilon\in(0,\rho/100)$ be sufficiently small depending on $\theta$ and $\eta$ , and assume $X$ is sufficiently large depending on $\theta,\eta$ , $\rho$ , and $\varepsilon$ . Then there exist $P^{\prime},P^{\prime\prime}\in[X^{\varepsilon^{2}},X^{\varepsilon}]$ , an $(\frac{X}{P^{\prime}P^{\prime\prime}},\frac{H}{P^{\prime}P^{\prime\prime}})$ -family $\mathcal{I}^{\prime\prime}$ of intervals of cardinality $\gg X/H$ , and a real number $\alpha^{\prime\prime}_{I^{\prime\prime}}$ associated to each $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ such that

[TABLE]

for all $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ . Furthermore, there exist $\gg_{\eta}(\frac{P^{\prime}}{\log P^{\prime}})^{2}\frac{X}{H}$ quadruples $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2})$ with $I^{\prime\prime}_{1},I^{\prime\prime}_{2}$ distinct intervals in $\mathcal{I}^{\prime\prime}$ and $p^{\prime}_{1},p^{\prime}_{2}$ distinct primes in $[P^{\prime},2P^{\prime}]$ , such that $I^{\prime\prime}_{1}$ lies within $50\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{2}}{p^{\prime}_{1}}I^{\prime\prime}_{2}$ , and such that

[TABLE]

for $\gg_{\eta}\frac{P^{\prime\prime}}{\log P^{\prime\prime}}$ primes $p^{\prime\prime}\in[P^{\prime\prime}/2,P^{\prime\prime}]$ .

Proof.

By Lemma 2.1, one can find $(X,H)$ -family $\mathcal{I}$ of intervals of cardinality $\gg\eta X/H$ and a real number $\alpha_{I}$ associated to each $I\in\mathcal{I}$ such that

[TABLE]

for all $I\in{\mathcal{I}}$ . Applying Proposition 3.1, one can find $P^{\prime}\in[X^{\varepsilon^{2}},X^{\varepsilon}]$ , an $(\frac{X}{P^{\prime}},\frac{H}{P^{\prime}})$ -family $\mathcal{I}^{\prime}$ of intervals of cardinality $\gg_{\eta}X/H$ , and a real number $\alpha^{\prime}_{I^{\prime}}$ associated to each $I^{\prime}\in\mathcal{I}^{\prime}$ , such that

[TABLE]

for all $I^{\prime}\in\mathcal{I}^{\prime}$ . Furthermore, for each $I^{\prime}\in\mathcal{I}^{\prime}$ , one can find $\gg_{\eta}\frac{P^{\prime}}{\log P^{\prime}}$ pairs $(I,p^{\prime})$ , where $I$ is an interval in $\mathcal{I}$ and $p^{\prime}$ is a prime in $[P^{\prime},2P^{\prime}]$ , such that $I/p^{\prime}$ lies within $3\frac{H}{P^{\prime}}$ of $I^{\prime}$ and

[TABLE]

By a second application of Proposition 3.1, one can find $P^{\prime\prime}\in[(X/P^{\prime})^{\varepsilon^{2}},(X/P^{\prime})^{\varepsilon}]$ , an $(\frac{X}{P^{\prime}P^{\prime\prime}},\frac{H}{P^{\prime}P^{\prime\prime}})$ -family $\mathcal{I}^{\prime\prime}$ of intervals of cardinality $\gg_{\eta}X/H$ , and a real number $\alpha^{\prime\prime}_{I^{\prime\prime}}$ associated to each $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ , such that

[TABLE]

for all $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ . Furthermore, for each $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ , one can find $\gg_{\eta}\frac{P^{\prime\prime}}{\log P^{\prime\prime}}$ pairs $(I^{\prime},p^{\prime\prime})$ , where $I^{\prime}$ is an interval in $\mathcal{I}^{\prime}$ and $p^{\prime\prime}$ is a prime in $[P^{\prime\prime}/2,P^{\prime\prime}]$ , such that $I^{\prime}/p^{\prime\prime}$ lies within $3\frac{H}{P^{\prime}P^{\prime\prime}}$ of $I^{\prime\prime}$ , and such that

[TABLE]

Also, since the $I^{\prime}$ are $500H$ -separated, we see that each prime $p^{\prime\prime}$ is associated to at most one $I^{\prime}$ in this fashion (for a fixed choice of $I^{\prime\prime}$ ). The above situation is depicted in Figure 7.

Note that if $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ , then one can add an arbitrary integer to each real number $\alpha^{\prime\prime}_{I^{\prime\prime}}$ without affecting any of the above properties. In particular, if one adds an integer with an appropriate residue class mod $p^{\prime\prime}$ , one can upgrade (41) to

[TABLE]

for any pair $(I^{\prime},p^{\prime\prime})$ appearing previously. By the Chinese remainder theorem, we may thus select $\alpha^{\prime\prime}_{I^{\prime\prime}}$ so that (42) holds for all pairs $(I^{\prime},p^{\prime\prime})$ appearing previously.

Combining the above properties, we see that we can find $\gg_{\eta}\frac{P^{\prime}}{\log P^{\prime}}\frac{P^{\prime\prime}}{\log P^{\prime\prime}}\frac{X}{H}$ quintuplets $(I,I^{\prime},I^{\prime\prime},p^{\prime},p^{\prime\prime})$ , where $I\in\mathcal{I}$ , $I^{\prime}\in\mathcal{I}^{\prime}$ , $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ , $p^{\prime}$ is a prime in $[P^{\prime},2P^{\prime}]$ , $p^{\prime\prime}$ is a prime in $[P^{\prime\prime}/2,P^{\prime\prime}]$ , $\frac{1}{p^{\prime}}I$ lies within $3\frac{H}{P^{\prime}}$ of $I^{\prime}$ , $\frac{1}{p^{\prime\prime}}I^{\prime}$ lies within $3\frac{H}{P^{\prime}P^{\prime\prime}}$ of $I^{\prime\prime}$ , and one has the equations

[TABLE]

and

[TABLE]

Multiplying the first equation by $p^{\prime\prime}$ and combining with the second equation, we conclude in particular that

[TABLE]

The number of possible choices for $(I,p^{\prime\prime})$ is (trivially) at most $\frac{P^{\prime\prime}}{\log P^{\prime\prime}}\frac{X}{H}$ . Applying the Cauchy-Schwarz inequality, we conclude that we can find $\gg_{\eta}(\frac{P^{\prime}}{\log P^{\prime}})^{2}\frac{P^{\prime\prime}}{\log P^{\prime\prime}}\frac{X}{H}$ octuplets $(I,I^{\prime}_{1},I^{\prime}_{2},I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2},p^{\prime\prime})$ , where

•

$I\in{\mathcal{I}}$ , $I^{\prime}_{1},I^{\prime}_{2}\in\mathcal{I}^{\prime}$ , $I^{\prime\prime}_{1},I^{\prime\prime}_{2}\in\mathcal{I}^{\prime\prime}$ ;

•

$p^{\prime}_{1},p^{\prime}_{2}$ are primes in $[P^{\prime},2P^{\prime}]$ , and $p^{\prime\prime}$ is a prime in $[P^{\prime\prime}/2,P^{\prime\prime}]$ ;

•

For $i=1,2$ , $\frac{1}{p^{\prime}_{i}}I$ lies within $3\frac{H}{P^{\prime}}$ of $I^{\prime}_{i}$ , and $\frac{1}{p^{\prime\prime}}I^{\prime}_{i}$ lies within $3\frac{H}{P^{\prime}P^{\prime\prime}}$ of $I^{\prime\prime}_{i}$ .

•

We have

[TABLE]

and

[TABLE]

See Figure 8.

Multiplying (43) by $p^{\prime}_{2}$ and (44) by $p^{\prime}_{1}$ and then subtracting, we see that

[TABLE]

Also, $p^{\prime}_{1}I^{\prime}_{1}$ lies within $6H$ of $p^{\prime}_{1}p^{\prime\prime}I^{\prime\prime}_{1}$ and $p^{\prime}_{2}I^{\prime}_{2}$ lies within $6H$ of $p^{\prime}_{2}p^{\prime\prime}I^{\prime\prime}_{2}$ , so by the triangle inequality $p^{\prime}_{1}p^{\prime\prime}I^{\prime\prime}_{1}$ and $p^{\prime}_{2}p^{\prime\prime}I^{\prime\prime}_{2}$ lie at distance at most $24H$ from each other. Hence, on dividing by $p^{\prime}_{1}p^{\prime\prime}$ , $I^{\prime\prime}_{1}$ and $\frac{p^{\prime}_{2}}{p^{\prime}_{1}}I^{\prime\prime}_{2}$ lie at distance at most $48\frac{H}{P^{\prime}P^{\prime\prime}}$ from each other. In particular, if $p^{\prime}_{1}=p^{\prime}_{2}$ , then $I^{\prime\prime}_{1}=I^{\prime\prime}_{2}$ , and similarly $I^{\prime}_{1}=I^{\prime}_{2}$ . As a consequence, the number of octuplets with this property is at most $O(\frac{P^{\prime}}{\log P^{\prime}}\frac{P^{\prime\prime}}{\log P^{\prime\prime}}\frac{X}{H})$ . Since $P^{\prime}\geq X^{\varepsilon^{2}}$ and $X$ is sufficiently large depending on $\varepsilon$ , the contribution of this case is thus negligible, so that there are $\gg_{\eta}(\frac{P^{\prime}}{\log P^{\prime}})^{2}\frac{P^{\prime\prime}}{\log P^{\prime\prime}}\frac{X}{H}$ octuplets $(I,I^{\prime}_{1},I^{\prime}_{2},I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2},p^{\prime\prime})$ with $p^{\prime}_{1}\neq p^{\prime}_{2}$ .

Observe that if $I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2}$ are fixed, then $I,I^{\prime}_{1},I^{\prime}_{2}$ are completely determined by $p^{\prime\prime}$ thanks to the separation properties of $\mathcal{I}$ and $\mathcal{I}^{\prime\prime}$ ; in particular, there are $O(\frac{P^{\prime\prime}}{\log P^{\prime\prime}})$ ways to complete the quadruplet $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2})$ to an octuplet. Similarly, $I^{\prime\prime}_{1}$ is completely determined by $I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2}$ (since there is at most one interval in $\mathcal{I}^{\prime\prime}$ that lies within $48\frac{H}{P^{\prime}P^{\prime\prime}}$ from $\frac{p^{\prime}_{2}}{p^{\prime}_{1}}I^{\prime\prime}_{2}$ ). Thus the number of eligible quadruplets $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2})$ is $O((\frac{P^{\prime}}{\log P^{\prime}})^{2}\frac{X}{H})$ . We conclude that there exist $\gg_{\eta}(\frac{P^{\prime}}{\log P^{\prime}})^{2}\frac{X}{H}$ quadruplets $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2})$ , each of which can be completed to an octuplet in $\gg_{\eta}\frac{P^{\prime\prime}}{\log P^{\prime\prime}}$ ways. In particular, for such a quadruplet, (45) holds for $\gg_{\eta}\frac{P^{\prime\prime}}{\log P^{\prime\prime}}$ choices of $p^{\prime\prime}$ (recalling that $I,I^{\prime}_{1},I^{\prime}_{2}$ are completely determined by the remaining coefficients of the octuplet). The claim follows. ∎

4. Local structure of $\alpha^{\prime\prime}$

We now analyse the structure of the function $\alpha^{\prime\prime}$ appearing in Proposition 3.2. The main result of this section asserts that $\alpha^{\prime\prime}_{I^{\prime\prime}}$ locally behaves like $\frac{T}{x_{I^{\prime\prime}}}$ with $T$ “not too large” (and up to a shift $\frac{a}{q}$ with small denominator), where $x_{I^{\prime\prime}}$ denotes the left endpoint of the interval $I^{\prime\prime}$ . Crucially, $T$ will not vary much with $I^{\prime\prime}$ , at least “locally”. It is here that we will rely on the hypothesis $H=X^{\theta}$ that $H$ is of polynomial size in $X$ .

Proposition 4.1.

Let $\theta,\eta,\rho,X,H,f,\varepsilon,P^{\prime},P^{\prime\prime},\mathcal{I}^{\prime\prime},\alpha^{\prime\prime}$ be as in Proposition 3.2. Then, for $\gg_{\varepsilon}\frac{X}{H}\left(\frac{P^{\prime}}{\log P^{\prime}}\right)^{2}$ of the pairs $(I^{\prime\prime}_{1},I^{\prime\prime}_{2})$ of intervals in $(\mathcal{I}^{\prime\prime})^{2}$ , there exist a natural number

[TABLE]

integers $a_{1},a_{2}$ , a real number

[TABLE]

and a set ${\mathcal{P}}(I^{\prime\prime}_{1},I^{\prime\prime}_{2})$ of primes in $[P^{\prime\prime}/2,P^{\prime\prime}]$ of cardinality $\gg_{\theta,\eta,\varepsilon,\rho}\frac{P^{\prime\prime}}{\log P^{\prime\prime}}$ such that

[TABLE]

for $j=1,2$ . Furthermore, for each such pair, there exist primes $p^{\prime}_{1},p^{\prime}_{2}\in[P^{\prime},2P^{\prime}]$ such that $I^{\prime\prime}_{1}$ lies within $100\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{2}}{p^{\prime}_{1}}I^{\prime\prime}_{2}$ , and such that

[TABLE]

Proof.

Let $\theta,\eta,\rho,X,H,f,\varepsilon,P^{\prime},P^{\prime\prime},\mathcal{I}^{\prime\prime},\alpha^{\prime\prime}$ be as in Proposition 3.2. Thus for instance we now have $P^{\prime\prime},P^{\prime}\leq H^{\rho/100}$ . Henceforth we allow implied constants to depend on $\theta,\eta,\varepsilon,\rho$ . We abbreviate

[TABLE]

for the cardinality of $\mathcal{I}^{\prime\prime}$ and $d$ for the quantity

[TABLE]

thus the number of quadruples $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2})$ in Proposition 3.2 is $\gg dN$ . We construct a graph ${\mathcal{G}}=(V,E)$ whose vertices are just the intervals in $\mathcal{I}^{\prime\prime}$ (thus $V=\mathcal{I}^{\prime\prime}$ has $N$ vertices), and the edges $e$ are those unordered pairs $e=\{I^{\prime\prime}_{1},I^{\prime\prime}_{2}\}$ for which there exist distinct primes $p^{\prime}_{1},p^{\prime}_{2}$ in $[P^{\prime},2P^{\prime}]$ such that $p_{1}^{\prime}I^{\prime\prime}_{1}$ lies within $100\frac{H}{P^{\prime\prime}}$ of $p^{\prime}_{2}I^{\prime\prime}_{2}$ , and such that

[TABLE]

for a set ${\mathcal{P}}(e)$ of primes $p^{\prime\prime}$ in $[P^{\prime\prime}/2,P^{\prime\prime}]$ of cardinality $\gg\frac{P^{\prime\prime}}{\log P^{\prime\prime}}$ (note that these properties are symmetric in $I^{\prime\prime}_{1}$ and $I^{\prime\prime}_{2}$ ). Observe that the primes $p^{\prime}_{1},p^{\prime}_{2}$ are uniquely determined by $I^{\prime\prime}_{1},I^{\prime\prime}_{2}$ , for if there was another pair of primes $p^{\prime}_{3},p^{\prime}_{4}$ with the same properties, then $\frac{p^{\prime}_{2}}{p^{\prime}_{1}}I^{\prime\prime}_{2}$ and $\frac{p^{\prime}_{4}}{p^{\prime}_{3}}I^{\prime\prime}_{2}$ would lie within $200\frac{H}{P^{\prime}P^{\prime\prime}}$ of each other, which implies that

[TABLE]

but if $(p^{\prime}_{1},p^{\prime}_{2})\neq(p^{\prime}_{3},p^{\prime}_{4})$ then the left-hand side has magnitude at least $\frac{1}{p^{\prime}_{3}p^{\prime}_{1}}\gg X^{-2\varepsilon^{2}}$ , which leads to a contradiction if $\varepsilon$ is small enough and $X$ is large enough. Thus, by Proposition 3.2, we see that the number of edges in ${\mathcal{G}}$ is $\gg dN$ . On the other hand, the degree of each vertex in ${\mathcal{G}}$ is $O(d)$ , since for fixed $I^{\prime\prime}_{1}$ there are only $O(d)$ choices for $p^{\prime}_{1}$ and $p^{\prime}_{2}$ , and $I^{\prime\prime}_{2}$ is uniquely determined by these choices. Thus ${\mathcal{G}}$ has $\asymp dN$ edges and the mean degree of ${\mathcal{G}}$ is $\asymp d$ .

At present, the sets ${\mathcal{P}}(e)$ of primes associated to each edge $e$ are large, but the intersections ${\mathcal{P}}(e_{1})\cap\dots\cap{\mathcal{P}}(e_{k})$ could be small. This will cause difficulties later. To get around this problem we use a random refinement trick of Gowers [8]. Let $\mathbf{p}^{\prime\prime}$ be a prime in $[P^{\prime\prime}/2,P^{\prime\prime}]$ selected uniformly at random, and let $\mathbf{G}=(V,\mathbf{E})$ be the subgraph of ${\mathcal{G}}$ consisting of the same vertex set $V$ as ${\mathcal{G}}$ , and with the edge set $\mathbf{E}$ consisting of all edges $e\in E$ with $\mathcal{P}(e)$ containing $\mathbf{p}$ . By the prime number theorem, each edge has probability $\gg 1$ of lying in $\mathbf{G}$ , so by linearity of expectation the expected number of edges in ${\mathbf{G}}$ is $\gg dN$ . In particular, we see that with probability $\gg 1$ , the random graph $\mathbf{G}$ has $\gg dN$ edges. Of course, $\mathbf{G}$ has maximum degree $O(d)$ since it is a subgraph of ${\mathcal{G}}$ . As we shall see later, the advantage of working with $\mathbf{G}$ instead of ${\mathcal{G}}$ is that the intersections ${\mathcal{P}}(e_{1})\cap\dots\cap{\mathcal{P}}(e_{k})$ have a high probability of being large when $e_{1},\dots,e_{k}$ are all constrained to lie in ${\mathcal{G}}$ .

If $\mathbf{A}$ is the adjacency matrix of $\mathbf{G}$ , then by the preceding discussion we have $1^{T}\mathbf{A}1\gg dN$ (where $1$ denotes the all-ones column vector) with probability $\gg 1$ . By the Blakley-Roy inequality [2], we now see that for any natural number $k$ , we have $1^{T}\mathbf{A}^{k}1\gg_{k}d^{k}N$ with probability $\gg 1$ . That is to say, with probability $\gg 1$ , the number of $(k+1)$ -tuples $(I^{\prime\prime}_{0},\dots,I^{\prime\prime}_{k})$ in $V^{k+1}$ such that $\{I^{\prime\prime}_{j},I^{\prime\prime}_{j+1}\}\in\mathbf{E}$ for $j=0,\dots,k-1$ is $\gg_{k}d^{k}N$ .

Now let $k$ be the first even integer for which

[TABLE]

Then (since $P^{\prime},P^{\prime\prime}\leq X^{\varepsilon}$ ) we have $k=O(1)$ and

[TABLE]

In particular, we may allow implied constants to depend888If one were to extend the arguments here to smaller values of $H$ , one would need to pay more attention as to the precise dependence of these constants on $k$ . on $k$ . From the preceding discussion, with probability $\gg 1$ , the number of $(k+2)$ -tuples

[TABLE]

such that $\{I^{\prime\prime}_{j,1},I^{\prime\prime}_{j+1,1}\},\{I^{\prime\prime}_{j,2},I^{\prime\prime}_{j+1,2}\},\{I^{\prime\prime}_{0,1},I^{\prime\prime}_{0,2}\}\in\mathbf{E}$ for $j=0,\dots,k/2-1$ is $\gg d^{k+1}N$ . This situation is depicted in Figure 9.

The number of possible choices for the quadruplet $(I^{\prime\prime}_{k/2,1},I^{\prime\prime}_{0,1},I^{\prime\prime}_{0,2},I^{\prime\prime}_{k/2,2})$ is $O(dN^{3})$ , since there are $N^{3}$ choices for $I^{\prime\prime}_{k/2,1},I^{\prime\prime}_{0,1},I^{\prime\prime}_{k/2,2}$ , and once $I^{\prime\prime}_{0,1}$ is fixed there are $O(d)$ choices for $I^{\prime\prime}_{0,2}$ . Thus by the Cauchy-Schwarz inequality, with probability $\gg 1$ , we have there are $\gg(d^{k+1}N)^{2}/(dN^{3})=d^{2k+1}/N$ pairs of such tuples with a common quadruplet $(I^{\prime\prime}_{k/2,1},I^{\prime\prime}_{0,1},I^{\prime\prime}_{0,2},I^{\prime\prime}_{k/2,2})$ . Relabeling, we conclude999This bound also follows from the work of Sidorenko [30], as the graph consisting of two $k$ -cycles (with $k$ even) connected by an edge is one of the confirmed cases of Sidorenko’s conjecture. that with probability $\gg 1$ , the number of $2k$ -tuples

[TABLE]

such that $\{I^{\prime\prime}_{j,i},I^{\prime\prime}_{j+1,i}\},\{I^{\prime\prime}_{0,1},I^{\prime\prime}_{0,2}\}\in\mathbf{E}$ for $j=0,\dots,k-1$ , $i=1,2$ is $\gg d^{2k+1}/N$ , where we adopt the periodic convention $I^{\prime\prime}_{k,i}=I^{\prime\prime}_{0,i}$ for $i=1,2$ . In particular, by definition of $\mathbf{G}$ , we have

[TABLE]

for all $j=0,1,\ldots,k-1$ and $i=1,2$ . The situation is depicted in Figure 10.

Call the $2k$ -tuples $\vec{I}^{\prime\prime}$ of the above form good, thus there are $\gg d^{2k+1}/N$ good tuples. Given a good tuple, to each edge $\{I^{\prime\prime}_{j,i},I^{\prime\prime}_{j+1,i}\}$ we have (uniquely determined) primes $p^{\prime}_{1,j,i},p^{\prime}_{2,j,i}$ in $[P^{\prime},2P^{\prime}]$ , such that $I^{\prime\prime}_{j+1,i}$ lies within $100\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{1,j,i}}{p^{\prime}_{2,j,i}}I^{\prime\prime}_{j,i}$ for $j=0,1,\ldots,k-1$ and $i=1,2$ ; we also have primes $p^{\prime}_{1},p^{\prime}_{2}\in[P^{\prime},2P^{\prime}]$ such that $I^{\prime\prime}_{0,2}$ lies within $100\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{1}}{p^{\prime}_{2}}I^{\prime\prime}_{0,1}$ . Again, we refer the reader to Figure 10 for a depiction of these relationships. Iterating the former claim, we see that $I^{\prime\prime}_{0,i}$ lies within $O(\frac{H}{P^{\prime}P^{\prime\prime}})$ from $\frac{\prod_{j=1}^{k}p^{\prime}_{2,j,i}}{\prod_{j=1}^{k}p^{\prime}_{1,j,i}}I^{\prime\prime}_{0,i}$ for $i=1,2$ , thus

[TABLE]

Multiplying out, we conclude that

[TABLE]

thanks to (48).

We now eliminate some degenerate cases. Suppose $\prod_{j=1}^{k}p^{\prime}_{2,j,1}-\prod_{j=1}^{k}p^{\prime}_{1,j,1}=0$ . Then, by the fundamental theorem of arithmetic, the $p^{\prime}_{1,j,1}$ are a permutation of the $p^{\prime}_{2,j,1}$ . By the prime number theorem, the total number of possibilities for the $p^{\prime}_{1,j,1},p^{\prime}_{2,j,1}$ is then at most $\ll_{k}(P^{\prime}/\log P^{\prime})^{k}\ll d^{k/2}$ . By Lemma 2.6, there are $O(d^{k}/N)$ choices for $p^{\prime}_{1,j,2},p^{\prime}_{2,j,2}$ , and finally there are $O(d)$ possibilities for $p^{\prime}_{1},p^{\prime}_{2}$ and $O(N)$ possibilities for $I^{\prime\prime}_{0,1}$ . All the other $I^{\prime\prime}_{j,i}$ are uniquely determined by this data, so the number of tuples with $\prod_{j=1}^{k}p^{\prime}_{2,j,1}-\prod_{j=1}^{k}p^{\prime}_{1,j,1}=0$ is

[TABLE]

which is negligible compared to $d^{2k+1}/N$ thanks to (48). Thus there are $\gg d^{2k+1}/N$ good tuples for which $\prod_{j=1}^{k}p^{\prime}_{2,j,1}-\prod_{j=1}^{k}p^{\prime}_{1,j,1}$ does not vanish. Repeating this argument for $\prod_{j=1}^{k}p^{\prime}_{2,j,2}-\prod_{j=1}^{k}p^{\prime}_{1,j,2}$ , we may see that with probability $\gg 1$ , there are $\gg d^{2k+1}/N$ good tuples for which $\prod_{j=1}^{k}p^{\prime}_{2,j,i}-\prod_{j=1}^{k}p^{\prime}_{1,j,i}\neq 0$ for $i=1,2$ . We will call such good tuples non-degenerate.

Another case we would like to exclude is when the set

[TABLE]

is unusually small, say

[TABLE]

for some small $\delta>0$ depending on $\varepsilon,\theta,\rho,\eta$ ) to be chosen later. Define a candidate tuple to be a tuple $\vec{I}^{\prime\prime}=(I^{\prime\prime}_{j,i})_{j\in\{0,1,\ldots,k-1\};i=1,2}\in V^{2k}$ with $\{I^{\prime\prime}_{0,1},I^{\prime\prime}_{0,2}\}\in E$ , $\{I^{\prime\prime}_{j,i},I^{\prime\prime}_{j+1,i}\}\in E$ for $j=0,\dots,k-1$ , and $i=1,2$ obeying (52) and with $\prod_{j=1}^{k}p^{\prime}_{2,j,i}-\prod_{j=1}^{k}p^{\prime}_{1,j,i}$ non-vanishing for $i=1,2$ . Observe that a tuple $\vec{I}^{\prime\prime}$ is a non-degenerate good tuple obeying (52) precisely if it is a candidate tuple with $\mathbf{p}\in\mathcal{P}(\vec{I}^{\prime\prime})$ . In particular, the probability that a given candidate tuple is actually good is $O(\delta)$ . On the other hand, from two applications of Lemma 2.6, the number of candidate tuples is at most

[TABLE]

and so, by linearity of expectation, the expected number of good tuples obeying (52) is $O_{(}\delta d^{2k+1}/N)$ . On the other hand, with probability $\gg 1$ we have $\gg d^{2k+1}/N$ non-degenerate good tuples. With $X$ large enough (which makes $P^{\prime}$ large compared with $\eta,\varepsilon,\rho,\theta$ ), and setting $\delta$ sufficiently small depending on $\eta,\varepsilon,\rho,\theta$ , we thus have with positive probability that there are $\gg d^{2k+1}/N$ non-degenerate good tuples $\vec{I}^{\prime\prime}$ for which

[TABLE]

Let us call such tuples very good, thus we can find a deterministic choice of $\mathbf{p}$ such that there are $\gg d^{2k+1}/N$ very good tuples.

Henceforth $\mathbf{p}$ is chosen deterministically as above. Let $\vec{I}^{\prime\prime}$ be a very good tuple, with attendant primes $p^{\prime}_{1,j,i},p^{\prime}_{2,j,i}$ and $p^{\prime}_{1},p^{\prime}_{2}$ for $j\in\{0,1,\ldots,k-1\}$ and $i=1,2$ . From (47), (53) we see that there is a collection ${\mathcal{P}}(\vec{I}^{\prime\prime})$ of primes in $[P^{\prime\prime}/2,P^{\prime\prime}]$ of cardinality

[TABLE]

such that

[TABLE]

and

[TABLE]

for all $p^{\prime\prime}\in{\mathcal{P}}(\vec{I}^{\prime\prime})$ , $j\in\{0,1,\ldots,k-1\}$ , and $i=1,2$ . For $X$ large enough, the error term $O_{\eta}(\frac{(P^{\prime})^{2}P^{\prime\prime}}{H})$ is less than $1/2$ in magnitude; thus the nearest integer to $p^{\prime}_{2,j,i}\alpha^{\prime\prime}_{I^{\prime\prime}_{j,i}}-p^{\prime}_{1,j,i}\alpha^{\prime\prime}_{I^{\prime\prime}_{j+1,i}}$ is divisible by all the primes in ${\mathcal{P}}(\vec{I}^{\prime\prime})$ , and is hence divisible by the product $Q\coloneqq\prod_{p^{\prime\prime}\in{\mathcal{P}}(\vec{I}^{\prime\prime})}p^{\prime\prime}$ of all the primes. Thus

[TABLE]

for all $j=0,1,\ldots,k-1$ and $i=1,2$ and similarly

[TABLE]

We multiply the former equation by $\prod_{0\leq j^{\prime}<j}p^{\prime}_{1,j^{\prime},i}\prod_{j<j^{\prime}<k}p^{\prime}_{2,j^{\prime},i}$ and sum the telescoping series for $j=0,\dots,k-1$ to conclude that

[TABLE]

This implies that

[TABLE]

for $i=1,2$ , where $q_{i}$ is the non-negative integer

[TABLE]

As $\vec{I}^{\prime\prime}$ is non-degenerate, $q_{i}$ is strictly positive. From (51) we conclude that

[TABLE]

From (55), we may write

[TABLE]

for $i=1,2$ and some integers $b_{1},b_{2}$ . Inserting this into (54), we conclude that

[TABLE]

or equivalently

[TABLE]

The left-hand side is a rational of denominator at most $O(d^{4})$ . Meanwhile, since ${\mathcal{P}}(\vec{I}^{\prime\prime})$ has cardinality $\gg\frac{P^{\prime}}{\log P^{\prime}}\gg X^{\varepsilon^{2}}/\log X$ , we have

[TABLE]

for some $c>0$ depending on $\varepsilon,\rho,\theta,\eta$ . Thus the expression $O(\frac{(P^{\prime})^{k+2}P^{\prime\prime}}{QH})$ is far smaller than the denominator on the left-hand side, and hence

[TABLE]

Since we can modify $\frac{b_{1}}{q_{1}}$ and $\frac{b_{2}}{q_{2}}$ by arbitrary integers without affecting the claimed properties, and $p^{\prime}_{1},p^{\prime}_{2}$ are distinct, we may in fact assume without loss of generality that

[TABLE]

thus we can write $\frac{b_{i}}{q_{i}}=\frac{ap^{\prime}_{i}}{q}$ for some integer $a$ , some $1\leq q\ll d^{2}$ , and for $i=1,2$ . In particular, from (56) we have

[TABLE]

for $i=1,2$ ; from (48) we thus have

[TABLE]

We can then write

[TABLE]

for some real number

[TABLE]

and we then write

[TABLE]

for some real number

[TABLE]

Inserting these equations back into (54), we obtain

[TABLE]

Since $I^{\prime\prime}_{0,2}$ lies within $100\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{1}}{p^{\prime}_{2}}I^{\prime\prime}_{0,1}$ , we have

[TABLE]

and hence by (58)

[TABLE]

Combining this with (59), (57) we conclude that

[TABLE]

and thus

[TABLE]

for $i=1,2$ .

Finally, by two applications of Lemma 2.6, each pair $(I^{\prime\prime}_{0,1},I^{\prime\prime}_{0,2})$ is associated to at most $(O(\frac{d^{k}}{N}))^{2}$ very good tuples; since there are $\gg d^{2k+1}/N$ such tuples, the number of pairs $(I^{\prime\prime}_{0,1},I^{\prime\prime}_{0,2})$ that arise in this fashion is

[TABLE]

The claim follows. ∎

5. Global structure of $\alpha^{\prime\prime}$

Proposition 4.1 gives some control on $\alpha^{\prime\prime}$ , but it is currently “local” because the parameters $T,q$ that arise in this control depend on the pair $I^{\prime\prime}_{1},I^{\prime\prime}_{2}$ . Fortunately, one can use the “mixing” or “ergodicity” properties of the graph of such pairs to convert this local control to global control. To do this we first need a lemma.

Lemma 5.1 (Mixing lemma).

Let $\theta,\eta,X,H,f,\rho,\varepsilon,P^{\prime},P^{\prime\prime},\mathcal{I}^{\prime\prime},\alpha^{\prime\prime}$ be as in Proposition 3.2. We allow implied constants to depend on $\theta,\eta,\rho,\varepsilon$ . Let $\mathcal{A}_{1},\mathcal{A}_{2}$ be two subsets of $\mathcal{I}^{\prime\prime}$ . Then the number of quadruplets $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2})$ with $I^{\prime\prime}_{1}\in\mathcal{A}_{1},I^{\prime\prime}_{2}\in\mathcal{A}_{2}$ , $p^{\prime}_{1},p^{\prime}_{2}$ primes in $[P^{\prime},2P^{\prime}]$ , and $I^{\prime\prime}_{1}$ lying within $100\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{2}}{p^{\prime}_{1}}I^{\prime\prime}_{2}$ is

[TABLE]

Proof.

Let $\psi:\mathbb{R}\to\mathbb{R}$ be a non-negative Schwartz function whose Fourier transform $\hat{\psi}(\xi)\coloneqq\int_{\mathbb{R}}\psi(x)e(-x\xi)\ dx$ is supported on $[-1,1]$ . Observe that if $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},p^{\prime}_{1},p^{\prime}_{2})$ is a quadruplet of the required form, then

[TABLE]

Thus it will suffice to bound the expression

[TABLE]

by (60). Using the Fourier inversion formula $\psi(x)=\int_{\mathbb{R}}\hat{\psi}(\xi)e(x\xi)\ d\xi$ , we can write this expression as

[TABLE]

which after a change of variable can be bounded by

[TABLE]

where

[TABLE]

for $i=1,2$ and

[TABLE]

From the triangle inequality we have

[TABLE]

while from the large sieve inequality (Lemma 2.3) we have

[TABLE]

Furthermore from [22, Lemma 2] we have

[TABLE]

for $|\xi|\leq\frac{X}{H}$ . The claim now follows from the triangle inequality and the Cauchy-Schwarz inequality. ∎

Using this lemma, we have the following tool for converting local approximate constancy to global approximate constancy. The corollary will allow us to show that many of the intervals $I^{\prime\prime}$ in Proposition 4.1 share essentially same values of $T$ and $q$ .

Corollary 5.2 (Approximate ergodicity).

Let $\theta,\eta,X,H,f,\rho,\varepsilon,P^{\prime},P^{\prime\prime},\mathcal{I}^{\prime\prime},\alpha^{\prime\prime}$ be as in Proposition 3.2. We allow implied constants to depend on $\theta,\eta,\rho,\varepsilon$ . Let $M,K,\delta>0$ . Let $(Z,d)$ be a metric space, and let $r>0$ be a radius with the property that every ball of radius $5r/2$ can contain at most $M$ disjoint balls of radius $r/2$ . For each $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ , let $\mathcal{F}(I^{\prime\prime})$ be a finite subset of $Z$ with cardinality at most $K$ . Let $\mathcal{S}$ be a collection of sextuples $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},z_{1},z_{2},p^{\prime}_{1},p^{\prime}_{2})$ with $I^{\prime\prime}_{1},I^{\prime\prime}_{2}\in\mathcal{I}^{\prime\prime}$ with $z_{1}\in\mathcal{F}(I^{\prime\prime}_{1}),z_{2}\in\mathcal{F}(I^{\prime\prime}_{2}),d(z_{1},z_{2})\leq r$ , and $p^{\prime}_{1},p^{\prime}_{2}$ distinct primes in $[P^{\prime},2P^{\prime}]$ with $I^{\prime\prime}_{1}$ lying within $100\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{2}}{p^{\prime}_{1}}I^{\prime\prime}_{2}$ . Suppose that

[TABLE]

Then either

[TABLE]

or else there exists $z_{0}\in Z$ and a collection ${\mathcal{T}}$ of pairs $(I^{\prime\prime},z)$ with $I^{\prime\prime}\in\mathcal{I}$ , $z\in\mathcal{F}(I^{\prime\prime})$ , and $d(z,z_{0})\leq 2r$ such that

[TABLE]

and such that there are $\gg\frac{\delta^{2}}{MK^{4}}\frac{X}{H}(P^{\prime}/\log P^{\prime})^{2}$ sextuples $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},z_{1},z_{2},p^{\prime}_{1},p^{\prime}_{2})\in\mathcal{S}$ such that $(I^{\prime\prime}_{1},z_{1})$ , $(I^{\prime\prime}_{2},z_{2})$ both lie in ${\mathcal{T}}$ .

Proof.

For technical reasons we first need to refine the set ${\mathcal{S}}$ . Let ${\mathcal{T}}_{0}$ be the set of all pairs $(I^{\prime\prime}_{1},z_{1})$ with $I^{\prime\prime}_{1}\in\mathcal{I}^{\prime\prime}$ and $z_{1}\in\mathcal{F}(I^{\prime\prime})$ . From (61) we have

[TABLE]

where

[TABLE]

We have $\#\mathcal{T}_{0}\leq 10KX/H$ . We conclude that there is a subset $\mathcal{T}_{1}$ of $\mathcal{T}_{0}$ with

[TABLE]

for all $(I^{\prime\prime}_{1},z_{1})\in{\mathcal{T}}_{1}$ , such that

[TABLE]

Let $\Omega$ be a maximal $r$ -separated net in $Z$ , thus every point in $Z$ lies within distance $r$ of at least one point in $\Omega$ . From (64) and the triangle inequality we conclude that

[TABLE]

If we define

[TABLE]

and

[TABLE]

then the left-hand side of (65) is bounded by

[TABLE]

which by Lemma 5.1 is bounded by

[TABLE]

Any pair $(I^{\prime\prime}_{2},z_{2})\in\mathcal{T}_{0}$ can contribute to $\mathcal{A}_{2}(z_{0})$ only if $B(z_{0},r/2)$ is contained in $B(z_{2},5r/2)$ . As the balls $B(z_{0},r/2)$ with $z_{0}\in\Omega$ are disjoint, we conclude that each such pair contributes to at most $M$ sets $\mathcal{A}_{2}(z_{0})$ , and hence

[TABLE]

and similarly

[TABLE]

By Cauchy-Schwarz, we may thus bound the left-hand side of (65) by

[TABLE]

and hence

[TABLE]

Thus, either (62) holds, or there exists $z_{0}\in\Omega$ with

[TABLE]

Suppose the latter claim is true. If we now let ${\mathcal{T}}_{2}$ denote the collection of those $(I^{\prime\prime}_{1},z_{1})\in\mathcal{T}_{1}$ with $I^{\prime\prime}_{1}\in\mathcal{A}_{1}(z_{0})$ and $z_{1}\in B(z_{0},r)$ , then we have

[TABLE]

From (63) there exist $\gg\frac{\delta}{MK^{3}}\frac{X}{H}\frac{\delta}{K}(P^{\prime}/\log P^{\prime})^{2}$ sextuples $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},z_{1},z_{2},p^{\prime}_{1},p^{\prime}_{2})\in\mathcal{S}$ such that $(I^{\prime\prime}_{1},z_{1})\in{\mathcal{T}}_{2}$ . Since $z_{1}\in B(z_{0},r)$ and $d(z_{1},z_{2})\leq r$ , we have $z_{2}\in B(z_{0},2r)$ . Thus, if we take ${\mathcal{T}}$ to be the collection of those $(I^{\prime\prime}_{1},z_{1})\in\mathcal{T}_{0}$ with $I^{\prime\prime}_{1}\in\mathcal{A}_{2}(z_{0})$ and $z_{1}\in B(z_{0},2r)$ , we obtain the claim. ∎

Let $\theta,\eta,X,H,f,\varepsilon,\rho,P^{\prime},P^{\prime\prime},\mathcal{I}^{\prime\prime},\alpha^{\prime\prime}$ be as in Proposition 3.2. Let $\delta>0$ be a small quantity (depending on $\theta,\eta,\varepsilon$ ) which we will specify in a moment. Inspired by Proposition 4.1, define a good quadruple to be a quadruple $(I^{\prime\prime},T,q,a)$ , where $I^{\prime\prime}$ is an interval in $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ , $T$ is a real number with

[TABLE]

$q$ is a natural number with $1\leq q\leq H^{\rho}/\delta$ , $a\in\{0,\dotsc,q-1\}$ is coprime to $q$ , and there exists a real number $\theta$ with $|\theta|\leq\frac{1}{\delta}\frac{1}{H^{1-\rho}}$ such that

[TABLE]

for a set $\mathcal{P}$ of primes in $[P^{\prime\prime}/2,P^{\prime\prime}]$ of cardinality at least $\delta\frac{P^{\prime\prime}}{\log P^{\prime\prime}}$ . Proposition 4.1 guarantees that once $\delta$ is chosen sufficiently small in terms of $\theta,\varepsilon,\eta,\rho$ there exist $\gg X/H$ good quadruples. Throughout we fix $\delta$ sufficiently small so that this holds; in particular, implied constants may now depend on $\delta$ in addition to $\theta,\varepsilon,\eta,\rho$ .

We have some limitations on how many good quadruples can be associated to a single interval $I^{\prime\prime}$ :

Proposition 5.3.

Let $\delta,\rho$ be as above, and let $I^{\prime\prime}$ be an interval in $\mathcal{I}^{\prime\prime}$ . Let $K\geq\frac{2}{\delta}$ , and let $(I^{\prime\prime},T_{j},q_{j},a_{j})$ for $j=1,\dots,K$ be a collection of good quadruples. Then there exist $1\leq j<j^{\prime}\leq K$ with the following properties:

(i)

$q_{j}=q_{j^{\prime}}$ .

(ii)

$a_{j}=a_{j^{\prime}}$ .

(iii)

$T_{j}=T_{j^{\prime}}+O\left(\frac{X}{H^{1-\rho}}\right)$ .

Proof.

Without loss of generality we may take $K=\lceil\frac{2}{\delta}\rceil$ . For $j=1,\dotsc,K$ , let $\mathcal{P}_{j}$ be the set of primes in $[P^{\prime\prime}/2,P^{\prime\prime}]$ associated to the good quadruple $(I^{\prime\prime},T_{j},q_{j},a_{j})$ . Then

[TABLE]

and $\sum_{j=1}^{K}1_{\mathcal{P}_{j}}\leq K\ll 1/\delta$ . From this and the prime number theorem we conclude that $\sum_{j=1}^{K}1_{\mathcal{P}_{j}}\geq 2$ for at least $\gg\frac{P^{\prime\prime}}{\log P^{\prime\prime}}$ primes in $[P^{\prime},2P^{\prime}]$ ; this implies that there exist distinct indices $j,j^{\prime}\in\{1,\dotsc,K\}$ such that

[TABLE]

If one writes $Q\coloneqq\prod_{p^{\prime\prime}\in\mathcal{P}_{j}\cap\mathcal{P}_{j^{\prime}}}p^{\prime\prime}$ , we then have

[TABLE]

for some $c_{\delta}>0$ . On the other hand, from (67) one has

[TABLE]

and

[TABLE]

In particular,

[TABLE]

which when combined with (68) (and noting that the denominator on the left-hand side is at most $O_{\delta}(H^{2\rho})$ ) forces

[TABLE]

Since $a_{j}/q_{j}$ and $a_{j^{\prime}}/q_{j^{\prime}}$ are in lowest terms and in $[0,1)$ , this implies that $a_{j}=a_{j^{\prime}}$ and $q_{j}=q_{j^{\prime}}$ . Subtracting (69) from (70), we conclude that

[TABLE]

since $|T_{j}-T_{j^{\prime}}|\leq\frac{2}{\delta}\frac{X^{2}}{H^{2-\rho}}$ , we conclude from (68) that

[TABLE]

and hence $T_{j}-T_{j^{\prime}}\ll_{\delta}\frac{X}{H^{1-\rho}}$ . The claim follows. ∎

From the above proposition and the greedy algorithm, we conclude

Corollary 5.4.

For each $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ , there exists a set $\mathcal{F}(I^{\prime\prime})$ of triples $(T,q,a)$ of cardinality

[TABLE]

such that, for any good quadruple $(I^{\prime\prime},T,q,a)$ , there exists $T^{\prime}\in\mathbb{R}$ such that $(T^{\prime},q,a)\in\mathcal{F}(I^{\prime\prime})$ and

[TABLE]

On the other hand, Proposition 4.1 provides us with a large number of quadruples:

Proposition 5.5.

Let $\delta$ be as above and $X$ sufficiently large depending on $\delta$ and $\varepsilon$ . All implied constants may depend on $\varepsilon,\eta,\theta,\rho$ . Then, for $\gg(X/H)\cdot(P^{\prime}/\log P^{\prime})^{2}$ of the pairs $(I^{\prime\prime}_{1},I^{\prime\prime}_{2})$ of intervals $(\mathcal{I}^{\prime\prime})^{2}$ , there exist $T_{1},T_{2},q^{\prime},a^{\prime}_{1},a^{\prime}_{2}$ such that $(T_{1},q^{\prime},a^{\prime}_{1})\in\mathcal{F}(I^{\prime\prime}_{1})$ and $(T_{2},q^{\prime},a^{\prime}_{2})\in\mathcal{F}(I^{\prime\prime}_{2})$ , and

[TABLE]

Furthermore, for each such pair, there exist primes $p^{\prime}_{1},p^{\prime}_{2}\in[P^{\prime},2P^{\prime}]$ coprime to $q^{\prime}$ such that $I^{\prime\prime}_{1}$ lies within $100\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{2}}{p^{\prime}_{1}}I^{\prime\prime}_{2}$ , and such that

[TABLE]

Proof.

This is almost immediate from Proposition 4.1; the main difficulty is that the integers $a,q$ provided by that proposition need not be coprime.

We resolve this as follows. If $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ and $(T,q,a)\in\mathcal{F}(I^{\prime\prime})$ , then $q$ has at most $O(\frac{\log X}{\log P^{\prime\prime}})=O_{\varepsilon}(1)$ prime factors in $[P^{\prime\prime}/2,P^{\prime\prime}]$ . Thus, for each $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ , there are at most $O(1)$ primes that divide $q$ for some $(T,q,a)\in\mathcal{F}(I^{\prime\prime})$ .

Proposition 4.1 provides us with $\gg\frac{X}{H}\left(\frac{P^{\prime}}{\log P^{\prime}}\right)^{2}$ pairs $(I^{\prime\prime}_{1},I^{\prime\prime}_{2})$ of intervals $(\mathcal{I}^{\prime\prime})^{2}$ , together with associated primes $p^{\prime}_{1},p^{\prime}_{2}$ , obeying the properties of that proposition. It could happen that $p^{\prime}_{1}$ or $p^{\prime}_{2}$ divides $q$ for some $(T,q,a)$ in $\mathcal{F}(I^{\prime\prime}_{1})$ or $\mathcal{F}(I^{\prime\prime}_{2})$ , but by the preceding paragraph, the number of times this can happen is at most $O(\frac{X}{H}\frac{P^{\prime}}{\log P^{\prime}})$ , which is a negligible portion when $X$ is large enough. Thus for $\gg\frac{X}{H}\left(\frac{P^{\prime}}{\log P^{\prime}}\right)^{2}$ of the above pairs, $p^{\prime}_{1}$ or $p^{\prime}_{2}$ do not divide any such $q$ .

From Proposition 4.1, we have

[TABLE]

for $j=1,2$ , where $Q\coloneqq\prod_{p^{\prime\prime}\in{\mathcal{P}}(I^{\prime\prime}_{1},I^{\prime\prime}_{2})}p^{\prime\prime}$ . We write $a_{1}/q$ in lowest terms as $a^{\prime}_{1}/q^{\prime}$ . Then $(I^{\prime\prime}_{1},T,q^{\prime},a^{\prime}_{1})$ is a good quadruple and $p^{\prime}_{1},p^{\prime}_{2}$ do not divide $q^{\prime}$ . From (46) we may thus also write $a_{2}/q$ in lowest terms as $a^{\prime}_{2}/q^{\prime}$ and still have that (72) holds. Then $(I^{\prime\prime}_{2},T,q^{\prime},a^{\prime}_{2})$ is a good quadruple, and the claim follows from Corollary 5.4. ∎

Let $Z$ be the collection of triples $(T,q,a)$ with $T\in\mathbb{R}$ , $q\geq 1$ , and $a$ coprime to $q$ , endowed with the metric101010The $\frac{1}{100}1_{a_{1}\neq a_{2}}$ term is present only to keep the metric $Z$ from degenerating, but otherwise plays no role in the argument; if one prefers, one could drop this term and observe that Corollary 5.2 also applies to degenerate metric spaces.

[TABLE]

and some sufficiently small constant $c(\delta)>0$ depending on $\delta$ (and thus ultimately on $\theta,\eta,\rho,\varepsilon$ ). Let $\mathcal{S}$ be the collection of sextuples

[TABLE]

with $I^{\prime\prime}_{1},I^{\prime\prime}_{2}\in\mathcal{I}^{\prime\prime}$ , $(T_{1},q^{\prime},a_{1})\in\mathcal{F}(I^{\prime\prime}_{1})$ , $(T_{2},q^{\prime},a_{2})\in\mathcal{F}(I^{\prime\prime}_{2})$ , and $p^{\prime}_{1},p^{\prime}_{2}$ distinct primes in $[P^{\prime},2P^{\prime}]$ with $I^{\prime\prime}_{1}$ lying within $100\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{2}}{p^{\prime}_{1}}I^{\prime\prime}_{2}$ , with $p_{1},p_{2}$ coprime to $q^{\prime}$ and obeying (72) and (71). In particular (for $c(\delta)$ sufficiently small) we have

[TABLE]

From Proposition 5.5 we have $\#\mathcal{S}\gg(X/H)\cdot(P^{\prime}/\log P^{\prime})^{2}$ . Applying Corollary 5.2 with $r=\frac{1}{10}$ , $M=100$ , $K=\frac{2}{\delta}$ , we conclude that there exists $(T_{0},q_{0},a_{0})\in Z$ and a collection ${\mathcal{T}}$ of quadruples $(I^{\prime\prime},T,q,a)$ with $I^{\prime\prime}\in\mathcal{I}$ , $(T,q,a)\in\mathcal{F}(I^{\prime\prime})$ , and $d((T,q,a),(T_{0},q_{0},a_{0}))\leq\frac{1}{5}$ such that

[TABLE]

and there are $\gg\frac{X}{H}(P^{\prime}/\log P^{\prime})^{2}$ sextuples $(I^{\prime\prime}_{1},I^{\prime\prime}_{2},(T_{1},q_{1},a_{1}),(T_{2},q_{2},a_{2}),p^{\prime}_{1},p^{\prime}_{2})\in\mathcal{S}$ such that $(I^{\prime\prime}_{1},T_{1},q_{1},a_{1})$ , $(I^{\prime\prime}_{2},T_{2},q_{2},a_{2})$ both lie in ${\mathcal{T}}$ .

If $(I^{\prime\prime},T,q,a)\in{\mathcal{T}}$ , then $d((T,q,a),(T_{0},q_{0},a_{0}))\leq\frac{1}{5}$ , and hence by (73) we have $q=q_{0}$ and

[TABLE]

From (66) we thus have

[TABLE]

At present $q_{0}$ obeys the bounds $1\leq q_{0}\ll H^{\rho}$ . We can improve the control on $q_{0}$ significantly.

Proposition 5.6.

$q_{0}\ll 1$ .

Proof.

Consider the graph ${\mathcal{G}}$ whose vertex set $V$ is the set ${\mathcal{T}}$ as above, and whose edge set $E$ consists of pairs $(I^{\prime\prime}_{1},T_{1},q_{0},a_{1})$ , $(I^{\prime\prime}_{2},T_{2},q_{0},a_{2})$ in ${\mathcal{T}}$ with

[TABLE]

for some $p_{1}^{\prime}$ and $p_{2}^{\prime}$ . Then by the preceding dicussion $G$ has $\gg N$ vertices and $\gg dN$ edges, where $N\coloneqq X/H$ and $d\coloneqq(P^{\prime}/\log P^{\prime})^{2}$ .

Now let $k$ be the first even integer for which

[TABLE]

Using the Blakley-Roy inequality as in Section 4, the number of $(\frac{k}{2}+1)$ -tuples

[TABLE]

such that $\{Q_{j},Q_{j+1}\}\in E$ for $0\leq j<k/2$ is $\gg d^{k/2}N$ . The number of possible values for the pair $(Q_{0},Q_{k/2})$ is $O(N^{2})$ . Thus by the Cauchy-Schwarz inequality, there are $\gg d^{k}$ pairs of $\frac{k+2}{2}$ -tuples of the above form with matching pairs $(Q_{0},Q_{k/2})$ . Relabeling, we conclude that there the number of $k$ -tuples

[TABLE]

such that $\{Q_{j},Q_{j+1}\}\in E$ for $j=0,1,\ldots,k-1$ is $\gg d^{k}$ . On the other hand, we may upper bound the number of such tuples in a different way, as we will now do. Writing $Q_{j}=(I^{\prime\prime}_{j},T_{j},q_{0},a_{j})$ , we see from (72) that there are primes $p^{\prime}_{j,1},p^{\prime}_{j,2}\in[P^{\prime},2P^{\prime}]$ such that

[TABLE]

(with the periodic convention $a_{k}=a_{0}$ ) and such that $I^{\prime\prime}_{j}$ lies within $100\frac{H}{P^{\prime}P^{\prime\prime}}$ of $\frac{p^{\prime}_{j,2}}{p^{\prime}_{j,1}}I^{\prime\prime}_{j+1}$ for all $j=0,1,\ldots,k-1$ . From the first claim we have

[TABLE]

while from the second claim we have

[TABLE]

by repeating the derivation of (51). By Lemma 2.6, the number of tuples of primes $(p^{\prime}_{1,1},\dots,p^{\prime}_{k,1},p^{\prime}_{1,2},\dots,p^{\prime}_{k,2})$ obeying these constraints is $\ll\frac{d^{k}}{N}(\frac{1}{q_{0}^{1/2}}+\frac{1}{\log X}))$ . There are $\ll N$ choices for $I^{\prime\prime}_{1}$ , and this interval and the tuple of primes determine all the other $I^{\prime\prime}_{k}$ . Since all the sets ${\mathcal{F}}(I^{\prime\prime}_{j})$ have cardinality $O_{\delta}(1)$ , we conclude that the number of $k$ -tuples $(Q_{j})_{j=0,1,\ldots,k-1}$ under consideration is

[TABLE]

Comparing the upper and lower bounds yields

[TABLE]

and the claim follows. ∎

From (67), (75) we see that whenever $(I^{\prime\prime},T,q_{0},a)\in{\mathcal{T}}$ , one has

[TABLE]

for some $b\in\mathbb{Z}/q_{0}\mathbb{Z}$ . Since each $I^{\prime\prime}$ is associated to $O(1)$ quadruples in ${\mathcal{T}}$ , we conclude from (74) that for $\gg_{\varepsilon,\delta}X/H$ intervals $I^{\prime\prime}\in\mathcal{I}^{\prime\prime}$ , one has (77) for some $b\in\mathbb{Z}/q_{0}\mathbb{Z}$ .

Let $I^{\prime\prime}$ be one of these intervals, so that (see (40))

[TABLE]

Let $H^{*}\coloneqq\frac{H^{1-2\rho}}{P^{\prime}P^{\prime\prime}}$ . We may translate $I^{\prime\prime}$ by any shift of size at most $H^{*}$ without affecting this estimate. Averaging over such shifts, we conclude that

[TABLE]

and thus by the triangle inequality

[TABLE]

From (77), (76) and Taylor expansion, we have

[TABLE]

The contribution of the $O(H^{-\rho})$ is negligible, thus

[TABLE]

Recalling $(b,q_{0})=1$ and Proposition 5.6, we can apply a Fourier decomposition

[TABLE]

where $c_{b,\chi}\ll 1$ and $\chi$ ranges over Dirichlet characters of modulus $q_{1}$ . From the triangle inequality, we thus have

[TABLE]

Summing over the $\gg_{\varepsilon}X/H$ intervals $I^{\prime\prime}$ , we conclude that

[TABLE]

By the triangle inequality, there thus exist $q_{0}=q_{1}q_{2}$ and $\chi\ (q_{1})$ such that

[TABLE]

Writing $n=dm$ with $d|q_{2}^{\infty}$ and $(m,q_{2})=1$ we obtain by the triangle inequality

[TABLE]

where $d|q_{2}^{\infty}$ means that all the prime factors of $d$ are also prime factors of $q_{2}$ . Since $\sum_{d|q_{2}^{\infty}}d^{-1}\ll 1$ there exists an natural number $d=O(1)$ such that,

[TABLE]

Therefore by [24, Proposition A.3] we have $\mathbb{D}(f1_{(n,q_{2})=1}n^{-2\pi iT_{0}}\chi;T^{\prime};Q)\ll 1$ for some $Q\ll 1$ and $|T^{\prime}|\ll X$ . Therefore $\mathbb{D}(f;T;Q)\ll 1$ for some $|T|\ll X^{2}/H^{2-\rho}$ and $Q\ll 1$ as claimed.

6. Proof of Corollary 1.5 and Corollary 1.3

Now we prove Corollary 1.5 and Corollary 1.3. It is enough to prove the former corollary since, for any fixed $Q>0$ and $A>0$ , we have $\mathbb{D}(\lambda;X^{A};Q)\rightarrow\infty$ as $X\rightarrow\infty$ by the Vinogradov-Korobov zero-free region [29, §9.5].

We restrict attention to the correlation for $f(n)a(n+h)b(n+2h)$ , as the other two correlations are handled similarly. The proof proceeds along classical lines by noticing that

[TABLE]

where

[TABLE]

Notice that

[TABLE]

We now claim the bound

[TABLE]

If $|a(n)|\ll\Lambda(n)$ then this bound follows from [10, Proposition 4.2]. On the other hand if $|a(n)|\ll 1$ for all integers $n\geq 1$ , then, by Hölder’s inequality,

[TABLE]

The general case $a(n)\ll 1+\Lambda(n)$ now follows from the triangle inequality. Similarly for $b(n)$ . Therefore,

[TABLE]

and finally,

[TABLE]

Thus,

[TABLE]

Therefore if the left-hand side of $\eqref{eq:end}$ is $\geq\eta HX$ , then,

[TABLE]

for some absolute constant $c>0$ . Hence, for some $Y\in[c\eta^{3}X/3,X]$ , one has,

[TABLE]

Now the claim follows from Theorem 1.4.

Bibliography36

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Balog. The prime k 𝑘 k -tuplets conjecture on average. In Analytic number theory (Allerton Park, IL, 1989) , volume 85 of Progr. Math. , pages 47–75. Birkhäuser Boston, Boston, MA, 1990.
2[2] G. R. Blakley and P. Roy. A Hölder type inequality for symmetric matrices with nonnegative entries. Proc. Amer. Math. Soc. , 16:1244–1245, 1965.
3[3] V. Blomer. On triple correlations of divisor functions. Bull. Lond. Math. Soc. , 49(1):10–22, 2017.
4[4] S. Chowla. The Riemann hypothesis and Hilbert’s tenth problem . Mathematics and Its Applications, Vol. 4. Gordon and Breach Science Publishers, New York-London-Paris, 1965.
5[5] H. Davenport. On some infinite series involving arithmetical functions. ii. Quart. J. Math. Oxf. , 8:313–320, 1937.
6[6] P. D. T. A. Elliott. Probabilistic number theory. I , volume 239 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Science] . Springer-Verlag, New York-Berlin, 1979. Mean-value theorems.
7[7] N. Frantzikinakis and B. Host. Asymptotics for multilinear averages of multiplicative functions. Math. Proc. Cambridge Philos. Soc. , 161(1):87–101, 2016.
8[8] W. T. Gowers. A new proof of Szemerédi’s theorem for arithmetic progressions of length four. Geom. Funct. Anal. , 8(3):529–551, 1998.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Fourier uniformity of bounded multiplicative functions in short intervals on average

Abstract.

1. Introduction

Conjecture 1.1** (Local higher order Fourier Uniformity).**

Theorem 1.2** (Local Fourier Uniformity for s=1s=1s=1 at scale XθX^{\theta}Xθ).**

Corollary 1.3**.**

Theorem 1.4** (Main theorem).**

Corollary 1.5**.**

1.1. An overview of the proof

1.2. Some final remarks

Notational conventions.

Acknowledgments.

2. Auxiliary results

Lemma 2.1** (Discretizing).**

Proof.

Lemma 2.2** (Maximal large sieve).**

Proof.

Lemma 2.3** (Variant of large sieve).**

Proof.

Lemma 2.4** (Completion of sums).**

Proof.

Proposition 2.5** (Mean scales down).**

Proof.

Lemma 2.6** (Counting nearby products of primes).**

Proof.

3. Intervals and frequencies

Proposition 3.1** (Scaling down).**

Proof.

Proposition 3.2**.**

Proof.

4. Local structure of α′′\alpha^{\prime\prime}α′′

Proposition 4.1**.**

Proof.

5. Global structure of α′′\alpha^{\prime\prime}α′′

Lemma 5.1** (Mixing lemma).**

Proof.

Corollary 5.2** (Approximate ergodicity).**

Proof.

Proposition 5.3**.**

Proof.

Corollary 5.4**.**

Proposition 5.5**.**

Proof.

Proposition 5.6**.**

Proof.

6. Proof of Corollary 1.5 and Corollary 1.3

Conjecture 1.1 (Local higher order Fourier Uniformity).

Theorem 1.2 (Local Fourier Uniformity for $s=1$ at scale $X^{\theta}$ ).

Corollary 1.3.

Theorem 1.4 (Main theorem).

Corollary 1.5.

Lemma 2.1 (Discretizing).

Lemma 2.2 (Maximal large sieve).

Lemma 2.3 (Variant of large sieve).

Lemma 2.4 (Completion of sums).

Proposition 2.5 (Mean scales down).

Lemma 2.6 (Counting nearby products of primes).

Proposition 3.1 (Scaling down).

Proposition 3.2.

4. Local structure of $\alpha^{\prime\prime}$

Proposition 4.1.

5. Global structure of $\alpha^{\prime\prime}$

Lemma 5.1 (Mixing lemma).

Corollary 5.2 (Approximate ergodicity).

Proposition 5.3.

Corollary 5.4.

Proposition 5.5.

Proposition 5.6.