A new proof of Hal\'asz's Theorem, and its consequences
Andrew Granville, Adam J Harper, K. Soundararajan

TL;DR
This paper presents a new, more flexible proof of Halász's Theorem, leading to simpler proofs of prime distribution results like Hoheisel's and Linnik's theorems, with potential for broader applications.
Contribution
A novel proof of Halász's Theorem that enhances flexibility and applicability, enabling easier derivation of prime distribution results in short intervals and arithmetic progressions.
Findings
New proof of Halász's Theorem with improved flexibility
Simplified proofs of Hoheisel's and Linnik's Theorems
Potential for extending asymptotic analysis to short intervals and progressions
Abstract
Hal\'asz's Theorem gives an upper bound for the mean value of a multiplicative function . The bound is sharp for general such , and, in particular, it implies that a multiplicative function with has either mean value , or is "close to" for some fixed . The proofs in the current literature have certain features that are difficult to motivate and which are not particularly flexible. In this article we supply a different, more flexible, proof, which indicates how one might obtain asymptotics, and can be modified to short intervals and to arithmetic progressions. We use these results to obtain new, arguably simpler, proofs that there are always primes in short intervals (Hoheisel's Theorem), and that there are always primes near to the start of an arithmetic progression (Linnik's Theorem).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A new proof of Halász’s Theorem, and its consequences
Andrew Granville
AG: Département de mathématiques et de statistique
Université de Montréal
CP 6128 succ. Centre-Ville
Montréal, QC H3C 3J7
Canada; and Department of Mathematics
University College London
Gower Street
London WC1E 6BT
England.
,
Adam J Harper
AJH: Mathematics Institute, Zeeman Building, University of Warwick, Coventry CV4 7AL, England
and
K. Soundararajan
KS: Department of Mathematics, Stanford University, Stanford CA 94305, USA
Abstract.
Halász’s Theorem gives an upper bound for the mean value of a multiplicative function . The bound is sharp for general such , and, in particular, it implies that a multiplicative function with has either mean value [math], or is “close to” for some fixed . The proofs in the current literature have certain features that are difficult to motivate and which are not particularly flexible. In this article we supply a different, more flexible, proof, which indicates how one might obtain asymptotics, and can be modified to short intervals and to arithmetic progressions. We use these results to obtain new, arguably simpler, proofs that there are always primes in short intervals (Hoheisel’s Theorem), and that there are always primes near to the start of an arithmetic progression (Linnik’s Theorem).
Key words and phrases:
Multiplicative functions, Halász’s theorem, Hoheisel’s theorem, Linnik’s theorem, Siegel zeroes
2010 Mathematics Subject Classification:
11N56, 11M20
Andrew Granville has received funding in aid of this research from the European Research Council grant agreement n 670239, and from NSERC Canada under the CRC program. Adam Harper was supported, for parts of the research, by a postdoctoral fellowship from the Centre de recherches mathématiques in Montréal, and by a research fellowship at Jesus College, Cambridge. Kannan Soundararajan was partially supported by NSF grant DMS 1500237, and a Simons Investigator grant from the Simons Foundation. In addition, part of this work was carried out at MSRI, Berkeley during the Spring semester of 2017, supported in part by NSF grant DMS 1440140
1. Introduction
Throughout this paper, will denote a multiplicative function. We let
[TABLE]
and assume that all three Dirichlet series are absolutely convergent in Re. We will restrict attention to the class of multiplicative functions for which
[TABLE]
where is some fixed positive constant. If then it follows that for all , where denotes the -divisor function (the coefficient of in ). The class includes many of the most useful multiplicative functions; one could weaken the assumptions further if needed (for example, using a weak Ramanujan hypothesis instead, as in [19]), but we restrict ourselves to for simplicity.
1.1. A generalization of Halász’s Theorem
We begin by generalizing the classical version of Halász’s Theorem to the class .
Theorem 1.1** (Halász’s Theorem).**
Let be a fixed positive number, and let be large. Then, uniformly for all ,
[TABLE]
The following corollary may be easier to work with in practice.
Corollary 1.2**.**
For given , define by
[TABLE]
Then
[TABLE]
Notice , as . Therefore at worst Corollary 1.2 matches the trivial bound . This lossless quality of the analytic bound is one of the key features of Halász’s Theorem. On the other hand, from Lemma 2.7 below it follows that is always , so that at best Corollary 1.2 produces a bound of .
Existing proofs of Halász’s Theorem [10, 13, 14, 16, 17, 20] establish an inequality like Theorem 1.1, and then a consequence like Corollary 1.2 (usually with , sometimes less sharply). These proofs are all built on the original fundamental arguments of Halász, and employ “average of averages” type arguments, whereas our proof uses a more direct and hopefully more intuitive “triple convolution” application of Perron’s formula, which generalises naturally to diverse other situations (such as short intervals and arithmetic progressions). The reader may also consult [7] where the argument is presented in the simpler context of function fields, and [8] where an even simpler approach is presented in the case .
This new proof can help us to identify the “main term(s)” for the mean value of up to . Indeed, the following result will be proved in [9]: Fix . For , there are intervals , with , each of length such that
[TABLE]
for . These integrals can be determined under appropriate assumptions, to provide asymptotics for various interesting examples (see section 9, which cites asymptotics determined in [9]).
Theorem 1.1 and Corollary 1.2 are established in Section 2 below.
1.2. Multiplicative functions in short intervals
We are interested in the mean value of a multiplicative function in a short interval with as small as possible.
Theorem 1.3** (Halász’s Theorem for intervals).**
Given , real and , we have
[TABLE]
A simplified version of this theorem is given in the following corollary.
Corollary 1.4**.**
Given , real and , we have
[TABLE]
where
[TABLE]
Theorem 1.3 and Corollary 1.4 are established in Section 3. We remark that several aspects of Corollary 1.4 are in general best possible. For instance, taking for any given then will be almost constant on the interval , so there will be no cancellation in the mean value. This shows that the increased range of in the definition of , as well as the lack of a denominator (as compared with the definition of in Corollary 1.2), are both necessary here.
The term in Corollary 1.4 is also necessary, in view of the following example: Suppose that we are given values of for all , consistent with . We are interested in extending the definition of to large primes in such a way that certain sums over the corresponding are maximized. Since any prime divides at most one integer in , one can choose in such a way that for all such multiples the corresponding values all point in the same direction. Therefore, maximizing over all ways to extend (given the values at primes below ), we have
[TABLE]
Hence, using the prime number theorem,
[TABLE]
If, for example, each for then this is .
1.3. Slow variation of mean-values of multiplicative functions.
We are also interested in bounding how much the mean value of a multiplicative function can vary in short multiplicative intervals; a general version of this principle forms the main input in [19]. The example shows that the mean value (which is ) might rotate with ; but here we may consider the “renormalized” mean-values , which now varies slowly with . The following result (which will be established in Section 4) establishes a general result of this flavor. It extends the earlier work of Granville and Soundararajan [10] (improving on Elliott [3]) obtained in the special case .
Theorem 1.5** (Lipschitz Theorem).**
Let be given, and let be large. Let be a point in where the maximum of (for in this interval) is attained. For all we have
[TABLE]
If is always non-negative, then the estimate above holds with the improved exponent .
In Examples 2 and 3 of Section 9, we show that when there are examples of functions in where the exponent that appears above is attained, and there are examples where the exponent for non-negative functions is attained. In [10] it was suggested that the right Lipschitz exponent for might be as large as ; we now see that this is not the case.
1.4. Multiplicative functions in arithmetic progressions
We now establish analogues of Halász’s Theorem 1.1 and Corollary 1.2 for multiplicative functions in arithmetic progressions. There is a strong analogy with the situation in short intervals, with Dirichlet characters modulo being introduced here and playing a similar role as the “continuous characters” in the short interval theorems.
Theorem 1.6** (Halász’s Theorem for arithmetic progressions).**
Given , if and then
[TABLE]
where denotes the twisted Dirichlet series.
For each character we define as follows:
[TABLE]
The following corollary gives a simplified version of Theorem 1.6.
Corollary 1.7**.**
Given let . Then
[TABLE]
Theorem 1.6 and Corollary 1.7 will be proved in Section 3.
1.5. Isolating characters that correlate with a multiplicative function, and the pretentious large sieve.
Given a multiplicative function we shall show in Section 5 that there cannot be many characters that correlate with . For instance, there cannot be many characters for which the quantity is small. By extracting the few characters that correlate with , one can obtain a more refined understanding of the distribution of in arithmetic progressions. For a given and a character define
[TABLE]
Theorem 1.8**.**
Let , let be large, and be a natural number with . Put . For any natural number , there are at most characters for which
[TABLE]
If denotes the set of at most characters for which (1.2) holds, then for all in the range and all reduced residue classes we have
[TABLE]
Theorem 1.8 gives non-trivial bounds once for a sufficiently large constant . Precursors to this result may be found in the work of Elliott [4], where the effect of the principal character was removed and the difference estimated. Theorem 1.8 together with related variants will be established in Section 6.
We next describe one of these variants, which we call “the pretentious large sieve.” For any sequence of complex numbers with , the orthogonality relation for characters gives
[TABLE]
and, using the Cauchy–Schwarz inequality, this is
[TABLE]
This simple large sieve inequality is, in general, the best one can do. For a multiplicative function this becomes, when ,
[TABLE]
using Lemma 2.3 below. Our work shows that if a handful of characters are removed, then the above large sieve estimate can be improved in the range for a suitably large .
Theorem 1.9** (The pretentious large sieve).**
Suppose that . Let , and be as in Theorem 1.8. Let be a natural number, and let consist of the at most characters for which (1.2) holds. Then for all we have
[TABLE]
1.6. Primes in short intervals and arithmetic progressions.
We now give some applications of our work to counting primes in short intervals and arithmetic progressions. We shall obtain qualitative versions of Hoheisel’s theorem that there are many primes in intervals for some , and Linnik’s theorem that the least prime is below for an absolute constant . In both situations, more precise results are known, but our work is perhaps simpler in comparison with other approaches, avoiding the use of log-free zero-density estimates and refined zero-free regions. In the case of Linnik’s theorem, our work bears several features in common with Elliott’s proof in [5].
Corollary 1.10**.**
Let be small, and be a large real number. Then, with implied constants being absolute,
[TABLE]
Now we turn to primes in arithmetic progressions, where the story is more complicated because of the possibility of exceptional characters that might pretend to be the Möbius function.
Theorem 1.11**.**
Let be a natural number, and be large. Put and let be a natural number. Let denote the set of non-principal characters such that
[TABLE]
Then has at most elements, and for any non-principal character we have
[TABLE]
Further for any reduced residue class we have
[TABLE]
Corollary 1.12**.**
Let be a natural number and be large. Set . Then, for any reduced residue class ,
[TABLE]
unless there is a non-principal quadratic character such that
[TABLE]
If this exceptional character exists, then
[TABLE]
Theorem 1.11 also leads to an improvement of the large sieve estimate for primes, once one removes the effect of a few characters.
Corollary 1.13** (The pretentious large sieve for the primes).**
Let , and be as in Theorem 1.11, and let denote the set of Theorem 1.11 extended to include the principal character. Then
[TABLE]
Theorem 1.11 and Corollaries 1.12 and 1.13 will be established in Section 7. Finally in Section 8 we shall deduce the following qualitative form of Linnik’s theorem.
Corollary 1.14** (Linnik’s Theorem).**
There exists a constant such that if then there is a prime with .
2. Halász’s theorem for the class : Proofs of Theorem 1.1 and Corollary 1.2
In this section, we establish Theorem 1.1. The strategy of the proof also generalizes readily to give variants of Halász’s theorem in short intervals and arithmetic progressions, and we shall give these versions in the next section.
Let and denote (respectively) the smallest and largest prime factors of . Given a parameter , define the multiplicative functions (supported on small primes) and (supported on large primes) by
[TABLE]
so that is the convolution of and . Therefore setting (for with Re)
[TABLE]
We define and analogously. Note that , , and all depend on the parameter . We also remark that since is defined by a finite Euler product, converges and is analytic for Re. The following proposition, and variants of it, will play a key role in our proof of Halász’s theorem.
Proposition 2.1**.**
Let , let so that . Set . Then, for any ,
[TABLE]
equals
[TABLE]
The crucial point here is the presence of multiple Dirichlet polynomials inside the integral (visibly four, but really three since the factors and go together). To understand why this is useful one may think of the circle method for ternary problems (for example, sums of three primes). In such situations, three generating functions are involved, and a satisfactory bound is obtained by using an estimate for one of the generating functions and then using estimates (Parseval’s identity) to bound the remaining two. Proposition 2.1 allows us to employ a similar strategy here, bounding in magnitude, and then using a Plancherel bound (see Lemma 2.6) for the remaining two sums.
2.1. Introducing many convolutions
The kernel of Proposition 2.1 is contained in the following slightly simpler identity, which we will tailor (see Lemma 2.2) to obtain a more flexible variant.
Lemma 2.1**.**
For any with not an integer, and any , we have
[TABLE]
Proof.
We give two proofs. In the first proof, we interchange the integrals over and , and . First perform the integral over . Since we are left with
[TABLE]
and now performing the integral over this is
[TABLE]
Perron’s formula now shows that the above matches the left hand side of the stated identity.
For the second (of course closely related) proof, Perron’s formula gives that
[TABLE]
Integrating now over and gives
[TABLE]
where we use that (which follows upon comparing coefficients in ). ∎
Lemma 2.2**.**
Let , , , and be as above. Let and be real numbers. Then
[TABLE]
[TABLE]
Proof.
First we perform the integral over in (2.3), obtaining
[TABLE]
The term arising from above gives the third term in (2.4), using Perron’s formula. The term arising from gives
[TABLE]
upon evaluating the integral over . Perron’s formula now matches the two terms above with the first two terms in (2.4). This establishes our identity. ∎
To bound the contributions of the second and third terms in (2.4), as well as other related quantities, we use a well known result of Shiu establishing a Brun–Titchmarsh inequality for non-negative multiplicative functions.
Lemma 2.3**.**
Let and be fixed positive real numbers. Let be large, and suppose and . Then uniformly for all we have
[TABLE]
where is any reduced residue class.
Proof.
This follows from Theorem 1 of Shiu [18]. ∎
Lemma 2.4**.**
Keep notations as above. For , we have
[TABLE]
Proof.
Applying Lemma 2.3 (with and there) we find that
[TABLE]
As by the Chebyshev bound , we have
[TABLE]
which is
[TABLE]
as desired. ∎
2.2. Tailoring the integral
Now we turn to the task of bounding the integrals in (2.3), with a view to proving Proposition 2.1 and ultimately Halász’s Theorem 1.1. Fix . If we write the terms in the Dirichlet series in (2.3) as respectively, then by Perron’s formula the integral over equals a sum of the shape . Both and are , by the definition of , and so , , , and must all be . Hence in (2.3) we may replace
[TABLE]
and similarly for . Moreover, with these Dirichlet series truncated to finite sums we may move the line of integration to Re for any .
Choose , so that (after a change of variables) the inner integral over in (2.3) is
[TABLE]
We now show that the integral in (2.5) may be truncated at with an error term that is at most the second error term in Proposition 2.1. To do this, we shall find useful the following consequence of the Chebyshev bound : for all
[TABLE]
Lemma 2.5**.**
For , the part of the integral in (2.5) with , integrated over , is bounded by .
Proof.
The error introduced in truncating the integral in (2.5) at is, by the quantitative Perron formula (see Chapter 17 of Davenport [2]),
[TABLE]
We bound this error term by splitting the values of into various ranges. The terms with or are bounded by
[TABLE]
by (2.6). Integrating over and leads to a bound .
Now we turn to the terms with where . Note that is the largest divisor of free of prime factors , so that . Integrating over and , we find that
[TABLE]
where only has prime factors . Using twice, we deduce that the above is
[TABLE]
Hence these terms contribute
[TABLE]
splitting the sum over into intervals for , proving the lemma. The final inequality here requires an upper bound for the sum of in intervals of length around , which follows from Lemma 2.3. ∎
Proof of Proposition 2.1.
We begin with the identity in Lemma 2.2, replacing the inner integral in (2.3) by (2.5). We truncate the inner integral obtaining the error in Lemma 2.5. The terms in (2.4) are bounded by Lemma 2.4. Finally we replace by . ∎
2.3. Proof of Theorem 1.1
With Proposition 2.1 in hand, we may proceed to the proof of Halász’s theorem 1.1. We will bound the integral in Proposition 2.1 by a suitable application of Cauchy–Schwarz and the following simple mean value theorem for Dirichlet polynomials.
Lemma 2.6**.**
For any complex numbers and any we have
[TABLE]
Proof.
Let be an even non-negative function with for , for which , the Fourier transform of , is compactly supported. For example, take and note that is supported in . We may then bound our integral by
[TABLE]
Since and by symmetry, the above is
[TABLE]
Since is compactly supported we know that for all , and that for a given , our sum over is only supported on those values of for which . Therefore, using the Brun–Titchmarsh theorem, the sum over is seen to be . The lemma follows. ∎
Proof of Theorem 1.1.
Let be large, and take and in our work above. The error terms in Proposition 2.1 are , and it remains to handle the integral in (2.1). For given and in , the integral there is (since is small)
[TABLE]
Since , , are all at most ,
[TABLE]
and so
[TABLE]
Using Cauchy–Schwarz, Lemma 2.6 (noting that ), and then (2.6), the integral in (2.3) is
[TABLE]
Combining the last two estimates, we find that the quantity in (2.3) is
[TABLE]
Writing so that \min\Big{(}\log x,\frac{1}{\beta}\Big{)}\asymp\frac{1}{\sigma}, the integral in (2.1) is
[TABLE]
Since ,
[TABLE]
and so the quantity in (2.3) may be bounded by
[TABLE]
completing the proof of Theorem 1.1. ∎
The following simple lemma establishes limits on the quality of bounds that can be deduced from Halász’s theorem, and will be useful in deducing Corollary 1.2.
Lemma 2.7**.**
Let . Suppose , and is such that . Then
[TABLE]
Proof.
For any , a simple contour shift argument gives
[TABLE]
Therefore
[TABLE]
The portion of the integral with contributes an amount that is bounded in magnitude by
[TABLE]
and the lemma follows. ∎
Proof of Corollary 1.2.
We use the maximum modulus principle in the region
[TABLE]
to bound . By the definition of , the maximum of this quantity along the vertical line segment is , and by Lemma 2.7 this is . On the horizontal segments , may be bounded by , and the same bound holds on the vertical line segment . Thus
[TABLE]
Moreover, bounding by we also have
[TABLE]
Inserting these bounds into Halász’s Theorem 1.1 the integral there is
[TABLE]
and the corollary follows. ∎
In some applications it may happen that even though the function belongs a priori to some class the average size of might be smaller. For example, if is the indicator function of sums of two squares then lies in , whereas the average size of is . Another example comes from the normalized Fourier coefficients of holomorphic eigenforms. These lie naturally in , but the average size of these coefficients at primes is smaller than in absolute value. In such situations, one would like refined versions of our results taking the average behavior of into account. We give a sample result of this sort in the context of Corollary 1.2, and it would be desirable to flesh out similar variants for our other results.
Corollary 2.8**.**
Let and put . Suppose that is such that for all . Then
[TABLE]
where
[TABLE]
Proof.
We proceed as in the proof of Corollary 1.2 but now note that , and so
[TABLE]
Inserting this into Halász’s Theorem 1.1 we obtain this corollary. ∎
3. Halász’s theorem in short intervals and progressions: Proofs of Theorems 1.3 and 1.6
3.1. Short intervals
Proof of Theorem 1.3.
Since and
[TABLE]
we may assume that is small in the theorem (else the term in the stated bound dominates). We may also assume that else the result follows from Theorem 1.1. We follow the strategy of the proof of Theorem 1.1 with suitable modifications. We choose , and , and define , , and as in Section 2. We apply Lemma 2.2 twice, at and at , and subtract. Deferring the treatment of the secondary terms in (2.4) for a moment, we first explain how to bound the difference of contour integrals.
The difference of contour integrals mentioned above is (in place of (2.3))
[TABLE]
As described in section 2.2, we truncate this integral at height , the error term in Lemma 2.5 being acceptable since . Then we proceed as in the proof of Halász’s Theorem 1.1 in Section 2.3, with the only difference being that our integral here (in the step analogous to (2.3)) is multiplied through by a factor
[TABLE]
Thus, in place of (2.3), we have the bound
[TABLE]
The contribution of those for which the maximum of occurs with , is (since )
[TABLE]
This is acceptable for the estimate stated in Theorem 1.3.
It remains to deal with the terms corresponding to (2.4), which here are bounded by
[TABLE]
Using Lemma 2.3, the first term is
[TABLE]
To bound the second term, we distinguish the cases when and when so that . In the first case, we use the Brun–Titchmarsh inequality to estimate the sum over , and thus such terms contribute
[TABLE]
which is . In the second case, we sum over first using Lemma 2.3 and then sum over ; thus, we obtain
[TABLE]
This completes our proof of Theorem 1.3. ∎
To deduce Corollary 1.4 (and also in the proof of Theorem 1.5), we will find useful the following variant of the maximum modulus principle, which may be found as Lemma 2.2 of [10].
Lemma 3.1**.**
Let be a sequence of complex numbers such that is absolutely convergent for Re. For all real numbers and all we have
[TABLE]
and for any
[TABLE]
Proof of Corollary 1.4.
Take , and in Lemma 3.1, so that for
[TABLE]
where as before. Combining this with the upper bound in appropriate ranges of , we deduce Corollary 1.4 from Theorem 1.3 (in much the same way as we deduced Corollary 1.2 from Theorem 1.1). ∎
3.2. Arithmetic progressions
In this section we begin our study of the mean value of in an arithmetic progression with . Our starting point is the orthogonality relation of characters
[TABLE]
and now we may invoke our earlier work in understanding the mean-values of the multiplicative functions .
Proof of Theorem 1.6.
Starting with (3.3), we take and , and apply Lemma 2.2 to each sum . After summing over , the second and third terms in (2.4) contribute
[TABLE]
We bound these terms in a similar manner to the argument in the previous section. Thus, by Lemma 2.3, the first term in (3.4) is
[TABLE]
To bound the second term in (3.4), we distinguish the cases when and when so that . In the first case, we use the Brun–Titchmarsh inequality to estimate the sum over (noting that must lie in a fixed progression modulo ), and obtain
[TABLE]
which is . Finally in the second case when , we first bound the sum over using Lemma 2.3 and then perform the sum over (which is now at most ). Thus such terms contribute (multiplying through by since )
[TABLE]
These bounds are all acceptable for Theorem 1.6.
Now we turn to the corresponding contour integrals in (2.3). Arguing as in Section 2.2, we may truncate the corresponding integrals at height , incurring an error term in the analogue of Lemma 2.5 of
[TABLE]
which again is acceptable for Theorem 1.6. After summing over mod , we are left with a main term (analogously to Proposition 2.1) of
[TABLE]
where and are appropriate twists of and respectively. To bound this we need the following extension of Lemma 2.6.
Lemma 3.2**.**
Let be given. Then for any complex numbers we have
[TABLE]
Proof.
Arguing as in Lemma 2.6, the left side of (3.5) may be bounded by
[TABLE]
and the lemma follows upon using the Brun–Titchmarsh theorem to bound the number of primes in an interval that are also in a given arithmetic progression. ∎
We can now proceed much as in the proof of Halász’s Theorem 1.1 (replacing by for convenience). The contribution from any , fixed is (arguing as in (2.3))
[TABLE]
Using Cauchy–Schwarz and (3.5) (which is permissible since and ), the second factor above is bounded by
[TABLE]
by (2.6). Integrating over , and taking with so that , we obtain a bound of
[TABLE]
for our main term. Now and so the contribution to the integral from any is , which is an acceptable error, so we may restrict the maximum to the range . The result follows. ∎
Proof of Corollary 1.7.
This follows from Theorem 1.6, just as Corollary 1.2 follows from Theorem 1.1. ∎
4. A Lipschitz estimate for mean-values: Proof of Theorem 1.5
Let be large, and . Recall that is chosen such that is a maximum in the range . Given , our goal in Theorem 1.5 is to bound
[TABLE]
We apply Lemma 2.2 twice, with and , and divide through by and respectively. The terms arising from the second and third sums in (2.4) may be bounded (as in Lemma 2.4) by . The integral arising from (2.3) is
[TABLE]
We now tailor this integral as in Section 2.2, taking there , as before, and setting . Then, up to an error term which is acceptable for Theorem 1.5, we may bound the magnitude of (4.1) by
[TABLE]
From here on we follow the argument in Section 2.3, making suitable modifications.
Thus, arguing as in (2.3) and (2.8), we may bound (for given , ) the inner integral in (4) by
[TABLE]
Using the triangle inequality and since and are below , we may bound the maximum in (4.3) by
[TABLE]
Insert this bound in (4.3), and then into (4); we conclude that the quantity in (4) is bounded by
[TABLE]
A small calculation allows us to bound the contribution of the second term above by
[TABLE]
(The term can be replaced by except when .)
It remains to handle the first term in (4.4), for which we require the following lemma.
Lemma 4.1**.**
Let . Select a real number with which maximizes . Then for we have
[TABLE]
*If for all then we obtain the analogous result with in place of . *
Proof.
If then the stated estimate is immediate, and we suppose below that . By the maximality of , we have , and so, setting , we obtain
[TABLE]
We may now estimate the sum above using the prime number theorem (in the strong form ) and partial summation. For a suitably large constant , put . For we use , while for larger we can exploit the fact that . In this way, we obtain (and the reader may consult the proof of Lemma 2.3 of [10] for further details if needed)
[TABLE]
from which the first assertion of the lemma follows.
If for all , then using we find
[TABLE]
Now we argue as above, exploiting here that . ∎
Before applying the lemma, we comment that the estimates given there are in general the best possible. Suppose . Given with consider defined by . Here the bound of the lemma is attained for . Similarly for the non-negative case, consider the function defined by . For more discussion on these examples, see section 9 below.
Now we return to the first term in (4.4), seeking a bound for
[TABLE]
Note that if then the quantity in (4.6) may be bounded trivially by . Now restrict to the range , and distinguish the cases and . In the second case , we have
[TABLE]
upon using Lemma 3.1 with , and Lemma 4.1. In the first case when , put , so that using the second part of Lemma 3.1
[TABLE]
Appealing now to Lemma 4.1 where , the above is bounded by
[TABLE]
after a little calculation. Thus in all cases we may bound the quantity in (4.6) by
[TABLE]
Using this estimate in (4.4), we conclude that the first term there contributes
[TABLE]
The bound in Theorem 1.5 for general follows upon combining this estimate with (4.5). The bound for non-negative functions follows similarly, using now the stronger input of Lemma 4.1 for that situation.
5. Repulsion results
We first show that can only be large for lying in a few rare intervals; in other words, that large values “repel” one another. Such results are useful in many contexts (see for example Lemma 5.1 of [19] which formulates a similar result in a slightly more general setting); while we will not use Proposition 5.1 directly in this paper, it will be useful in [9] in extracting asymptotic formulae (such as the examples given in Section 9).
Proposition 5.1**.**
Let . If satisfy , and for all , and then
[TABLE]
Proof of Proposition 5.1.
The proof uses a simple but powerful “duality” idea that is fundamental in large values theory for Dirichlet polynomials. Since each , we have
[TABLE]
By Cauchy–Schwarz the square of this sum is
[TABLE]
The first factor above is . Expanding out the sum over , the second factor on the right hand side is
[TABLE]
where the first term comes from the terms, and the second term from the bound
[TABLE]
whenever , since . The proposition follows. ∎
Of greater use in the present paper is Proposition 5.2 below, which generalizes Proposition 5.1 to Dirichlet characters. To prove that we require the following lemma.
Lemma 5.1**.**
Let be a non-principal character. Let , and suppose that for some constant , and . Then for all we have
[TABLE]
where (for Re)
[TABLE]
Proof.
This follows from Lemma 4.1 of Koukoulopoulos [15], where an elementary proof based on a sieve argument is given.
The following alternative argument, which uses some basic analytic properties of -functions, may also be illuminating. Suppose, for simplicity, that is primitive, and put or depending on whether is even or odd. The completed -function extends to an entire function of the complex plane of order (see Chapter 9 of [2]). We claim that for fixed , the size of the completed -function is monotone increasing in . To see this, suppose , and note that by Hadamard’s factorization formula (see (16) and (18) of Chapter 12 of [2])
[TABLE]
where the product is over all non-trivial zeros of , which satisfy . Since and , each individual term in our product above is , and our claim follows.
Now if then
[TABLE]
so that the lemma follows in this case. If , then using Stirling’s formula and the monotonicity observation above we find that
[TABLE]
This completes the lemma for primitive , and it is a simple matter to extend to the general case. ∎
Before stating our proposition, let us recall some notation used previously in Section 3.2. If is given, and is a real parameter, and a Dirichlet character, then (with the sum being over all with prime factors all larger than )
[TABLE]
Proposition 5.2**.**
Let . Suppose that , , are distinct Dirichlet characters , and that , , are real numbers with . Let , , be non-negative real numbers. For and in the range we have
[TABLE]
Proof.
The proof is similar to that of Proposition 5.1. We start with
[TABLE]
By Cauchy–Schwarz the square of the sum here is
[TABLE]
The first factor is . Expanding out the sum over , the second factor is
[TABLE]
The terms contribute , while by Lemma 5.1 the terms contribute . We conclude that
[TABLE]
and the proposition follows. ∎
6. Multiplicative functions in arithmetic progressions and the pretentious large sieve
Given and a Dirichlet character , recall that we defined
[TABLE]
The proof of Halász’s theorem in arithmetic progressions presented in Section 3.2 may be refined to take into account the contribution of a given set of characters. Recall that for a character we defined by the relation
[TABLE]
Proposition 6.1**.**
Let be a set of characters and put
[TABLE]
Then, with
[TABLE]
we have
[TABLE]
Proof.
We start with the orthogonality relation (3.3), omitting the characters in , so that
[TABLE]
and then apply Lemma 2.2 to each of the inner sums taking and . Consider the contribution of the second term arising from (2.4), which is
[TABLE]
A similar decomposition holds for the third term arising from (2.4). The first term above together with its counterpart arising from the third term in (2.4) contribute the expression in (3.4), and may be bounded as before by . Similarly, the second term above together with its counterpart may be handled by Lemma 2.4, and is bounded by . From here on, we may follow the argument of Section 3.2, and making small modifications to that argument yields the proposition. ∎
The following result contains Theorem 1.8.
Theorem 6.1**.**
Let be a natural number, and let be a given integer. Let be large with , and set . Consider the following three definitions of a set of exceptional characters:
(i) The set consists of the characters corresponding to the largest values of .
(ii) The set consists of the characters corresponding to the smallest values of .
(iii) The set consists of the characters corresponding to the largest values of ; here take in the definition of .
In all of the cases , , or one has, uniformly for all in the range ,
[TABLE]
Further, for any character one has, uniformly for all in the range ,
[TABLE]
Proof.
By Proposition 5.2, there are at most characters with
[TABLE]
For , it is easy to check that the difference between and is . Therefore, we have that there are at most characters with
[TABLE]
For a character different from these characters, one has
[TABLE]
In other words, for such a character one has
[TABLE]
The estimate (6.1) for and (that is, cases (ii) and (iii)) follows now from Proposition 6.1, noting that the contribution from the error term in Proposition 6.1 is smaller than the upper bound in (6.1).
If , then applying Corollary 1.2 (the first corollary of Halász’s Theorem 1.1) to yields (6.2). By the maximality in the definition of , we deduce that the estimate (6.2) must also hold for any character .
Now we handle (6.1) in case (i). We have already shown the result for the set . Taking the difference between the sums defining and yields that
[TABLE]
Using (6.2), this yields a bound on the difference that is acceptable, and so the theorem follows in case (i) also. ∎
Proof of Theorem 1.9.
Let be a set of characters , and consider
[TABLE]
Expanding out the inner sum, we obtain
[TABLE]
Thus
[TABLE]
Taking to be the set of Theorem 6.1, and inserting (6.1) into (6.4), we obtain the result. ∎
7. Counting primes in short intervals and arithmetic progressions
Here and in the next section, we wish to understand (given )
[TABLE]
where is either a short interval near , or an arithmetic progression up to (with uniformity in the modulus ). One can relate the prime sums in (7.1) to multiplicative functions as follows. Set , and let denote the corresponding multiplicative function (so that for all primes ). Note that and so , whence also belongs to . Comparing Dirichlet series coefficients in the identity , we obtain
[TABLE]
We can try to understand averages of the RHS over using the hyperbola method, noting that the sums over and now involve multiplicative functions. However, this approach is doomed to failure because the Halász type results allow us to save at most one logarithm (at least when ), whereas the trivial bound obtained by taking absolute values in our expression above for with is, (in the particular case that and , for simplicity),
[TABLE]
To overcome this hurdle, we perform a “pre-sieving” to eliminate integers with prime factors below for a suitable , and reprove several of our earlier results, though now restricted to multiplicative functions supported only on large prime factors.
7.1. Multiplicative functions supported only on the primes
Suppose with for all and . Thus and , and making small modifications to the proof of Proposition 2.1 (specifically in the analogues of Lemmas 2.4 and 2.5), we arrive at the following result.
Proposition 7.1**.**
Given , let and . Let with for all primes and . Then, for any ,
[TABLE]
[TABLE]
In fact, the proofs here simplify a little because in bounding some error terms there is no sum over smooth numbers. For instance, in the proof of Lemma 2.4 we only have the terms with , so that there is no Euler product over primes , and hence no term.
We can obtain analogues of our previous results for multiplicative functions supported on large primes, by following the earlier proofs, using and replacing applications of Proposition 2.1 with Proposition 7.1 above. Thus if , with if has a prime factor , where , then the argument of Halász’s Theorem 1.1 with yields
[TABLE]
We may now follow the proof of Corollary 1.2 using the bound (for )
[TABLE]
in place of the bound . Thus we obtain that
[TABLE]
where
[TABLE]
Next we turn to the analogue of Theorem 1.3 in this setting. We obtain, assuming that and , and taking ,
[TABLE]
where
[TABLE]
Similarly, we may formulate an analogue for arithmetic progressions. Thus, if and , then (for with for )
[TABLE]
where
[TABLE]
For our work on primes in arithmetic progressions what will be useful are analogues of Theorem 1.8. Using Proposition 5.2, we find that if then for any natural number there are at most characters with
[TABLE]
If denotes the set of at most characters for which the above holds, then for all by (7.3) we have
[TABLE]
Moreover, arguing as in Section 6, we find that
[TABLE]
7.2. Primes in short intervals: Proof of Corollary 1.10
Let be small, and set
[TABLE]
Let be the indicator function of those numbers with all prime factors , and be the Möbius function restricted to numbers with all prime factors . Our interest is in counting primes in the interval and for in this interval we may write , which is usually , as
[TABLE]
where and are parameters to be chosen later with , and . Thus
[TABLE]
We begin by analyzing the first term in (7.8). Temporarily setting , a weak form of the fundamental lemma from sieve theory (see for example Corollary 6.10 of [6]) gives
[TABLE]
and so
[TABLE]
Thus the first term in (7.8) is
[TABLE]
Using the sieve again, the above equals
[TABLE]
Setting this aside for the moment, we turn to the second term in (7.8). Applying (7.4) to (and with we obtain
[TABLE]
where
[TABLE]
To estimate the above quantity, we invoke the following proposition.
Proposition 7.2**.**
Let , and suppose . Then
[TABLE]
In our application of Proposition 7.2 we have so that . Therefore Proposition 7.2 implies that , so that
[TABLE]
Performing now the sum over , and using the sieve once again, the second term in (7.8) is
[TABLE]
We now choose so that , and . Then, combining (7.9) and (7.10), we conclude that
[TABLE]
It remains to simplify the main term in (7.11). We may finesse this issue as follows. Summing up (7.11) over all intervals of length between and we find
[TABLE]
The prime number theorem (which, incidentally, may be deduced from Halász’s theorem) now allows us to evaluate the left side above. We conclude that
[TABLE]
which establishes Corollary 1.10. It remains lastly to prove Proposition 7.2.
Proof of Proposition 7.2.
Set . If then the product in the proposition may be estimated trivially as
[TABLE]
We may therefore assume that . The elementary inequality yields (as in usual proofs of the zero-free region)
[TABLE]
Now Lemma 2.4 of Koukoulopoulos [15] implies that111Actually Lemma 2.4 of [15] provides a much better dependence on , as it uses the Vinogradov–Korobov estimate for sums . This will improve to in the statement of Proposition 7.2, but it has no effect on the strength of (7.12). for all
[TABLE]
Partial summation gives
[TABLE]
and we now input the asymptotic formula above. This leads to the main term
[TABLE]
as . Moreover the error term contributes (after integrating by parts)
[TABLE]
which completes the proof.
Alternatively, we may upper bound by using the monotonicity principle described in Lemma 5.1. If , or if then Proposition 7.2 is immediate, so we may assume that and . Next observe that
[TABLE]
Next in the range , using Stirling’s formula we obtain
[TABLE]
where denotes Riemann’s -function, and the final inequality holds because and is monotone increasing for . ∎
7.3. Primes in progressions: Theorem 1.11 and Corollaries 1.12 and 1.13
Set , and let and be as in the previous section. Recall that , so that . Let be the set of non-principal characters for which
[TABLE]
By Proposition 5.2 applied to the function , we know that contains no more than elements. Let denote the set extended to include the principal character.
As before, let and be parameters with , and with and . Using the decomposition of , as in (7.8), we obtain that
[TABLE]
equals
[TABLE]
We now follow the argument in the previous section to handle the two sums above, starting with the first sum. Set , as in the previous section. Using partial summation and the fundamental lemma from sieve theory (Corollary 6.10 of [6]) we obtain (for any reduced residue class )
[TABLE]
If is a non-principal character, then using (7.14)
[TABLE]
For the principal character, we obtain similarly that
[TABLE]
Putting these results together for the inner sums over in the first term of (7.3), we may bound that term by
[TABLE]
Now we turn to the second term in (7.3). By (7.7) (with there replaced by and then here), we obtain
[TABLE]
Therefore the second term in (7.3) may be bounded by
[TABLE]
upon using the sieve once again. Recall that and choose once again , (so that in particular , provided is large). Then combining the above with (7.3) and (7.15) we conclude that
[TABLE]
This establishes (1.5) of Theorem 1.11, using the prime number theorem to account for the contribution of the principal character.
Now if is a non-principal character mod , with then
[TABLE]
by the last displayed equation. This implies (1.4), completing the proof of Theorem 1.11. Alternatively, arguing as in our proof of (1.5) (with the same parameters, but replacing the use of (7.7) by (7.3) applied to ), we obtain that for any non-principal character
[TABLE]
This again proves (1.4), and moreover (7.16) will be used in the proof of Corollary 1.12. ∎
Proof of Corollary 1.13.
Corollary 1.13 follows from Theorem 1.11 in much the same way that Theorem 1.9 was deduced from Theorem 1.8, in section 6. ∎
Proof of Corollary 1.12.
Apply our work above with . If is empty, then there is no exceptional character and a stronger form of (1.6) follows from Theorem 1.11. If is non-empty, then the unique character that it contains must be real (else its conjugate would also belong to the set). Now (1.8) follows at once from Theorem 1.11. Using (7.16) and (1.8) we see that if (1.6) fails, then we must have
[TABLE]
If the maximum above occurs at a point , then taking logarithms we have
[TABLE]
It remains to deduce (1.7) from (7.17). From (7.17) it follows that
[TABLE]
Using the prime number theorem and partial summation (as in our proof of Lemma 4.1, and recalling that ) we find that for ,
[TABLE]
Comparing the above two estimates, provided is large enough we obtain (since )
[TABLE]
Hence
[TABLE]
Bounding the first sum using (7.17) and the second sum by
[TABLE]
we deduce (1.7). ∎
8. Linnik’s Theorem: Proof of Corollary 1.14
Let be a sufficiently large fixed constant; what “sufficiently large” means will evolve from satisfying certain inequalities in the proof. Suppose that and every prime is . We appeal to Corollary 1.12 with , where is chosen large enough that the error terms in in Corollary 1.12 are all much smaller than the main terms.
Clearly (1.6) does not hold, and therefore we must have an exceptional real character for which (1.7) and (1.8) hold. In particular, (1.7) gives
[TABLE]
Now put , and apply Theorem 1.11 with for in the range . The set consists precisely of one character, namely the exceptional character : indeed
[TABLE]
Theorem 1.11, equation (1.5), now tells us that (since there are no primes below ) for all we have
[TABLE]
Partial summation, using this relation, yields
[TABLE]
so that
[TABLE]
We proved above that the left hand side is , and so must be , implying that
[TABLE]
At this stage, we invoke a sieve estimate from Friedlander and Iwaniec [6] (see Proposition 24.1, and the preceding discussion). Put
[TABLE]
This is a natural object in number theory: If is the quadratic character that corresponds to a fundamental discriminant , then counts (after accounting for automorphisms) the number of representations of by binary quadratic forms of discriminant . Friedlander and Iwaniec consider the sequence over and (with ), and sieve this sequence to extract the primes. (Note that if then for a prime one has .) They establish that if , and then
[TABLE]
The idea behind this result is that one can usually sieve (as in the fundamental lemma) up to being a small power of , and if there are few large primes for which then one has a sifting problem of low dimension so that it becomes possible to estimate accurately the number of primes left after sieving. The estimate (8.2) is useful only if is very small, but thanks to (8.1) this is precisely the situation we are in!
We therefore apply (8.2) taking and , so that
[TABLE]
If is suitably large, it then follows that
[TABLE]
say, and we have shown that the least prime is below . Note that we do not need Siegel’s theorem here; an effective lower bound of the form suffices.
9. Examples of Asymptotics
We will prove (1.1) in [9], and that will allow us to find asymptotics for the mean values of certain interesting . Here we discuss three important examples given in [9].
Example 1: If so that , then
[TABLE]
This is a well known result of Selberg and Delange.
Example 2: where sign. Here if mod 1, and otherwise. The Fourier coefficients of are given by
[TABLE]
and one can prove that for certain non-zero constants , where . Thus can be taken to be either or , and one can show that
[TABLE]
for some constant . Taking with, say , we deduce that
[TABLE]
Hence the Lipschitz exponent for is at most .
For any given in the range we can let for a carefully selected real number to obtain a lower bound on the difference that is of the same size, up to a constant, as the difference in (9.1):
[TABLE]
This gives examples, when , showing that the exponent in Theorem 1.5 cannot be made larger.
Example 3: where . Here if , and otherwise, so that for all . One can show that
[TABLE]
and one can develop this example to show that, when , the exponent for real, non-negative in Theorem 1.5 cannot be made larger.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. de la Bretèche, A. Granville and K. Soundararajan, Exponential sums with multiplicative coefficients and applications , preprint.
- 2[2] H. Davenport, Multiplicative number theory. Springer GTM 74 , (2000).
- 3[3] P.D.T.A. Elliott, Extrapolating the mean-values of multiplicative functions , Nederl. Akad. Wetensch. Indag. Math. 51 , no. 4 (1989), 409–420.
- 4[4] P.D.T.A. Elliott, Multiplicative functions on arithmetic progressions. VII. Large moduli , J. London Math. Soc. 66 (2002), 14–28.
- 5[5] P.D.T.A. Elliott, The least prime primitive root and Linnik’s theorem , Number theory for the millennium, I (Urbana, IL, 2000), A K Peters, Natick, MA, 2002, 393–418.
- 6[6] J.B. Friedlander and H. Iwaniec, Opera de Cribro , AMS Colloquium Publications vol. 57, 2010.
- 7[7] A. Granville, A. J. Harper, and K. Soundararajan, Mean values of multiplicative functions over function fields. Research in number theory (2015) 1: 25. doi:10.1007/s 40993-015-0023-5
- 8[8] A. Granville, A. J. Harper, and K. Soundararajan, A more intuitive proof of a sharp version of Halász’s theorem . Preprint.
