On the Waring-Goldbach problem with almost equal summands
Juho Salmensuu

TL;DR
This paper advances the understanding of the Waring-Goldbach problem by employing the transference principle to show that large integers can be expressed as sums of prime k-th powers within almost equal intervals, improving bounds on the interval size.
Contribution
It introduces new bounds on the interval size for primes in the Waring-Goldbach problem, reducing the previously known interval size from over 3/4 to approximately 0.525 to 0.55, depending on parameters.
Findings
Established that large integers can be represented as sums of prime k-th powers within short intervals.
Improved bounds on the interval size parameter for various values of k and s.
Extended previous results by lowering the threshold from over 3/4 to about 0.525-0.55.
Abstract
We use transference principle to show that whenever is suitably large depending on , every sufficiently large natural number satisfying some congruence conditions can be written in the form , where are primes, and . We also improve known results for when and . For example when and we have . All previously known results on the problem had .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the Waring-Goldbach problem with almost equal summands
Juho Salmensuu
Abstract
We use transference principle to show that whenever is suitably large depending on , every sufficiently large natural number satisfying some congruence conditions can be written in the form , where are primes, and . We also improve known results for when and . For example when and we have . All previously known results on the problem had .
1 Introduction
Let . For each prime , define so that . Let where
[TABLE]
First result concerning Waring-Goldbach problem is from Hua [Hua38] who showed that every sufficiently large natural number can be written in the form
[TABLE]
where are primes and . Since then the number of required summands has been greatly reduced, the latest improvement being from Kumchev and Wooley [KW17] who proved that (2) holds, for large , if .
Another interesting way to study the Waring-Goldbach problem is to replace the set of primes with some sparse subset of the primes. A natural way to choose such subset is to restrict primes to lie in a short interval , where and 111The limitation is a consequence of a slight modification of Wright’s argument (see [Wri37]). We talk more about it in Section 2.. Let be the least exponent such that (2) can be solved, for all sufficiently large and , whenever . Wei and Wooley [WW15] were first ones able to show that for every there exists such that . They showed that if , then
[TABLE]
Huang [Hua16] improved that result by showing that , for all and . The latest result is from Kumchev and Liu [KL17] who showed that , when and . As we can see from the previous results there has been difficulties to prove that . We break that barrier in this paper. 222Recently also Xuancheng Shao [Sha19] has been able to break this barrier. He proves that , when . He obtain his result by finding new estimate for the exponential sum: .
Our goal is to prove that , when and is sufficiently large. Note that value is a necessary limit due to what we currently know about primes in short intervals [BHP01]. We use the transference principle to obtain our main result. This approach is motivated by the work of Matomäki, Maynard and Shao [MMS17]. They used transference principle to show that every sufficiently large natural number can be written as the sum of three primes which lie in the interval . Our main result is the following.
Theorem 1**.**
Let , , and . Let be such that, whenever is sufficiently large, we have for each interval of length and for every such that and ,
[TABLE]
Suppose that
[TABLE]
Then, for every sufficiently large integer , there exist primes such that for each and
[TABLE]
When inequality (3) holds for (see [Har07, Theorem 10.3]) and when inequality (3) holds for (see [Har07, Theorem 10.8]). Thus
,
,
, when and ,
, when and .
These significantly improve previously known results which always had .
2 Outline
In this section we give an outline of the proof of Theorem 1. We will introduce the used notation in Section 3.
In Section 4 we prove the following transference lemma that is the main ingredient in proving Theorem 1.
Lemma 1**.**
Let and . Let be a natural number and, for each let be a function that satisfies the following assumptions:
(Mean condition)* For each arithmetic progression with we have ;* 2. 2.
(Pseudorandomness condition)* There exists a majorant with pointwise, such that * 3. 3.
(Restriction estimate)* We have for some with and .*
Then for each we have
[TABLE]
where is a constant depending only on .
The idea of the proof is following: If function satisfies conditions 1-3, then we can find functions and such that , both and satisfy condition 3, satisfies condition 1 and is Fourier uniform (i.e. ). Functions and are often called anti-uniform and uniform part of , respectively. Using Hölder’s inequality we can then reduce the problem to showing that
[TABLE]
This problem can be solved using induction and strategy that is very similar to the proof of [EGM14, Theorem 4.1].
Next we explain how Lemma 1 implies Theorem 1.
Let be a weighted W-tricked characteristic function of -th powers of primes in short interval (for precise definitions, see Subsection 5.1). In Section 5 we show that if conditions 1-3 of Lemma 1 hold for the function , then by Lemma 1 it follows that every sufficiently large natural number can be written as the sum of th power of primes, which belong to the short interval. So it remains to show that the function satisfies conditions 1-3 of Lemma 1.
In Section 6 we establish condition 1 for our precise choice of with an easy calculation using knowledge about primes in arithmetic progressions in short intervals.
In Section 7 we establish condition 2 which essentially corresponds to understanding the function
[TABLE]
where , and . We split into two disjoint sets, major arcs and minor arcs , using the Hardy-Littlewood decomposition, and treat the function differently in those two. In Subsection 7.1 we prove that
[TABLE]
if . In Subsection 7.2 we establish similar bound for those that are not very close to zero. We also show that if is very close to zero, then
[TABLE]
With these results we are able to prove the pseudorandomness of .
In Section 8 we prove condition 3, by first using the main conjecture in Vinogradov’s mean value theorem established by Bourgain, Demeter and Guth [BDG16, Theorem 1.1]333Wooley has an alternative proof of the main conjecture in Vinogradov’s mean value theorem in [Woo17]. and Daemen’s result concerning localized solutions in Waring’s problem [Dae10, Theorem 3] to show that
[TABLE]
for . Then we apply Bourgain’s strategy (see [Bou89, Section 4]) to the inequality (4) in order to get
[TABLE]
for as requested.
Remark 1. Let us now say a few words about the lower bound of the number of required summands. In Theorem 1 we need
[TABLE]
The first requirement comes from the pseudorandomness condition on the minor arcs. This essentially says that the number of required summands goes to infinity as approaches . We expect this kind of behaviour because, when , we cannot anymore represent every sufficiently large natural number at form , where and is some small coefficient. This can be seen from Wright’s argument (see [Wri37]). Using a slight modification of Wright’s construction (namely taking , where ), we can see that non-representability concerns also those numbers for which .
The second requirement comes from the pseudorandomness condition on the major arcs. If we had , then the term would be dominated by .
The third requirement comes from the restriction estimate. Since the third requirement is the most limiting one when is large, it is interesting to ask whether it can be improved. When we can replace by approximately using similar calculations as in [Cho18, Section 5]. This suggests that, for , can be replaced by something that depends on . Therefore at least minor improvements to the requirement should be possible when .
Remark 2. When we can establish a similar result to Theorem 1 requiring only that the number of summands is . Based on the proof of Theorem 1 we only need to establish the restriction estimate and the pseudorandomness condition for the transference function defined in (26) when . That can be done in a similar way to how we do it in this paper when , but the calculations are much simpler.
Remark 3. Using [Kou15, Theorem 1.3] in the proof of Theorem 1 one can easily prove that if , and are as in Theorem 1, then for almost every sufficiently large integer , there exist primes such that for each and
[TABLE]
We can also prove a similar result when , and . The author want to thank Trevor Wooley for helping to observe this fact.
3 Notation
For , we define convolution by
[TABLE]
For a set , write for its characteristic function. Define . Let and . We define by
[TABLE]
The Fourier transform of a function is defined by
[TABLE]
where . We will also use notation as an abbreviation for .
Let and . We write if there exists a constant such that for all values of in the domain of . If takes only positive values we then define similarly if there exists a constant such that for all values of in the domain of . If the implied constant depends on some contant we use notations . If and we write We also write if
[TABLE]
The function is asymptotic to , denoted if
[TABLE]
We will use notation for . We also define norms
[TABLE]
for functions , and where is a metric space with metric .
For the function we define Gowers -norm by , where for some arbitrary and
[TABLE]
The functions and are regarded as functions on by defining if , where is regarded as embedded in in a natural manner. Note that is independent of the choice of .
Acknowledgments The author is grateful to his supervisor Kaisa Matomäki for suggesting the topic and for many useful discussions. The author also thanks Joni Teräväinen for reading the paper and giving useful comments. During the work author was supported by Academy of Finland project no. 293876 and by project funding from Emil Aaltonen foundation.
4 Transference principle
In this section our aim is to prove Lemma 1 that is a generalization of [MMS17, Proposition 3.1]. Lemma 1 is based on the transference principle. The transference principle was first introduced by Green [Gre05] and it has appeared to be a powerful tool to study additive problems. The following example shows how the transference principle works.
Let be a sparse set of positive integers and say that we are interested in existence of solutions of linear equation
[TABLE]
where and . That corresponds to finding a positive lower bound for the sum
[TABLE]
Depending on the set that might be difficult to find directly.
Let , where is some suitably chosen weight function. The key of the transference principle is to find some set with positive density such that . Due to the positive density of one might hope to prove that
[TABLE]
Then
[TABLE]
which, once made rigorous, implies that
[TABLE]
4.1 Sumset estimates
In this subsection we prove some helpful lemmas about sumsets which we use later to prove the result that is similar to (5).
Lemma 2**.**
For any , there exists a constant such that the following statement holds. Let be a natural number and . Let be two subsets with the properties that
[TABLE]
for each arithmetic progression with . Then
[TABLE]
The proof we are going to present mainly follows ideas of the proof of [EGM14, Theorem 4.1]. The main differences are that we only need part of that proof and we consider instead of . In order to prove Lemma 2 we need to first establish an arithmetic regularity lemma that is valid for multiple sets simultaneously. In general the arithmetic regularity lemma says that bounded function can be decomposed into a (well-equidistributed, virtual) -step nilsequence, an error which is small in -norm and a further error which is minuscule in the Gowers -norm, where is a parameter. The proof and some applications of such regularity lemma can be found in [GT10]. We only need the arithmetic regularity lemma in the case , and the proof of this simpler case can be found also in [Ebe16] which we will utilise.
Before we present and prove our regularity lemma, we need some necessary definitions. We define a metric on by
[TABLE]
Using usual metric on , the previously defined metric on and the discrete metric on we define a metric on by the sum of these metrics. Let . We say that is -irrational, if and implies that . We say a subtorus T of of dimension has complexity at most if there is some , all of whose coefficients have size at most , such that . In this case we implicitly identify with using . For instance, we say is -irrational in if is -irrational in . By a growth function, we mean increasing function .
Lemma 3**.**
Let . For , let be functions, a growth function and . Then there exist a quantity , positive integers and -irrational such that, for each , we have a decomposition
[TABLE]
of into functions such that
* for some function with ,* 2. 2.
, 3. 3.
.
*Proof. *This proof is a straight-forward generalization of the proof of [Ebe16, Theorem 7]. Let and, for , be growth functions depending on and in a manner to be determined. By [Ebe16, Theorem 5], for each , there exists and a decomposition
[TABLE]
of into functions such that
, where and with , 2. 2.
, 3. 3.
.
Let be the concatenation of the vectors . Now we define functions , for each .
By [Ebe16, Theorem 6] we can find such that for all and decomposes as
[TABLE]
where
, 2. 2.
for some and 3. 3.
is -irrational in a subtorus of complexity , which means that is -irrational in .
Then for each
[TABLE]
where is defined by
[TABLE]
Noting that , we can find exceeding and for all . But since , if is sufficiently large depending on then , and similarly , so if is sufficiently large depending on for all then for all . After all these dependencies are fixed we have , and the conclusion of the theorem holds.
Proof of Lemma 2. We can assume that is sufficiently large depending on , since otherwise we can choose and lemma is trivially true. Let be a growth function depending on . Let . Then by Lemma 3 there exists , positive integers and -irrational such that
[TABLE]
where , and
[TABLE]
for some with . Similarly
[TABLE]
where satisfy same requirements with subscripts and superscripts replaced by .
Let and consider, for and , the progressions
[TABLE]
Define by and by . Define also
[TABLE]
and
[TABLE]
Since is -Lipschitz we see that . Similarly . Now we have decomposition
[TABLE]
where , . Similar bounds hold with replaced by . Now given an arbitrary growth function depending on , we may choose to grow sufficiently rapidly depending on so that , whence and is -irrational. Write for the density of in and for the density of in .
Let be set of those pairs for which or . We see that
[TABLE]
since otherwise
[TABLE]
for either or , and that leads to a contradiction using Cauchy-Schwarz because . Now using deductions of the proof of [EGM14, Lemma 4.4] we get that if then
[TABLE]
for both and .
Next we prove a variant of [EGM14, Lemma 4.7].
Claim 1**.**
Let and be such that and . Then
[TABLE]
*Proof. *By (7) it suffices to prove
[TABLE]
From we get that
[TABLE]
Let be the set of those integers for which
[TABLE]
We see that (only values near the right end of do not belong to ).
Note that a product of two -Lipschitz functions, each of which is bounded pointwise by , is -Lipschitz. Therefore, when and is sufficiently rapidly growing, we can use [EGM14, Lemma A.3] to the function to get that
[TABLE]
where
[TABLE]
By [EGM14, Lemma A.11] we have that is -Lipschitz. Since is -irrational and can be chosen to grow arbitrarily fast, we have by [EGM14, Lemma 4.5] that
[TABLE]
where
[TABLE]
But by (8) and [EGM14, Lemma 4.6] with we have that
[TABLE]
Putting this all together,
[TABLE]
for a set of of size at least
[TABLE]
provided that is large enough depending on . We denote the set of those values by .
Now using the fact that when and [EGM14, Lemma A.12] with we see that
[TABLE]
for all . Similar bounds apply to \Big{|}f_{sml}^{\prime A}|_{I_{a,i}}*f_{struct}^{B}|_{I_{b,j}}(c)\Big{|} and \Big{|}f_{sml}^{\prime A}|_{I_{a,i}}*f_{sml}^{\prime B}|_{I_{b,j}}(c)\Big{|}. Therefore
[TABLE]
for all . Recalling that
[TABLE]
[TABLE]
[TABLE]
and provided that grows fast enough [EGM14, Lemma A.13] implies that
[TABLE]
for all in a subset of size at least
[TABLE]
All these lie in , which is of course contained in .
Conclusion of the proof of Lemma 2. Set . We can assume that since otherwise the claim is obvious. Then for all and . Now we have by assumption that and for all and . Thus by Claim 1 we get that
[TABLE]
where excluding an exceptional set with size at most . Now using (6), (10) and the fact that it follows that
[TABLE]
Lemma 4**.**
For any , there exists constant such that the following statements holds. Let be natural number and . Let be two subsets with the properties that
[TABLE]
for each arithmetic progression with . Then
[TABLE]
for each arithmetic progression with .
*Proof. *Let be as in Lemma 2 and set that . We can assume that since otherwise statement is obvious. Given we see that there exist progressions and with the same common difference such that , and . (Simply choose and , where is common difference of and ). Let and . Clearly . Recall that . Since it follows that
[TABLE]
and therefore
[TABLE]
Now define and , where is the common difference of the progression . We see that
[TABLE]
Our aim is now to use Lemma 2 to sets and . Recall that . Let and be progression such that . Set . Then by assumption
[TABLE]
Similarly . Since it follows by Lemma 2 that
[TABLE]
Thus by (11)
[TABLE]
4.2 Transference lemma
In this subsection we will finally establish Lemma 1, that is a crucial ingredient in proving our main theorem. Before that we use induction over Lemma 4 to get the following lemma that essentially is our version of (5).
Lemma 5**.**
For any and , there exists a constant such that the following statement holds. Let be natural numbers, and, for , let and be such that
[TABLE]
for each arithmetic progression with . Assume that
[TABLE]
Then, for each , we have
[TABLE]
*Proof. *We may assume that is sufficiently large, since the claim is obvious when (and we can choose to be sufficiently small). Fix a positive integer . Let us define and for . Let
[TABLE]
for . By (12) we see that , for , provided that is sufficiently small. Let , and, for , , where is as in Lemma 4. Let
[TABLE]
for each . We shall choose . Then it follows from Lemma 4 that
[TABLE]
for each arithmetic progression with . Similarly
[TABLE]
for each arithmetic progression with . Repeating this argument inductively, for each , we get that
[TABLE]
for each arithmetic progression with . Hence in particular
[TABLE]
for . Let . We see that
[TABLE]
since , where . Thus
[TABLE]
since . From (13) and (14) we get that for
[TABLE]
Repeating previous argument, it follows that
[TABLE]
Since whenever , it follows by (17) and (18) that
[TABLE]
We are now ready present and prove the transference lemma.
Lemma 6**.**
(Transference Lemma) 444Using this asymmetric version of the transference lemma and some other results of this paper one should be able to establish results concerning Waring-Goldbach problem with mixed powers on short intervals. Let and . For all , let and be positive real numbers such that
[TABLE]
and
[TABLE]
Let be a natural number and, for each let be a function that satisfies the following assumptions:
(Mean condition)* For each arithmetic progression with we have ;* 2. 2.
(Pseudorandomness condition)* There exists a majorant with pointwise, such that * 3. 3.
(Restriction estimate)* We have for some .*
Then for each we have
[TABLE]
where is a constant depending only on .
*Proof. *A symmetric version of the case has been shown in [MMS17, Section 4.3] by Matomäki, Maynard and Shao. With minor changes the same proof works for the asymmetric version with any using Lemma 5 in place of [MMS17, Proposition 3.2]. The main difference is that when [MMS17] uses Hölder’s inequality to get that
[TABLE]
we use Hölder’s inequality to get that
[TABLE]
where is chosen such that .
Lemma 1, which we will use to prove our main theorem, is a symmetric version of the previous lemma.
5 Proof of the Main Theorem
In this section we will prove Theorem 1 using Lemma 1 assuming some lemmas which we will prove later.
5.1 Definitions
Let such that ,
[TABLE]
where , and . We see that . Let be the characteristic function for the primes. Next, we define a majorant function for the function based on the linear sieve. Let
[TABLE]
where
[TABLE]
[TABLE]
and
[TABLE]
for certain to be chosen later. We know that , for all (see [Nat96, Theorem 9.3]). Set
[TABLE]
We will prove in Subsection 7.2.1 that . We now define functions , by
[TABLE]
and
[TABLE]
where
[TABLE]
5.2 Key lemmas
We will apply Lemma 1 to the functions and . The following three lemmas (to be proven later) show that the functions and satisfy the conditions of Lemma 1. We use the notation of Subsection 5.1.
Lemma 7**.**
(Mean condition) Let , and . Let and be as in (26). Let also be such that, for each interval of length , and every such that and , we have
[TABLE]
when is sufficiently large. Let be an arithmetic progression such that . If is sufficiently large then
[TABLE]
We shall quickly establish Lemma 7 in Section 6.
Lemma 8**.**
(Pseudorandomness condition) Let , , and . Let be as in (24) and be as in (27). Assume that \delta<\max\Big{(}\frac{2\theta-1}{k},\frac{\theta}{k(k/2+1)}\Big{)}. Then
[TABLE]
when is sufficiently large depending on .
We establish Lemma 8 in Section 7. The pseudorandomness condition (Lemma 8) is the hardest condition to establish and therefore we will spend most of the remaining paper proving Lemma 8. As stated in Section 2 to prove pseudorandomness we split the interval into minor and major arcs and treat those sets differently. For the minor arcs, we use an application of the [Hua16, Lemma 1] and for the major arcs, we develop some ideas that are from [Vau97, Section 4] and [Cho18, Section 4].
Lemma 9**.**
(Restriction estimate) Let and let be as in (26). Then there exists such that and
[TABLE]
We establish Lemma 9 in Section 8. The proof follows mostly by combining [BDG16, Theorem 1.1], [Dae10, Theorem 3] and some ideas of [Bou89, Section 4].
5.3 Conclusion
In this subsection we prove Theorem 1 assuming the lemmas presented in the previous subsection. Before presenting the proof we need the following lemma about local solutions of Waring’s problem.
Lemma 10**.**
Let and be such that , where and is as in (1). If , then congruence
[TABLE]
has a solution with .
*Proof. *Let be the number of solutions of the congruence (30). Let
[TABLE]
Let us first show that is multiplicative. For this, let , where . Using [Hua65, Lemma 8.1] it follows that
[TABLE]
Thus is multiplicative and so it suffices to prove the lemma with , where is a prime and . If we get from [Hua65, Lemma 8.3] that
[TABLE]
Together with [Hua65, Lemma 8.8] and [Hua65, Lemma 8.9] this implies the claim.
Proof of Theorem 1 assuming Lemmas 7, 8, 9. Let be a natural number for which and let . Our goal is to show that can be written in form
[TABLE]
where are primes which belong to the interval .
We now define the exact values of the variables and . Let
[TABLE]
We see that and . Hence (22) holds.
By lemma 10 we can choose with such that for some and . We shall apply Lemma 1 with where is as in (26).
Assuming Lemmas 7, 8 and 9 we have by Lemma 1 that, for each , there exists a representation
[TABLE]
where for each there exists a prime such that . Thus
[TABLE]
Set . Now if , it follows that as claimed. From (31) and definition of we see that
[TABLE]
[TABLE]
and
[TABLE]
Using these it follows that
[TABLE]
Thus when is large enough.
6 Mean condition
Proof of Lemma 7 By (26) we see that
[TABLE]
Since is an arithmetic progression with , there exist integers such that , and . Therefore, for , there exists such that , where
[TABLE]
By (see eq. (19)) we see that
[TABLE]
Set and . Note that and . Then
[TABLE]
By the mean value theorem and (22) we have that
[TABLE]
for any provided that is large enough. Similarly
[TABLE]
Thus and . We also get that
[TABLE]
provided that is large enough depending on . Now if is sufficiently large it follows by (29), (32) and (33) that
[TABLE]
By (19) we have that and thus . Using (19), [IR90, Proposition 4.2.1] and [IR90, Proposition 4.2.2] we get that . Therefore
[TABLE]
7 Pseudorandomness condition
We assume notation of Subsection 5.1. In this section we will prove Lemma 8. In order to do so we divide into two disjoint sets, major and minor arcs, using Hardy and Littlewood decomposition.
Let
[TABLE]
for to be chosen later and as in (24). For and , write . Let
[TABLE]
If is suitably small, and, if is sufficiently large, then and thus all intervals are disjoint. Let also . We call major arcs and minor arcs.
Next we decompose . From (27) we have that
[TABLE]
where
[TABLE]
Using (23) we can write
[TABLE]
where the function
[TABLE]
is called generating function.
7.1 Minor arcs
In this subsection we will prove the following lemma which immediately implies Lemma 8 for .
Lemma 11**.**
Let , , , and . Let also be as in (27) and be as in (34). Then
[TABLE]
Lemma 11 will easily follow from the following estimate for the generating function on the minor arcs.
Lemma 12**.**
Let , , , and . Let and be as in . Let also . Then
[TABLE]
We have the trivial bound . We also note that by the mean value theorem and (22)
[TABLE]
*Proof. *Let . By Dirichlet’s Theorem (see e.g. [Nat96, Theorem 4.1]) there exist integers and such that
[TABLE]
Because we must have . By (37)
[TABLE]
where with . Let
[TABLE]
In order to analyse we will use a result of Huang [Hua16, Lemma 1]. Note that from underlying proof it follows that [Hua16, Lemma 1] also holds when and is replaced by . Assume that . Then, for small enough ,
[TABLE]
Now by [Hua16, Lemma 1] for suitable small depending on and any either
[TABLE]
or there exist integers and such that
[TABLE]
and
[TABLE]
By (34), (39) and (42) we have
[TABLE]
for some , when is small enough and . Assuming that is sufficiently large, depending on , we have that
[TABLE]
and consequently and . Thus
[TABLE]
Now the claim follows from (40), (41) and (43).
Proof of Lemma 11. Let . By Dirichlet’s Theorem (see e.g. [Nat96, Theorem 4.1]) there exist integers and such that
[TABLE]
Because we must have . Thus
[TABLE]
From Lemma 12 and (34) we get that
[TABLE]
Together with (35), (36), (37) and (38) it follows that
[TABLE]
7.2 Major arcs
In this subsection we will establish Lemma 8 when . In particular we need to understand the generating function (37) on the major arcs. We use a standard strategy similar to [Vau97, Section 4.1] to approximate our generating function. The result we will prove is the following.
Lemma 13**.**
Let , , , and . Let also be as in (27). Then
[TABLE]
when is sufficiently large depending on .
7.2.1 Auxiliary lemmas
In this subsection we state two lemmas that follow from standard linear sieve estimates. They are needed in order to prove Lemma 13.
Lemma 14**.**
Let and such that . Let be as in (23). Then
[TABLE]
Additionally, if , then
[TABLE]
*Proof. *Let the sieving range be primes not dividing . Then it follows by Mertens formula (see e.g. [IK04, formula (2.16)]) and from the theory of linear sieve (see e.g. [Nat96, Theorem 9.6] and [Nat96, Theorem 9.8]) that, for any ,
[TABLE]
Define
[TABLE]
For the lower bound we use the following strategy.
[TABLE]
by the prime number theorem provided that is large enough. Since we have that
[TABLE]
From the previous lemma we get that
[TABLE]
for any and for some provided that is large enough.
Lemma 15**.**
Let and be such that . Let be as in (23). Then
[TABLE]
*Proof. *The proof follows the same general idea, which is used to prove the upper bounds with the linear sieve (see e.g. [Nat96, Section 9]). Set . We can assume that , which means that is also square-free. Therefore
[TABLE]
Let
[TABLE]
and
[TABLE]
Then
[TABLE]
We also note the following estimate
[TABLE]
Next we establish recursive formula for the upper bound of which is similar to [Nat96, Lemma 9.4]. In case we see by (47) that
[TABLE]
Hence by [Nat96, Lemma 9.2]
[TABLE]
Now let
[TABLE]
Then it follows that
[TABLE]
By (48), (49) and [Nat96, Lemma 9.4] we have that
[TABLE]
where
[TABLE]
Therefore by (46), (47) and [Nat96, Lemma 9.3]
[TABLE]
Thus by [Nat96, Theorem 9.6], [Nat96, Theorem 9.8] and Mertens formula (see e.g. [IK04, (2.16)])
[TABLE]
Using Mertens formula again we get that
[TABLE]
Since , it now follows that
[TABLE]
The claim now follows from (45).
7.2.2 The generating function
Our main goal in this subsection is to approximate the generating function on the major arcs by , where ,
[TABLE]
and
[TABLE]
say. For we define
[TABLE]
so that
[TABLE]
Set so that . Then, for all ,
[TABLE]
where and . Hence
[TABLE]
We need the following auxiliary lemma in order to estimate .
Lemma 16**.**
Let be a prime number, , and . Let be the number of solutions of
[TABLE]
with and . Then
[TABLE]
*Proof. *The claim follows from the facts that the equation
[TABLE]
has at most solutions with and, for any such , the equation
[TABLE]
has at most solutions with .
Now we can start estimating . The following lemma is based on [Vau97, Lemma 4.1] and has therefore a similar proof.
Lemma 17**.**
Let and . Then
[TABLE]
where . This also means that
[TABLE]
*Proof. *By (54) its enough to prove that
[TABLE]
where is a prime number and . Case follows directly from [Sch76, Chapter II, Corollary 2F]. Thus we can suppose that .
Assume first that . Then both and run through all residue classes modulo when runs through all residue classes modulo . Thus
[TABLE]
and thus (55) holds by [Vau97, Lemma 4.1] since .
Now it remains to prove (55) when . Let
[TABLE]
We have that . Also when runs through all residue classes modulo and runs through all residue classes modulo then runs through all residue classes modulo . Thus
[TABLE]
Thus
[TABLE]
where is the number of solutions of the congruence
[TABLE]
with . Let be such that and . If , then congruence (56) is insoluble, which gives us the claim. Hence we can assume that . We can also assume that since otherwise (55) is trivial. Now we must have that because otherwise (56) is insoluble and (55) is immediate. Thus is at most the number of solutions of
[TABLE]
with and . From Lemma 16 we get that
[TABLE]
Therefore by
[TABLE]
Next we show that the function (defined in (50)) can be approximated by an integral.
Lemma 18**.**
Let and . Then
[TABLE]
*Proof. *Let
[TABLE]
Using partial summation and integration by parts it follows that
[TABLE]
Now we are ready to prove an approximation lemma for the generating function . The proof will mostly follow the proof of [Vau97, Theorem 4.1].
Lemma 19**.**
Let , , and . If then
[TABLE]
Proof We see that
[TABLE]
Writing with and , we see that
[TABLE]
Thus
[TABLE]
where
[TABLE]
For and let . Then exists and is continuous and is monotonic on . We also see that , where , when . Thus by van der Corput method (see e.g. [Vau97, Lemma 4.2]) we have that
[TABLE]
where
[TABLE]
Since
[TABLE]
by the divisor bound (see e.g. [Nat96, Theorem A.11]), we have from Lemma 17 and (58) that
[TABLE]
where . Using integration by parts it follows that
[TABLE]
Therefore by Lemmas 17, 18 and (60)
[TABLE]
We can write previous lemma as follows.
Lemma 20**.**
Let , , and . If and , then
[TABLE]
The following two lemmas will be needed for showing that the main contribution of the major arcs comes when .
Lemma 21**.**
Let be such that and . Write , where is -smooth and . Then
[TABLE]
where
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
*Proof. *We mostly follow ideas presentend in [Cho18, Section 4].
Recalling (51) and (54) we see that
[TABLE]
Let , , and . By (52)
[TABLE]
Since
[TABLE]
we have that
[TABLE]
Because and , we see that and
[TABLE]
We split into several cases: (i) (ii) (iii) , and (iv) and .
Case (i) . Trivially true.
Case (ii) . We have , since is -smooth and . Therefore in this case from which it follows that by (61).
Case (iii) , and . Clearly . Also because and we have by (52) that . Thus by (53)
[TABLE]
Now because and it follows that the inner sum in the last expression vanishes. Therefore in this case.
Case (iv) and . As in case (iii) we have that . Since we get that
[TABLE]
Because it also follows that
[TABLE]
Thus by (61)
[TABLE]
Lemma 22**.**
Assume the notation of Lemma 21. Let , be as in (23) and be as in (28). Then
[TABLE]
*Proof. *Case follows from Lemma 14. Case and or is clear since vanishes by Lemma 21. Assume that , and . We can write
[TABLE]
By Lemma 21 it follows that
[TABLE]
Since we have by [Hua40, Theorem] that
[TABLE]
The claim now follows by divisor bound (see e.g. [Nat96, Theorem A.11]) and Lemma 15.
7.2.3 Proof of Lemma 13
Proof of Lemma 13 Let . First we will analyse the function on . Recall from (35) that we essentially need to analyse the function . By (36) and Lemma 20 we have that
[TABLE]
Similarly to (59) we note that . Hence by (35)
[TABLE]
From (21), (22), (24), (34) it follows that the error term is
[TABLE]
for some , provided that and are sufficiently small and . Now it follows from Lemma 22 that for the main term of in (63) holds that
[TABLE]
If then by Lemma 21 either the main term of (63) is [math] or we have and . In case and we have by (65) that the main term is . Therefore when .
Assume now that . Then and so that
[TABLE]
by (20) and (21). Hence by (63) and (64)
[TABLE]
Together with (25) this implies that
[TABLE]
Now it remains analyse function when . If then and
[TABLE]
Thus, when is small enough and ,
[TABLE]
since
[TABLE]
when and .
Combining Lemmas 11 and 13, and noting that
[TABLE]
we get Lemma 8.
We also record the following lemma for later use.
Lemma 23**.**
Let be suitably small, , , , , and . Let also be as in (27). Then
[TABLE]
*Proof. *The claim follows from the main term estimate (65), the error term estimate (64) and the fact that
[TABLE]
8 Restriction estimate
To establish Lemma 9 we will use a strategy which is very similar to what Sam Chow uses in his case [Cho18, Section 5]. Most significant differences are that we have our function defined on short interval and that we need to use Bourgain’s strategy (see [Bou89, Section 4]) only once, since we have power saving on the minor arcs. First we prove the following restriction estimate that has an additional factor.
Lemma 24**.**
Let , and . Let be as in (26). Then
[TABLE]
*Proof. *Let . Using orthogonality and the definition of we see that
[TABLE]
Let . By the mean value theorem . Let denote the number of integral solutions for the following system
[TABLE]
with . Now it follows from [Dae10, Theorem 3] and [BDG16, Theorem 1.1] that
[TABLE]
Since by (21) we obtain
[TABLE]
The claim now follows since
[TABLE]
8.1 Bourgain’s strategy
To obtain Lemma 9 from Lemma 24 we will use a strategy that was originally introduced by Bourgain [Bou89, Section 4] and is used similarly to our case in [Cho18, Section 5] and [BP17, Section 6]. The following lemma is our version of the Bourgain’s argument. It essentially says that if we have for a function and a value satisfying certain conditions, then for all .
Lemma 25**.**
Let , , and with and . Let and be such that for all . Let us have a Hardy-Littlewood decomposition:
[TABLE]
where and are some real variables with and for some large contant . Assume that
** 2. 2.
** 3. 3.
(Major arc estimate) If then 4. 4.
(Minor arc estimate) If then
Then
[TABLE]
*Proof. *For , define
[TABLE]
It is enough to prove that , for every , since that implies
[TABLE]
provided that .
Fix . Since by assumption 2
[TABLE]
we get that
[TABLE]
Thus we can assume that .
It suffices to show that if are any -spaced points in , then necessarily
[TABLE]
To prove (67), we define such that and for all . Furthermore, we define such that and for all . From Cauchy-Schwarz-inequality and assumption 1 it follows that
[TABLE]
Thus
[TABLE]
Now let be a parameter to be chosen later. Then by Hölder’s inequality
[TABLE]
Recalling , we obtain from the minor arc estimate (assumption 4) that
[TABLE]
Therefore
[TABLE]
Let , with . Note that . From the major arc estimate (assumption 3) we get that
[TABLE]
The right hand side of (69) is negligible compared to provided that is large enough. Thus, combining this with (68) and the major arc estimate (assumption 3), we get that
[TABLE]
Hence
[TABLE]
where
[TABLE]
The inequality (70) is very similar to [Bou89, Eq. (4.16)]. We have instead of , instead of and instead of . Assuming that , we can then apply Bourgain’s strategy and use [Bou89, Eq. (4.27)] and [Bou89, Lemma 4.28] to obtain
[TABLE]
where and are some arbitrarily chosen constants with and is a constant depending on and . If we choose to be sufficiently large depending on and , then . Therefore
[TABLE]
when and are suitably chosen. Hence (67) holds and the claim follows.
Now we are ready to prove Lemma 9.
Proof of the Lemma 9 The claim will follow applying Lemma 25 with , , , , and , where and are sufficiently small. Note that for all . We also use Hardy-Littlewood decomposition defined in Section 7. If is chosen to be suitable small depending on and , then provided that is large enough. Assumptions 1 - 4 follow, respectively, from Lemma 8 (by it ), Lemma 24, Lemma 23 and Lemma 11. Lemma 9 now follows from Lemma 25.
As noted in Section 5 this also completes the proof of Theorem 1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[BHP 01] R.. Baker, G. Harman and J. Pintz “The difference between consecutive primes. II” In Proc. London Math. Soc. (3) 83.3 , 2001, pp. 532–562 DOI: 10.1112/plms/83.3.532 · doi ↗
- 2[Bou 89] J. Bourgain “On Λ ( p ) Λ 𝑝 \Lambda(p) -subsets of squares” In Israel J. Math. 67.3 , 1989, pp. 291–311 DOI: 10.1007/BF 02764948 · doi ↗
- 3[BDG 16] Jean Bourgain, Ciprian Demeter and Larry Guth “Proof of the main conjecture in Vinogradov’s mean value theorem for degrees higher than three” In Ann. of Math. (2) 184.2 , 2016, pp. 633–682 DOI: 10.4007/annals.2016.184.2.7 · doi ↗
- 4[BP 17] T.. Browning and S.. Prendiville “A transference approach to a Roth-type theorem in the squares” In Int. Math. Res. Not. IMRN , 2017, pp. 2219–2248 DOI: 10.1093/imrn/rnw 096 · doi ↗
- 5[Cho 18] Sam Chow “Roth–Waring–Goldbach” In Int. Math. Res. Not. IMRN , 2018, pp. 2341–2374 DOI: 10.1093/imrn/rnw 307 · doi ↗
- 6[Dae 10] Dirk Daemen “The asymptotic formula for localized solutions in Waring’s problem and approximations to Weyl sums” In Bull. Lond. Math. Soc. 42.1 , 2010, pp. 75–82 DOI: 10.1112/blms/bdp 095 · doi ↗
- 7[Ebe 16] S. Eberhard “The abelian arithmetic regularity lemma” In Ar Xiv e-prints , 2016 ar Xiv: 1606.09303 [math.NT]
- 8[EGM 14] Sean Eberhard, Ben Green and Freddie Manners “Sets of integers with no large sum-free subset” In Ann. of Math. (2) 180.2 , 2014, pp. 621–652 DOI: 10.4007/annals.2014.180.2.5 · doi ↗
