Optimal mean value estimates beyond Vinogradov's mean value theorem
Julia Brandes, Trevor D. Wooley

TL;DR
This paper presents new, sharper mean value estimates for certain Diophantine systems, achieving the best possible bounds and establishing the Hasse principle for specific cubic and quadratic equations, surpassing previous limitations.
Contribution
It introduces the first bounds of this quality for non-Vinogradov type Diophantine systems and proves the Hasse principle in new parameter ranges.
Findings
Attained sharpest conjectured bounds for certain Diophantine systems.
Established the Hasse principle for systems with cubic and quadratic equations in specific variables.
Achieved the convexity barrier for these problems.
Abstract
We establish improved mean value estimates associated with the number of integer solutions of certain systems of diagonal equations, in some instances attaining the sharpest conjectured conclusions. This is the first occasion on which bounds of this quality have been attained for Diophantine systems not of Vinogradov type. As a consequence of this progress, whenever we obtain the Hasse principle for systems consisting of cubic and quadratic diagonal equations in variables, thus attaining the convexity barrier for this problem.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Optimal mean value estimates beyond
Vinogradov’s mean value theorem
Julia Brandes
Mathematical Sciences, University of Gothenburg and Chalmers Institute of Technology, 412 96 Göteborg, Sweden
and
Trevor D. Wooley
Department of Mathematics, Purdue University, 150 N. University Street, West Lafayette, IN 47907-2067, USA
Abstract.
We establish improved mean value estimates associated with the number of integer solutions of certain systems of diagonal equations, in some instances attaining the sharpest conjectured conclusions. This is the first occasion on which bounds of this quality have been attained for Diophantine systems not of Vinogradov type. As a consequence of this progress, whenever we obtain the Hasse principle for systems consisting of cubic and quadratic diagonal equations in variables, thus attaining the convexity barrier for this problem.
Key words and phrases:
Exponential sums, Hardy–Littlewood method
2010 Mathematics Subject Classification:
11L15, 11D45, 11L07, 11P55
1. Introduction
In recent years, our understanding of systems of diagonal equations and their associated mean values has advanced rapidly. Whilst only a few years ago, such mean values had been comprehensively understood only in the most basic cases, the resolution of the main conjecture associated with Vinogradov’s mean value theorem by the second author [13, 14] and Bourgain, Demeter and Guth [1] has transformed the landscape. It now seems feasible to address the challenge of establishing similarly strong results for a much wider class of cognate problems.
In this memoir, we make progress towards, and in certain cases attain, the convexity barrier for a family of mean values associated with systems of equations that fail to be translation-dilation invariant and thus lie outside the scope of the efficient congruencing and -decoupling methods developed by the second author [13, 14] and Bourgain, Demeter and Guth [1]. The most accessible of our results addresses systems of cubic and quadratic diagonal equations. Let denote the number of integral solutions of the system of equations
[TABLE]
consisting of quadratic and cubic equations of diagonal shape. Here and throughout we assume the coefficients of such systems to be integral. It is clear that the presence of coefficients in such systems necessitates some kind of non-singularity condition, lest the equations interact in some non-generic way. We refer to an matrix as highly non-singular if and any collection of distinct columns of forms a non-singular matrix.
Our first result shows that satisfies the anticipated asymptotic formula for all sets of coefficients in general position, provided that and . This achieves the conjectured convexity barrier.
Theorem 1.1**.**
Suppose that and that . Assume further that the coefficient matrices
[TABLE]
are highly non-singular. Then there exist constants and such that
[TABLE]
Moreover, if the system (1.1) has non-singular real and -adic solutions for all primes , then .
In general, asymptotic formulæ like the one supplied by (1.2) are expected to hold whenever the number of variables exceeds twice the total degree of the system. However, thus far the validity of such an asymptotic formula has been proved only in a few isolated instances. Arguably the first non-trivial case in which this convexity barrier was achieved occurs in work of Cook [7, 8] concerning pairs and triples of diagonal quadratic equations. Recent work of Brüdern and the second author [5, 6] obtains asymptotic lower bounds at the convexity limit for systems of diagonal cubic forms. In the case of mixed systems of cubic and quadratic equations, work of the second author underlying [12, Theorem 1.2] achieves the convexity limit in the case with relating to systems consisting of one cubic and one quadratic diagonal equation. Most recently, investigations of the first author joint with Parsell [3, Theorem 1.4] establish an asymptotic formula tantamount to (1.2) for systems of cubic and quadratic diagonal equations, though under the more restrictive hypothesis that , thus missing the convexity barrier whenever . In subsequent work [2], the first author proved that an asymptotic formula of the shape (1.2) holds when and , which misses the convexity barrier when . Thus, Theorem 1.1 provides the first instance where bounds of the expected quality have been achieved for systems of cubic and quadratic equations in settings where both and exceed .
Theorem 1.1 is in fact a special case of our more general Theorem 1.5 below. Both of these results rest on our new estimates for certain mean values of Vinogradov type. In their most general form, such mean values encode the number of integral solutions of systems of the general shape
[TABLE]
in which are non-negative integers and the coefficients are integers. When all of the coefficient matrices
[TABLE]
are highly non-singular, then the main conjecture states that the number of integral solutions of the system (1.3) should be at most of order , for any , where denotes the system’s total degree. A corresponding lower bound, with , is provided by an argument akin to that delivering [11, equation (7.4)]. Systems of the shape (1.3) have previously been studied by the first author together with Parsell [3], where it was shown that the main conjecture for such systems holds when for all . In the latter circumstances, the system (1.3) can be viewed as a superposition of Vinogradov systems of various degrees (see Theorem 2.1 and Corollary 2.2 in that paper). In wider generality, bounds of the strength of those described in Theorems 1.1 and 1.5 were known hitherto only for systems of quadratic equations and systems of Vinogradov type, as well as superpositions of these two special classes of systems.
The goal of the work at hand is to enlarge the range of systems of type (1.3) for which the main conjecture is known to hold. When the coefficient matrices are highly non-singular for , we denote by the number of integral solutions of the system (1.3), where
[TABLE]
Write further
[TABLE]
so that denotes the total degree of the system.
In order to describe our new results concerning the mean value , we need to consider certain auxiliary systems of equations. Let be an integer and write . Then, given a positive number , we denote by the number of integer tuples and satisfying
[TABLE]
The main conjecture for systems of the shape (1.5) claims that
[TABLE]
Our first main result is as follows.
Theorem 1.2**.**
Suppose that , and are integers with , and assume (1.6) for . Then for any and any we have
[TABLE]
By combining the ideas of the proof of Theorem 1.2 with those underlying [3, Theorem 2.1], we can extend our results to cover also superpositions of systems of equations of the kind considered in Theorem 1.2. Fix a collection of degrees with associated multiplicities . Moreover, fix a tuple of non-negative integers with for , set , and define and for . Now define the parameter by putting
[TABLE]
We denote by
[TABLE]
the number of integer solutions of the system (1.3) with defined as in (1.7). These systems can be viewed as superpositions of systems of the shape considered in Theorem 1.2 with parameters , together with additional quadratic equations. Here, the total degree is given by
[TABLE]
where, in accordance with (1.4), we write
[TABLE]
In this notation, we have the following generalisation of Theorem 1.2.
Theorem 1.3**.**
Let be a non-negative integer. Suppose that , and let as well as be natural numbers with and for . Also, assume (1.6) for all degrees with . Then for and any we have
[TABLE]
We also have an alternative, unconditional formulation of this result, which is given in Theorem 3.3 below.
To illustrate the strength of our results in Theorems 1.2 and 1.3, we discuss in more detail some of the most relevant special cases. Among the systems of diagonal equations not of Vinogradov type, the most well-studied ones are systems of cubic equations and systems of cubic and quadratic equations, such as we considered in our motivating example in Theorem 1.1. Regarding such systems, it is immediate from work of the second author [12, Theorem 1.1] that for every one has , and this bound implies via [3, Theorem 2.1] that for all . Theorem 1.3 now allows us to improve this result.
Corollary 1.4**.**
Suppose that and . Then for any we have
[TABLE]
This follows from Theorem 1.3 in combination with Lemma 2.1 below. Corollary 1.4 represents only the second occasion, after the second author’s successful treatment of the cubic case of Vinogradov’s mean value theorem [13], that the convexity barrier has been attained for a system of diagonal equations involving cubic equations. In particular, we now have the main conjecture for mean values that correspond to systems consisting of one cubic and three quadratic diagonal equations. This is the main new input that enables us to prove Theorem 1.1.
Our results complement older ones that can be obtained by other means. On the one hand, it follows from Theorem 2.1 and Corollary 2.2 of the first author’s work with Parsell [3] in combination with Vinogradov’s mean value theorem [1, Theorem 1.1] that the conclusion of Theorem 1.3 holds unconditionally in the range
[TABLE]
On the other hand, for small the second author’s result [14, Corollary 1.2] can be combined with the arguments of [3, Theorem 2.1] to establish the conclusion of Theorem 1.3 unconditionally in the range
[TABLE]
Mean value estimates like those of Theorems 1.2 and 1.3 have long been employed to establish asymptotic formulæ for the number of solutions of simultaneous diagonal equations. For as in (1.7) and highly non-singular coefficient matrices , denote by the number of integral solutions of the system of equations
[TABLE]
with for . It is well known that, if is sufficiently large in terms of , and , there is an asymptotic formula of the shape
[TABLE]
where is a non-negative constant encoding the local solubility data for the system (1.10). The relevant question is how large has to be for an asymptotic formula like that of (1.11) to hold. Theorem 1.1 of [3] provides a bound for that is somewhat unwieldy, but can likely be reduced to
[TABLE]
by accounting for our revised treatment of the major arcs described in §§5–6 below. On the other hand, unless fundamentally new methods become available that avoid the use of mean values, we cannot expect to be able to establish such asymptotic formulæ when . Thanks to our new mean value estimates in Theorem 1.3, we are now able to make progress towards this theoretical barrier.
Theorem 1.5**.**
Let be a non-negative integer. Suppose that , and let as well as be natural numbers with and for . Also, assume (1.6) for all degrees with . Then for the asymptotic formula (1.11) holds with . If, furthermore, the system (1.10) has non-singular solutions in as well as in the fields for all , then the constant is positive.
Again, we refer to Theorem 4.1 below for an unconditional version of this result. Moreover, we note that in Lemma 2.1 below it is shown that the bound (1.6) holds for , and thus Theorem 1.1 can be deduced as a special case of Theorem 1.5, corresponding to the parameters and .
The proofs of our results rest on an idea that played a crucial role in the second author’s work on pairs of quadratic and cubic diagonal equations [12], and which has been explored further in the authors’ recent work on incomplete Vinogradov systems [4]. In these papers, the missing linear equation is artificially added in, which makes it possible to exploit the strong bounds on Vinogradov’s mean value theorem. By taking advantage of the translation-dilation invariance of the newly completed Vinogradov systems, we then relate these systems to the auxiliary mean values introduced above. Whilst our understanding of these auxiliary mean values remains unsatisfactory for general degree, the quantity may be comprehensively understood in terms of quadratic Vinogradov systems. This observation plays a pivotal role in our argument, and it is the main reason why we attain the convexity barrier in Theorem 1.1 and Corollary 1.4.
Notation. Throughout, the letters , , , and , as well as the entries of the vectors , , , and , will denote non-negative integers. The letter will be used to denote an arbitrary, but sufficiently small positive number, and we adopt the convention that whenever it appears in a statement, we assert that the statement holds for all sufficiently small . We take to be a large positive number which, just like the implicit constants in the notations of Landau and Vinogradov, is permitted to depend at most on , , , , the coefficient matrices , and . We employ the non-standard notation that when is integrable for some , then
[TABLE]
Here and elsewhere, we use vector notation liberally in a manner that is easily discerned from the context. In particular, when denotes the integer tuple , we write .
Acknowledgements. Both authors thank the Fields Institute in Toronto for excellent working conditions and support that made this work possible during the Thematic Program on Unlikely Intersections, Heights, and Efficient Congruencing. This work was further facilitated by subsequent visits of the first author to the University of Bristol, and of the second author to the University of Waterloo. The authors gratefully acknowledge the hospitality of both institutions. The work of both authors was supported by the National Science Foundation under Grant No. DMS-1440140 while they were in residence at the Mathematical Sciences Research Institute in Berkeley, California, during the Spring 2017 semester. The first author’s work was supported in part by Starting Grant 2017-05110 from Vetenskapsrådet. The second author’s work was supported by a European Research Council Advanced Grant under the European Union’s Horizon 2020 research and innovation programme via grant agreement No. 695223, and in the final stages by the National Science Foundation via Grant No. DMS-1854398 and DMS-2001549.
The authors are also very grateful to Scott Parsell for pointing out an oversight in an earlier version of this paper, which necessitated a fundamental re-write of parts of the argument.
2. Preliminaries and preparatory steps
Our goal in this and the next section is the proof of Theorem 1.3. Before delving to the core of the argument, we pause to introduce some notation and establish a mean value estimate that will be of use in our subsequent discussion. For we define the exponential sum by putting
[TABLE]
and we write
[TABLE]
Then, with the standard notation associated with Vinogradov’s mean value theorem in mind, we put
[TABLE]
We note that the main conjecture associated with Vinogradov’s mean value theorem is now known to hold for all degrees. This is classical when , it is a consequence of work of the second author [13] for degree , and for degrees exceeding three it follows from the work of Bourgain, Demeter and Guth, and of the second author (see [1, Theorem 1.1] and [14, Corollary 1.3]). Thus, for all one has
[TABLE]
For future reference, we record the trivial inequality
[TABLE]
which is valid for all .
We begin by bounding the mean value .
Lemma 2.1**.**
Let , and be large real numbers. Then one has
[TABLE]
Proof.
Upon considering the underlying system of equations, we see that the mean value on the left hand side of (2.4) is given by the number of integer solutions of the system of equations
[TABLE]
with , and for . The second of these equations permits the substitution into the first, whence
[TABLE]
Suppose first that is non-zero. Then for each of the possible choices for , and fixing the latter integer in such a manner, an elementary divisor function estimate shows there to be possible choices for the integers and , and hence also for and . These choices also fix , so we see that there are solutions of this first type. Meanwhile, if , then or , and at the same time either or . In any case, therefore, each of the possible choices for and determine and either or . Since there are possible choices left by this constraint for the latter, and is again fixed by these choices just as before, we find that there are solutions of this second type. The conclusion of the lemma follows by summing the contributions from both types of solutions. ∎
Upon taking in Lemma 2.1, we conclude that , which establishes (1.6) for . We remark also that the system (2.5) can be interpreted as being of Vinogradov shape of degree two by means of the substitution and for . Viewed in this way, Lemma 2.1 amounts to no more than a rephrasing of the classical elementary proof of the quadratic case in Vinogradov’s mean value theorem.
We now initiate the proof of Theorem 1.3, assuming the hypotheses of its statement. For , let
[TABLE]
Define for . When , write , where runs over all values for which . We then put
[TABLE]
Also, set for and for , and put . Then by orthogonality we have
[TABLE]
Set , and for a set of positive integers to be fixed later take
[TABLE]
Thus, on recalling (1.7), we see in particular that
[TABLE]
Further, let denote the set of all integral -tuples
[TABLE]
with pairwise distinct entries , and put
[TABLE]
We can bound in terms of . In particular, this will allow us to concentrate on the case when .
Lemma 2.2**.**
For any fixed choice of the positive integers , we have the bounds
[TABLE]
Proof.
When , the trivial bound delivers the estimate
[TABLE]
and the conclusion of the lemma follows in this case from (2.3). Suppose now that . Then from (2.3) and an application of Hölder’s inequality, we find that
[TABLE]
Thus the lemma is established in both cases. ∎
Suppose that the maximum in (2.8) is assumed at the tuple , which we consider fixed for the remainder of the analysis. For and , set when and , and likewise when . We then have the coefficient matrices
[TABLE]
We define via the relations for , and put for . Here, we employ notational conventions analogous to those described in the sequel to (2.6).
Write
[TABLE]
Thus, in the case and , we have .
Lemma 2.3**.**
One has
[TABLE]
Proof.
Recall (2.7) and (2.8). For temporary notational convenience, we put . Then, after possibly relabelling indices, we see from (2.3) that
[TABLE]
The desired conclusion now follows by essentially the same argument as in [3, Theorem 2.1]. Recall that the coefficient matrices are highly non-singular. Consequently, the matrices underlying the mean value in (2.8) inherit that property. Upon considering the underlying Diophantine equations and applying elementary row operations, we may thus assume without loss of generality that the first submatrix of each matrix is diagonal.
Recall the definition of the parameter from (1.7). Since has entries, we see that the matrix is of square format and hence diagonal. Thus we have for . In particular, the entries of with are independent of all the variables with . We may therefore interpret as the ordered pair with and . In this notation we can write
[TABLE]
where
[TABLE]
and
[TABLE]
The latter mean value counts integer solutions of the system
[TABLE]
where each solution is counted with a unimodular weight depending on . It then follows from the triangle inequality and Hua’s lemma that
[TABLE]
We now iterate this procedure for . For each index , we see from (1.7) that only when . Moreover, we have if , and if . Since we had arranged for the first submatrices of each to be diagonal, it follows that the entries of with are independent of all the variables with and , and also of all with . Together, these latter groups of variables form the vectors with . Hence by a similar argument to that encountered before, we can write , where and , noting in particular that the vector is empty. For put
[TABLE]
and take . Also, let
[TABLE]
Note that this mean value counts integer solutions to the system
[TABLE]
where each solution is counted with a unimodular weight depending on . An application of the triangle inequality shows that . We thus deduce that for we have
[TABLE]
and upon iterating we find that
[TABLE]
The conclusion of the lemma follows upon combining this bound with (2.9), (2.10) and (2.11). ∎
3. The underlying mean value
From Lemma 2.3 it is clear that the desired bound will follow if we can show that for . We thus proceed to establish the latter bound. In the discussion of Lemmata 3.1 and 3.2 that follows, it is expedient to drop all mention of the indices with . Note also that in this situation, we have and for . We introduce variables and define to be the identity matrix. Set further , and extend our previous notational conventions surrounding the vector so as to incorporate in the natural manner.
Next, we define
[TABLE]
We begin by establishing the bound contained in the following lemma.
Lemma 3.1**.**
One has .
Proof.
Define to be 1 when , and [math] otherwise. We decompose the set into the blocks for . The mean value counts the number of integral solutions of the system of equations
[TABLE]
where
[TABLE]
with for and for . Observe that in our current situation all coefficient matrices with are of format . Just as in the proof of Lemma 2.3, we can therefore assume without loss of generality that the coefficients with vanish except when . Also, note that the constraints on the expressions for imposed by the linear equations in (3.2) are void, since the ranges for the new variables automatically accommodate all possible values for within (3.2).
We now consider the effect of shifting every variable with index in a given block by an integer with . By the binomial theorem, for any family of shifts , one finds that is a solution of (3.2) if and only if it is also a solution of the system
[TABLE]
where
[TABLE]
Thus, for each fixed integer -tuple with (), the mean value is bounded above by the number of integral solutions of the system
[TABLE]
with and . On applying orthogonality and averaging over all possible choices for , we therefore infer that
[TABLE]
where
[TABLE]
The proof of the lemma is completed by reference to (2.1) and (3.1). ∎
We can now turn to the task of estimating . We will do this in somewhat wider generality than is required for the proofs of Theorems 1.3 and 1.5. This will allow us to prove the unconditional results adumbrated in the introduction.
When is an integer and , denote by the mean value
[TABLE]
When and are integers, this mean value counts the number of integer tuples and satisfying
[TABLE]
In particular, we have . The main conjecture for mean values of the shape (3.3) states that for all one should have
[TABLE]
Note that the case when corresponds to Vinogradov’s mean value theorem, and in this case the bound (3.4) is known (see equation (2.2) above).
Suppose that and are integers with and
[TABLE]
We now choose
[TABLE]
so that at the critical point we have . Note also that by (3.5) as well as the definition of in (1.4), the quantity is indeed an integer whenever .
Lemma 3.2**.**
Let and be integers with and satisfying the conditions and . Assume also that and . Then we have
[TABLE]
Proof.
Set
[TABLE]
and
[TABLE]
Then it follows from (3.1) via (2.3) that, after possibly relabelling variables, we have
[TABLE]
Recall now that we had arranged for the coefficient matrices to be diagonal. Consequently, the variables are independent of those having and . Then, by setting and , it follows that fully determines , and and together completely determine all entries of . On recalling (3.7), we may thus rewrite the integral on the right hand side of (3.8) to obtain the bound
[TABLE]
where and
[TABLE]
Define
[TABLE]
and
[TABLE]
Also, write
[TABLE]
and note that, as a consequence of (1.4) and (3.6), one has
[TABLE]
Then, since and , it follows via Hölder’s inequality that
[TABLE]
Since is an even integer, it follows by standard orthogonality considerations that and count solutions to their respective associated systems of equations with degrees , with each solution being counted with a unimodular weight depending on . It thus follows from the triangle inequality that for . Using the fact that the coefficient matrices with are all diagonal, and recalling (2.2), we thus discern that
[TABLE]
By an analogous chain of reasoning, we derive from the definition (3.3) of and a consideration of the underlying system of equations the corresponding bound
[TABLE]
Thus, from (3.10), (3.11) and (3) we have
[TABLE]
At this stage in our argument, we discern from (3.9) that
[TABLE]
Recall that , where is defined by (3.7). Since the first minors of the coefficient matrices for are now diagonal, we deduce from (2.2) that
[TABLE]
Finally, on substituting (3.14) into (3.13) and recalling (1.4), we conclude that
[TABLE]
This completes the proof of the lemma. ∎
We now resume the practice of appending the suffix to the parameters , , and that we temporarily abandoned during the discussion of Lemmata 3.1 and 3.2. We assume, moreover, that and are integers with and
[TABLE]
In accordance with (3.6), we now fix the parameters by taking
[TABLE]
Hence, whenever , the quantity is an integer. With these natural numbers defined thus, we recall the definition of from (2.7). We are now equipped to provide an unconditional version of Theorem 1.3
Theorem 3.3**.**
Suppose that . Assume further that satisfy the relations
[TABLE]
for . Let be a non-negative integer. Then for any , one has
[TABLE]
Proof.
We apply Lemma 2.2 with , followed by Lemmata 2.3, 3.1 and 3.2. This shows that
[TABLE]
and the proof is complete upon reference to (1.9). ∎
We can now complete the proof of Theorem 1.3. To this end, we choose and for . With this choice of parameters the hypotheses of Theorem 3.3 are satisfied whenever and are in accordance with the conditions of Theorem 1.3, and moreover the conjectural bound is then tantamount to both (1.6) and (3.4). Thus, in the case the desired conclusion is an immediate consequence of the conclusion of Theorem 3.3, and for general values of it follows in like manner upon utilising the additional flexibility offered by Lemma 2.2.
4. The Hardy-Littlewood method
We can now initiate the derivation of Theorem 1.5 from the mean value estimate of Theorem 1.3. We shall prove the following rather more general result.
Theorem 4.1**.**
Suppose that . Suppose further that lie in the respective ranges
[TABLE]
and satisfy the divisibility conditions
[TABLE]
for . Assume, moreover, that
[TABLE]
Let be a non-negative integer, put and define via (3.15) for . Set , suppose that and write for the total degree of the system as usual. Then the asymptotic formula
[TABLE]
holds with . If, furthermore, the system (1.10) has non-singular solutions in as well as in the fields for all , then the constant is positive.
Note that Theorem 1.5 follows from the special case of Theorem 4.1 in which and for .
We make use of the notation introduced in §2, and recall in particular (2.6) and its sequel. From now on we will set and , so that . Also, we will assume throughout that , , , , satisfy the hypotheses of the statement of Theorem 4.1.
When is a measurable set, put
[TABLE]
Our Hardy–Littlewood dissection is defined as follows. When and are parameters with , we take the major arcs to be the union of the boxes
[TABLE]
with and . The corresponding set of minor arcs is defined by putting . Unless indicated otherwise, we fix and , and abbreviate to and to .
We require certain auxiliary functions in order to analyse the contribution of the major arcs . Write
[TABLE]
and recall that the argument of [11, Theorem 7.1] gives
[TABLE]
Further, set
[TABLE]
and recall from the arguments of [11, Theorem 7.3] the estimate
[TABLE]
We put
[TABLE]
Following the same convention regarding vector notation as we applied for in (2.6) and its sequel, we have . Then as a consequence of [11, Theorem 7.2], we find that when , one has
[TABLE]
Finally, define
[TABLE]
and
[TABLE]
where
[TABLE]
The preliminary conclusion of our major arcs analysis is summarised in the following lemma.
Lemma 4.2**.**
There is a positive number for which
[TABLE]
Proof.
Since , it follows from (4.7) that
[TABLE]
Furthermore, by a change of variables we see that . The conclusion of the lemma therefore follows from our choice . ∎
In order to address the contribution of the minor arcs, we need the following Weyl-type estimate.
Lemma 4.3**.**
Suppose that . There exists such that for each -tuple of distinct indices there exists an index with for which one has
[TABLE]
Proof.
This is the content of [3, Lemma 3.1]. Note that the minor arcs in our setting are a subset of the minor arcs defined in the context of that lemma. ∎
We now complete the analysis of the minor arcs for Theorem 4.1.
Lemma 4.4**.**
Assume the hypotheses of Theorem 4.1. Then there is a positive number for which .
Proof.
Given a measurable set , we write
[TABLE]
We begin by estimating the last exponential sums in the product (4.2) trivially, so that
[TABLE]
For and sufficiently small, let denote the set of for which . In view of (2.3), we can identify a subset of indices with for which
[TABLE]
Write for the submatrix of having columns indexed by . The condition that the coefficient matrices be highly non-singular implies that the submatrices of are also highly non-singular. Thus, by orthogonality, we see from the definition (1.8) of the mean value that
[TABLE]
Consider a fixed . If has been chosen sufficiently small, Lemma 4.3 ensures that we can find an index with such that . Thus we see that we have the inclusion , whence
[TABLE]
Now recall that . Note also that the hypotheses of Theorem 4.1 under which we are currently working permit the assumption of those of Theorem 3.3. Thus, upon combining the estimate (4.11) with Theorem 3.3, inserting (4.1) and recalling (3.15), we obtain the bound
[TABLE]
By substituting this estimate into (4.10), we obtain the conclusion of the lemma. ∎
Upon combining the results of Lemmata 4.2 and 4.4, we infer that for some one has the asymptotic formula
[TABLE]
This completes our analysis of the minor arcs.
5. Initial considerations for the singular series
It remains to show that the singular series and singular integral converge as tends to infinity. We now put
[TABLE]
In this notation, the system under consideration can be viewed as a superposition of Vinogradov systems with respective degrees , all missing the linear slice, and thus it follows from the definition (1.9) that the total degree of this system is
[TABLE]
Throughout this and the next section, we work under the assumption that .
We first attend to the singular series. Put
[TABLE]
By applying (2.3), we find that for some choice of distinct indices we have the asymptotic bound
[TABLE]
where
[TABLE]
Note that both and are multiplicative in . For this reason, the key to understanding the singular series is to maintain good control over the multiplicative quantity
[TABLE]
as runs over the prime powers.
Define by setting for , and write , so that . For consistency we also set . Now, adopting a notation similar to that of Section 2, when we write for the submatrices
[TABLE]
of the coefficient matrices consisting of the columns indexed by . Note that the hypothesis that each is highly non-singular ensures that the same is true for each . For and we set , and we employ the same conventions regarding vector notation as in (4.6) and also (2.6) and its sequel. Thus, we write and , so that
[TABLE]
In this notation, it follows from standard orthogonality relations that
[TABLE]
counts the number of solutions of the system of congruences
[TABLE]
where and .
Our first goal is to apply a procedure inspired by the proof of Theorem 2.1 in [3] in order to disentangle the congruences in (5.5). This will enable us to replace the sum by a related expression in which for all indices the degree in the exponential sum is replaced by . Since is typically smaller than , we will reap the rewards of this preparatory step when the reduced degrees allow us to exert greater control on the size of the exponential sums in question.
Given a -tuple of variables , we adopt the convention that for . Also, when is a coefficient vector, we abbreviate the vector to , and we appropriate the notation and to denote the corresponding subvectors whose entries are indexed by . The following observation will play a part in our ensuing arguments.
Lemma 5.1**.**
Let , and be natural numbers, with . Suppose that and are fixed integers, and put
[TABLE]
Then for any fixed integers we have
[TABLE]
Proof.
By standard orthogonality relations, the sum
[TABLE]
counts solutions of the system of congruences
[TABLE]
where each solution is counted with a unimodular weight depending on the inert variables , together with the coefficients and . Thus, by the triangle inequality, one finds that
[TABLE]
We therefore discern that is bounded above by the number of solutions of (5.7) counted without weights, and hence by the number of solutions of the system of congruences
[TABLE]
We interpret the latter as the number of solutions of the system
[TABLE]
with and for . Thus, by orthogonality and the triangle inequality, one sees that
[TABLE]
The conclusion of the lemma is now immediate from (5.6). ∎
We now define
[TABLE]
The crucial bound for our analysis of the singular series is contained in the following lemma.
Lemma 5.2**.**
Let be a natural number, and suppose that the matrices are all highly non-singular. Then there exists a finite set of primes and a natural number , both depending at most on the coefficient matrices and in the latter case also , with the property that
[TABLE]
The constant is bounded above uniformly in , and one can take whenever for all .
Proof.
Recall that counts the number of solutions of the system of congruences (5.5) for and . Since is a multiplicative function of , it is apparent that it suffices to establish the conclusion of the lemma in the special case in which is a prime power, say for a given prime . By applying suitable elementary row operations within the coefficient matrices for that are invertible over , we may suppose without loss of generality that each coefficient matrix is in upper row echelon form. This operation corresponds to taking appropriate linear combinations of the congruences comprising (5.5). Here, we stress that the property that each is highly non-singular implies that the first submatrix of is now upper triangular. We denote this matrix by . Note that the power of dividing the diagonal entries of depends only on the first submatrices of the original coefficient matrices . In particular, by defining to be the set of all primes dividing any of the determinants of the latter submatrices, we ensure that when , then none of the diagonal entries of is divisible by .
We now employ an inductive argument in order to successively reduce the degrees of the exponential sums occurring within the mean value
[TABLE]
Observe that, as a result of our preparatory manipulations, the coefficient matrics with are upper triangular. Thus, the only exponential sum within the above formula for that depends on is the one involving . In order to save clutter, we temporarily drop the modulus in our exponential sums . We may thus write
[TABLE]
The inner sum is of the shape considered in Lemma 5.1 with . On writing (), we thus obtain the bound
[TABLE]
Now suppose that for some index with we have the bound
[TABLE]
where
[TABLE]
Again, since we may assume all coefficient matrices to be in upper row echelon form, the only exponential sum within the mean value defining that depends on the vector is the one involving . Thus, as in the case considered above, we may isolate the exponential sum indexed by and apply Lemma 5.1. As a result, we find that
[TABLE]
Inserting this bound into (5.8) reproduces (5.8) with replaced by . We may clearly iterate, and after steps we find that
[TABLE]
Clearly, the vectors with together list the coordinates of . Since is multiplicative, the assertion of the lemma is now confirmed upon taking to be the multiplicative function defined via the formula
[TABLE]
In particular, we note that depends at most on the coefficient matrices , and one has whenever . ∎
6. Conclusion of the major arcs analysis
With Lemma 5.2 we are now equipped to engage with our goal of showing that the singular series converges absolutely. In this context, for each prime number we define the -adic factor
[TABLE]
Lemma 6.1**.**
Suppose that the coefficient matrices associated with the system (1.10) are highly non-singular, and that for . Also, assume that . Then the -adic densities exist, the singular series is absolutely convergent, and . In particular, one has for some . Moreover, if the system (1.10) has a non-singular -adic solution for all primes , then .
Proof.
On recalling (4.8) and (5.1), we see that , and so the estimation of the quantity is our central focus. The multiplicativity of allows us to restrict our attention to the cases where is a prime power. Set and . If the product
[TABLE]
converges absolutely as , then so does with the same limit. In such circumstances, one has . It is therefore sufficient to show that for all primes the limit
[TABLE]
exists, and moreover that there exists a positive number having the property that for all but at most a finite set of primes .
On recalling (5.2), we find from (4.4) that
[TABLE]
The invertibility of the coordinate transform (5.4) implies that when , then there is at least one index with such that , with an implied constant depending at most on the coefficient matrices . Since and may be taken arbitrarily small, we deduce that there is a positive number , depending at most on the coefficient matrices , having the property that
[TABLE]
We now wish to apply Lemma 5.2. To this end, we first recall (5.3) and observe that a summation by parts yields the relation
[TABLE]
Since all coefficients on the right hand side are positive, and also both and are non-negative for all non-negative integers , it follows from Lemma 5.2 that we may majorise the right hand side of (6.3) by replacing with for . Set , noting that this maximum exists as is an integer which is bounded uniformly for all non-negative integers . Also, in analogy to the definition of , we put
[TABLE]
Thus, another summation by parts shows that the right hand side of (6.3) is no larger than
[TABLE]
We have therefore established the bound
[TABLE]
Since for all , we can infer further from (4.4) that there exists a positive number , depending at most on , such that
[TABLE]
For a fixed vector denote by the number of vectors satisfying and for . Then one has
[TABLE]
where the sum is over all vectors satisfying and having the property that for at least one index . For any fixed , the number of choices for having and is at most . It follows that
[TABLE]
and hence
[TABLE]
On recalling (6.2), (6.4) and (6.5) we find that
[TABLE]
It follows that the -adic density defined in (6.1) exists. In particular, whenever we have
[TABLE]
for some positive number depending at most on the coefficient matrices . On recalling the conclusion of Lemma 5.2, one sees that for all primes with , and thus
[TABLE]
Hence, the singular series converges absolutely and one has .
Furthermore, a standard argument yields
[TABLE]
where denotes the number of solutions of the congruences
[TABLE]
corresponding to the equations (1.10). Using again the observation that for all sufficiently large primes , we discern from (6.6) that there exists an integer with the property that
[TABLE]
For the remaining finite set of primes, a standard application of Hensel’s lemma shows that whenever the system (1.10) possesses a non-singular solution in . We thus conclude that under the hypotheses of the lemma we have as claimed. ∎
We next demonstrate the existence of the limit
[TABLE]
With this goal in mind, when is a positive real number, we introduce the auxiliary mean value
[TABLE]
Lemma 6.2**.**
Under the hypotheses of Theorem 4.1, there is a positive number for which one has , and hence the limit exists. In particular, one has
[TABLE]
Furthermore, if the system (1.10) has a non-singular solution inside the real unit cube , then the singular integral is positive.
Proof.
The first part of the proof is inspired by a singular series argument of Heath-Brown and Skorobogatov (see [9, pages 173 and 174]). Recall that
[TABLE]
where the integers are defined by means of (3.15). Thus, the hypotheses of Theorem 4.1 imply that . Let denote the set of -element subsets of . When , define
[TABLE]
and
[TABLE]
Set , and define the major arcs via (4.3). By making the necessary modifications to our initial analysis of the major arcs, we see from (4.9) that for any one has
[TABLE]
Note that we have for the term corresponding to in (6.7). Since all other summands are non-negative, it follows that for any and any , one has
[TABLE]
On the other hand, for the major arcs are disjoint, and we conclude from Theorem 3.3 that under the hypotheses of Theorem 4.1 we have
[TABLE]
In combination with (6.8) and (6.9) it follows that
[TABLE]
Since is a power of , we discern from (4.5) and (6.10) via (2.3) that for any we have
[TABLE]
Here, we exploited the fact that, since the coefficient matrices are highly non-singular, the condition implies that for some index with . This implies the first statement of the lemma. In particular, the singular integral converges absolutely.
In order to establish the second claim, we follow an argument of Schmidt [10]. When , define
[TABLE]
and recall that
[TABLE]
where the integral converges absolutely. Set
[TABLE]
and put
[TABLE]
We adapt the argument of §11 in Schmidt’s work [10] to show that as .
Set
[TABLE]
Then in light of (6.11) a change of the order of integration shows that
[TABLE]
and hence
[TABLE]
In order to analyse the integral on the right hand side of (6.12), it is convenient to consider two domains separately. Write , and set . From the power series expansion of we find that
[TABLE]
whence we discern that the domain contributes at most
[TABLE]
Note that in the last step we used our previous insight that the singular integral converges absolutely. Meanwhile, the contribution from is bounded above by
[TABLE]
for some positive number with , where again we took advantage of our earlier findings. Thus we infer from (6.12) that
[TABLE]
for all , and hence does indeed converge to , as claimed.
Suppose now that the system (1.10) has a non-singular solution inside . Then it follows from the implicit function theorem that the real manifold described by the equations in (1.10) has positive -dimensional volume inside . In such circumstances, Lemma 2 of Schmidt [10] shows that uniformly in . We therefore deduce from (6.13) that is indeed positive, confirming the second claim of the lemma. ∎
Upon combining (4.12) with Lemmata 6.1 and 6.2, we conclude that
[TABLE]
where . Moreover, the constant is positive whenever the system (1.10) possesses non-singular solutions in all local fields. This confirms the asymptotic formula (1.11), and completes our proof of Theorem 4.1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Bourgain, C. Demeter and L. Guth, Proof of the main conjecture in Vinogradov’s mean value theorem for degrees higher than three , Ann. of Math. (2) 184 (2016), no. 2, 633–682.
- 2[2] J. Brandes, The Hasse principle for systems of quadratic and cubic diagonal equations , Q. J. Math. 68 (2017), no. 3, 831–850.
- 3[3] J. Brandes and S. T. Parsell, Simultaneous additive equations: repeated and differing degrees , Canad. J. Math. 69 (2017), no. 2, 258–283.
- 4[4] J. Brandes and T. D. Wooley, Vinogradov systems with a slice off , Mathematika 63 (2017), no. 3, 797–817.
- 5[5] J. Brüdern and T. D. Wooley, The Hasse principle for pairs of diagonal cubic forms , Ann. of Math. (2) 166 (2007), no. 3, 865–895.
- 6[6] J. Brüdern and T. D. Wooley, The Hasse principle for systems of diagonal cubic forms , Math. Ann. 364 (2016), no. 3-4, 1255–1274.
- 7[7] R. J. Cook, Simultaneous quadratic equations , J. London Math. Soc. (2) 4 (1971), 319–326.
- 8[8] R. J. Cook, Simultaneous quadratic equations II , Acta Arith. 25 (1973/74), 1–5.
