Optimal mean value estimates beyond Vinogradov's mean value theorem

Julia Brandes; Trevor D. Wooley

arXiv:1901.03153·math.NT·August 21, 2020

Optimal mean value estimates beyond Vinogradov's mean value theorem

Julia Brandes, Trevor D. Wooley

PDF

TL;DR

This paper presents new, sharper mean value estimates for certain Diophantine systems, achieving the best possible bounds and establishing the Hasse principle for specific cubic and quadratic equations, surpassing previous limitations.

Contribution

It introduces the first bounds of this quality for non-Vinogradov type Diophantine systems and proves the Hasse principle in new parameter ranges.

Findings

01

Attained sharpest conjectured bounds for certain Diophantine systems.

02

Established the Hasse principle for systems with cubic and quadratic equations in specific variables.

03

Achieved the convexity barrier for these problems.

Abstract

We establish improved mean value estimates associated with the number of integer solutions of certain systems of diagonal equations, in some instances attaining the sharpest conjectured conclusions. This is the first occasion on which bounds of this quality have been attained for Diophantine systems not of Vinogradov type. As a consequence of this progress, whenever $u \geq 3 v$ we obtain the Hasse principle for systems consisting of $v$ cubic and $u$ quadratic diagonal equations in $6 v + 4 u + 1$ variables, thus attaining the convexity barrier for this problem.

Equations386

c_{i, 1}^{(3)} x_{1}^{3} + \dots + c_{i, s}^{(3)} x_{s}^{3}

c_{i, 1}^{(3)} x_{1}^{3} + \dots + c_{i, s}^{(3)} x_{s}^{3}

c_{j, 1}^{(2)} x_{1}^{2} + \dots + c_{j, s}^{(2)} x_{s}^{2}

C^{(2)} = (c_{i, j}^{(2)})_{1 ⩽ i ⩽ u 1 ⩽ j ⩽ s} and C^{(3)} = (c_{i, j}^{(3)})_{1 ⩽ i ⩽ v 1 ⩽ j ⩽ s}

C^{(2)} = (c_{i, j}^{(2)})_{1 ⩽ i ⩽ u 1 ⩽ j ⩽ s} and C^{(3)} = (c_{i, j}^{(3)})_{1 ⩽ i ⩽ v 1 ⩽ j ⩽ s}

N_{s, v, u} (X) = C X^{s - 3 v - 2 u} + O (X^{s - 3 v - 2 u - δ}) .

N_{s, v, u} (X) = C X^{s - 3 v - 2 u} + O (X^{s - 3 v - 2 u - δ}) .

c_{i, 1}^{(l)} (x_{1}^{l} - y_{1}^{l}) + \dots + c_{i, s}^{(l)} (x_{s}^{l} - y_{s}^{l}) = 0 (1 ⩽ i ⩽ r_{l}, 1 ⩽ l ⩽ k),

c_{i, 1}^{(l)} (x_{1}^{l} - y_{1}^{l}) + \dots + c_{i, s}^{(l)} (x_{s}^{l} - y_{s}^{l}) = 0 (1 ⩽ i ⩽ r_{l}, 1 ⩽ l ⩽ k),

C^{(l)}=\bigl{(}c_{i,j}^{(l)}\bigr{)}_{\begin{subarray}{c}1\leqslant i\leqslant r_{l}\\ 1\leqslant j\leqslant s\end{subarray}}

C^{(l)}=\bigl{(}c_{i,j}^{(l)}\bigr{)}_{\begin{subarray}{c}1\leqslant i\leqslant r_{l}\\ 1\leqslant j\leqslant s\end{subarray}}

r_{k} = v, r_{k - 1} = \dots = r_{2} = u, r_{1} = 0.

r_{k} = v, r_{k - 1} = \dots = r_{2} = u, r_{1} = 0.

K = \frac{1}{2} k (k - 1) u + k v - u,

K = \frac{1}{2} k (k - 1) u + k v - u,

i = 1 \sum σ - 2 (x_{i}^{j} - y_{i}^{j}) + j (z_{1}^{j - 1} h_{1} + z_{2}^{j - 1} h_{2}) = 0 (1 ⩽ j ⩽ l) .

i = 1 \sum σ - 2 (x_{i}^{j} - y_{i}^{j}) + j (z_{1}^{j - 1} h_{1} + z_{2}^{j - 1} h_{2}) = 0 (1 ⩽ j ⩽ l) .

M_{l}^{*} (X) ≪ X^{\frac{1}{2} l (l + 1) + ε} .

M_{l}^{*} (X) ≪ X^{\frac{1}{2} l (l + 1) + ε} .

I_{s, k}^{v, u} (X) ≪ X^{ε} (X^{s} + X^{2 s - K}) .

I_{s, k}^{v, u} (X) ≪ X^{ε} (X^{s} + X^{2 s - K}) .

r_{1} = 0, r_{2} = w_{n} + u_{0}, r_{l} = ⎩ ⎨ ⎧ w_{n} w_{j} w_{j - 1} + v_{j} when 3 ⩽ l < k_{n}, when k_{j + 1} < l < k_{j}, when l = k_{j} for some j .

r_{1} = 0, r_{2} = w_{n} + u_{0}, r_{l} = ⎩ ⎨ ⎧ w_{n} w_{j} w_{j - 1} + v_{j} when 3 ⩽ l < k_{n}, when k_{j + 1} < l < k_{j}, when l = k_{j} for some j .

I_{s, k}^{v, u} (X) = I_{s, k}^{v, u} (X; C^{(2)}, \dots, C^{(k)})

I_{s, k}^{v, u} (X) = I_{s, k}^{v, u} (X; C^{(2)}, \dots, C^{(k)})

K = l = 2 \sum k l r_{l} = j = 1 \sum n K_{j} + 2 u_{0},

K = l = 2 \sum k l r_{l} = j = 1 \sum n K_{j} + 2 u_{0},

K_{j} = \frac{1}{2} k_{j} (k_{j} - 1) u_{j} + k_{j} v_{j} - u_{j} .

K_{j} = \frac{1}{2} k_{j} (k_{j} - 1) u_{j} + k_{j} v_{j} - u_{j} .

I_{s, k}^{v, u} (X) ≪ X^{ε} (X^{s} + X^{2 s - K}) .

I_{s, k}^{v, u} (X) ≪ X^{ε} (X^{s} + X^{2 s - K}) .

I_{s, 3}^{v, u} (X) ≪ X^{ε} (X^{s} + X^{2 s - 3 v - 2 u}) .

I_{s, 3}^{v, u} (X) ≪ X^{ε} (X^{s} + X^{2 s - 3 v - 2 u}) .

s ⩾ j = 1 \sum n (v_{j} \frac{k _{j} ( k _{j} + 1 )}{2} + (u_{j} - v_{j}) \frac{k _{j} ( k _{j} - 1 )}{2}) + 2 u_{0} = K + w_{n} .

s ⩾ j = 1 \sum n (v_{j} \frac{k _{j} ( k _{j} + 1 )}{2} + (u_{j} - v_{j}) \frac{k _{j} ( k _{j} - 1 )}{2}) + 2 u_{0} = K + w_{n} .

s

s

= K - j = 1 \sum n ((k_{j} - 2) u_{j} + v_{j}) .

c_{i, 1}^{(l)} x_{1}^{l} + \dots + c_{i, s}^{(l)} x_{s}^{l} = 0 (1 ⩽ i ⩽ r_{l}, 2 ⩽ l ⩽ k)

c_{i, 1}^{(l)} x_{1}^{l} + \dots + c_{i, s}^{(l)} x_{s}^{l} = 0 (1 ⩽ i ⩽ r_{l}, 2 ⩽ l ⩽ k)

N_{s, k}^{v, u} (X) = (C + o (1)) X^{s - K},

N_{s, k}^{v, u} (X) = (C + o (1)) X^{s - K},

\displaystyle s\geqslant\sum_{j=1}^{n}\big{(}v_{j}k_{j}(k_{j}+1)+(u_{j}-v_{j})k_{j}(k_{j}-1)\big{)}+4u_{0}+1=2K+2w_{n}+1

\displaystyle s\geqslant\sum_{j=1}^{n}\big{(}v_{j}k_{j}(k_{j}+1)+(u_{j}-v_{j})k_{j}(k_{j}-1)\big{)}+4u_{0}+1=2K+2w_{n}+1

\oint G (α) d α = \int_{[0, 1)^{n}} G (α) d α .

\oint G (α) d α = \int_{[0, 1)^{n}} G (α) d α .

K_{l} (α; Z, H) = ∣ h ∣ ⩽ H \sum ∣ z ∣ ⩽ Z \sum e (h α^{(1)} + 2 h z α^{(2)} + \dots + l h z^{l - 1} α^{(l)}),

K_{l} (α; Z, H) = ∣ h ∣ ⩽ H \sum ∣ z ∣ ⩽ Z \sum e (h α^{(1)} + 2 h z α^{(2)} + \dots + l h z^{l - 1} α^{(l)}),

f_{l} (α; X) = ∣ x ∣ ⩽ X \sum e (α^{(1)} x + α^{(2)} x^{2} + \dots + α^{(l)} x^{l}) .

f_{l} (α; X) = ∣ x ∣ ⩽ X \sum e (α^{(1)} x + α^{(2)} x^{2} + \dots + α^{(l)} x^{l}) .

J_{s, l} (X) = \oint ∣ f_{l} (α; X) ∣^{2 s} d α .

J_{s, l} (X) = \oint ∣ f_{l} (α; X) ∣^{2 s} d α .

J_{σ, l} (X) ≪ X^{ε} (X^{σ} + X^{2 σ - l (l + 1) /2}) .

J_{σ, l} (X) ≪ X^{ε} (X^{σ} + X^{2 σ - l (l + 1) /2}) .

∣ a_{1} \dots a_{n} ∣ ⩽ ∣ a_{1} ∣^{n} + \dots + ∣ a_{n} ∣^{n},

∣ a_{1} \dots a_{n} ∣ ⩽ ∣ a_{1} ∣^{n} + \dots + ∣ a_{n} ∣^{n},

\oint ∣ f_{2} (α; X) K_{2} (α; Z, H) ∣^{2} d α ≪ (X H Z)^{ε} (H Z^{2} + X Z^{2} + X Z H) .

\oint ∣ f_{2} (α; X) K_{2} (α; Z, H) ∣^{2} d α ≪ (X H Z)^{ε} (H Z^{2} + X Z^{2} + X Z H) .

x_{1}^{2} - x_{2}^{2}

x_{1}^{2} - x_{2}^{2}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Optimal mean value estimates beyond

Vinogradov’s mean value theorem

Julia Brandes

Mathematical Sciences, University of Gothenburg and Chalmers Institute of Technology, 412 96 Göteborg, Sweden

[email protected]

and

Trevor D. Wooley

Department of Mathematics, Purdue University, 150 N. University Street, West Lafayette, IN 47907-2067, USA

[email protected]

Abstract.

We establish improved mean value estimates associated with the number of integer solutions of certain systems of diagonal equations, in some instances attaining the sharpest conjectured conclusions. This is the first occasion on which bounds of this quality have been attained for Diophantine systems not of Vinogradov type. As a consequence of this progress, whenever $u\geqslant 3v$ we obtain the Hasse principle for systems consisting of $v$ cubic and $u$ quadratic diagonal equations in $6v+4u+1$ variables, thus attaining the convexity barrier for this problem.

Key words and phrases:

Exponential sums, Hardy–Littlewood method

2010 Mathematics Subject Classification:

11L15, 11D45, 11L07, 11P55

1. Introduction

In recent years, our understanding of systems of diagonal equations and their associated mean values has advanced rapidly. Whilst only a few years ago, such mean values had been comprehensively understood only in the most basic cases, the resolution of the main conjecture associated with Vinogradov’s mean value theorem by the second author [13, 14] and Bourgain, Demeter and Guth [1] has transformed the landscape. It now seems feasible to address the challenge of establishing similarly strong results for a much wider class of cognate problems.

In this memoir, we make progress towards, and in certain cases attain, the convexity barrier for a family of mean values associated with systems of equations that fail to be translation-dilation invariant and thus lie outside the scope of the efficient congruencing and $\ell^{2}$ -decoupling methods developed by the second author [13, 14] and Bourgain, Demeter and Guth [1]. The most accessible of our results addresses systems of cubic and quadratic diagonal equations. Let ${\mathcal{N}}_{s,v,u}(X)$ denote the number of integral solutions ${\mathbf{x}}\in[-X,X]^{s}$ of the system of equations

[TABLE]

consisting of $u$ quadratic and $v$ cubic equations of diagonal shape. Here and throughout we assume the coefficients $c_{i,j}^{(k)}$ of such systems to be integral. It is clear that the presence of coefficients in such systems necessitates some kind of non-singularity condition, lest the equations interact in some non-generic way. We refer to an $r\times s$ matrix $C$ as highly non-singular if $s\geqslant r$ and any collection of $r$ distinct columns of $C$ forms a non-singular matrix.

Our first result shows that ${\mathcal{N}}_{s,v,u}(X)$ satisfies the anticipated asymptotic formula for all sets of coefficients in general position, provided that $s\geqslant 6v+4u+1$ and $u\geqslant 3v$ . This achieves the conjectured convexity barrier.

Theorem 1.1.

Suppose that $u\geqslant 3v$ and that $s\geqslant 6v+4u+1$ . Assume further that the coefficient matrices

[TABLE]

are highly non-singular. Then there exist constants ${\mathcal{C}}\geqslant 0$ and $\delta>0$ such that

[TABLE]

Moreover, if the system (1.1) has non-singular real and $p$ -adic solutions for all primes $p$ , then ${\mathcal{C}}>0$ .

In general, asymptotic formulæ like the one supplied by (1.2) are expected to hold whenever the number of variables exceeds twice the total degree of the system. However, thus far the validity of such an asymptotic formula has been proved only in a few isolated instances. Arguably the first non-trivial case in which this convexity barrier was achieved occurs in work of Cook [7, 8] concerning pairs and triples of diagonal quadratic equations. Recent work of Brüdern and the second author [5, 6] obtains asymptotic lower bounds at the convexity limit for systems of diagonal cubic forms. In the case of mixed systems of cubic and quadratic equations, work of the second author underlying [12, Theorem 1.2] achieves the convexity limit in the case $u=v=1$ with $s\geqslant 11$ relating to systems consisting of one cubic and one quadratic diagonal equation. Most recently, investigations of the first author joint with Parsell [3, Theorem 1.4] establish an asymptotic formula tantamount to (1.2) for systems of $v$ cubic and $u$ quadratic diagonal equations, though under the more restrictive hypothesis that $s\geqslant\lfloor 20v/3\rfloor+4u+1$ , thus missing the convexity barrier whenever $v\geqslant 2$ . In subsequent work [2], the first author proved that an asymptotic formula of the shape (1.2) holds when $v\geqslant 2u$ and $s\geqslant 6v+\lfloor 14u/3\rfloor+1$ , which misses the convexity barrier when $u\geqslant 2$ . Thus, Theorem 1.1 provides the first instance where bounds of the expected quality have been achieved for systems of $v$ cubic and $u$ quadratic equations in settings where both $u$ and $v$ exceed $1$ .

Theorem 1.1 is in fact a special case of our more general Theorem 1.5 below. Both of these results rest on our new estimates for certain mean values of Vinogradov type. In their most general form, such mean values encode the number of integral solutions of systems of the general shape

[TABLE]

in which $r_{1},\dots,r_{k}$ are non-negative integers and the coefficients $c_{i,j}^{(l)}$ are integers. When all of the coefficient matrices

[TABLE]

are highly non-singular, then the main conjecture states that the number of integral solutions ${\mathbf{x}},{\mathbf{y}}\in[-X,X]^{s}$ of the system (1.3) should be at most of order $X^{s+\varepsilon}+X^{2s-K}$ , for any $\varepsilon>0$ , where $K=r_{1}+2r_{2}+\ldots+kr_{k}$ denotes the system’s total degree. A corresponding lower bound, with $\varepsilon=0$ , is provided by an argument akin to that delivering [11, equation (7.4)]. Systems of the shape (1.3) have previously been studied by the first author together with Parsell [3], where it was shown that the main conjecture for such systems holds when $r_{l}\geqslant r_{l+1}$ for all $1\leqslant l\leqslant k-1$ . In the latter circumstances, the system (1.3) can be viewed as a superposition of Vinogradov systems of various degrees (see Theorem 2.1 and Corollary 2.2 in that paper). In wider generality, bounds of the strength of those described in Theorems 1.1 and 1.5 were known hitherto only for systems of quadratic equations and systems of Vinogradov type, as well as superpositions of these two special classes of systems.

The goal of the work at hand is to enlarge the range of systems of type (1.3) for which the main conjecture is known to hold. When the coefficient matrices $C^{(l)}$ are highly non-singular for $2\leqslant l\leqslant k$ , we denote by $I_{s,k}^{v,u}(X)=I_{s,k}^{v,u}(X;C^{(2)},\ldots,C^{(k)})$ the number of integral solutions ${\mathbf{x}},{\mathbf{y}}\in[-X,X]^{s}$ of the system (1.3), where

[TABLE]

Write further

[TABLE]

so that $K$ denotes the total degree of the system.

In order to describe our new results concerning the mean value $I_{s,k}^{v,u}(X)$ , we need to consider certain auxiliary systems of equations. Let $l\geqslant 2$ be an integer and write ${\sigma}=\frac{1}{2}l(l+1)$ . Then, given a positive number $X$ , we denote by $M^{*}_{l}(X)$ the number of integer tuples ${\mathbf{x}},{\mathbf{y}}\in[-X,X]^{{\sigma}-2}$ and ${\mathbf{z}},{\mathbf{h}}\in[-X,X]^{2}$ satisfying

[TABLE]

The main conjecture for systems of the shape (1.5) claims that

[TABLE]

Our first main result is as follows.

Theorem 1.2.

Suppose that $k\geqslant 3$ , $v\geqslant 1$ and $u\geqslant 2v$ are integers with $u|kv$ , and assume (1.6) for $l=k-1$ . Then for any $s\geqslant u$ and any $\varepsilon>0$ we have

[TABLE]

By combining the ideas of the proof of Theorem 1.2 with those underlying [3, Theorem 2.1], we can extend our results to cover also superpositions of systems of equations of the kind considered in Theorem 1.2. Fix a collection of degrees $k_{1}>k_{2}>\ldots>k_{n}\geqslant 3$ with associated multiplicities $v_{1},\ldots,v_{n}\in\mathbb{N}$ . Moreover, fix a tuple $u_{0},u_{1},\ldots,u_{n}$ of non-negative integers with $u_{i}\geqslant v_{i}$ for $1\leqslant i\leqslant n$ , set $k=k_{1}$ , and define $w_{0}=0$ and $w_{i}=u_{1}+\ldots+u_{i}$ for $1\leqslant i\leqslant n$ . Now define the parameter $r_{l}$ by putting

[TABLE]

We denote by

[TABLE]

the number of integer solutions ${\mathbf{x}},{\mathbf{y}}\in[-X,X]^{s}$ of the system (1.3) with ${\mathbf{r}}$ defined as in (1.7). These systems can be viewed as superpositions of systems of the shape considered in Theorem 1.2 with parameters $(k_{j},v_{j},u_{j})$ , together with $u_{0}$ additional quadratic equations. Here, the total degree is given by

[TABLE]

where, in accordance with (1.4), we write

[TABLE]

In this notation, we have the following generalisation of Theorem 1.2.

Theorem 1.3.

Let $u_{0}$ be a non-negative integer. Suppose that $k_{1}>\ldots>k_{n}\geqslant 3$ , and let $u_{1},\ldots,u_{n}$ as well as $v_{1},\ldots,v_{n}$ be natural numbers with $u_{j}\geqslant 2v_{j}$ and $u_{j}|k_{j}v_{j}$ for $1\leqslant j\leqslant n$ . Also, assume (1.6) for all degrees $l=k_{j}-1$ with $1\leqslant j\leqslant n$ . Then for $s\geqslant r_{2}$ and any $\varepsilon>0$ we have

[TABLE]

We also have an alternative, unconditional formulation of this result, which is given in Theorem 3.3 below.

To illustrate the strength of our results in Theorems 1.2 and 1.3, we discuss in more detail some of the most relevant special cases. Among the systems of diagonal equations not of Vinogradov type, the most well-studied ones are systems of cubic equations and systems of cubic and quadratic equations, such as we considered in our motivating example in Theorem 1.1. Regarding such systems, it is immediate from work of the second author [12, Theorem 1.1] that for every $\varepsilon>0$ one has $I_{5,3}^{1,1}(X)\ll X^{31/6+\varepsilon}$ , and this bound implies via [3, Theorem 2.1] that $I_{3+2u,3}^{1,u}(X)\ll X^{3+2u+1/6+\varepsilon}$ for all $u\geqslant 1$ . Theorem 1.3 now allows us to improve this result.

Corollary 1.4.

Suppose that $v\geqslant 1$ and $s\geqslant u\geqslant 3v$ . Then for any $\varepsilon>0$ we have

[TABLE]

This follows from Theorem 1.3 in combination with Lemma 2.1 below. Corollary 1.4 represents only the second occasion, after the second author’s successful treatment of the cubic case of Vinogradov’s mean value theorem [13], that the convexity barrier has been attained for a system of diagonal equations involving cubic equations. In particular, we now have the main conjecture for mean values that correspond to systems consisting of one cubic and three quadratic diagonal equations. This is the main new input that enables us to prove Theorem 1.1.

Our results complement older ones that can be obtained by other means. On the one hand, it follows from Theorem 2.1 and Corollary 2.2 of the first author’s work with Parsell [3] in combination with Vinogradov’s mean value theorem [1, Theorem 1.1] that the conclusion of Theorem 1.3 holds unconditionally in the range

[TABLE]

On the other hand, for small $s$ the second author’s result [14, Corollary 1.2] can be combined with the arguments of [3, Theorem 2.1] to establish the conclusion of Theorem 1.3 unconditionally in the range

[TABLE]

Mean value estimates like those of Theorems 1.2 and 1.3 have long been employed to establish asymptotic formulæ for the number of solutions of simultaneous diagonal equations. For ${\mathbf{r}}$ as in (1.7) and highly non-singular coefficient matrices $C^{(l)}$ $(2\leqslant l\leqslant k)$ , denote by $N_{s,{\mathbf{k}}}^{{\mathbf{v}},{\mathbf{u}}}(X)$ the number of integral solutions of the system of equations

[TABLE]

with $|x_{j}|\leqslant X$ for $1\leqslant j\leqslant s$ . It is well known that, if $s$ is sufficiently large in terms of ${\mathbf{k}}$ , ${\mathbf{v}}$ and ${\mathbf{u}}$ , there is an asymptotic formula of the shape

[TABLE]

where ${\mathcal{C}}$ is a non-negative constant encoding the local solubility data for the system (1.10). The relevant question is how large $s$ has to be for an asymptotic formula like that of (1.11) to hold. Theorem 1.1 of [3] provides a bound for $s$ that is somewhat unwieldy, but can likely be reduced to

[TABLE]

by accounting for our revised treatment of the major arcs described in §§5–6 below. On the other hand, unless fundamentally new methods become available that avoid the use of mean values, we cannot expect to be able to establish such asymptotic formulæ when $s\leqslant 2K$ . Thanks to our new mean value estimates in Theorem 1.3, we are now able to make progress towards this theoretical barrier.

Theorem 1.5.

Let $u_{0}$ be a non-negative integer. Suppose that $k_{1}>\ldots>k_{n}\geqslant 3$ , and let $u_{1},\ldots,u_{n}$ as well as $v_{1},\ldots,v_{n}$ be natural numbers with $u_{j}\geqslant 2v_{j}$ and $u_{j}|k_{j}v_{j}$ for $1\leqslant j\leqslant n$ . Also, assume (1.6) for all degrees $l=k_{j}-1$ with $1\leqslant j\leqslant n$ . Then for $s\geqslant 2K+1$ the asymptotic formula (1.11) holds with ${\mathcal{C}}\geqslant 0$ . If, furthermore, the system (1.10) has non-singular solutions in $\mathbb{R}$ as well as in the fields $\mathbb{Q}_{p}$ for all $p$ , then the constant ${\mathcal{C}}$ is positive.

Again, we refer to Theorem 4.1 below for an unconditional version of this result. Moreover, we note that in Lemma 2.1 below it is shown that the bound (1.6) holds for $l=2$ , and thus Theorem 1.1 can be deduced as a special case of Theorem 1.5, corresponding to the parameters $k=3$ and $u_{0}=u-3v$ .

The proofs of our results rest on an idea that played a crucial role in the second author’s work on pairs of quadratic and cubic diagonal equations [12], and which has been explored further in the authors’ recent work on incomplete Vinogradov systems [4]. In these papers, the missing linear equation is artificially added in, which makes it possible to exploit the strong bounds on Vinogradov’s mean value theorem. By taking advantage of the translation-dilation invariance of the newly completed Vinogradov systems, we then relate these systems to the auxiliary mean values $M^{*}_{l}(X)$ introduced above. Whilst our understanding of these auxiliary mean values remains unsatisfactory for general degree, the quantity $M^{*}_{2}(X)$ may be comprehensively understood in terms of quadratic Vinogradov systems. This observation plays a pivotal role in our argument, and it is the main reason why we attain the convexity barrier in Theorem 1.1 and Corollary 1.4.

Notation. Throughout, the letters $s$ , $u$ , $v$ , and $k$ , as well as the entries of the vectors ${\mathbf{k}}$ , ${\mathbf{u}}$ , ${\mathbf{v}}$ , and ${\mathbf{r}}$ , will denote non-negative integers. The letter $\varepsilon$ will be used to denote an arbitrary, but sufficiently small positive number, and we adopt the convention that whenever it appears in a statement, we assert that the statement holds for all sufficiently small $\varepsilon>0$ . We take $X$ to be a large positive number which, just like the implicit constants in the notations of Landau and Vinogradov, is permitted to depend at most on $s$ , ${\mathbf{k}}$ , ${\mathbf{v}}$ , ${\mathbf{u}}$ , the coefficient matrices $C^{(l)}$ $(2\leqslant l\leqslant k)$ , and $\varepsilon$ . We employ the non-standard notation that when $G:[0,1)^{n}\rightarrow\mathbb{C}$ is integrable for some $n\in\mathbb{N}$ , then

[TABLE]

Here and elsewhere, we use vector notation liberally in a manner that is easily discerned from the context. In particular, when ${\mathbf{b}}$ denotes the integer tuple $(b_{1},\dots,b_{n})$ , we write $(q,{\mathbf{b}})=\gcd(q,b_{1},\dots,b_{n})$ .

Acknowledgements. Both authors thank the Fields Institute in Toronto for excellent working conditions and support that made this work possible during the Thematic Program on Unlikely Intersections, Heights, and Efficient Congruencing. This work was further facilitated by subsequent visits of the first author to the University of Bristol, and of the second author to the University of Waterloo. The authors gratefully acknowledge the hospitality of both institutions. The work of both authors was supported by the National Science Foundation under Grant No. DMS-1440140 while they were in residence at the Mathematical Sciences Research Institute in Berkeley, California, during the Spring 2017 semester. The first author’s work was supported in part by Starting Grant 2017-05110 from Vetenskapsrådet. The second author’s work was supported by a European Research Council Advanced Grant under the European Union’s Horizon 2020 research and innovation programme via grant agreement No. 695223, and in the final stages by the National Science Foundation via Grant No. DMS-1854398 and DMS-2001549.

The authors are also very grateful to Scott Parsell for pointing out an oversight in an earlier version of this paper, which necessitated a fundamental re-write of parts of the argument.

2. Preliminaries and preparatory steps

Our goal in this and the next section is the proof of Theorem 1.3. Before delving to the core of the argument, we pause to introduce some notation and establish a mean value estimate that will be of use in our subsequent discussion. For $2\leqslant l\leqslant k$ we define the exponential sum $K_{l}({\bm{\alpha}};Z,H)$ by putting

[TABLE]

and we write

[TABLE]

Then, with the standard notation associated with Vinogradov’s mean value theorem in mind, we put

[TABLE]

We note that the main conjecture associated with Vinogradov’s mean value theorem is now known to hold for all degrees. This is classical when $l=2$ , it is a consequence of work of the second author [13] for degree $l=3$ , and for degrees exceeding three it follows from the work of Bourgain, Demeter and Guth, and of the second author (see [1, Theorem 1.1] and [14, Corollary 1.3]). Thus, for all ${\sigma}>0$ one has

[TABLE]

For future reference, we record the trivial inequality

[TABLE]

which is valid for all $a_{1},\ldots,a_{n}\in\mathbb{C}$ .

We begin by bounding the mean value $M^{*}_{2}(X)$ .

Lemma 2.1.

Let $X$ , $Z$ and $H$ be large real numbers. Then one has

[TABLE]

Proof.

Upon considering the underlying system of equations, we see that the mean value on the left hand side of (2.4) is given by the number of integer solutions of the system of equations

[TABLE]

with $|x_{i}|\leqslant X$ , $|h_{i}|\leqslant H$ and $|z_{i}|\leqslant Z$ for $i=1,2$ . The second of these equations permits the substitution $h_{2}=h_{1}-x_{1}+x_{2}$ into the first, whence

[TABLE]

Suppose first that $h_{1}(z_{1}-z_{2})$ is non-zero. Then for each of the $O(HZ^{2})$ possible choices for $h_{1}$ , $z_{1}$ and $z_{2}$ fixing the latter integer in such a manner, an elementary divisor function estimate shows there to be $O((HZ)^{\varepsilon})$ possible choices for the integers $x_{1}-x_{2}$ and $x_{1}+x_{2}-2z_{2}$ , and hence also for $x_{1}$ and $x_{2}$ . These choices also fix $h_{2}=h_{1}-x_{1}+x_{2}$ , so we see that there are $O(H^{1+\varepsilon}Z^{2+\varepsilon})$ solutions of this first type. Meanwhile, if $h_{1}(z_{1}-z_{2})=0$ , then $h_{1}=0$ or $z_{1}=z_{2}$ , and at the same time either $x_{1}=x_{2}$ or $x_{1}=2z_{2}-x_{2}$ . In any case, therefore, each of the $O(XZ)$ possible choices for $z_{2}$ and $x_{2}$ determine $x_{1}$ and either $h_{1}$ or $z_{1}$ . Since there are $O(Z+H)$ possible choices left by this constraint for the latter, and $h_{2}$ is again fixed by these choices just as before, we find that there are $O(XZ(Z+H))$ solutions of this second type. The conclusion of the lemma follows by summing the contributions from both types of solutions. ∎

Upon taking $X=H=Z$ in Lemma 2.1, we conclude that $M_{2}^{*}(X)\ll X^{3+\varepsilon}$ , which establishes (1.6) for $l=2$ . We remark also that the system (2.5) can be interpreted as being of Vinogradov shape of degree two by means of the substitution $h_{i}=u_{i}-v_{i}$ and $z_{i}=u_{i}+v_{i}$ for $i=1,2$ . Viewed in this way, Lemma 2.1 amounts to no more than a rephrasing of the classical elementary proof of the quadratic case in Vinogradov’s mean value theorem.

We now initiate the proof of Theorem 1.3, assuming the hypotheses of its statement. For $l\geqslant 2$ , let

[TABLE]

Define ${\bm{\alpha}}^{(l)}=({\alpha}_{i}^{(l)})_{1\leqslant i\leqslant r_{l}}$ for $2\leqslant l\leqslant k$ . When $1\leqslant i\leqslant r_{2}$ , write ${\bm{\alpha}}_{i}=({\alpha}_{i}^{(l)})$ , where $l$ runs over all values for which $r_{l}\geqslant i$ . We then put

[TABLE]

Also, set $\bm{\gamma}_{j}=({\gamma}_{j}^{(l)})_{2\leqslant l\leqslant k}$ for $1\leqslant j\leqslant s$ and $\bm{\gamma}^{(l)}=({\gamma}_{j}^{(l)})_{1\leqslant j\leqslant s}$ for $2\leqslant l\leqslant k$ , and put $\bm{\gamma}=(\bm{\gamma}_{1},\dots,\bm{\gamma}_{s})=(\bm{\gamma}^{(2)},\dots,\bm{\gamma}^{(k)})^{T}$ . Then by orthogonality we have

[TABLE]

Set $t_{0}=2$ , and for a set of positive integers $t_{1},\ldots,t_{n}$ to be fixed later take

[TABLE]

Thus, on recalling (1.7), we see in particular that

[TABLE]

Further, let ${\mathcal{I}}$ denote the set of all integral $r_{2}$ -tuples

[TABLE]

with pairwise distinct entries $j_{m,h}\in\{1,\ldots,s\}$ , and put

[TABLE]

We can bound $I_{s,{\mathbf{k}}}^{{\mathbf{v}},{\mathbf{u}}}(X)$ in terms of ${\mathcal{G}}_{{\mathbf{t}},{\mathbf{k}}}^{{\mathbf{v}},{\mathbf{u}}}(X)$ . In particular, this will allow us to concentrate on the case when $s=s_{0}$ .

Lemma 2.2.

For any fixed choice of the positive integers $t_{1},\ldots,t_{n}$ , we have the bounds

[TABLE]

Proof.

When $s>s_{0}$ , the trivial bound $g_{k}(\bm{\gamma}_{j};X)=O(X)$ delivers the estimate

[TABLE]

and the conclusion of the lemma follows in this case from (2.3). Suppose now that $r_{2}\leqslant s\leqslant s_{0}$ . Then from (2.3) and an application of Hölder’s inequality, we find that

[TABLE]

Thus the lemma is established in both cases. ∎

Suppose that the maximum in (2.8) is assumed at the tuple ${\mathbf{j}}\in{\mathcal{I}}$ , which we consider fixed for the remainder of the analysis. For $2\leqslant l\leqslant k$ and $1\leqslant i\leqslant r_{l}$ , set $d_{i,w_{m-1}+h}^{(l)}=c_{i,j_{m,h}}^{(l)}$ when $1\leqslant h\leqslant u_{m}$ and $1\leqslant m\leqslant n$ , and likewise $d_{i,w_{n}+h}^{(l)}=c_{i,j_{0,h}}^{(l)}$ when $1\leqslant h\leqslant u_{0}$ . We then have the coefficient matrices

[TABLE]

We define ${\delta}_{i}^{(l)}$ via the relations $\bm{\delta}^{(l)}=(D^{(l)})^{T}{\bm{\alpha}}^{(l)}$ for $2\leqslant l\leqslant k$ , and put $\bm{\delta}_{j}=({\delta}_{j}^{(l)})_{2\leqslant l\leqslant k}$ for $1\leqslant j\leqslant r_{2}$ . Here, we employ notational conventions analogous to those described in the sequel to (2.6).

Write

[TABLE]

Thus, in the case $n=1$ and $u_{0}=0$ , we have ${\mathcal{G}}_{{\mathbf{t}},k}^{v,u}(X)=G_{t,k}^{v,u}(X)$ .

Lemma 2.3.

One has

[TABLE]

Proof.

Recall (2.7) and (2.8). For temporary notational convenience, we put $u_{n+1}=u_{0}$ . Then, after possibly relabelling indices, we see from (2.3) that

[TABLE]

The desired conclusion now follows by essentially the same argument as in [3, Theorem 2.1]. Recall that the coefficient matrices $C^{(l)}$ are highly non-singular. Consequently, the matrices $D^{(l)}$ underlying the mean value in (2.8) inherit that property. Upon considering the underlying Diophantine equations and applying elementary row operations, we may thus assume without loss of generality that the first $r_{l}\times r_{l}$ submatrix of each matrix $D^{(l)}$ is diagonal.

Recall the definition of the parameter $r_{l}$ from (1.7). Since ${\mathbf{j}}\in{\mathcal{I}}$ has $r_{2}$ entries, we see that the matrix $D^{(2)}$ is of square format and hence diagonal. Thus we have ${\delta}_{i}^{(2)}=d_{i,i}^{(2)}{\alpha}_{i}^{(2)}$ for $1\leqslant i\leqslant w_{n}+u_{0}$ . In particular, the entries of $\bm{\delta}_{i}$ with $1\leqslant i\leqslant w_{n}$ are independent of all the variables ${\alpha}_{w_{n}+i}^{(2)}$ with $1\leqslant i\leqslant u_{0}$ . We may therefore interpret ${\bm{\alpha}}$ as the ordered pair $({\bm{\alpha}}_{n+1}^{\dagger},{\bm{\alpha}}_{n+1}^{*})$ with ${\bm{\alpha}}_{n+1}^{\dagger}=({\bm{\alpha}}_{i})_{1\leqslant i\leqslant w_{n}}$ and ${\bm{\alpha}}_{n+1}^{*}=({\alpha}_{w_{n}+i}^{(2)})_{1\leqslant i\leqslant u_{0}}$ . In this notation we can write

[TABLE]

where

[TABLE]

and

[TABLE]

The latter mean value counts integer solutions ${\mathbf{x}},{\mathbf{y}}\in[-X,X]^{2u_{0}}$ of the system

[TABLE]

where each solution is counted with a unimodular weight depending on ${\bm{\alpha}}_{n+1}^{\dagger}$ . It then follows from the triangle inequality and Hua’s lemma that

[TABLE]

We now iterate this procedure for $j=n,n-1,\ldots,1$ . For each index $j$ , we see from (1.7) that $r_{l}>w_{j-1}$ only when $l\leqslant k_{j}$ . Moreover, we have $r_{l}\geqslant w_{j}$ if $l\leqslant k_{j}-1$ , and $r_{l}=w_{j-1}+v_{j}$ if $l=k_{j}$ . Since we had arranged for the first $r_{l}\times r_{l}$ submatrices of each $D^{(l)}$ to be diagonal, it follows that the entries of $\bm{\delta}_{i}$ with $1\leqslant i\leqslant w_{j-1}$ are independent of all the variables ${\alpha}_{w_{j-1}+i}^{(l)}$ with $1\leqslant i\leqslant u_{j}$ and $2\leqslant l\leqslant k_{j}-1$ , and also of all ${\alpha}_{w_{j-1}+i}^{(k_{j})}$ with $1\leqslant i\leqslant v_{j}$ . Together, these latter groups of variables form the vectors ${\bm{\alpha}}_{i}$ with $w_{j-1}+1\leqslant i\leqslant w_{j}$ . Hence by a similar argument to that encountered before, we can write ${\bm{\alpha}}_{j+1}^{\dagger}=({\bm{\alpha}}_{j}^{\dagger},{\bm{\alpha}}_{j}^{*})$ , where ${\bm{\alpha}}_{j}^{\dagger}=({\bm{\alpha}}_{i})_{1\leqslant i\leqslant w_{j-1}}$ and ${\bm{\alpha}}_{j}^{*}=({\bm{\alpha}}_{w_{j-1}+i})_{1\leqslant i\leqslant u_{j}}$ , noting in particular that the vector ${\bm{\alpha}}_{1}^{\dagger}$ is empty. For $2\leqslant j\leqslant n$ put

[TABLE]

and take ${\mathcal{F}}_{1}({\bm{\alpha}}_{1}^{\dagger})=1$ . Also, let

[TABLE]

Note that this mean value counts integer solutions $|{\mathbf{x}}|,|{\mathbf{y}}|\leqslant X$ to the system

[TABLE]

where each solution is counted with a unimodular weight depending on ${\bm{\alpha}}_{j}^{\dagger}$ . An application of the triangle inequality shows that $G_{j}({\bm{\alpha}}_{j}^{\dagger};X)\leqslant G_{j}(\bm{0};X)=G_{t_{j},k_{j}}^{v_{j},u_{j}}(X)$ . We thus deduce that for $1\leqslant j\leqslant n$ we have

[TABLE]

and upon iterating we find that

[TABLE]

The conclusion of the lemma follows upon combining this bound with (2.9), (2.10) and (2.11). ∎

3. The underlying mean value

From Lemma 2.3 it is clear that the desired bound ${\mathcal{G}}_{{\mathbf{t}},{\mathbf{k}}}^{{\mathbf{v}},{\mathbf{u}}}(X)\ll X^{K+\varepsilon}$ will follow if we can show that $G_{t_{j},k_{j}}^{v_{j},u_{j}}(X)\ll X^{K_{j}+\varepsilon}$ for $j=1,\ldots,n$ . We thus proceed to establish the latter bound. In the discussion of Lemmata 3.1 and 3.2 that follows, it is expedient to drop all mention of the indices $j$ with $1\leqslant j\leqslant n$ . Note also that in this situation, we have $r_{k}=v$ and $r_{l}=u$ for $2\leqslant l\leqslant k-1$ . We introduce variables ${\bm{\alpha}}^{(1)}\in[0,1)^{u}$ and define $D^{(1)}$ to be the $u\times u$ identity matrix. Set further $\bm{\delta}^{(1)}={\bm{\alpha}}^{(1)}$ , and extend our previous notational conventions surrounding the vector $\bm{\delta}$ so as to incorporate $\bm{\delta}^{(1)}$ in the natural manner.

Next, we define

[TABLE]

We begin by establishing the bound contained in the following lemma.

Lemma 3.1.

One has $G_{t,k}^{v,u}(X)\ll X^{-u}H_{t,k}^{v,u}(X)$ .

Proof.

Define ${\omega}_{l}$ to be 1 when $l=1$ , and [math] otherwise. We decompose the set $\{1,\ldots,tu\}$ into the blocks ${\mathcal{B}}_{m}=\{(m-1)t+1,\ldots,mt\}$ for $1\leqslant m\leqslant u$ . The mean value $G_{t,k}^{v,u}(X)$ counts the number of integral solutions of the system of equations

[TABLE]

where

[TABLE]

with $-X\leqslant x_{i},y_{i}\leqslant X$ for $1\leqslant i\leqslant tu$ and $|h_{j}|\leqslant 2tX$ for $1\leqslant j\leqslant u$ . Observe that in our current situation all coefficient matrices $D^{(l)}$ with $1\leqslant l\leqslant k-1$ are of format $u\times u$ . Just as in the proof of Lemma 2.3, we can therefore assume without loss of generality that the coefficients $d_{j,m}^{(l)}$ with $1\leqslant l\leqslant k-1$ vanish except when $j=m$ . Also, note that the constraints on the expressions $\xi_{m}^{(1)}$ for $(1\leqslant m\leqslant u)$ imposed by the linear equations in (3.2) are void, since the ranges for the new variables $h_{j}$ automatically accommodate all possible values for $\xi_{m}^{(1)}$ within (3.2).

We now consider the effect of shifting every variable with index in a given block ${\mathcal{B}}_{m}$ by an integer $z_{m}$ with $|z_{m}|\leqslant X$ . By the binomial theorem, for any family of shifts ${\mathbf{z}}$ , one finds that $({\mathbf{x}},{\mathbf{y}})$ is a solution of (3.2) if and only if it is also a solution of the system

[TABLE]

where

[TABLE]

Thus, for each fixed integer $u$ -tuple ${\mathbf{z}}$ with $|z_{m}|\leqslant X$ ( $1\leqslant m\leqslant u$ ), the mean value $G_{t,k}^{v,u}(X)$ is bounded above by the number of integral solutions of the system

[TABLE]

with $|{\mathbf{v}}|,|{\mathbf{w}}|\leqslant 2X$ and $|{\mathbf{h}}|\leqslant 2tX$ . On applying orthogonality and averaging over all possible choices for ${\mathbf{z}}$ , we therefore infer that

[TABLE]

where

[TABLE]

The proof of the lemma is completed by reference to (2.1) and (3.1). ∎

We can now turn to the task of estimating $H_{t,k}^{v,u}(X)$ . We will do this in somewhat wider generality than is required for the proofs of Theorems 1.3 and 1.5. This will allow us to prove the unconditional results adumbrated in the introduction.

When $l\geqslant 2$ is an integer and ${\sigma}\geqslant m\geqslant 0$ , denote by $M_{l,{\sigma},m}(X)$ the mean value

[TABLE]

When ${\sigma}$ and $m$ are integers, this mean value counts the number of integer tuples ${\mathbf{x}},{\mathbf{y}}\in[-X,X]^{{\sigma}-m}$ and ${\mathbf{z}},{\mathbf{h}}\in[-X,X]^{m}$ satisfying

[TABLE]

In particular, we have $M_{l}^{*}(X)=M_{l,\frac{1}{2}l(l+1),2}(X)$ . The main conjecture for mean values of the shape (3.3) states that for all ${\sigma}\geqslant m$ one should have

[TABLE]

Note that the case when $m=0$ corresponds to Vinogradov’s mean value theorem, and in this case the bound (3.4) is known (see equation (2.2) above).

Suppose that ${\sigma}$ and $m$ are integers with $2\leqslant m\leqslant{\sigma}$ and

[TABLE]

We now choose

[TABLE]

so that at the critical point ${\sigma}=\frac{1}{2}k(k-1)$ we have $tu=K$ . Note also that by (3.5) as well as the definition of $K$ in (1.4), the quantity $t$ is indeed an integer whenever $u|kv$ .

Lemma 3.2.

Let ${\sigma}$ and $m$ be integers with $2\leqslant m\leqslant{\sigma}$ and satisfying the conditions $2|m$ and $m|({\sigma}-\frac{1}{2}k(k-1))$ . Assume also that $u\geqslant\frac{m}{m-1}v$ and $u|kv$ . Then we have

[TABLE]

Proof.

Set

[TABLE]

and

[TABLE]

Then it follows from (3.1) via (2.3) that, after possibly relabelling variables, we have

[TABLE]

Recall now that we had arranged for the coefficient matrices $D^{(1)},\ldots,D^{(k-1)}$ to be diagonal. Consequently, the variables $\bm{\delta}_{1},\ldots,\bm{\delta}_{v}$ are independent of those ${\alpha}_{i}^{(l)}$ having $1\leqslant l\leqslant k-1$ and $v+1\leqslant i\leqslant u$ . Then, by setting ${\bm{\eta}}_{1}=({\bm{\alpha}}_{i})_{1\leqslant i\leqslant v}$ and ${\bm{\eta}}_{2}=({\bm{\alpha}}_{i})_{v+1\leqslant i\leqslant u}$ , it follows that ${\bm{\eta}}_{1}$ fully determines $\bm{\delta}_{1},\ldots,\bm{\delta}_{v}$ , and ${\bm{\eta}}_{1}$ and ${\bm{\eta}}_{2}$ together completely determine all entries of $\bm{\delta}$ . On recalling (3.7), we may thus rewrite the integral on the right hand side of (3.8) to obtain the bound

[TABLE]

where ${\mathfrak{H}}_{1}({\bm{\eta}}_{1})={\mathfrak{G}}_{1}(\bm{\delta})$ and

[TABLE]

Define

[TABLE]

and

[TABLE]

Also, write

[TABLE]

and note that, as a consequence of (1.4) and (3.6), one has

[TABLE]

Then, since $m\geqslant 2$ and $u\geqslant\frac{m}{m-1}v$ , it follows via Hölder’s inequality that

[TABLE]

Since $m$ is an even integer, it follows by standard orthogonality considerations that $U_{1}({\bm{\eta}}_{1})$ and $U_{2}({\bm{\eta}}_{1})$ count solutions to their respective associated systems of equations with degrees $1,\ldots,k-1$ , with each solution being counted with a unimodular weight depending on ${\bm{\eta}}_{1}$ . It thus follows from the triangle inequality that $U_{i}({\bm{\eta}}_{1})\leqslant U_{i}(\bm{0})$ for $i=1,2$ . Using the fact that the coefficient matrices $D^{(l)}$ with $1\leqslant l\leqslant k-1$ are all diagonal, and recalling (2.2), we thus discern that

[TABLE]

By an analogous chain of reasoning, we derive from the definition (3.3) of $M_{l,{\sigma},m}(X)$ and a consideration of the underlying system of equations the corresponding bound

[TABLE]

Thus, from (3.10), (3.11) and (3) we have

[TABLE]

At this stage in our argument, we discern from (3.9) that

[TABLE]

Recall that ${\mathfrak{H}}_{1}({\bm{\eta}}_{1})={\mathfrak{G}}_{1}(\bm{\delta})$ , where ${\mathfrak{G}}_{1}(\bm{\delta})$ is defined by (3.7). Since the first $v\times v$ minors of the coefficient matrices $D^{(l)}$ for $1\leqslant l\leqslant k$ are now diagonal, we deduce from (2.2) that

[TABLE]

Finally, on substituting (3.14) into (3.13) and recalling (1.4), we conclude that

[TABLE]

This completes the proof of the lemma. ∎

We now resume the practice of appending the suffix $j$ to the parameters $k$ , $u$ , $v$ and $K$ that we temporarily abandoned during the discussion of Lemmata 3.1 and 3.2. We assume, moreover, that ${\sigma}_{j}$ and $m_{j}$ are integers with $2\leqslant m_{j}\leqslant{\sigma}_{j}$ and

[TABLE]

In accordance with (3.6), we now fix the parameters $t_{j}$ by taking

[TABLE]

Hence, whenever $u_{j}|k_{j}v_{j}$ , the quantity $t_{j}$ is an integer. With these natural numbers $t_{j}$ defined thus, we recall the definition of $s_{0}$ from (2.7). We are now equipped to provide an unconditional version of Theorem 1.3

Theorem 3.3.

Suppose that $k_{1}>\ldots>k_{n}\geqslant 3$ . Assume further that ${\mathbf{u}},{\mathbf{v}},{\mathbf{m}},{\bm{{\sigma}}}\in\mathbb{N}^{n}$ satisfy the relations

[TABLE]

for $1\leqslant j\leqslant n$ . Let $u_{0}$ be a non-negative integer. Then for any $\varepsilon>0$ , one has

[TABLE]

Proof.

We apply Lemma 2.2 with $s=s_{0}$ , followed by Lemmata 2.3, 3.1 and 3.2. This shows that

[TABLE]

and the proof is complete upon reference to (1.9). ∎

We can now complete the proof of Theorem 1.3. To this end, we choose $m_{j}=2$ and ${\sigma}_{j}=\frac{1}{2}k_{j}(k_{j}-1)$ for $1\leqslant j\leqslant n$ . With this choice of parameters the hypotheses of Theorem 3.3 are satisfied whenever ${\mathbf{k}},{\mathbf{v}}$ and ${\mathbf{u}}$ are in accordance with the conditions of Theorem 1.3, and moreover the conjectural bound $M_{k_{j}-1,{\sigma}_{j},2}(2t_{j}X)\ll X^{{\sigma}_{j}+\varepsilon}$ is then tantamount to both (1.6) and (3.4). Thus, in the case $s=s_{0}$ the desired conclusion is an immediate consequence of the conclusion of Theorem 3.3, and for general values of $s$ it follows in like manner upon utilising the additional flexibility offered by Lemma 2.2.

4. The Hardy-Littlewood method

We can now initiate the derivation of Theorem 1.5 from the mean value estimate of Theorem 1.3. We shall prove the following rather more general result.

Theorem 4.1.

Suppose that $k_{1}>\ldots>k_{n}\geqslant 3$ . Suppose further that ${\mathbf{u}},{\mathbf{v}},{\mathbf{m}},{\bm{{\sigma}}}\in\mathbb{N}^{n}$ lie in the respective ranges

[TABLE]

and satisfy the divisibility conditions

[TABLE]

for $1\leqslant j\leqslant n$ . Assume, moreover, that

[TABLE]

Let $u_{0}$ be a non-negative integer, put $t_{0}=2$ and define $t_{j}$ via (3.15) for $1\leqslant j\leqslant n$ . Set $s_{0}=t_{0}u_{0}+\ldots+t_{n}u_{n}$ , suppose that $s\geqslant 2s_{0}+1$ and write $K=2u_{0}+K_{1}+\ldots+K_{n}$ for the total degree of the system as usual. Then the asymptotic formula

[TABLE]

holds with ${\mathcal{C}}\geqslant 0$ . If, furthermore, the system (1.10) has non-singular solutions in $\mathbb{R}$ as well as in the fields $\mathbb{Q}_{p}$ for all $p$ , then the constant ${\mathcal{C}}$ is positive.

Note that Theorem 1.5 follows from the special case of Theorem 4.1 in which $m_{j}=2$ and ${\sigma}_{j}=\frac{1}{2}k_{j}(k_{j}-1)$ for $1\leqslant j\leqslant n$ .

We make use of the notation introduced in §2, and recall in particular (2.6) and its sequel. From now on we will set $r=r_{2}+\ldots+r_{k}$ and $w=u_{0}+\ldots+u_{n}$ , so that $w=r_{2}=w_{n}+u_{0}$ . Also, we will assume throughout that ${\mathbf{k}}$ , ${\mathbf{v}}$ , ${\mathbf{u}}$ , ${\bm{{\sigma}}}$ , ${\mathbf{m}}$ satisfy the hypotheses of the statement of Theorem 4.1.

When ${\mathfrak{B}}\subseteq[0,1)^{r}$ is a measurable set, put

[TABLE]

Our Hardy–Littlewood dissection is defined as follows. When $Y$ and $Q$ are parameters with $1\leqslant Q\leqslant Y$ , we take the major arcs ${\mathfrak{M}}_{Y}={\mathfrak{M}}_{Y}(Q)$ to be the union of the boxes

[TABLE]

with $0\leqslant{\mathbf{a}}\leqslant q\leqslant Q$ and $(q,{\mathbf{a}})=1$ . The corresponding set of minor arcs ${\mathfrak{m}}_{Y}={\mathfrak{m}}_{Y}(Q)$ is defined by putting ${\mathfrak{m}}_{Y}(Q)=[0,1)^{r}\setminus{\mathfrak{M}}_{Y}(Q)$ . Unless indicated otherwise, we fix $Y=X$ and $Q=X^{1/(6r)}$ , and abbreviate ${\mathfrak{M}}_{X}$ to ${\mathfrak{M}}$ and ${\mathfrak{m}}_{X}$ to ${\mathfrak{m}}$ .

We require certain auxiliary functions in order to analyse the contribution of the major arcs $N_{s,{\mathbf{k}}}^{{\mathbf{v}},{\mathbf{u}}}(X;{\mathfrak{M}})$ . Write

[TABLE]

and recall that the argument of [11, Theorem 7.1] gives

[TABLE]

Further, set

[TABLE]

and recall from the arguments of [11, Theorem 7.3] the estimate

[TABLE]

We put

[TABLE]

Following the same convention regarding vector notation as we applied for $\bm{\gamma}$ in (2.6) and its sequel, we have ${\bm{\vartheta}}=\bm{\gamma}-{\bm{\Lambda}}/q$ . Then as a consequence of [11, Theorem 7.2], we find that when ${\bm{\alpha}}={\mathbf{a}}/q+{\bm{\beta}}\in{\mathfrak{M}}$ , one has

[TABLE]

Finally, define

[TABLE]

and

[TABLE]

where

[TABLE]

The preliminary conclusion of our major arcs analysis is summarised in the following lemma.

Lemma 4.2.

There is a positive number ${\omega}$ for which

[TABLE]

Proof.

Since $\operatorname{vol}({\mathfrak{M}})\ll Q^{2r+1}X^{-K}$ , it follows from (4.7) that

[TABLE]

Furthermore, by a change of variables we see that ${\mathfrak{J}}_{X}(Q)=X^{s-K}{\mathfrak{J}}_{1}(Q)$ . The conclusion of the lemma therefore follows from our choice $Q=X^{1/(6r)}$ . ∎

In order to address the contribution of the minor arcs, we need the following Weyl-type estimate.

Lemma 4.3.

Suppose that ${\bm{\alpha}}\in{\mathfrak{m}}$ . There exists $\tau>0$ such that for each $w$ -tuple $(j_{1},\ldots,j_{w})$ of distinct indices there exists an index $i$ with $1\leqslant i\leqslant w$ for which one has

[TABLE]

Proof.

This is the content of [3, Lemma 3.1]. Note that the minor arcs in our setting are a subset of the minor arcs defined in the context of that lemma. ∎

We now complete the analysis of the minor arcs for Theorem 4.1.

Lemma 4.4.

Assume the hypotheses of Theorem 4.1. Then there is a positive number ${\omega}$ for which $N_{s,{\mathbf{k}}}^{{\mathbf{v}},{\mathbf{u}}}(X;{\mathfrak{m}})\ll X^{s-K-{\omega}}$ .

Proof.

Given a measurable set ${\mathfrak{B}}\subseteq[0,1)^{r}$ , we write

[TABLE]

We begin by estimating the last $s-(2s_{0}+1)$ exponential sums in the product (4.2) trivially, so that

[TABLE]

For $1\leqslant i\leqslant w$ and $\tau>0$ sufficiently small, let ${\mathfrak{m}}^{(i)}$ denote the set of ${\bm{\alpha}}\in[0,1)^{r}$ for which $|g_{k}(\bm{\gamma}_{i};X)|\leqslant XQ^{-\tau}$ . In view of (2.3), we can identify a subset of indices ${\mathcal{J}}_{i}\subseteq\{1,\dots,2s_{0}+1\}\setminus\{i\}$ with $\operatorname{card}({\mathcal{J}}_{i})=s_{0}$ for which

[TABLE]

Write $C_{i}^{(l)}$ for the submatrix of $C^{(l)}$ having columns indexed by ${\mathcal{J}}_{i}$ . The condition that the coefficient matrices $C^{(l)}$ be highly non-singular implies that the submatrices $C_{i}^{(l)}$ of $C^{(l)}$ are also highly non-singular. Thus, by orthogonality, we see from the definition (1.8) of the mean value $I_{s_{0},{\mathbf{k}}}^{{\mathbf{v}},{\mathbf{u}}}(X)$ that

[TABLE]

Consider a fixed ${\bm{\alpha}}\in{\mathfrak{m}}$ . If $\tau$ has been chosen sufficiently small, Lemma 4.3 ensures that we can find an index $j$ with $1\leqslant j\leqslant w$ such that ${\bm{\alpha}}\in{\mathfrak{m}}^{(j)}$ . Thus we see that we have the inclusion ${\mathfrak{m}}\subseteq{\mathfrak{m}}^{(1)}\cup\dots\cup{\mathfrak{m}}^{(w)}$ , whence

[TABLE]

Now recall that $Q=X^{1/(6r)}$ . Note also that the hypotheses of Theorem 4.1 under which we are currently working permit the assumption of those of Theorem 3.3. Thus, upon combining the estimate (4.11) with Theorem 3.3, inserting (4.1) and recalling (3.15), we obtain the bound

[TABLE]

By substituting this estimate into (4.10), we obtain the conclusion of the lemma. ∎

Upon combining the results of Lemmata 4.2 and 4.4, we infer that for some $\omega>0$ one has the asymptotic formula

[TABLE]

This completes our analysis of the minor arcs.

5. Initial considerations for the singular series

It remains to show that the singular series ${\mathfrak{S}}(Q)$ and singular integral ${\mathfrak{J}}_{1}(Q)$ converge as $Q$ tends to infinity. We now put

[TABLE]

In this notation, the system under consideration can be viewed as a superposition of $w$ Vinogradov systems with respective degrees $\ell_{j}$ , all missing the linear slice, and thus it follows from the definition (1.9) that the total degree of this system is

[TABLE]

Throughout this and the next section, we work under the assumption that $s\geqslant 2K+1$ .

We first attend to the singular series. Put

[TABLE]

By applying (2.3), we find that for some choice of distinct indices $j_{1},\ldots,j_{w}\in\{1,\ldots,s\}$ we have the asymptotic bound

[TABLE]

where

[TABLE]

Note that both $A(q)$ and $A_{1}(q)$ are multiplicative in $q$ . For this reason, the key to understanding the singular series is to maintain good control over the multiplicative quantity

[TABLE]

as $q$ runs over the prime powers.

Define $\tau_{j}$ by setting $\tau_{j}=\frac{1}{2}\ell_{j}(\ell_{j}+1)-1$ for $1\leqslant j\leqslant w$ , and write $T_{j}=\tau_{1}+\ldots+\tau_{j}$ , so that $T_{w}=K$ . For consistency we also set $T_{0}=0$ . Now, adopting a notation similar to that of Section 2, when $2\leqslant l\leqslant k$ we write $D^{(l)}$ for the submatrices

[TABLE]

of the coefficient matrices $C^{(l)}$ consisting of the columns indexed by $j_{1},\ldots,j_{w}$ . Note that the hypothesis that each $C^{(l)}$ is highly non-singular ensures that the same is true for each $D^{(l)}$ . For $1\leqslant h\leqslant w$ and $2\leqslant l\leqslant k$ we set ${\Delta}_{h}^{(l)}={\Lambda}_{j_{h}}^{(l)}$ , and we employ the same conventions regarding vector notation as in (4.6) and also (2.6) and its sequel. Thus, we write $\bm{\Delta}_{j}=({\Delta}_{j}^{(l)})_{2\leqslant l\leqslant k}$ and $\bm{\Delta}^{(l)}=({\Delta}_{j}^{(l)})_{1\leqslant j\leqslant w}$ , so that

[TABLE]

In this notation, it follows from standard orthogonality relations that

[TABLE]

counts the number of solutions ${\mathbf{x}},{\mathbf{y}}\in(\mathbb{Z}/q\mathbb{Z})^{K}$ of the system of congruences

[TABLE]

where $1\leqslant i\leqslant r_{l}$ and $2\leqslant l\leqslant k$ .

Our first goal is to apply a procedure inspired by the proof of Theorem 2.1 in [3] in order to disentangle the congruences in (5.5). This will enable us to replace the sum $B_{1}(q)$ by a related expression in which for all indices $j$ the degree $k$ in the exponential sum $S_{k}(q,\bm{\Delta}_{j})$ is replaced by $\ell_{j}$ . Since $\ell_{j}$ is typically smaller than $k$ , we will reap the rewards of this preparatory step when the reduced degrees allow us to exert greater control on the size of the exponential sums in question.

Given a $(k-1)$ -tuple of variables $\xi^{(2)},\ldots,\xi^{(k)}$ , we adopt the convention that ${\bm{\xi}}^{[l]}=(\xi^{(2)},\ldots,\xi^{(l)})$ for $2\leqslant l\leqslant k$ . Also, when ${\mathbf{d}}=(d_{2},\dots,d_{k})$ is a coefficient vector, we abbreviate the vector $(d_{2}\xi^{(2)},\dots,d_{k}\xi^{(k)})$ to ${\mathbf{d}}{\bm{\xi}}$ , and we appropriate the notation ${\mathbf{d}}^{[l]}$ and $({\mathbf{d}}{\bm{\xi}})^{[l]}$ to denote the corresponding subvectors whose entries are indexed by $2\leqslant i\leqslant l$ . The following observation will play a part in our ensuing arguments.

Lemma 5.1.

Let $l$ , $q$ and $t$ be natural numbers, with $2\leqslant l\leqslant k-1$ . Suppose that $d_{2},\dots,d_{k}$ and $c_{2},\dots,c_{k}$ are fixed integers, and put

[TABLE]

Then for any fixed integers $a^{(l+1)},\dots,a^{(k)}$ we have

[TABLE]

Proof.

By standard orthogonality relations, the sum

[TABLE]

counts solutions ${\mathbf{x}},{\mathbf{y}}\in(\mathbb{Z}/q\mathbb{Z})^{t}$ of the system of congruences

[TABLE]

where each solution is counted with a unimodular weight depending on the inert variables $a^{(l+1)},\ldots,a^{(k)}$ , together with the coefficients ${\mathbf{d}}$ and ${\mathbf{c}}$ . Thus, by the triangle inequality, one finds that

[TABLE]

We therefore discern that $T$ is bounded above by the number of solutions of (5.7) counted without weights, and hence by the number of solutions ${\mathbf{x}},{\mathbf{y}}\in(\mathbb{Z}/q\mathbb{Z})^{t}$ of the system of congruences

[TABLE]

We interpret the latter as the number of solutions of the system

[TABLE]

with ${\mathbf{x}},{\mathbf{y}}\in(\mathbb{Z}/q\mathbb{Z})^{t}$ and $1\leqslant e_{j}\leqslant(q,d_{j})$ for $2\leqslant j\leqslant l$ . Thus, by orthogonality and the triangle inequality, one sees that

[TABLE]

The conclusion of the lemma is now immediate from (5.6). ∎

We now define

[TABLE]

The crucial bound for our analysis of the singular series is contained in the following lemma.

Lemma 5.2.

Let $q$ be a natural number, and suppose that the matrices $D^{(l)}$ are all highly non-singular. Then there exists a finite set of primes ${\Omega}(D)$ and a natural number ${\mathcal{R}}(q)={\mathcal{R}}(q,D)$ , both depending at most on the coefficient matrices $D^{(l)}$ and in the latter case also $q$ , with the property that

[TABLE]

The constant ${\mathcal{R}}(q)$ is bounded above uniformly in $q$ , and one can take ${\mathcal{R}}(q)=1$ whenever $(q,p)=1$ for all $p\in{\Omega}(D)$ .

Proof.

Recall that $q^{2K-r}B_{1}(q)$ counts the number of solutions ${\mathbf{x}},{\mathbf{y}}\in(\mathbb{Z}/q\mathbb{Z})^{K}$ of the system of congruences (5.5) for $1\leqslant j\leqslant r_{l}$ and $2\leqslant l\leqslant k$ . Since $B_{1}(q)$ is a multiplicative function of $q$ , it is apparent that it suffices to establish the conclusion of the lemma in the special case in which $q$ is a prime power, say $q=p^{h}$ for a given prime $p$ . By applying suitable elementary row operations within the coefficient matrices $D^{(l)}$ for $2\leqslant l\leqslant k$ that are invertible over $\mathbb{Z}/p^{h}\mathbb{Z}$ , we may suppose without loss of generality that each coefficient matrix $D^{(l)}$ is in upper row echelon form. This operation corresponds to taking appropriate linear combinations of the congruences comprising (5.5). Here, we stress that the property that each $D^{(l)}$ is highly non-singular implies that the first $r_{l}\times r_{l}$ submatrix of $D^{(l)}$ is now upper triangular. We denote this matrix by $D_{0}^{(l)}$ . Note that the power of $p$ dividing the diagonal entries of $D_{0}^{(l)}$ depends only on the first $r_{l}\times r_{l}$ submatrices of the original coefficient matrices $D^{(l)}$ . In particular, by defining ${\Omega}(D)$ to be the set of all primes dividing any of the determinants of the latter submatrices, we ensure that when $p\not\in{\Omega}(D)$ , then none of the diagonal entries of $D_{0}^{(l)}$ is divisible by $p$ .

We now employ an inductive argument in order to successively reduce the degrees of the exponential sums occurring within the mean value

[TABLE]

Observe that, as a result of our preparatory manipulations, the $r_{\ell}\times r_{\ell}$ coefficient matrics $D^{(l)}$ with $2\leqslant l\leqslant\ell_{w}$ are upper triangular. Thus, the only exponential sum within the above formula for $B_{1}(p^{h})$ that depends on ${\mathbf{a}}_{w}^{[\ell_{w}]}$ is the one involving $\bm{\Delta}_{w}$ . In order to save clutter, we temporarily drop the modulus $p^{h}$ in our exponential sums $S_{k}(p^{h},\bm{\Delta}_{j})$ . We may thus write

[TABLE]

The inner sum is of the shape considered in Lemma 5.1 with $l=\ell_{w}$ . On writing ${\mathbf{d}}_{j}=(d_{j,j}^{(l)})_{2\leqslant l\leqslant\ell_{j}}$ ( $1\leqslant j\leqslant w$ ), we thus obtain the bound

[TABLE]

Now suppose that for some index $j$ with $1\leqslant j\leqslant w-1$ we have the bound

[TABLE]

where

[TABLE]

Again, since we may assume all coefficient matrices $D^{(l)}$ to be in upper row echelon form, the only exponential sum within the mean value defining ${\Upsilon}_{j}$ that depends on the vector ${\mathbf{a}}_{j}^{[\ell_{j}]}$ is the one involving $\bm{\Delta}_{j}$ . Thus, as in the case $j=w$ considered above, we may isolate the exponential sum indexed by $j$ and apply Lemma 5.1. As a result, we find that

[TABLE]

Inserting this bound into (5.8) reproduces (5.8) with $j$ replaced by $j-1$ . We may clearly iterate, and after $w$ steps we find that

[TABLE]

Clearly, the vectors ${\mathbf{a}}_{j}^{[\ell_{j}]}$ with $1\leqslant j\leqslant w$ together list the coordinates of ${\mathbf{a}}$ . Since $B_{1}(q)$ is multiplicative, the assertion of the lemma is now confirmed upon taking ${\mathcal{R}}(q)$ to be the multiplicative function defined via the formula

[TABLE]

In particular, we note that ${\mathcal{R}}(p^{h})$ depends at most on the coefficient matrices $D^{(l)}$ , and one has ${\mathcal{R}}(p^{h})=1$ whenever $p\not\in{\Omega}(D)$ . ∎

6. Conclusion of the major arcs analysis

With Lemma 5.2 we are now equipped to engage with our goal of showing that the singular series ${\mathfrak{S}}=\underset{Q\to\infty}{\lim}{\mathfrak{S}}(Q)$ converges absolutely. In this context, for each prime number $p$ we define the $p$ -adic factor

[TABLE]

Lemma 6.1.

Suppose that the coefficient matrices $C^{(l)}$ associated with the system (1.10) are highly non-singular, and that $r_{l}\geqslant r_{l+1}$ for $2\leqslant l\leqslant k-1$ . Also, assume that $s\geqslant 2K+1$ . Then the $p$ -adic densities $\chi_{p}$ exist, the singular series ${\mathfrak{S}}$ is absolutely convergent, and ${\mathfrak{S}}=\prod_{p}\chi_{p}$ . In particular, one has ${\mathfrak{S}}(Q)={\mathfrak{S}}+O(Q^{-\delta})$ for some $\delta>0$ . Moreover, if the system (1.10) has a non-singular $p$ -adic solution for all primes $p$ , then ${\mathfrak{S}}\gg 1$ .

Proof.

On recalling (4.8) and (5.1), we see that ${\mathfrak{S}}(Q)=\sum_{1\leqslant q\leqslant Q}A(q)$ , and so the estimation of the quantity $A(q)$ is our central focus. The multiplicativity of $A(q)$ allows us to restrict our attention to the cases where $q$ is a prime power. Set $\chi_{p}(H)=\sum_{h=0}^{H}A(p^{h})$ and $L_{p}(Q)=\lfloor\log Q/\log p\rfloor$ . If the product

[TABLE]

converges absolutely as $Q\rightarrow\infty$ , then so does ${\mathfrak{S}}(Q)$ with the same limit. In such circumstances, one has ${\mathfrak{S}}=\prod_{p}\chi_{p}$ . It is therefore sufficient to show that for all primes $p$ the limit

[TABLE]

exists, and moreover that there exists a positive number $\delta$ having the property that $\chi_{p}=1+O(p^{-1-\delta})$ for all but at most a finite set of primes $p$ .

On recalling (5.2), we find from (4.4) that

[TABLE]

The invertibility of the coordinate transform (5.4) implies that when $({\mathbf{a}},p^{h})=1$ , then there is at least one index $j$ with $1\leqslant j\leqslant w$ such that $(p^{h},\bm{\Delta}_{j})\ll 1$ , with an implied constant depending at most on the coefficient matrices $C^{(l)}$ . Since $s-2K\geqslant 1$ and $\varepsilon$ may be taken arbitrarily small, we deduce that there is a positive number $c_{1}$ , depending at most on the coefficient matrices $C^{(l)}$ , having the property that

[TABLE]

We now wish to apply Lemma 5.2. To this end, we first recall (5.3) and observe that a summation by parts yields the relation

[TABLE]

Since all coefficients on the right hand side are positive, and also both $B_{1}(p^{h})$ and $B_{1}^{*}(p^{h})$ are non-negative for all non-negative integers $h$ , it follows from Lemma 5.2 that we may majorise the right hand side of (6.3) by replacing $B_{1}(p^{h})$ with ${\mathcal{R}}(p^{h})B_{1}^{*}(p^{h})$ for $0\leqslant h\leqslant L$ . Set ${\mathcal{R}}_{p}=\max_{h\geqslant 0}{\mathcal{R}}(p^{h})$ , noting that this maximum exists as ${\mathcal{R}}(p^{h})$ is an integer which is bounded uniformly for all non-negative integers $h$ . Also, in analogy to the definition of $B_{1}^{*}(p^{h})$ , we put

[TABLE]

Thus, another summation by parts shows that the right hand side of (6.3) is no larger than

[TABLE]

We have therefore established the bound

[TABLE]

Since $2\tau_{j}=\ell_{j}(\ell_{j}+1)-2\geqslant\ell_{j}^{2}$ for all $j$ , we can infer further from (4.4) that there exists a positive number $c_{2}$ , depending at most on $\varepsilon$ , such that

[TABLE]

For a fixed vector ${\mathbf{e}}\in\mathbb{Z}_{\geqslant 0}^{w}$ denote by $\Xi(p^{h},{\mathbf{e}})$ the number of vectors ${\mathbf{a}}\in\mathbb{Z}^{r}$ satisfying $1\leqslant{\mathbf{a}}\leqslant p^{h}$ and $(p^{h},{\mathbf{a}}_{j}^{[\ell_{j}]})=p^{e_{j}}$ for $1\leqslant j\leqslant w$ . Then one has

[TABLE]

where the sum is over all vectors ${\mathbf{e}}\in\mathbb{Z}^{w}$ satisfying $0\leqslant e_{j}\leqslant h$ and having the property that $e_{j}=0$ for at least one index $j$ . For any fixed $j$ , the number of choices for ${\mathbf{a}}_{j}^{[\ell_{j}]}\in\mathbb{Z}^{\ell_{j}-1}$ having $1\leqslant{\mathbf{a}}_{j}^{[\ell_{j}]}\leqslant p^{h}$ and $(p^{h},{\mathbf{a}}_{j}^{[\ell_{j}]})=p^{e_{j}}$ is at most $p^{(h-e_{j})(\ell_{j}-1)}$ . It follows that

[TABLE]

and hence

[TABLE]

On recalling (6.2), (6.4) and (6.5) we find that

[TABLE]

It follows that the $p$ -adic density $\chi_{p}$ defined in (6.1) exists. In particular, whenever ${\mathcal{R}}_{p}=1$ we have

[TABLE]

for some positive number $c_{3}$ depending at most on the coefficient matrices $C^{(l)}$ . On recalling the conclusion of Lemma 5.2, one sees that ${\mathcal{R}}_{p}=1$ for all primes $p$ with $p\not\in{\Omega}(D)$ , and thus

[TABLE]

Hence, the singular series ${\mathfrak{S}}$ converges absolutely and one has ${\mathfrak{S}}=\prod_{p}\chi_{p}$ .

Furthermore, a standard argument yields

[TABLE]

where $M(q)$ denotes the number of solutions ${\mathbf{x}}\in(\mathbb{Z}/q\mathbb{Z})^{s}$ of the congruences

[TABLE]

corresponding to the equations (1.10). Using again the observation that ${\mathcal{R}}_{p}=1$ for all sufficiently large primes $p$ , we discern from (6.6) that there exists an integer $p_{0}$ with the property that

[TABLE]

For the remaining finite set of primes, a standard application of Hensel’s lemma shows that $\chi_{p}>0$ whenever the system (1.10) possesses a non-singular solution in $\mathbb{Q}_{p}$ . We thus conclude that under the hypotheses of the lemma we have ${\mathfrak{S}}\gg 1$ as claimed. ∎

We next demonstrate the existence of the limit

[TABLE]

With this goal in mind, when $W$ is a positive real number, we introduce the auxiliary mean value

[TABLE]

Lemma 6.2.

Under the hypotheses of Theorem 4.1, there is a positive number ${\delta}$ for which one has ${\mathfrak{J}}^{*}_{1}(2Q)-{\mathfrak{J}}^{*}_{1}(Q)\ll Q^{-{\delta}}$ , and hence the limit $\chi_{\infty}$ exists. In particular, one has

[TABLE]

Furthermore, if the system (1.10) has a non-singular solution inside the real unit cube $(-1,1)^{s}$ , then the singular integral $\chi_{\infty}$ is positive.

Proof.

The first part of the proof is inspired by a singular series argument of Heath-Brown and Skorobogatov (see [9, pages 173 and 174]). Recall that

[TABLE]

where the integers $t_{j}$ are defined by means of (3.15). Thus, the hypotheses of Theorem 4.1 imply that $s\geqslant 2s_{0}+1\geqslant 2K+1$ . Let ${\mathcal{J}}$ denote the set of $s_{0}$ -element subsets $\{j_{1},\dots,j_{s_{0}}\}$ of $\{1,\dots,s\}$ . When $J\in{\mathcal{J}}$ , define

[TABLE]

and

[TABLE]

Set $Y=Q^{6r}$ , and define the major arcs ${\mathfrak{M}}_{Y}(Q)$ via (4.3). By making the necessary modifications to our initial analysis of the major arcs, we see from (4.9) that for any $J\in{\mathcal{J}}$ one has

[TABLE]

Note that we have $S_{k}(1,\bm{1})=1$ for the term corresponding to $q=1$ in (6.7). Since all other summands are non-negative, it follows that for any $Q\geqslant 1$ and any $J\in{\mathcal{J}}$ , one has

[TABLE]

On the other hand, for $Y=Q^{6r}$ the major arcs ${\mathfrak{M}}_{Y}(Q)$ are disjoint, and we conclude from Theorem 3.3 that under the hypotheses of Theorem 4.1 we have

[TABLE]

In combination with (6.8) and (6.9) it follows that

[TABLE]

Since $Y$ is a power of $Q$ , we discern from (4.5) and (6.10) via (2.3) that for any $Q>1$ we have

[TABLE]

Here, we exploited the fact that, since the coefficient matrices $C^{(l)}$ are highly non-singular, the condition $|{\bm{\beta}}|>Q$ implies that $|{\bm{\vartheta}}_{j}|\gg Q$ for some index $j$ with $1\leqslant j\leqslant s$ . This implies the first statement of the lemma. In particular, the singular integral $\chi_{\infty}$ converges absolutely.

In order to establish the second claim, we follow an argument of Schmidt [10]. When $T\geqslant 1$ , define

[TABLE]

and recall that

[TABLE]

where the integral converges absolutely. Set

[TABLE]

and put

[TABLE]

We adapt the argument of §11 in Schmidt’s work [10] to show that $W_{T}\to\chi_{\infty}$ as $T\to\infty$ .

Set

[TABLE]

Then in light of (6.11) a change of the order of integration shows that

[TABLE]

and hence

[TABLE]

In order to analyse the integral on the right hand side of (6.12), it is convenient to consider two domains separately. Write $U_{1}=[-\sqrt{T},\sqrt{T}]^{r}$ , and set $U_{2}=\mathbb{R}^{r}\setminus U_{1}$ . From the power series expansion of $\psi_{T}$ we find that

[TABLE]

whence we discern that the domain $U_{1}$ contributes at most

[TABLE]

Note that in the last step we used our previous insight that the singular integral converges absolutely. Meanwhile, the contribution from $U_{2}$ is bounded above by

[TABLE]

for some positive number ${\delta}$ with ${\delta}<1$ , where again we took advantage of our earlier findings. Thus we infer from (6.12) that

[TABLE]

for all $T\geqslant 1$ , and hence $W_{T}$ does indeed converge to $\chi_{\infty}$ , as claimed.

Suppose now that the system (1.10) has a non-singular solution inside $(-1,1)^{s}$ . Then it follows from the implicit function theorem that the real manifold described by the equations in (1.10) has positive $(s-r)$ -dimensional volume inside $(-1,1)^{s}$ . In such circumstances, Lemma 2 of Schmidt [10] shows that $W_{T}\gg 1$ uniformly in $T$ . We therefore deduce from (6.13) that $\chi_{\infty}$ is indeed positive, confirming the second claim of the lemma. ∎

Upon combining (4.12) with Lemmata 6.1 and 6.2, we conclude that

[TABLE]

where ${\mathcal{C}}=\chi_{\infty}\prod_{p}\chi_{p}$ . Moreover, the constant ${\mathcal{C}}$ is positive whenever the system (1.10) possesses non-singular solutions in all local fields. This confirms the asymptotic formula (1.11), and completes our proof of Theorem 4.1.

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. Bourgain, C. Demeter and L. Guth, Proof of the main conjecture in Vinogradov’s mean value theorem for degrees higher than three , Ann. of Math. (2) 184 (2016), no. 2, 633–682.
2[2] J. Brandes, The Hasse principle for systems of quadratic and cubic diagonal equations , Q. J. Math. 68 (2017), no. 3, 831–850.
3[3] J. Brandes and S. T. Parsell, Simultaneous additive equations: repeated and differing degrees , Canad. J. Math. 69 (2017), no. 2, 258–283.
4[4] J. Brandes and T. D. Wooley, Vinogradov systems with a slice off , Mathematika 63 (2017), no. 3, 797–817.
5[5] J. Brüdern and T. D. Wooley, The Hasse principle for pairs of diagonal cubic forms , Ann. of Math. (2) 166 (2007), no. 3, 865–895.
6[6] J. Brüdern and T. D. Wooley, The Hasse principle for systems of diagonal cubic forms , Math. Ann. 364 (2016), no. 3-4, 1255–1274.
7[7] R. J. Cook, Simultaneous quadratic equations , J. London Math. Soc. (2) 4 (1971), 319–326.
8[8] R. J. Cook, Simultaneous quadratic equations II , Acta Arith. 25 (1973/74), 1–5.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Optimal mean value estimates beyond

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Theorem 1.1**.**

Theorem 1.2**.**

Theorem 1.3**.**

Corollary 1.4**.**

Theorem 1.5**.**

2. Preliminaries and preparatory steps

Lemma 2.1**.**

Proof.

Lemma 2.2**.**

Proof.

Lemma 2.3**.**

Proof.

3. The underlying mean value

Lemma 3.1**.**

Proof.

Lemma 3.2**.**

Proof.

Theorem 3.3**.**

Proof.

4. The Hardy-Littlewood method

Theorem 4.1**.**

Lemma 4.2**.**

Proof.

Lemma 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

5. Initial considerations for the singular series

Lemma 5.1**.**

Proof.

Lemma 5.2**.**

Proof.

6. Conclusion of the major arcs analysis

Lemma 6.1**.**

Proof.

Lemma 6.2**.**

Proof.

Theorem 1.1.

Theorem 1.2.

Theorem 1.3.

Corollary 1.4.

Theorem 1.5.

Lemma 2.1.

Lemma 2.2.

Lemma 2.3.

Lemma 3.1.

Lemma 3.2.

Theorem 3.3.

Theorem 4.1.

Lemma 4.2.

Lemma 4.3.

Lemma 4.4.

Lemma 5.1.

Lemma 5.2.

Lemma 6.1.

Lemma 6.2.