On the Erd\H{o}s--Ginzburg--Ziv Problem in large dimension

Lisa Sauermann; Dmitrii Zakharov

arXiv:2302.14737·math.CO·March 1, 2023

On the Erd\H{o}s--Ginzburg--Ziv Problem in large dimension

Lisa Sauermann, Dmitrii Zakharov

PDF

Open Access

TL;DR

This paper investigates the Erdős–Ginzburg–Ziv problem in high dimensions, providing significantly improved upper bounds for fixed m and large n by combining advanced combinatorial methods.

Contribution

It introduces new upper bounds for the problem in high dimensions, utilizing the slice rank polynomial method and a higher-uniformity Balog–Szemerédi–Gowers theorem.

Findings

01

Established upper bounds of the form D_{ε,m} * (C_{ε} m^{ε})^n for the problem

02

Improved understanding of the problem's behavior in large dimensions

03

Combined algebraic and combinatorial techniques for bounding solutions

Abstract

The Erd\H{o}s--Ginzburg--Ziv Problem is a classical extremal problem in discrete geometry. Given $m$ and $n$ , the problem asks about the smallest number $s$ such that among any $s$ points in the integer lattice $Z^{n}$ one can find $m$ points whose centroid is again a lattice point. Despite of a lot of attention over the last 50 years, this problem is far from well-understood. For fixed dimension $n$ , Alon and Dubiner proved that the answer grows linearly with $m$ . In this paper, we focus on the opposite case, where the number $m$ is fixed and the dimension $n$ is large. We drastically improve the previous upper bounds in this regime, showing that for every $ε > 0$ the answer is at most $D_{ε, m} \cdot (C_{ε} m^{ε})^{n}$ for all $m$ and $n$ . Our proof combines (a consequence of) the slice rank polynomial method with a higher-uniformity version of…

Equations68

∣ ℓ A ∣ = ∣ A + \dots + A ∣ \leq p^{n} = (p^{ε n})^{c} \leq ∣ A ∣^{c} .

∣ ℓ A ∣ = ∣ A + \dots + A ∣ \leq p^{n} = (p^{ε n})^{c} \leq ∣ A ∣^{c} .

s (Z_{m}^{n}) < m \cdot p \sum \frac{s ( Z _{p}^{n} )}{p - 1},

s (Z_{m}^{n}) < m \cdot p \sum \frac{s ( Z _{p}^{n} )}{p - 1},

s (Z_{m}^{n}) < m \cdot p \sum (D_{ε, p} + 1) \cdot (C_{ε} p^{ε})^{n},

s (Z_{m}^{n}) < m \cdot p \sum (D_{ε, p} + 1) \cdot (C_{ε} p^{ε})^{n},

x_{1}^{(j_{1})} + \dots + x_{k}^{(j_{k})} = 0 if and only if j_{1} = \dots = j_{k} .

x_{1}^{(j_{1})} + \dots + x_{k}^{(j_{k})} = 0 if and only if j_{1} = \dots = j_{k} .

Γ_{p, k} = 0 < γ < 1 in f \frac{1 + γ + \dots + γ ^{p - 1}}{γ ^{(p - 1) / k}} < p .

Γ_{p, k} = 0 < γ < 1 in f \frac{1 + γ + \dots + γ ^{p - 1}}{γ ^{(p - 1) / k}} < p .

C_{λ}^{'} = 0 < γ < 1 in f \frac{1}{( 1 - γ ) \cdot γ ^{1/ λ}} .

C_{λ}^{'} = 0 < γ < 1 in f \frac{1}{( 1 - γ ) \cdot γ ^{1/ λ}} .

Γ_{p, ⌈ λ p ⌉ + 1} = 0 < γ < 1 in f \frac{1 + γ + \dots + γ ^{p - 1}}{γ ^{(p - 1) / (⌈ λ p ⌉ + 1)}} \leq 0 < γ < 1 in f \frac{1 + γ + \dots + γ ^{p - 1}}{γ ^{1/ λ}} \leq 0 < γ < 1 in f \frac{1}{( 1 - γ ) \cdot γ ^{1/ λ}} = C_{λ}^{'},

Γ_{p, ⌈ λ p ⌉ + 1} = 0 < γ < 1 in f \frac{1 + γ + \dots + γ ^{p - 1}}{γ ^{(p - 1) / (⌈ λ p ⌉ + 1)}} \leq 0 < γ < 1 in f \frac{1 + γ + \dots + γ ^{p - 1}}{γ ^{1/ λ}} \leq 0 < γ < 1 in f \frac{1}{( 1 - γ ) \cdot γ ^{1/ λ}} = C_{λ}^{'},

Γ_{p, p} = 0 < γ < 1 in f \frac{1 + γ + \dots + γ ^{p - 1}}{γ ^{(p - 1) / p}} \leq 0 < γ < 1 in f \frac{1}{( 1 - γ ) \cdot γ} \leq 0 < γ < 1 in f \frac{1}{( 1 - γ ) \cdot γ ^{1/ λ}} = C_{λ}^{'},

Γ_{p, p} = 0 < γ < 1 in f \frac{1 + γ + \dots + γ ^{p - 1}}{γ ^{(p - 1) / p}} \leq 0 < γ < 1 in f \frac{1}{( 1 - γ ) \cdot γ} \leq 0 < γ < 1 in f \frac{1}{( 1 - γ ) \cdot γ ^{1/ λ}} = C_{λ}^{'},

∣ A_{ℓ}^{'} ∣ \geq ∣ A_{p}^{'} ∣ - (ℓ - p) \cdot p \cdot (C_{λ}^{'})^{n} \geq ∣ A ∣ - p^{2} \cdot (C_{λ}^{'})^{n} > 0,

∣ A_{ℓ}^{'} ∣ \geq ∣ A_{p}^{'} ∣ - (ℓ - p) \cdot p \cdot (C_{λ}^{'})^{n} \geq ∣ A ∣ - p^{2} \cdot (C_{λ}^{'})^{n} > 0,

∣ {y_{k}^{(j)}, \dots, y_{p}^{(j)}} ∖ {y_{1}^{(1)}, \dots, y_{L}^{(1)}} ∣ \geq ℓ - 1

∣ {y_{k}^{(j)}, \dots, y_{p}^{(j)}} ∖ {y_{1}^{(1)}, \dots, y_{L}^{(1)}} ∣ \geq ℓ - 1

x_{k}^{(j)} = y_{k}^{(j)} + \dots + y_{p}^{(j)} = - (y_{1}^{(j)} + \dots + y_{k - 1}^{(j)}) = - (k - 1) \cdot y_{1}^{(j)} .

x_{k}^{(j)} = y_{k}^{(j)} + \dots + y_{p}^{(j)} = - (y_{1}^{(j)} + \dots + y_{k - 1}^{(j)}) = - (k - 1) \cdot y_{1}^{(j)} .

0 = x_{1}^{(j_{1})} + \dots + x_{k - 1}^{(j_{k - 1})} + x_{k}^{(j_{k})} = x_{1}^{(j)} + \dots + x_{k - 1}^{(j)} - (k - 1) \cdot y_{1}^{(j_{k})} = (k - 1) \cdot y_{1}^{(j)} - (k - 1) \cdot y_{1}^{(j_{k})}

0 = x_{1}^{(j_{1})} + \dots + x_{k - 1}^{(j_{k - 1})} + x_{k}^{(j_{k})} = x_{1}^{(j)} + \dots + x_{k - 1}^{(j)} - (k - 1) \cdot y_{1}^{(j_{k})} = (k - 1) \cdot y_{1}^{(j)} - (k - 1) \cdot y_{1}^{(j_{k})}

{x_{1}^{(j_{1})}, \dots, x_{k - 1}^{(j_{k - 1})}} = {y_{1}^{(j_{1})}, \dots, y_{k - 1}^{(j_{k - 1})}} = {y_{1}^{(j_{1})}, \dots, y_{1}^{(j_{k - 1})}}

{x_{1}^{(j_{1})}, \dots, x_{k - 1}^{(j_{k - 1})}} = {y_{1}^{(j_{1})}, \dots, y_{k - 1}^{(j_{k - 1})}} = {y_{1}^{(j_{1})}, \dots, y_{1}^{(j_{k - 1})}}

0 = x_{1}^{(j_{1})} + \dots + x_{k - 1}^{(j_{k - 1})} + x_{k}^{(j_{k})} = y_{1}^{(j_{1})} + \dots + y_{k - 1}^{(j_{k - 1})} + y_{k}^{(j)} + \dots + y_{p}^{(j)} = y_{1}^{(j_{1})} + \dots + y_{1}^{(j_{k - 1})} + y_{k}^{(j)} + \dots + y_{p}^{(j)},

0 = x_{1}^{(j_{1})} + \dots + x_{k - 1}^{(j_{k - 1})} + x_{k}^{(j_{k})} = y_{1}^{(j_{1})} + \dots + y_{k - 1}^{(j_{k - 1})} + y_{k}^{(j)} + \dots + y_{p}^{(j)} = y_{1}^{(j_{1})} + \dots + y_{1}^{(j_{k - 1})} + y_{k}^{(j)} + \dots + y_{p}^{(j)},

∣ {y_{1}^{(j_{1})}, \dots, y_{1}^{(j_{k - 1})}, y_{k}^{(j)}, \dots, y_{p}^{(j)}} ∣ \geq ∣ {y_{1}^{(j_{1})}, \dots, y_{1}^{(j_{k - 1})}} ∣ + ∣ {y_{k}^{(j)}, \dots, y_{p}^{(j)}} ∖ {y_{1}^{(1)}, \dots, y_{L}^{(1)}} ∣ \geq 2 + (ℓ - 1) = ℓ + 1,

∣ {y_{1}^{(j_{1})}, \dots, y_{1}^{(j_{k - 1})}, y_{k}^{(j)}, \dots, y_{p}^{(j)}} ∣ \geq ∣ {y_{1}^{(j_{1})}, \dots, y_{1}^{(j_{k - 1})}} ∣ + ∣ {y_{k}^{(j)}, \dots, y_{p}^{(j)}} ∖ {y_{1}^{(1)}, \dots, y_{L}^{(1)}} ∣ \geq 2 + (ℓ - 1) = ℓ + 1,

\frac{∣ A ∣}{2 p \cdot ∣ A ∣ ^{1/2} + 1} > \frac{∣ A ∣}{3 p \cdot ∣ A ∣ ^{1/2}} = \frac{∣ A ∣ ^{1/2}}{3 p} \geq \frac{3 p ^{3} \cdot ( C _{1/2}^{'} ) ^{n}}{3 p} = p^{2} \cdot (C_{1/2}^{'})^{n}

\frac{∣ A ∣}{2 p \cdot ∣ A ∣ ^{1/2} + 1} > \frac{∣ A ∣}{3 p \cdot ∣ A ∣ ^{1/2}} = \frac{∣ A ∣ ^{1/2}}{3 p} \geq \frac{3 p ^{3} \cdot ( C _{1/2}^{'} ) ^{n}}{3 p} = p^{2} \cdot (C_{1/2}^{'})^{n}

x_{1} + \dots + x_{p} = x_{t} + s = 1 \sum (p - 1) /2 (x_{i (s - 1)} + x_{j (s - 1)}) = y_{t} + s = 1 \sum (p - 1) /2 (y_{i (s - 1)} + y_{j (s - 1)}) = y_{1} + \dots + y_{p} = 0,

x_{1} + \dots + x_{p} = x_{t} + s = 1 \sum (p - 1) /2 (x_{i (s - 1)} + x_{j (s - 1)}) = y_{t} + s = 1 \sum (p - 1) /2 (y_{i (s - 1)} + y_{j (s - 1)}) = y_{1} + \dots + y_{p} = 0,

c^{''} \cdot (1 + ε ℓ^{'}) = (c - \frac{3}{4}) \cdot (1 + ε ℓ^{'}) \leq c - \frac{1}{2} = c^{'} .

c^{''} \cdot (1 + ε ℓ^{'}) = (c - \frac{3}{4}) \cdot (1 + ε ℓ^{'}) \leq c - \frac{1}{2} = c^{'} .

C (c) = max {(C (c^{'}))^{2}, (C_{1/ ℓ}^{'})^{ℓ / δ}},

C (c) = max {(C (c^{'}))^{2}, (C_{1/ ℓ}^{'})^{ℓ / δ}},

D (c, p) = max {(D (c^{'}, p))^{2}, p^{4} \cdot p^{2 ℓ / δ} \cdot 2^{(ℓ + 1) / δ}} .

D (c, p) = max {(D (c^{'}, p))^{2}, p^{4} \cdot p^{2 ℓ / δ} \cdot 2^{(ℓ + 1) / δ}} .

y \in F_{p}^{n} \sum N_{y} = ∣ A ∣^{ℓ - 1} .

y \in F_{p}^{n} \sum N_{y} = ∣ A ∣^{ℓ - 1} .

Y = {y \in F_{p}^{n} M_{y} \geq \frac{∣ A ∣}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}}}

Y = {y \in F_{p}^{n} M_{y} \geq \frac{∣ A ∣}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}}}

∣ Y ∣ \leq \frac{\sum _{y \in F_{p}^{n}} M _{y}}{∣ A ∣/ ( 2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n} )} \leq 2^{ℓ + 1} \cdot p^{2 ℓ} \cdot (C_{1/ ℓ}^{'})^{ℓ n} \cdot \frac{p \cdot ∣ A ∣ ^{c}}{∣ A ∣} = p^{2 ℓ + 1} \cdot (C_{1/ ℓ}^{'})^{ℓ n} \cdot ∣ A ∣^{c - 1} \leq ∣ A ∣^{c - (3/4)} .

∣ Y ∣ \leq \frac{\sum _{y \in F_{p}^{n}} M _{y}}{∣ A ∣/ ( 2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n} )} \leq 2^{ℓ + 1} \cdot p^{2 ℓ} \cdot (C_{1/ ℓ}^{'})^{ℓ n} \cdot \frac{p \cdot ∣ A ∣ ^{c}}{∣ A ∣} = p^{2 ℓ + 1} \cdot (C_{1/ ℓ}^{'})^{ℓ n} \cdot ∣ A ∣^{c - 1} \leq ∣ A ∣^{c - (3/4)} .

∣ {y_{1} + \dots + y_{ℓ - 1} ∣ (y_{1}, \dots, y_{ℓ - 1}) \in S} ∣ \leq ∣ Y ∣ \leq ∣ A ∣^{c - (3/4)} = ∣ A ∣^{c^{''}} .

∣ {y_{1} + \dots + y_{ℓ - 1} ∣ (y_{1}, \dots, y_{ℓ - 1}) \in S} ∣ \leq ∣ Y ∣ \leq ∣ A ∣^{c - (3/4)} = ∣ A ∣^{c^{''}} .

∣ ℓ^{'} A^{'} ∣ \leq ∣ A^{'} ∣^{c^{''} (1 + ε ℓ^{'})} \leq ∣ A^{'} ∣^{c^{'}},

∣ ℓ^{'} A^{'} ∣ \leq ∣ A^{'} ∣^{c^{''} (1 + ε ℓ^{'})} \leq ∣ A^{'} ∣^{c^{'}},

∣ A^{'} ∣ \geq ∣ A ∣^{1/2} \geq D (c, p)^{1/2} \cdot (C (c))^{n /2} \geq D (c^{'}, p) \cdot (C (c^{'}))^{n} .

∣ A^{'} ∣ \geq ∣ A ∣^{1/2} \geq D (c, p)^{1/2} \cdot (C (c))^{n /2} \geq D (c^{'}, p) \cdot (C (c^{'}))^{n} .

y \in Y \sum N_{y} < ∣ A ∣^{ℓ - 1 - δ} \leq \frac{∣ A ∣ ^{ℓ - 1}}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}},

y \in Y \sum N_{y} < ∣ A ∣^{ℓ - 1 - δ} \leq \frac{∣ A ∣ ^{ℓ - 1}}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}},

y \in Y \sum M_{y} N_{y} \leq ∣ A ∣ \cdot y \in Y \sum N_{y} \leq \frac{∣ A ∣ ^{ℓ}}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}} .

y \in Y \sum M_{y} N_{y} \leq ∣ A ∣ \cdot y \in Y \sum N_{y} \leq \frac{∣ A ∣ ^{ℓ}}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}} .

y \in F_{p}^{n} ∖ Y \sum M_{y} N_{y} \leq \frac{∣ A ∣}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}} \cdot y \in F_{p}^{n} ∖ Y \sum N_{y} \leq \frac{∣ A ∣}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}} \cdot ∣ A ∣^{ℓ - 1} = \frac{∣ A ∣ ^{ℓ}}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}},

y \in F_{p}^{n} ∖ Y \sum M_{y} N_{y} \leq \frac{∣ A ∣}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}} \cdot y \in F_{p}^{n} ∖ Y \sum N_{y} \leq \frac{∣ A ∣}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}} \cdot ∣ A ∣^{ℓ - 1} = \frac{∣ A ∣ ^{ℓ}}{2 ^{ℓ + 1} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}},

y \in F_{p}^{n} \sum M_{y} N_{y} = y \in Y \sum M_{y} N_{y} + y \in F_{p}^{n} ∖ Y \sum M_{y} N_{y} \leq \frac{∣ A ∣ ^{ℓ}}{2 ^{ℓ} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}} .

y \in F_{p}^{n} \sum M_{y} N_{y} = y \in Y \sum M_{y} N_{y} + y \in F_{p}^{n} ∖ Y \sum M_{y} N_{y} \leq \frac{∣ A ∣ ^{ℓ}}{2 ^{ℓ} \cdot p ^{2 ℓ} \cdot ( C _{1/ ℓ}^{'} ) ^{ℓ n}} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Geometry and Mesh Generation · Digital Image Processing Techniques · Limits and Structures in Graph Theory

Full text

On the Erdős–Ginzburg–Ziv Problem in large dimension

Lisa Sauermann Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA. Email: [email protected]. Research supported by NSF Award DMS-2100157 and a Sloan Research Fellowship.

Dmitrii Zakharov Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA. Email: [email protected].

Abstract

The Erdős–Ginzburg–Ziv Problem is a classical extremal problem in discrete geometry. Given $m$ and $n$ , the problem asks about the smallest number $s$ such that among any $s$ points in the integer lattice $\mathbb{Z}^{n}$ one can find $m$ points whose centroid is again a lattice point. Despite of a lot of attention over the last 50 years, this problem is far from well-understood. For fixed dimension $n$ , Alon and Dubiner proved that the answer grows linearly with $m$ . In this paper, we focus on the opposite case, where the number $m$ is fixed and the dimension $n$ is large. We drastically improve the previous upper bounds in this regime, showing that for every $\varepsilon>0$ the answer is at most $D_{\varepsilon,m}\cdot(C_{\varepsilon}m^{\varepsilon})^{n}$ for all $m$ and $n$ . Our proof combines (a consequence of) the slice rank polynomial method with a higher-uniformity version of the Balog–Szemerédi–Gowers Theorem due to Borenstein and Croot.

1 Introduction

For given positive integers $m$ and $n$ , what is the minimum number $s$ such that among any $s$ points in the $n$ -dimensional integer lattice $\mathbb{Z}^{n}$ one can always find $m$ points whose centroid is again a lattice point in $\mathbb{Z}^{n}$ ? This problem is called the Erdős–Ginzburg–Ziv Problem, and its answer is denoted by $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ and called the Erdős–Ginzburg–Ziv constant of $\mathbb{Z}_{m}^{n}$ . This notation reflects that the problem can be naturally translated into $\mathbb{Z}_{m}^{n}$ (where one then asks about the smallest $s$ such that any sequence of length $s$ of elements of $\mathbb{Z}_{m}^{n}$ contains a subsequence of length $m$ summing to zero).

This problem has been studied for fifty years (see e.g. [12, 13]), and is still wide open despite of receiving a lot of attention (in particular, over the past five years more than twenty papers were published on this topic). Only very few values of $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ are known exactly: For $n=1$ Erdős, Ginzburg, and Ziv [8] proved that $\mathfrak{s}(\mathbb{Z}_{m}^{1})=2m-1$ , and for $n=2$ Reiher [17] proved that $\mathfrak{s}(\mathbb{Z}_{m}^{2})=4m-3$ . The only other infinite family of known values is when $m$ is a power of $2$ , then $\mathfrak{s}(\mathbb{Z}_{m}^{n})=(m-1)\cdot 2^{n}+1$ as established by Harborth [12].

Furthermore, and maybe more importantly, the growth behaviour of the function $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ is far from being understood. For fixed dimension $n$ , Alon and Dubiner [1] proved that $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ grows linearly with $m$ . Improving their bound for the linearity constant, the second author [23] furthermore showed that the bound $\mathfrak{s}(\mathbb{Z}_{m}^{n})\leq 4^{n}\cdot m$ holds for every positive integer $m$ all of whose prime factors are sufficiently large with respect to $n$ .

However, in the opposite regime for fixed $m$ , there was an enormous gap between the upper and lower bounds. It turns out that in this regime, the problem can essentially be reduced (up to constant factors) to the case where $m$ is a prime. By the pigeonhole principle, one can easily obtain an upper bound of $\mathfrak{s}(\mathbb{Z}_{m}^{n})\leq m\cdot m^{n}$ (as first observed by Harborth [12]). This trivial bound was improved to an upper bound of the form $C_{m}\cdot(\Gamma_{m})^{n}$ for every fixed prime $m\geq 5$ by Naslund [16] and for $m=3$ by Ellenberg–Gijswijt [7], where $C_{m}$ and $\Gamma_{m}$ are constants only depending on $m$ with $0.84m\leq\Gamma_{m}\leq 0.92m$ . Note that the base $\Gamma_{m}$ of the main term $(\Gamma_{m})^{n}$ in Naslund’s bound is smaller than the base $m$ in the corresponding term $m^{n}$ in the trivial bound, but the base $\Gamma_{m}$ is still linear in $m$ . This was improved by the first author [18], who showed a bound of the form $C_{m}\cdot(2\sqrt{m})^{n}$ for every fixed prime $m\geq 5$ , where again $C_{m}$ is a constant only depending on $m$ . Here, the base is of the form $m^{1/2+o(1)}$ , whereas the bases for the previous bounds were of the form $m^{1-o(1)}$ with $m$ being the base in the trivial bound. As discussed below, the $\sqrt{m}$ term in the base in this bound constitutes an important barrier for this problem.

Breaking this barrier, we drastically improve these upper bounds to an upper bound with base $m^{o(1)}$ . More precisely, we show the following theorem bounding $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ for any fixed integer $m$ and large dimension $n$ .

Theorem 1.1.

For every fixed $\varepsilon>0$ and every fixed integer $m\geq 2$ , we have $\mathfrak{s}(\mathbb{Z}_{m}^{n})\leq D_{\varepsilon,m}\cdot(C_{\varepsilon}m^{\varepsilon})^{n}$ for all $n$ . Here, $C_{\varepsilon}$ is a constant only depending on $\varepsilon$ , and $D_{\varepsilon,m}$ is a constant only depending on $\varepsilon$ and $m$ .

As mentioned above, the problem of upper-bounding $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ for fixed $m$ and large $n$ can easily be reduced to the case where $m=p$ is a prime. The problem is then essentially equivalent (up to constant factors depending on $m=p$ ) to the following additive combinatorics problem: For a fixed prime $p$ and large $n$ , what is the maximum possible size of a subset of $\mathbb{F}_{p}^{n}$ not containing $p$ distinct vectors with sum zero? Our upper bound for $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ in Theorem 1.1 is obtained by proving the following new upper bound for this additive combinatorics problem.

Theorem 1.2.

For every fixed $\varepsilon>0$ and every fixed prime $p$ , the following holds for all $n$ . For any subset $A\subseteq\mathbb{F}_{p}^{n}$ not containing distinct vectors $x_{1},\dots,x_{p}\in A$ with $x_{1}+\dots+x_{p}=0$ , we have $|A|\leq D_{\varepsilon,p}\cdot(C_{\varepsilon}p^{\varepsilon})^{n}$ . Here, $C_{\varepsilon}$ is a constant only depending on $\varepsilon$ , and $D_{\varepsilon,p}$ is a constant only depending on $\varepsilon$ and $p$ .

The previous bounds for $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ for fixed $m$ and large $n$ in [16, 18] have also been obtained by studying this additive combinatorics problem, but here we prove much stronger bounds for this problem and hence for $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ . There is extensive literature and research activity on zero-sum problems in abelian groups (e.g. see the survey [10]), and this additive combinatorics problem is one of the most central problems in this area.

In the case of $p=3$ , a subset $A\subseteq\mathbb{F}_{3}^{n}$ not containing three distinct vectors $x_{1},x_{2},x_{3}\in A$ with $x_{1}+x_{2}+x_{3}=0$ is precisely the same as a three-term progression-free subset $A\subseteq\mathbb{F}_{3}^{n}$ . The problem of determining the maximum possible size of such a subset is a very famous problem in additive combinatorics, called the cap-set problem. In 2017, Ellenberg and Gijsiwjt [7] achieved a breakthrough on this problem, proving that any subset $A\subseteq\mathbb{F}_{3}^{n}$ without a three-term arithmetic progression has size at most $2.756^{n}$ . Hence for $p=3$ , in Theorem 1.2 one has the bound $|A|\leq 2.756^{n}$ .

Ellenberg and Gijsiwjt [7] actually proved a more general result, bounding the size of a three-term progression-free subset $A\subseteq\mathbb{F}_{p}^{n}$ for any fixed prime $p\geq 3$ . Their proof relies on a new polynomial method that was introduced by Croot, Lev and Pach [5] just a few weeks earlier, and that was shortly afterwards reformulated and generalized by Tao [20] to what is now called the slice rank polynomial method. Since the slice rank polynomial gives the bound $|A|\leq 2.756^{n}$ in Theorem 1.2 for $p=3$ , it is very natural to also try to apply it in the setting of Theorem 1.2 for larger primes $p$ . Unfortunately, the proof for $p=3$ breaks down for larger $p$ , since the slice rank polynomial method cannot handle the distinctness condition for the vectors $x_{1},\dots,x_{p}$ in Theorem 1.2. This is because, with this distinctness condition, the relevant tensor is not a diagonal tensor anymore, and so one looses control over its slice rank.

The above-mentioned works of Naslund [16] and the first author [18] obtain a weaker upper bound in the setting of Theorem 1.2 by using certain manipulations (relying on combinatorial arguments) to reduce to the setting of diagonal tensors for applying the slice rank polynomial method (or variants thereof).

Here, we obtain a much stronger upper bound with a new approach that incorporates both (a consequence of) the slice rank polynomial method and a higher-uniformity version of the Balog–Szemerédi–Gowers Theorem. These different tools from additive combinatorics have not previously been combined, and we believe that there may be future potential in pursuing such an approach further.

In particular, our approach manages to break the “multi-colored barrier” for this problem. Indeed, as discussed in [18], for the “multi-colored” version of the setting in Theorem 1.2 the bound $C_{p}\cdot(2\sqrt{p})^{n}$ due to the first author is essentially tight (there is a lower bound of $\sqrt{p}^{n}$ for even $n$ ). Thus, an improvement of the bound beyond $\sqrt{p}^{n}$ needs to use the “single-set” setting in Theorem 1.2 in an essential way (with an argument that is not generalizable to the “multi-colored” setting). So far, most arguments relying on the slice rank polynomial method naturally generalized to the “multi-colored” setting, and so finding new approaches that are specific to the “single-set” setting has been a major challenge. In particular, this challenge also appears for the problem of improving the bounds of Ellenberg–Gijswijt [7] for the cap-set problem and more generally the problem of bounding the size of three-term progression-free subsets of $\mathbb{F}_{p}^{n}$ . These bounds are also (essentially) tight in the “multi-colored setting” (see [14]), and so an approach specific to the “single-set” setting would be needed to overcome this “multi-colored barrier”. Our approach of blending the slice rank polynomial method with the Balog–Szemerédi–Gowers Theorem is indeed specific “single-set setting”, and so we believe that it may be helpful in breaking the “multi-colored barrier” in other problems as well.

The particular higher-uniformity version of the Balog–Szemerédi–Gowers Theorem [2, 11] that we are using is due to Borenstein and Croot [3] and is stated in Section 4.2 (there are also other higher-uniformity versions, see in particular [19]). Besides this theorem and a (consequence of) the slice rank polynomial method (see Section 3.1), our proof also uses combinatorial and probabilistic arguments.

The best known lower bounds for $\mathfrak{s}(\mathbb{Z}_{m}^{n})$ for large $n$ are of the form $(m-1)\cdot c^{n}$ for all odd $m\geq 3$ with $c\approx 2.1398$ and are due to Edel [6] (in the case of $m=3$ , the constant is slightly better, namely $c\approx 2.2180$ due to Tyrell [21]). This in particular leads to the lower bound $\mathfrak{s}(\mathbb{Z}_{m}^{n})\geq 2.1398^{n}$ for any $m$ which is not a power of $2$ and large $n$ (if $m$ is a power of $2$ , then $\mathfrak{s}(\mathbb{Z}_{m}^{n})=(m-1)\cdot 2^{n}+1$ is known exactly). Here, the exponential base $2.1398$ is an absolute constant independent of $m$ . It is still an open question whether this is the right behaviour, or whether the correct exponential base should depend on $m$ :

Question 1.3.

Is there an absolute constant $c$ , such that for every fixed integer $m\geq 2$ we have $\mathfrak{s}(\mathbb{Z}_{m}^{n})\leq D_{m}\cdot c^{n}$ for all $n$ , where $D_{m}$ is a constant only depending on $m$ ?

Our upper bound in Theorem 1.2 does not give such a constant $c$ , but instead it gives a term of the form $m^{o(1)}$ (where the exponent $o(1)$ converges to zero for growing $m$ ). In the opposite regime, where the dimension $n$ is fixed, the second author [23] showed the bound $\mathfrak{s}(\mathbb{Z}_{m}^{n})\leq m\cdot 4^{n}$ if all prime factors of $m$ are sufficiently large with respect to $n$ .

Acknowledgements. The authors would like to thank Cosmin Pohoata for helpful conversations and for pointing out reference [3], as well as Jacob Fox for useful comments on an earlier version of this paper.

Notation. For a subset $A$ of an additively written abelian group (for us, the group will usually be $\mathbb{F}_{p}^{n}$ ), we write $\ell A=A+\dots+A=\{x_{1}+\dots+x_{\ell}\mid x_{1},\dots,x_{\ell}\in A\}$ , as usual. The cover number of a finite family $\mathcal{F}$ of non-empty subsets $X\subseteq S$ of some ground set $S$ is the size of the smallest subset $S^{\prime}\subseteq S$ such that $S^{\prime}$ intersects every set $X\in\mathcal{F}$ .

2 Proof Overview

2.1 Proof Structure

The Balog–Szemerédi–Gowers Theorem gives, for a subset $A\subseteq\mathbb{F}_{p}^{n}$ with many solutions to the equation $y_{1}+y_{2}=y_{1}^{\prime}+y_{2}^{\prime}$ with $y_{1},y_{2},y_{1}^{\prime},y_{2}^{\prime}\in A$ , a subset $A^{\prime}\subseteq A$ such that the sum-set $A^{\prime}+A^{\prime}$ is small. Similarly, the higher uniformity version of the theorem due to Borenstein–Croot [3] gives, under suitable conditions on $A$ , a subset $A^{\prime}\subseteq A$ such that the $\ell$ -fold sum-set $\ell A^{\prime}=A^{\prime}+\dots+A^{\prime}$ is small for certain $\ell$ . A priori, it is unclear how such a subset $A^{\prime}$ is useful for finding a solution to $x_{1}+\dots+x_{p}=0$ with distinct vectors $x_{1},\dots,x_{p}\in A$ .

The key for taking advantage of such a statement lies in the inductive setup for our proof, which allows us to incorporate the higher uniformity Balog–Szemerédi–Gowers Theorem in interplay with the slice rank polynomial method. The actual statement that we induct on is as follows.

Theorem 2.1.

For every $c>1$ , there is a positive integer $\ell$ and a constant $C(c)\geq 1$ such that for every sufficiently large prime $p$ (large enough in terms of $c$ ) there is a constant $D(c,p)\geq 1$ such that the following holds for all $n$ . If $A\subseteq\mathbb{F}_{p}^{n}$ is a subset of size $|A|\geq D(c,p)\cdot(C(c))^{n}$ with $|\ell A|=|A+\dots+A|\leq|A|^{c}$ , then $A$ contains $p$ distinct vectors $x_{1},\dots,x_{p}\in A$ with $x_{1}+\dots+x_{p}=0$ .

We prove this statement inductively (taking $c=h/2$ and inducting on $h=3,4,5,\dots$ ). Assuming Theorem 2.1, it is not difficult to deduce Theorem 1.2:

Proof of Theorem 1.2 assuming Theorem 2.1.

As in Theorem 1.2, let $\varepsilon>0$ be fixed. Noting that the statement in Theorem 1.2 is trivial for $\varepsilon\geq 1$ (since we always have $|A|\leq p^{n}$ ), we may assume that $0<\varepsilon<1$ . Now, let $c=1/\varepsilon$ , and let $\ell$ and $C(c)$ be as in Theorem 2.1. Let us choose $C_{\varepsilon}>C(c)$ large enough such that the statement in Theorem 2.1 holds for all primes $p\geq C_{\varepsilon}$ .

As in Theorem 1.2, let us now consider a prime $p$ . Note that for any prime $p<C_{\varepsilon}$ and any subset $A\subseteq\mathbb{F}_{p}^{n}$ we trivially have $|A|\leq p^{n}\leq(C_{\varepsilon}p^{\varepsilon})^{n}$ . Thus, we may assume that $p\geq C_{\varepsilon}$ , and so there is a constant $D(c,p)$ such that the statement in Theorem 2.1 holds. Let $D_{\varepsilon,p}=D(c,p)$ .

Suppose that $A\subseteq\mathbb{F}_{p}^{n}$ is a subset of size $|A|>D_{\varepsilon,p}\cdot(C_{\varepsilon}p^{\varepsilon})^{n}$ not containing distinct vectors $x_{1},\dots,x_{p}\in A$ with $x_{1}+\dots+x_{p}=0$ . Note that we have $|A|>D_{\varepsilon,p}\cdot(C_{\varepsilon}p^{\varepsilon})^{n}\geq D(c,p)\cdot(C(c))^{n}$ and

[TABLE]

Thus, we obtain a contradiction to Theorem 2.1. ∎

It is also not difficult to show that Theorem 1.2 implies our main result about Erdős–Ginzburg–Ziv constants in Theorem 1.1.

Proof of Theorem 1.1 assuming Theorem 1.2.

As in the statement of Theorem 1.1, let $\varepsilon>0$ and $m\geq 2$ be fixed. We have

[TABLE]

where the sum is over all prime factors $p$ of $m$ (see, for example, [9, Lemma 11]). On the other hand, for each prime factor $p$ of $m$ , we can bound $\mathfrak{s}(\mathbb{Z}_{p}^{n})$ using Theorem 1.2 as follows. Consider a sequence of elements of $\mathbb{Z}_{p}^{n}\cong\mathbb{F}_{p}^{n}$ without a subsequence of length $p$ summing to zero. Clearly, this sequence can contain at most $p-1$ copies of any particular vector in $\mathbb{F}_{p}^{n}$ . On the other hand, the set $A\subseteq\mathbb{F}_{p}^{n}$ of all vectors appearing at least once in the sequence does not contain $p$ distinct vectors summing to zero, and so by Theorem 1.2 we have $|A|\leq D_{\varepsilon,p}\cdot(C_{\varepsilon}p^{\varepsilon})^{n}$ . This means that the sequence has length at most $(p-1)\cdot D_{\varepsilon,p}\cdot(C_{\varepsilon}p^{\varepsilon})^{n}$ and hence $\mathfrak{s}(\mathbb{Z}_{p}^{n})\leq(p-1)\cdot D_{\varepsilon,p}\cdot(C_{\varepsilon}p^{\varepsilon})^{n}+1\leq(p-1)\cdot(D_{\varepsilon,p}+1)\cdot(C_{\varepsilon}p^{\varepsilon})^{n}$ . Thus, we obtain

[TABLE]

where the sum is again over all prime factors $p$ of $m$ . Thus, the desired statement holds when taking the constant $D_{\varepsilon,m}$ in Theorem 1.1 to be the sum of $m\cdot(D_{\varepsilon,p}+1)$ for the constants $D_{\varepsilon,p}$ in Theorem 1.2 over all prime factors $p$ of $m$ (and taking $C_{\varepsilon}$ to be the same constant as in Theorem 1.2). ∎

The main difficulty is of course to prove Theorem 2.1, and the rest of this paper is devoted to this. We start by giving an outline of the main ideas of the proof in the next subsection.

2.2 Outline of proof of Theorem 2.1

Noting that Theorem 2.1 gets strictly stronger as we increase $c$ , we may assume that $c=h/2$ for an integer $h\geq 3$ . We will then prove Theorem 2.1 by induction on $h$ .

To prove the theorem, we need to show that there is a solution to $x_{1}+\dots+x_{p}=0$ with $p$ distinct vectors $x_{1},\dots,x_{p}\in A$ . Our strategy for finding such a solution is to start with a solution to $x_{1}+\dots+x_{p}=0$ where $x_{1},\dots,x_{p}\in A$ are not necessarily distinct, and then modify this solution to make $x_{1},\dots,x_{p}$ distinct. More specifically, to modify the solution we split the vectors $x_{1},\dots,x_{p}$ into $\ell$ -tuples (and a small remainder of fewer than $\ell$ vectors), and then we replace each $\ell$ -tuple by another $\ell$ -tuple of vectors in $A$ with the same sum. Since at every step we replace $\ell$ vectors in $A$ with $\ell$ different vectors in $A$ with the same sum, the sum $x_{1}+\dots+x_{p}$ does not change throughout this process, and so at every step our vectors $x_{1},\dots,x_{p}\in A$ form a solution to $x_{1}+\dots+x_{p}=0$ . The difficulty is, however, to obtain a solution where $x_{1},\dots,x_{p}$ are distinct.

It turns out that using the slice rank polynomial method and some combinatorial arguments, we can ensure that at the start of our process we have a solution to $x_{1}+\dots+x_{p}=0$ with $x_{1},\dots,x_{p}\in A$ that can be split into $\ell$ -tuples (and fewer than $\ell$ remaining vectors) in such a way that each $\ell$ -tuple consists of $\ell$ distinct vectors (and also such that the vectors in the remainder are distinct from each other). Our aim in each step of the process is now to replace one of the $\ell$ -tuples with a different $\ell$ -tuple of distinct vectors in $A$ with the same sum, such that the $\ell$ vectors in the new $\ell$ -tuple are distinct from all the other vectors appearing among our solution to $x_{1}+\dots+x_{p}=0$ at that step. If we can do this step by step for each $\ell$ -tuple, this greedy procedure will lead to a solution to $x_{1}+\dots+x_{p}=0$ with distinct vectors $x_{1},\dots,x_{p}\in A$ .

Of course, it may happen that at some step, when we want to replace a certain $\ell$ -tuple with sum $w\in\mathbb{F}_{p}^{n}$ , we cannot find an $\ell$ -tuple of distinct vectors in $A$ with the same sum $w$ which is disjoint from all vectors currently appearing in our solution to $x_{1}+\dots+x_{p}=0$ . In this case, every subset $\{y_{1},\dots,y_{\ell}\}\subseteq A$ consisting of distinct vectors $y_{1},\dots,y_{\ell}\in A$ with sum $y_{1}+\dots+y_{\ell}=w$ must contain one of the vectors $x_{1},\dots,x_{p}$ . Hence the family of subsets $\{y_{1},\dots,y_{\ell}\}\subseteq A$ with distinct elements $y_{1},\dots,y_{\ell}\in A$ with sum $y_{1}+\dots+y_{\ell}=w$ must have a cover of size at most $p$ . We call a vector $w\in\mathbb{F}_{p}^{n}$ bad if this happens. Furthermore, we call an $\ell$ -tuple of $\ell$ distinct vectors in $A$ bad, if the sum of the $\ell$ vectors is bad. Then at every step of our process, if the relevant $\ell$ -tuple is not bad, we will be able to replace it in the desired way.

Thus, if at the start of our process each of the $\ell$ -tuples into which we split our starting solution to $x_{1}+\dots+x_{p}=0$ is not a bad $\ell$ -tuple, we will be able to run this modification process for each of the $\ell$ -tuples and obtain a solution to $x_{1}+\dots+x_{p}=0$ with distinct vectors $x_{1},\dots,x_{p}\in A$ in the end. So it suffices to find a solution to $x_{1}+\dots+x_{p}=0$ with $x_{1},\dots,x_{p}\in A$ that can be spit into $\ell$ -tuples in such a way, that each of these $\ell$ -tuples consists of $\ell$ distinct vectors and is not bad (and such that the fewer than $\ell$ remaining vectors are distinct from each other).

If there are only few bad $\ell$ -tuples $(y_{1},\dots,y_{\ell})\in A^{\ell}$ , then by a probabilistic subset sampling argument there is a relatively large subset $A^{\prime}\subseteq A$ such that there exists no bad $\ell$ -tuple $(y_{1},\dots,y_{\ell})\in A^{\ell}$ with $y_{1},\dots,y_{\ell}\in A^{\prime}$ . Then, relying on the slice rank polynomial method and further combinatorial arguments, one can find a solution to $x_{1}+\dots+x_{p}=0$ with $x_{1},\dots,x_{p}\in A^{\prime}$ that can be split into $\ell$ -tuples in the desired way (and then automatically none of these $\ell$ -tuples will be bad). Applying our process as discussed above, we can turn this into a solution to $x_{1}+\dots+x_{p}=0$ with $x_{1},\dots,x_{p}\in A$ such that $x_{1},\dots,x_{p}$ are distinct.

For the induction beginning, i.e. the case $c=3/2$ in Theorem 2.1, this already suffices. In fact, if $c=3/2$ , we can take $\ell=2$ in Theorem 2.1 and observe that then by the assumption $|A+A|\leq|A|^{3/2}$ the number of bad $2$ -tuples is at most $2p\cdot|A|^{3/2}$ (indeed, every possible sum $w=x_{1}+x_{2}$ can lead to at most $2p$ bad $2$ -tuples $(x_{1},x_{2})\in A^{2}$ with $x_{1}+x_{2}=w$ ). Thus, there are only few bad $2$ -tuples in $A^{2}$ , and the above argument applies.

In the induction step, we also need to consider a second case, namely that there are many bad $\ell$ -tuples $(x_{1},\dots,x_{\ell})\in A^{\ell}$ . For each bad $\ell$ -tuple $(x_{1},\dots,x_{\ell})$ , the sum $w=x_{1}+\dots+x_{\ell}$ is bad, and one of the vectors $x_{1},\dots,x_{\ell}$ is in the cover of size at most $p$ of the family of all $\ell$ -element subsets of $A$ with sum $w$ . Thus, upon reordering the vectors, every bad $\ell$ -tuple can be rewritten as $(y_{1},\dots,y_{\ell-1},z)$ where $z$ is in the cover of size at most $p$ of the family of all $\ell$ -element subsets of $A$ with sum $y_{1}+\dots+y_{\ell-1}+z$ . By our assumption $|\ell A|\leq|A|^{c}$ in Theorem 2.1, there are not so many possibilities for the sum $y_{1}+\dots+y_{\ell}+z$ , and for each of these possibilities there are at most $p$ choices for $z$ . One can now show that there must be many bad $\ell$ -tuples $(y_{1},\dots,y_{\ell-1},z)\in A^{\ell}$ , for which the sums $y_{1}+\dots+y_{\ell-1}$ are concentrated on relatively few possible values. This means that one can apply the higher uniformity Balog–Szemerédi–Gowers Theorem due to Borenstein–Croot [3] (whose precise statement is given in Theorem 4.2). This theorem (with suitably chosen parameters) implies that there is a relatively large subset $A^{\prime}\subseteq A$ with $|\ell^{\prime}A^{\prime}|\leq|A|^{c-1/2}$ for the value $\ell^{\prime}$ such that Theorem 2.1 holds for $c^{\prime}:=c-1/2$ with $\ell^{\prime}$ instead of $\ell$ (recall that Theorem 2.1 holds for $c-1/2=(h-1)/2$ by our induction hypothesis). Now, by the conclusion of Theorem 2.1 for $c^{\prime}$ and and $\ell^{\prime}$ , we can find a solution to $x_{1}+\dots+x_{p}=0$ with distinct vectors $x_{1},\dots,x_{p}\in A^{\prime}\subseteq A$ .

This finishes the outline of our proof of Theorem 2.1. The actual proof can be found in Section 4, after some preparations for the proof in Section 3.

3 Preparations

3.1 Solutions with not too many repetitions

In this subsection, we show that any large subset $A\subseteq\mathbb{F}_{p}^{n}$ must contain a solution to the equation $x_{1}+\dots+x_{p}=0$ with $x_{1},\dots,x_{p}\in A$ , such that no vector appears a lot of times among $x_{1},\dots,x_{p}$ . The precise statement is given in Proposition 3.2 below. This will help us to find a solution to $x_{1}+\dots+x_{p}=0$ with $x_{1},\dots,x_{p}\in A$ , such that $(x_{1},\dots,x_{p})$ can be split into $\ell$ -tuples (and fewer than $\ell$ remaining vectors) such that each $\ell$ -tuple consists of $\ell$ distinct vectors.

The proof of our proposition uses the $k$ -coloured Sum-Free Theorem, which can be proved with the slice rank polynomial method. As mentioned in the introduction, the slice rank polynomial method was introduced by Tao [20] following work of Croot–Lev–Pach [5] and Ellenberg–Gijswijt [7]. A proof of the $k$ -coloured Sum-Free Theorem following this method can be found in [15].

Theorem 3.1 ( $k$ -coloured Sum-Free Theorem).

Let $k\geq 3$ be an integer, and let $p$ be a prime. For some positive integer $n$ , let $(x_{1}^{(j)},\dots,x_{k}^{(j)})\in\mathbb{F}_{p}^{n}\times\dots\times\mathbb{F}_{p}^{n}$ for $j=1,\dots,L$ be a list of $k$ -tuples of vectors in $\mathbb{F}_{p}^{n}$ . Suppose that for all $j_{1},\dots,j_{k}\in\{1,\dots,L\}$ , we have

[TABLE]

Then we must have $L\leq(\Gamma_{p,k})^{n}$ , where

[TABLE]

As an immediate consequence of the $k$ -coloured Sum-Free Theorem one can show that every subset $A\subseteq\mathbb{F}_{p}^{n}$ of size $|A|\geq 4^{n}>(\Gamma_{p,p})^{n}$ must contain a solution to the equation $y_{1}+\dots+y_{p}=0$ such that $y_{1},\dots,y_{p}\in A$ are not all equal. In other words, we obtain a solution such that every vector in $\mathbb{F}_{p}^{n}$ appears at most $p-1$ times among $y_{1},\dots,y_{p}$ .

The statement of the following proposition is somewhat similar, showing that every large enough subset $A\subseteq\mathbb{F}_{p}^{n}$ contains a solution to $y_{1}+\dots+y_{p}=0$ , such that every vector in $\mathbb{F}_{p}^{n}$ appears at most $\lambda p$ times among $y_{1},\dots,y_{p}$ (for some fixed $0<\lambda\leq 1$ ).

Proposition 3.2.

For every fixed $0<\lambda\leq 1$ , there exists a constant $C^{\prime}_{\lambda}\geq 1$ such that for every prime $p>1/\lambda$ and every positive integer $n$ the following holds. If $A\subseteq\mathbb{F}_{p}^{n}$ is a subset of size $|A|>p^{2}\cdot(C^{\prime}_{\lambda})^{n}$ , there exist vectors $y_{1},\dots,y_{p}\in A$ with $y_{1},+\dots+y_{p}=0$ such that every vector in $\mathbb{F}_{p}^{n}$ appears among $y_{1},\dots,y_{p}$ at most $\lambda p$ times.

Proof.

We define

[TABLE]

Note that then for all primes $p>1/\lambda$ we have $\lceil\lambda p\rceil+1\geq 3$ and

[TABLE]

and furthermore also

[TABLE]

Now, as in the statement of the proposition, let $A\subseteq\mathbb{F}_{p}^{n}$ be a subset of size $|A|\geq p^{2}\cdot(C^{\prime}_{\lambda})^{n}$ . Let us suppose for contradiction that for any solution to the equation $y_{1}+\dots+y_{p}=0$ with $y_{1},\dots,y_{p}\in A$ , there is a vector appearing among $y_{1},\dots,y_{p}$ at least $\lceil\lambda p\rceil$ times.

For every solution $(y_{1},\dots,y_{p})\in A^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ let us consider the number $|\{y_{1},\dots,y_{p}\}|$ , i.e. the number of distinct vectors appearing among $y_{1},\dots,y_{p}$ (this number is in $\{1,\dots,p\}$ ). Let us say that two solutions $(y_{1},\dots,y_{p}),(y^{\prime}_{1},\dots,y^{\prime}_{p})\in A^{p}$ to this equation are disjoint if no vector appears in both of them.

Claim 3.3.

There exists a number $\ell\in\{1,\dots,p\}$ and a subset $A^{\prime}\subseteq A$ satisfying the following two conditions:

(i)

There is a collection of more than $(C^{\prime}_{\lambda})^{n}$ pairwise disjoint solutions $(y_{1},\dots,y_{p})\in(A^{\prime})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ with $|\{y_{1},\dots,y_{p}\}|=\ell$ .

(ii)

Every solution $(y_{1},\dots,y_{p})\in(A^{\prime})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ satisfies $|\{y_{1},\dots,y_{p}\}|\leq\ell$ .

Proof.

Let us define a sequence of subsets $A^{\prime}_{p}\supseteq A^{\prime}_{p-1}\supseteq\dots\supseteq A^{\prime}_{\ell}$ of $A\subseteq\mathbb{F}_{p}^{n}$ for some $\ell\in\{0,\dots,p\}$ with the following recursive process. Throughout this process we will ensure that for every $j=\ell,\dots,p$ , every solution $(y_{1},\dots,y_{p})\in(A^{\prime}_{j})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ satisfies $|\{y_{1},\dots,y_{p}\}|\leq j$ .

We start by defining $A^{\prime}_{p}=A^{\prime}$ . Clearly every solution $(y_{1},\dots,y_{p})\in(A^{\prime}_{p})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ satisfies $|\{y_{1},\dots,y_{p}\}|\leq p$ .

Suppose that for some index $1\leq j\leq p$ , we have already defined the set $A^{\prime}_{j}\subseteq A\subseteq\mathbb{F}_{p}^{n}$ with the property that every solution $(y_{1},\dots,y_{p})\in(A^{\prime}_{j})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ satisfies $|\{y_{1},\dots,y_{p}\}|\leq j$ .

Let us now consider a maximal collection of pairwise disjoint solutions $(y_{1},\dots,y_{p})\in(A^{\prime}_{j})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ with $|\{y_{1},\dots,y_{p}\}|=j$ . If this maximal collection has size larger than $(C^{\prime}_{\lambda})^{n}$ , then let us terminate the process and define $\ell=j$ (then we do not need to define another set $A^{\prime}_{j-1}$ ). Otherwise, this maximal collection has size at most $(C^{\prime}_{\lambda})^{n}$ and so there are at most $p\cdot(C^{\prime}_{\lambda})^{n}$ different vectors appearing in one of the solutions $(y_{1},\dots,y_{p})$ in our collection in $(A^{\prime}_{j})^{p}$ . Now, let the set $A^{\prime}_{j-1}$ be obtained from $A^{\prime}_{j}$ by deleting all the vectors appearing in some solution in the collection. Note that then, by the maximality of the chosen collection, no solutions $(y_{1},\dots,y_{p})$ to $y_{1}+\dots+y_{p}=0$ with $|\{y_{1},\dots,y_{p}\}|=j$ remain. Hence in the set $A^{\prime}_{j-1}$ every solution $(y_{1},\dots,y_{p})\in(A^{\prime}_{j_{1}})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ satisfies $|\{y_{1},\dots,y_{p}\}|\leq j-1$ .

This process defines subsets $A^{\prime}_{p}\supseteq A^{\prime}_{p-1}\supseteq\dots\supseteq A^{\prime}_{\ell}$ of $A\subseteq\mathbb{F}_{p}^{n}$ for some $\ell\in\{0,\dots,p\}$ . Note that at every step of the process we delete at most $p\cdot(C^{\prime}_{\lambda})^{n}$ vectors, meaning that $|A^{\prime}_{j-1}|\geq|A^{\prime}_{j}|-p\cdot(C^{\prime}_{\lambda})^{n}$ for $\ell+1\leq j\leq p$ . This implies that

[TABLE]

so the final set $A^{\prime}_{\ell}$ is non-empty.

We claim that $\ell\neq 0$ . Indeed, if we had $\ell=0$ , then $A^{\prime}_{0}$ would be a non-empty subset of $\mathbb{F}_{p}^{n}$ such that every solution $(y_{1},\dots,y_{p})\in(A^{\prime}_{j_{1}})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ satisfies $|\{y_{1},\dots,y_{p}\}|\leq 0$ . This is a contradiction, since for any $y\in A^{\prime}_{0}$ we can form a solution solution $(y_{1},\dots,y_{p})\in(A^{\prime}_{0})^{p}$ to $y_{1}+\dots+y_{p}=0$ by taking $(y_{1},\dots,y_{p})=(y,\dots,y)$ and then we have $|\{y_{1},\dots,y_{p}\}|=1$ . Thus, we must have $\ell\in\{1,\dots,p\}$ .

This means that the process above terminated with the set $A^{\prime}_{\ell}$ , which means that $A^{\prime}_{\ell}$ contains a collection of more than $(C^{\prime}_{\lambda})^{n}$ pairwise disjoint solutions $(y_{1},\dots,y_{p})\in(A^{\prime}_{\ell})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ with $|\{y_{1},\dots,y_{p}\}|=\ell$ . Thus, taking $A^{\prime}=A^{\prime}_{\ell}$ , condition (i) is satisfied. Furthermore, condition (ii) is satisfied, since throughout the process we maintained the property that every solution $(y_{1},\dots,y_{p})\in(A^{\prime}_{\ell})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ satisfies $|\{y_{1},\dots,y_{p}\}|\leq\ell$ . ∎

As in Claim 3.3, let us choose $\ell\in\{1,\dots,p\}$ and $A^{\prime}\subseteq A$ satisfying conditions (i) and (ii). By condition (i), there exists a collection $\mathcal{C}\subseteq(A^{\prime})^{p}$ of $|\mathcal{C}|>(C^{\prime}_{\lambda})^{n}$ pairwise disjoint solutions $(y_{1},\dots,y_{p})\in(A^{\prime})^{p}$ to the equation $y_{1}+\dots+y_{p}=0$ with $|\{y_{1},\dots,y_{p}\}|=\ell$ . By our assumption, for each of these solutions there is a vector appearing among $y_{1},\dots,y_{p}$ at least $\lceil\lambda p\rceil$ times. So let us define $k=\lceil\lambda p\rceil+1$ (then $k-1=\lceil\lambda p\rceil$ ) and let us re-order the vectors in each solution $(y_{1},\dots,y_{p})\in\mathcal{C}$ in our collection in such a way that $y_{1}=\dots=y_{k-1}$ .

Let $L=|\mathcal{C}|>(C^{\prime}_{\lambda})^{n}$ , and let $(y_{1}^{(j)},\dots,y_{p}^{(j)})$ for $j=1,\dots,L$ be the $p$ -tuples in $\mathcal{C}\subseteq(A^{\prime})^{p}$ . Then for every $j=1,\dots,L$ we have $y_{1}^{(j)}+\dots+y_{p}^{(j)}=0$ and $|\{y_{1}^{(j)},\dots,y_{p}^{(j)}\}|=\ell$ as well as $y^{(j)}_{1}=\dots=y^{(j)}_{k-1}$ . This implies that $|\{y_{k}^{(j)},\dots,y_{p}^{(j)}\}\setminus\{y_{1}^{(j)}\}|\geq\ell-1$ . Furthermore, for distinct $j,j^{\prime}\in\{1,\dots,L\}$ the $p$ -tuples $(y_{1}^{(j)},\dots,y_{p}^{(j)})$ and $(y_{1}^{(j^{\prime})},\dots,y_{p}^{(j^{\prime})})$ are disjoint, and so we can conclude that

[TABLE]

for $j=1,\dots,L$ .

Suppose we have $\lceil\lambda p\rceil=p$ . Then by our assumption for any solution to the equation $y_{1}+\dots+y_{p}=0$ with $y_{1},\dots,y_{p}\in A$ , there is a vector appearing among $y_{1},\dots,y_{p}$ at least $\lceil\lambda p\rceil=p$ times. In other words, for any solution to the equation $y_{1}+\dots+y_{p}=0$ with $y_{1},\dots,y_{p}\in A$ we must have $y_{1}=\dots=y_{p}$ . This implies that there cannot be any solution of the form $(y_{1}^{(j_{1})},\dots,y_{p}^{(j_{p})})$ with $y_{1}^{(j_{1})}+\dots+y_{p}^{(j_{p})}=0$ where $j_{1},\dots,j_{p}\in\{1,\dots,L\}$ are not all equal (indeed, if $j_{i}\neq j_{i^{\prime}}$ , then $y_{i}^{(j_{i})}\neq y_{i^{\prime}}^{(j_{i^{\prime}})}$ as $(y_{1}^{(j_{i})},\dots,y_{p}^{(j_{i})})$ and $(y_{1}^{(j_{i^{\prime}})},\dots,y_{p}^{(j_{i^{\prime}})})$ disjoint). Hence the $p$ -tuples $(y_{1}^{(j)},\dots,y_{p}^{(j)})$ for $j=1,\dots,L$ satisfy the assumptions of Theorem 3.1 for $k=p$ . On the other hand, we have $L>(C^{\prime}_{\lambda})^{n}\geq(\Gamma_{p,p})^{n}$ which is a contradiction to the conclusion of Theorem 3.1.

So we may now assume that $2\leq\lceil\lambda p\rceil\leq p-1$ , meaning that $2\leq k-1\leq p-1$ . For $j=1,\dots,L$ , let us define a $k$ -tuple $(x_{1}^{(j)},\dots,x_{k}^{(j)})\in\mathbb{F}_{p}^{n}\times\dots\times\mathbb{F}_{p}^{n}$ by setting $x_{i}^{(j)}=y_{i}^{(j)}=y_{1}^{(j)}$ for $i=1,\dots,k-1$ and

[TABLE]

Note that then we have $x_{1}^{(j)}+\dots+x_{k}^{(j)}=y_{1}^{(j)}+\dots+y_{k-1}^{(j)}+y_{k}^{(j)}+\dots+y_{p}^{(j)}=0$ for $j=1,\dots,L$ . Since $L>(C^{\prime}_{\lambda})^{n}\geq(\Gamma_{p,k})^{n}$ , from Theorem 3.1 we can conclude that there must exist $j_{1},\dots j_{k}\in\{1,\dots,L\}$ with $x_{1}^{(j_{1})}+\dots+x_{k}^{(j_{k})}=0$ such that $j_{1},\dots,j_{k}$ are not all equal.

Suppose we have $j_{1}=\dots=j_{k-1}$ , then let $j=j_{1}=\dots=j_{k-1}$ and observe that $j_{k}\neq j$ (since $j_{1},\dots,j_{k}$ are not all equal). But now we have

[TABLE]

and since $k-1\neq 0$ in $\mathbb{F}_{p}$ this implies that $y_{1}^{(j)}=y_{1}^{(j_{k})}$ . But this is a contradiction since the $p$ -tuples $(y_{1}^{(j)},\dots,y_{p}^{(j)})$ and $(y_{1}^{(j_{k})},\dots,y_{p}^{(j_{k})})$ are disjoint.

So let us now assume that the indices $j_{1},\dots,j_{k-1}$ are not all equal. Then the set

[TABLE]

has size at least $2$ (since the $p$ -tuples $(x_{1}^{(j)},\dots,x_{p}^{(j)})$ for $j=1,\dots,L$ are pairwise disjoint). Now, we have

[TABLE]

So $(y_{1}^{(j_{1})},\dots,y_{1}^{(j_{k-1})},y_{k}^{(j)},\dots,y_{p}^{(j)})\in(A^{\prime})^{p}$ satisfies $y_{1}^{(j_{1})}+\dots+y_{1}^{(j_{k-1})}+y_{k}^{(j)}+\dots+y_{p}^{(j)}=0$ and

[TABLE]

where for the second inequality we used (3.1). But this is a contradiction to condition (ii) in our choice of $\ell$ and $A^{\prime}$ as in Claim 3.3. This finishes the proof of Proposition 3.2.∎

3.2 Partitioning into rainbow sets

This subsection proves the following combinatorial lemma about partitioning a set with coloured elements into rainbow subsets. This, together with the results from the last subsection, allows us to split our solution $(x_{1},\dots,x_{p})$ of $x_{1}+\dots+x_{p}$ into $\ell$ -tuples, each consisting of $\ell$ distinct vectors, in the desired way.

Lemma 3.4.

Let $1\leq\ell\leq k$ be integers, and consider a colouring of a set $S$ of size $|S|=k$ (assigning each of the elements in $S$ a colour). Suppose that each colour occurs at most $k/\ell$ times. Then there is a partition $S=S_{1}\cup\dots\cup S_{\lfloor k/\ell\rfloor}\cup S_{\lfloor k/\ell\rfloor+1}$ with $|S_{j}|=\ell$ for $j=1,\dots,\lfloor k/\ell\rfloor$ and $|S_{\lfloor k/\ell\rfloor+1}|=k-\lfloor k/\ell\rfloor\cdot\ell$ such that for each $j=1,\dots,\lfloor k/\ell\rfloor+1$ all elements of $S_{j}$ have distinct colours.

Proof.

Let $m=\lfloor k/\ell\rfloor$ , and $r=k-m\ell\in\{0,\dots,\ell-1\}$ . Let us label the elements of the set $S$ as $s_{1},\ldots,s_{k}$ in such a way that each colour class forms a block of consecutive elements. It now suffices to show that we can find a partition $\{1,\dots,k\}=I_{1}\cup\dots\cup I_{m}\cup I_{m+1}$ with $|I_{j}|=\ell$ for $j=1,\dots,m$ and $|S_{m+1}|=r$ such that for each $j=1,\dots,m+1$ we have $|x-y|\geq m$ for any two distinct elements $x,y\in S_{j}$ . Indeed, then we can define $S_{j}=\{s_{x}\mid x\in I_{j}\}$ for $j=1,\dots,m+1$ . Note that then for any $j=1,\dots,m+1$ and any distinct $x,y\in I_{j}$ , the elements $s_{x}$ and $s_{y}$ cannot have the same colour, since otherwise all elements between $s_{x}$ and $s_{y}$ in the list $s_{1},\ldots,s_{k}$ would also be of that colour, and so the colour would appear at least $m+1>k/\ell$ times since $|x-y|\geq m$ .

To define the desired partition $\{1,\dots,k\}=\{1,\dots,m\ell+r\}=I_{1}\cup\dots\cup I_{m}\cup I_{m+1}$ , let $I_{m+1}=\{m+1,2m+2,\dots,rm+r\}$ and $I_{j}=\{x\in\{1,\dots,k\}\mid x\equiv j\pmod{m}\}\setminus I_{m+1}$ for $j=1,\dots,m$ . It is not hard to see that $|I_{j}|=\ell$ for $j=1,\dots,m$ and $|S_{m+1}|=r$ . Furthermore, for any two distinct $x,y\in S_{m+1}$ , we have $|x-y|\geq m+1>m$ , since $x$ and $y$ are both multiples of $m+1$ . For any $j=1,\dots,m$ and any two distinct $x,y\in S_{j}$ , we have $|x-y|\geq m$ , since $x\equiv j\equiv y\pmod{m}$ . ∎

4 Proof of Theorem 2.1

In this section, we finally prove Theorem 2.1. Note that if the theorem holds for some $c>1$ , then it also holds for all smaller values of $c$ . Hence it suffices to prove the theorem for $c=h/2$ for all integers $h\geq 3$ .

We prove Theorem 2.1 for $c=h/2$ for $h=3,4,\dots$ by induction on $h$ . The first subsection of this section contains the induction beginning $h=3$ , and the second subsection contains the induction step.

4.1 Induction beginning $\boldsymbol{c=3/2}$

As the starting point of our induction, let us prove Theorem 2.1 for $c=3/2$ . The following lemma shows that the desired statement holds for $\ell=2$ and $C(c)=(C^{\prime}_{1/2})^{2}$ (with $C^{\prime}_{1/2}$ as in Proposition 3.2) and $D(p,3/2)=9p^{6}$ for any prime $p\geq 3$ .

Lemma 4.1.

Let $C^{\prime}_{1/2}\geq 1$ be as in Proposition 3.2 for $\lambda=1/2$ . Let $p\geq 3$ be a prime and let $n$ be a positive integer. Suppose that $A\subseteq\mathbb{F}_{p}^{n}$ is a subset of $\mathbb{F}_{p}^{n}$ such that $|A|\geq 9p^{6}\cdot(C^{\prime}_{1/2})^{2n}$ and $|A+A|\leq|A|^{3/2}$ . Then $A$ contains $p$ distinct vectors $x_{1},\dots,x_{p}\in A$ with $x_{1}+\dots+x_{p}=0$ .

Proof.

Let us say that a pair $\{x,y\}\subseteq A$ with $x\neq y$ is bad if there are at most $p$ pairs $\{x^{\prime},y^{\prime}\}\subseteq A$ with $x^{\prime}\neq y^{\prime}$ satisfying $x^{\prime}+y^{\prime}=x+y$ . In other words, a pair $\{x,y\}\subseteq A$ is bad if there are at most $p$ different ways to write $x+y$ as a sum of two distinct elements of $A$ . Note that every element in $A+A$ can occur as the sum of at most $p$ bad pairs (since otherwise the pairs with this sum would not be bad). Thus, there can be at most $p\cdot|A+A|\leq p\cdot|A|^{3/2}$ bad pairs $\{x,y\}\subseteq A$ .

Let us consider the graph with vertex set $A$ where for distinct $x,y\in A$ we draw an edge between $x$ and $y$ if and only if $\{x,y\}$ is a bad pair. Then this graph has at most $p\cdot|A|^{3/2}$ edges, and hence has average degree at most $2p\cdot|A|^{1/2}$ . Thus, by the well-known Caro–Wei bound [4, 22], the graph has an independent set of size at least

[TABLE]

So let $A^{\prime}\subseteq A\subseteq\mathbb{F}_{p}^{n}$ be a subset of size $|A^{\prime}|>p^{2}\cdot(C^{\prime}_{1/2})^{n}$ such that there does not exist a bad pair $\{x,y\}\subseteq A$ with $x,y\in A^{\prime}$ .

By Proposition 3.2 for $\lambda=1/2$ , there exist vectors $y_{1},\dots,y_{p}\in A^{\prime}$ with $y_{1}+\dots+y_{p}=0$ such that every vector in $\mathbb{F}_{p}^{n}$ appears among $y_{1},\dots,y_{p}$ at most $p/2$ times.

Let us now consider a colouring of the set $\{1,\dots,p\}$ where the colours correspond to the different vectors appearing among $y_{1},\dots,y_{p}$ . In other words, two indices $i,j\in\{1,\dots,p\}$ receive the same colour if and only if $y_{i}=y_{j}$ . Then every colour appears at most $p/2$ times on the set $\{1,\dots,p\}$ , and so by Lemma 3.4, there exists a partition of $\{1,\dots,p\}$ into sets $\{i_{s},j_{s}\}$ of size $|\{i(s),j(s)\}|=2$ for $s=1,\dots,(p-1)/2$ and one set $\{t\}$ of size $1$ such that all of the sets in this partition are rainbow.

In other words, we can split the list of vectors $y_{1},\dots,y_{p}\in A^{\prime}$ into pairs $\{y_{i(s)},y_{j(s)}\}$ with $y_{i(s)}\neq y_{j(s)}$ for $s=1,\dots,(p-1)/2$ and one remaining vector $y_{t}$ . Recalling that $A^{\prime}$ does not contain any bad pair, we observe that the pairs $\{y_{i(s)},y_{j(s)}\}\subseteq A^{\prime}$ for $s=1,\dots,(p-1)/2$ are not bad.

Let $x_{t}=y_{t}$ . We will now replace each pair $\{y_{i(s)},y_{j(s)}\}\subseteq A^{\prime}$ by a pair $\{x_{i(s)},x_{j(s)}\}\subseteq A$ with the same sum in order to construct a solution $(x_{1},\dots,x_{p})\in A^{k}$ to the equation $x_{1}+\dots+x_{p}=0$ with distinct vectors $x_{1},\dots,x_{p}$ . To do this, consider the indices $s=1,\dots,(p-1)/2$ one by one. For each index $s$ , consider the sum $y_{i(s)}+y_{j(s)}\in\mathbb{F}_{p}^{n}$ . Since $\{y_{i(s)},y_{j(s)}\}$ is not bad, there are at least $p$ pairs $\{x^{\prime},y^{\prime}\}\subseteq A$ with $x^{\prime}\neq y^{\prime}$ and $x^{\prime}+y^{\prime}=y_{i(s)}+y_{j(s)}$ . These pairs must all be disjoint (since knowing $x^{\prime}$ and the sum $x^{\prime}+y^{\prime}=y_{i(s)}+y_{j(s)}$ already determines $y^{\prime}$ ), and so there must be at least one pair $\{x^{\prime},y^{\prime}\}\subseteq A$ with $x^{\prime}\neq y^{\prime}$ and $x^{\prime}+y^{\prime}=y_{i(s)}+y_{j(s)}$ which does not contain $x_{t}$ or any of the $2s-2\leq p-3$ vectors in the already chosen pairs $\{x_{i(1)},x_{j(1)}\},\dots,\{x_{i(s-1)},x_{j(s-1)}\}$ . So let us choose $\{x_{i(s)},x_{j(s)}\}\subseteq A$ to be such a pair. Doing this step by step for $s=1,\dots,(p-1)/2$ , we obtain a $p$ -tuple $(x_{1},\dots,x_{p})\in A^{p}$ . By construction, the vectors $x_{1},\dots,x_{p}\in A$ are distinct and we have

[TABLE]

as desired. ∎

4.2 Induction step

For the induction step in the proof of Theorem 2.1, we will use the following result of Borenstein and Croot [3, Theorem 4], which is a higher-uniformity version of the Balog–Szemerédi–Gowers Theorem [2, 11].

Theorem 4.2 ([3]).

For every $0<\varepsilon<1/2$ and $c>1$ , there exists $\delta>0$ such that the following holds for all sufficiently large $k$ and all sufficiently large finite subsets $A$ of an additively written abelian group. If $S\subseteq A^{k}$ is a subset satisfying $|S|\geq|A|^{k-\delta}$ and $|\{y_{1}+\dots+y_{k}\mid(y_{1},\dots,y_{k})\in S\}|\leq|A|^{c}$ , then there is a subset $A^{\prime}\subseteq A$ of size $|A^{\prime}|\geq|A^{\prime}|^{1-\varepsilon}$ such that $|\ell^{\prime}A^{\prime}|=|A^{\prime}+\dots+A^{\prime}|\leq|A^{\prime}|^{c(1+\varepsilon\ell^{\prime})}$ for all positive integers $\ell^{\prime}$ .

In order to perform the induction step for the proof of Theorem 2.1, let us now assume that $c=h/2$ for an integer $h\geq 4$ and that we have already proved Theorem 2.1 for $c^{\prime}=(h-1)/2=c-(1/2)$ . Note that $c\geq 2$ , and define $c^{\prime\prime}=c-(3/4)>1$ .

Let us take a positive integer $\ell^{\prime}$ as in Theorem 2.1 for $c^{\prime}=c-(1/2)$ . Let us now choose $0<\varepsilon<1/2$ small enough (depending on $c$ ) such that

[TABLE]

Let us now apply Theorem 4.2 to $0<\varepsilon<1/2$ and $c^{\prime\prime}>1$ . We obtain some $\delta>0$ and some positive integer $k$ such that the statement in Theorem 4.2 holds for all sufficiently large $A$ . By decreasing $\delta$ if needed, we may assume that $0<\delta<1/4$ . Let $\ell=k+1$ .

Recall that we assume that Theorem 2.1 holds for $c^{\prime}$ as our induction hypothesis. So choose $C(c^{\prime})\geq 1$ as well as $D(c^{\prime},p)\geq 1$ for every sufficiently large prime $p$ as in Theorem 2.1. Furthermore, let $C^{\prime}_{1/\ell}\geq 1$ be as in Proposition 3.2 for $\lambda=1/\ell$ . Now, let us define

[TABLE]

and for every sufficiently large prime $p$ (large enough for Theorem 2.1 for $c^{\prime}$ ), let us define

[TABLE]

Now, assuming that $p$ is sufficiently large in terms of $c$ , any subset $A\subseteq\mathbb{F}_{p}^{n}$ of size $|A|\geq D(c,p)\cdot(C(c))^{n}\geq D(c,p)\geq p$ is large enough for the statement in Theorem 4.2.

As in Theorem 2.1, let us now assume that $A\subseteq\mathbb{F}_{p}^{n}$ is a subset of size $|A|\geq D(c,p)\cdot(C(c))^{n}$ with $|\ell A|=|A+\dots+A|\leq|A|^{c}$ . We need to show that $A$ contains $p$ distinct vectors $x_{1},\dots,x_{p}\in A$ with $x_{1}+\dots+x_{p}=0$ .

Let us say that a vector $w\in\ell A\subseteq\mathbb{F}_{p}^{n}$ is bad if the family of all subsets $\{x_{1},\dots,x_{\ell}\}\subseteq A$ with distinct elements $x_{1},\dots,x_{\ell}\in A$ satisfying $x_{1}+\dots+x_{\ell}=w$ has a cover $Z_{w}\subseteq A$ of size $|Z_{w}|\leq p$ . Let us say that a subset $\{x_{1},\dots,x_{\ell}\}\subseteq A$ with distinct elements $x_{1},\dots,x_{\ell}\in A$ is bad if the sum $x_{1}+\dots+x_{\ell}$ is bad.

Now, for every bad subset $\{x_{1},\dots,x_{\ell}\}\subseteq A$ , one of the elements of $\{x_{1},\dots,x_{\ell}\}$ must be in $Z_{w}$ for $w=x_{1}+\dots+x_{\ell}$ (indeed, $Z_{w}$ is a cover of all the size- $\ell$ subsets of $A$ summing to $w$ ). Thus, by suitably ordering, we can turn every bad subset $\{x_{1},\dots,x_{\ell}\}$ into an $\ell$ -tuple $(y_{1},\dots,y_{\ell-1},z)$ such that $w=y_{1}+\dots+y_{\ell-1}+z$ is bad and $z\in Z_{w}$ . Hence the number of bad subsets $\{x_{1},\dots,x_{\ell}\}\subseteq A$ is at most the number of $\ell$ -tuples $(y_{1},\dots,y_{\ell-1},z)\in A^{\ell}$ such that $w=y_{1}+\dots+y_{\ell-1}+z$ is bad and $z\in Z_{w}$ .

For every $y\in\mathbb{F}_{p}^{n}$ , let us now define $N_{y}$ to be the number of $(\ell-1)$ -tuples $(y_{1},\dots,y_{\ell-1})\in A^{\ell-1}$ with $y_{1}+\dots+y_{\ell-1}=y$ . Note that we have $N_{y}=0$ for all $y\not\in(\ell-1)A$ .

For every $y\in\mathbb{F}_{p}^{n}$ , let us furthermore define $M_{y}$ to be the number of vectors $z\in A$ such that $y+z$ is bad and $z\in Z_{y+z}$ . Note that we clearly have $M_{y}\leq|A|$ for all $y\in\mathbb{F}_{p}^{n}$ .

Now, the number of $\ell$ -tuples $(y_{1},\dots,y_{\ell-1},z)$ such that $w=y_{1}+\dots+y_{\ell-1}+z$ is bad and $z\in Z_{w}$ is precisely $\sum_{y\in\mathbb{F}_{p}^{n}}M_{y}N_{y}$ . Indeed, for every possible value of $y=y_{1}+\dots+y_{\ell-1}$ there are $M_{y}$ possibilities to choose $z\in A$ such that $w=y+z=y_{1}+\dots+y_{\ell-1}+z$ is bad and $z\in Z_{w}$ , and there are furthermore $N_{y}$ possibilities to choose $y_{1},\dots,y_{\ell-1}\in A$ with $y_{1}+\dots+y_{\ell-1}=y$ .

Thus, the number of bad subsets $\{x_{1},\dots,x_{\ell}\}\subseteq A$ is at most $\sum_{y\in\mathbb{F}_{p}^{n}}M_{y}N_{y}$ .

Now, observe that

[TABLE]

Indeed, $\sum_{y\in\mathbb{F}_{p}^{n}}N_{y}$ is the total number of $(\ell-1)$ -tuples $(y_{1},\dots,y_{\ell-1})\in A^{\ell-1}$ .

Next, we claim that $\sum_{y\in\mathbb{F}_{p}^{n}}M_{y}\leq p\cdot|\ell A|$ . Indeed, $\sum_{y\in(\ell-1)A}M_{y}$ is the number of pairs $(y,z)\in\mathbb{F}_{p}^{n}\times A$ such that $y+z$ is bad and $z\in Z_{y+z}$ . We can choose such pairs by first choosing a bad $w=y+z$ , then choosing $z\in Z_{w}$ , and finally calculating $y=w-z$ . Note that there are at most $|\ell A|$ choices for a bad $w$ (since by definition every bad $w$ is an element of the set $\ell A\subseteq\mathbb{F}_{p}^{n}$ ), and for every such choice of $w$ there are only $|Z_{w}|\leq p$ choices for $z\in Z_{w}$ . Hence the number of pairs $(y,z)\in\mathbb{F}_{p}^{n}\times A$ such that $y+z$ is bad and $z\in Z_{y+z}$ is indeed at most $|\ell A|\cdot p$ , and we indeed have $\sum_{y\in\mathbb{F}_{p}^{n}}M_{y}\leq p\cdot|\ell A|$ .

Recalling our assumption $|\ell A|\leq|A|^{c}$ , we can now conclude that $\sum_{y\in\mathbb{F}_{p}^{n}}M_{y}\leq p\cdot|\ell A|\leq p\cdot|A|^{c}$ . Let us now define

[TABLE]

Note that then we have

[TABLE]

Here, we used in the last inequality that $|A|\geq D(c,p)\cdot(C(c))^{n}\geq p^{4}\cdot p^{2\ell/\delta}\cdot(C^{\prime}_{1/\ell})^{\ell n/\delta}\geq p^{8\ell+4}\cdot(C^{\prime}_{1/\ell})^{4\ell n}$ (as $0<\delta<1/4$ ).

We will now distinguish two cases depending on the size of the sum $\sum_{y\in Y}N_{y}$ .

Case 1: $\boldsymbol{\sum_{y\in Y}N_{y}\geq|A|^{\ell-1-\delta}}$ . Recall that $k=\ell-1$ and that Theorem 4.2 holds with $c^{\prime\prime}=c-(3/4)$ and our chosen $0<\varepsilon<1/2$ with our values for $\delta$ and $k$ . Also recall that $A\subseteq\mathbb{F}_{p}^{n}$ is sufficiently large for Theorem 4.2 with these parameters.

Let us now define $S\subseteq A^{k}=A^{\ell-1}$ to be the collection of $(\ell-1)$ -tuples $(y_{1},\dots,y_{\ell-1})\in A^{\ell-1}$ such that $y_{1}+\dots+y_{\ell-1}\in Y$ . Then $|S|=\sum_{y\in Y}N_{y}\geq|A|^{\ell-1-\delta}=|A|^{k-\delta}$ . We furthermore have

[TABLE]

Therefore, by Theorem 4.2 there exists a subset $A^{\prime}\subseteq A$ of size $|A^{\prime}|\geq|A|^{1-\varepsilon}\geq|A|^{1/2}$ with

[TABLE]

where the second inequality follows from (4.1).

Note that

[TABLE]

This means that all assumptions are satisfied in Theorem 2.1 for $c^{\prime}$ (which was our induction hypothesis), recalling our choice of $\ell^{\prime}$ and our assumption that $p$ is large enough. Thus, by applying Theorem 2.1 for $c^{\prime}$ , we can conclude that $A^{\prime}$ contains $p$ distinct vectors $x_{1},\dots,x_{p}\in A^{\prime}$ with $x_{1}+\dots+x_{p}=0$ . As $A^{\prime}\subseteq A$ , this means in particular that $A$ contains $p$ such vectors.

Case 2: $\boldsymbol{\sum_{y\in Y}N_{y}<|A|^{\ell-1-\delta}}$ . In this case, we have

[TABLE]

using that $|A|\geq D(c,p)\cdot(C(c))^{n}\geq 2^{(\ell+1)/\delta}\cdot p^{2\ell/\delta}\cdot(C^{\prime}_{1/\ell})^{\ell n/\delta}$ . Recalling that $M_{y}\leq|A|$ for all $y\in\mathbb{F}_{p}^{n}$ , this implies

[TABLE]

On the other hand, by the definition of $Y$ , we have

[TABLE]

where the second inequality follows from (4.2). Thus, the number of bad subsets $\{x_{1},\dots,x_{\ell}\}\subseteq A$ is at most

[TABLE]

Let

[TABLE]

and note that $0\leq q\leq 1$ , since $|A|\geq D(c,p)\cdot(C(c))^{n}\geq 2^{(\ell+1)/\delta}\cdot p^{2\ell/\delta}\cdot(C^{\prime}_{1/\ell})^{\ell n/\delta}\geq 2p^{2}\cdot(C^{\prime}_{1/\ell})^{n}$ .

Let us now consider a random subset $A^{*}\subseteq A$ obtained by including every vector in $A$ into the subset $A^{*}$ with probability $q$ , independently for all vectors in $A$ . Then we have $\mathbb{E}[|A^{*}|]=q\cdot|A|=2p^{2}\cdot(C^{\prime}_{1/\ell})^{n}$ .

Recall that every bad subset $\{x_{1},\dots,x_{\ell}\}\subseteq A$ consists of $\ell$ distinct vectors in $A$ , and that we bounded the number of bad subsets in (4.3). For each such bad subset $\{x_{1},\dots,x_{\ell}\}\subseteq A$ , the probability of having $x_{1},\dots,x_{\ell}\in A^{*}$ is $q^{\ell}$ . Let $Y_{\text{bad}}$ be the number of bad subsets $\{x_{1},\dots,x_{\ell}\}\subseteq A$ with $x_{1},\dots,x_{\ell}\in A^{*}$ , then

[TABLE]

Hence

[TABLE]

Thus, there exists some outcome of the random subset $A^{*}\subseteq A$ such that we have $|A^{*}|-Y_{\text{bad}}>p^{2}\cdot(C^{\prime}_{1/\ell})^{n}$ . For each of the $Y_{\text{bad}}$ bad subsets $\{x_{1},\dots,x_{\ell}\}\subseteq A$ with $x_{1},\dots,x_{\ell}\in A^{*}$ , let us now delete one of the elements $x_{1},\dots,x_{\ell}\in A^{*}$ from the set $A^{*}$ , and let $A^{\prime}\subseteq A^{*}$ be the set obtained this way. Then $A^{\prime}\subseteq A$ is a subset of size $|A^{\prime}|\geq|A^{*}|-Y_{\text{bad}}>p^{2}\cdot(C^{\prime}_{1/\ell})^{n}$ and there does not exist any bad subset $\{x_{1},\dots,x_{\ell}\}\subseteq A$ with $x_{1},\dots,x_{\ell}\in A^{\prime}$ .

Applying Proposition 3.2 with $\lambda=1/\ell$ to the set $A^{\prime}\subseteq\mathbb{F}_{p}^{n}$ , we can find vectors $y_{1},\dots,y_{p}\in A^{\prime}$ with $y_{1}+\dots+y_{p}=0$ such that every vector in $\mathbb{F}_{p}^{n}$ appears among $y_{1},\dots,y_{p}$ at most $p/\ell$ times. Let $t$ and $r$ be non-negative integers such that $p=t\ell+r$ and $0\leq r\leq\ell-1$ . In other words, this means that $t=\lfloor p/\ell\rfloor$ and $r=p-\lfloor p/\ell\rfloor\cdot\ell$ .

Let us now consider a colouring of the set $\{1,\dots,p\}$ where the colours correspond to the different vectors appearing among $y_{1},\dots,y_{p}$ . In other words, two indices $i,j\in\{1,\dots,p\}$ receive the same colour if and only if $y_{i}=y_{j}$ . Then every colour appears at most $p/\ell$ times on the set $\{1,\dots,p\}$ , and so by Lemma 3.4, there exists a partition of $\{1,\dots,p\}=S_{1}\cup\dots\cup S_{t+1}$ with $|S_{j}|=\ell$ for $j=1,\dots,t$ and $|S_{t+1}|=r$ such that for each $j=1,\dots,t+1$ all elements of $S_{j}$ have distinct colours. So for each $j=1,\dots,t+1$ , the vectors $y_{i}$ with $i\in S_{j}$ are distinct.

Recalling that $y_{1},\dots,y_{p}\in A^{\prime}$ and $A^{\prime}$ does not contain any bad subset, we can conclude that for each $j=1,\dots,t$ , the set $\{y_{i}\mid i\in S_{j}\}$ is not bad. Hence, for $j=1,\dots,t$ , the sum $\sum_{i\in S_{j}}y_{i}$ is not bad (since the vectors $y_{i}$ for $i\in S_{j}$ are distinct and $|S_{j}|=\ell$ ).

Let us now construct the desired distinct vectors $x_{1},\dots,x_{p}\in A$ with $x_{1}+\dots+x_{p}=0$ . We start by defining $x_{i}=y_{i}$ for $i\in S_{t+1}$ (recall that the vectors $y_{i}$ for $i\in S_{t+1}$ are distinct). Note that clearly $\sum_{i\in S_{t+1}}x_{i}=\sum_{i\in S_{t+1}}y_{i}$ .

Now, for $j=1,\dots,t$ , let us step by step replace the vectors $y_{i}$ for $i\in S_{j}$ with vectors $x_{i}\in A$ such that $\sum_{i\in S_{j}}x_{i}=\sum_{i\in S_{j}}y_{i}$ . For each index $j=1,\dots,t$ , recall that the sum $\sum_{i\in S_{j}}y_{i}$ is not bad. This means that the family of all subsets $\{x^{\prime}_{1},\dots,x^{\prime}_{\ell}\}\subseteq A$ with distinct elements $x^{\prime}_{1},\dots,x^{\prime}_{\ell}\in A$ satisfying $x^{\prime}_{1}+\dots+x^{\prime}_{\ell}=\sum_{i\in S_{j}}y_{i}$ does not have a cover of size at most $p$ . Since we have chosen at most $p$ different vectors $x_{i}$ throughout our process so far, there must exist a subsets $\{x^{\prime}_{1},\dots,x^{\prime}_{\ell}\}\subseteq A$ with distinct elements $x^{\prime}_{1},\dots,x^{\prime}_{\ell}\in A$ satisfying $x^{\prime}_{1}+\dots+x^{\prime}_{\ell}=\sum_{i\in S_{j}}y_{i}$ , such that $\{x^{\prime}_{1},\dots,x^{\prime}_{\ell}\}$ is disjoint from the set of all our previously chosen vectors $x_{i}$ for $i\in S_{1}\cup\dots\cup S_{j-1}\cup S_{t+1}$ . Let us now assign the vectors $x_{i}$ for $i\in S_{j}$ to be $x^{\prime}_{1},\dots,x^{\prime}_{\ell}$ (in arbitrary order). Then we have $\sum_{i\in S_{j}}x_{i}=x^{\prime}_{1}+\dots+x^{\prime}_{\ell}=\sum_{i\in S_{j}}y_{i}$ , and the vectors $x_{i}$ for $i\in S_{j}$ are distinct and are also distinct from all the vectors $x_{i}$ for $i\in S_{1}\cup\dots\cup S_{j-1}\cup S_{t+1}$ .

Continuing this process step by step for all $j=1,\dots,t$ , in the end we obtain distinct vectors $x_{i}\in A$ for all $i\in S_{1}\cup\dots\cup S_{t+1}=\{1,\dots,p\}$ such that $\sum_{i\in S_{j}}x_{i}=\sum_{i\in S_{j}}y_{i}$ for $j=1,\dots,t+1$ . In other words, $x_{1},\dots,x_{p}\in A$ are distinct vectors, and we have

[TABLE]

This finishes the proof.

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] N. Alon and M. Dubiner, A lattice point problem and additive number theory , Combinatorica 15 (1995), 301–309.
2[2] A. Balog and E. Szemerédi, A statistical theorem of set addition , Combinatorica 14 (1994), 263–268.
3[3] E. Borenstein and E. Croot, On a certain generalization of the Balog-Szemerédi-Gowers theorem , SIAM J. Discrete Math. 25 (2011), 685–694.
4[4] Y. Caro, New Results on the Independence Number , Technical Report, Tel-Aviv University, 1979.
5[5] E. Croot, V. F. Lev, and P. P. Pach, Progression-free sets in ℤ 4 n superscript subscript ℤ 4 𝑛 \mathbb{Z}_{4}^{n} are exponentially small , Ann. of Math. 185 (2017), 331–337.
6[6] Y. Edel, Sequences in abelian groups G 𝐺 G of odd order without zero-sum subsequences of length exp ⁡ ( G ) exp 𝐺 \operatorname{exp}(G) , Des. Codes Cryptogr. 47 (2008), 125–134.
7[7] J. S. Ellenberg and D. Gijswijt, On large subsets of 𝔽 q n superscript subscript 𝔽 𝑞 𝑛 \mathbb{F}_{q}^{n} with no three-term arithmetic progression , Ann. of Math. 185 (2017), 339–343.
8[8] P. Erdős, A. Ginzburg, and A. Ziv, Theorem in the additive number theory , Bull. Res. Council Israel 10F (1961), 41–43.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

On the Erdős–Ginzburg–Ziv Problem in large dimension

Abstract

1 Introduction

Theorem 1.1**.**

Theorem 1.2**.**

Question 1.3**.**

2 Proof Overview

2.1 Proof Structure

Theorem 2.1**.**

Proof of Theorem 1.2 assuming Theorem 2.1.

Proof of Theorem 1.1 assuming Theorem 1.2.

2.2 Outline of proof of Theorem 2.1

3 Preparations

3.1 Solutions with not too many repetitions

Theorem 3.1** (kkk-coloured Sum-Free Theorem).**

Proposition 3.2**.**

Proof.

Claim 3.3**.**

Proof.

3.2 Partitioning into rainbow sets

Lemma 3.4**.**

Proof.

4 Proof of Theorem 2.1

4.1 Induction beginning c=3/2\boldsymbol{c=3/2}c=3/2

Lemma 4.1**.**

Proof.

4.2 Induction step

Theorem 4.2** ([3]).**

Theorem 1.1.

Theorem 1.2.

Question 1.3.

Theorem 2.1.

Theorem 3.1 ( $k$ -coloured Sum-Free Theorem).

Proposition 3.2.

Claim 3.3.

Lemma 3.4.

4.1 Induction beginning $\boldsymbol{c=3/2}$

Lemma 4.1.

Theorem 4.2 ([3]).