On the Erd\H{o}s--Ginzburg--Ziv Problem in large dimension
Lisa Sauermann, Dmitrii Zakharov

TL;DR
This paper investigates the Erdős–Ginzburg–Ziv problem in high dimensions, providing significantly improved upper bounds for fixed m and large n by combining advanced combinatorial methods.
Contribution
It introduces new upper bounds for the problem in high dimensions, utilizing the slice rank polynomial method and a higher-uniformity Balog–Szemerédi–Gowers theorem.
Findings
Established upper bounds of the form D_{ε,m} * (C_{ε} m^{ε})^n for the problem
Improved understanding of the problem's behavior in large dimensions
Combined algebraic and combinatorial techniques for bounding solutions
Abstract
The Erd\H{o}s--Ginzburg--Ziv Problem is a classical extremal problem in discrete geometry. Given and , the problem asks about the smallest number such that among any points in the integer lattice one can find points whose centroid is again a lattice point. Despite of a lot of attention over the last 50 years, this problem is far from well-understood. For fixed dimension , Alon and Dubiner proved that the answer grows linearly with . In this paper, we focus on the opposite case, where the number is fixed and the dimension is large. We drastically improve the previous upper bounds in this regime, showing that for every the answer is at most for all and . Our proof combines (a consequence of) the slice rank polynomial method with a higher-uniformity version of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Geometry and Mesh Generation · Digital Image Processing Techniques · Limits and Structures in Graph Theory
On the Erdős–Ginzburg–Ziv Problem in large dimension
Lisa Sauermann Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA. Email: [email protected]. Research supported by NSF Award DMS-2100157 and a Sloan Research Fellowship.
Dmitrii Zakharov Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA. Email: [email protected].
Abstract
The Erdős–Ginzburg–Ziv Problem is a classical extremal problem in discrete geometry. Given and , the problem asks about the smallest number such that among any points in the integer lattice one can find points whose centroid is again a lattice point. Despite of a lot of attention over the last 50 years, this problem is far from well-understood. For fixed dimension , Alon and Dubiner proved that the answer grows linearly with . In this paper, we focus on the opposite case, where the number is fixed and the dimension is large. We drastically improve the previous upper bounds in this regime, showing that for every the answer is at most for all and . Our proof combines (a consequence of) the slice rank polynomial method with a higher-uniformity version of the Balog–Szemerédi–Gowers Theorem due to Borenstein and Croot.
1 Introduction
For given positive integers and , what is the minimum number such that among any points in the -dimensional integer lattice one can always find points whose centroid is again a lattice point in ? This problem is called the Erdős–Ginzburg–Ziv Problem, and its answer is denoted by and called the Erdős–Ginzburg–Ziv constant of . This notation reflects that the problem can be naturally translated into (where one then asks about the smallest such that any sequence of length of elements of contains a subsequence of length summing to zero).
This problem has been studied for fifty years (see e.g. [12, 13]), and is still wide open despite of receiving a lot of attention (in particular, over the past five years more than twenty papers were published on this topic). Only very few values of are known exactly: For Erdős, Ginzburg, and Ziv [8] proved that , and for Reiher [17] proved that . The only other infinite family of known values is when is a power of , then as established by Harborth [12].
Furthermore, and maybe more importantly, the growth behaviour of the function is far from being understood. For fixed dimension , Alon and Dubiner [1] proved that grows linearly with . Improving their bound for the linearity constant, the second author [23] furthermore showed that the bound holds for every positive integer all of whose prime factors are sufficiently large with respect to .
However, in the opposite regime for fixed , there was an enormous gap between the upper and lower bounds. It turns out that in this regime, the problem can essentially be reduced (up to constant factors) to the case where is a prime. By the pigeonhole principle, one can easily obtain an upper bound of (as first observed by Harborth [12]). This trivial bound was improved to an upper bound of the form for every fixed prime by Naslund [16] and for by Ellenberg–Gijswijt [7], where and are constants only depending on with . Note that the base of the main term in Naslund’s bound is smaller than the base in the corresponding term in the trivial bound, but the base is still linear in . This was improved by the first author [18], who showed a bound of the form for every fixed prime , where again is a constant only depending on . Here, the base is of the form , whereas the bases for the previous bounds were of the form with being the base in the trivial bound. As discussed below, the term in the base in this bound constitutes an important barrier for this problem.
Breaking this barrier, we drastically improve these upper bounds to an upper bound with base . More precisely, we show the following theorem bounding for any fixed integer and large dimension .
Theorem 1.1**.**
For every fixed and every fixed integer , we have for all . Here, is a constant only depending on , and is a constant only depending on and .
As mentioned above, the problem of upper-bounding for fixed and large can easily be reduced to the case where is a prime. The problem is then essentially equivalent (up to constant factors depending on ) to the following additive combinatorics problem: For a fixed prime and large , what is the maximum possible size of a subset of not containing distinct vectors with sum zero? Our upper bound for in Theorem 1.1 is obtained by proving the following new upper bound for this additive combinatorics problem.
Theorem 1.2**.**
For every fixed and every fixed prime , the following holds for all . For any subset not containing distinct vectors with , we have . Here, is a constant only depending on , and is a constant only depending on and .
The previous bounds for for fixed and large in [16, 18] have also been obtained by studying this additive combinatorics problem, but here we prove much stronger bounds for this problem and hence for . There is extensive literature and research activity on zero-sum problems in abelian groups (e.g. see the survey [10]), and this additive combinatorics problem is one of the most central problems in this area.
In the case of , a subset not containing three distinct vectors with is precisely the same as a three-term progression-free subset . The problem of determining the maximum possible size of such a subset is a very famous problem in additive combinatorics, called the cap-set problem. In 2017, Ellenberg and Gijsiwjt [7] achieved a breakthrough on this problem, proving that any subset without a three-term arithmetic progression has size at most . Hence for , in Theorem 1.2 one has the bound .
Ellenberg and Gijsiwjt [7] actually proved a more general result, bounding the size of a three-term progression-free subset for any fixed prime . Their proof relies on a new polynomial method that was introduced by Croot, Lev and Pach [5] just a few weeks earlier, and that was shortly afterwards reformulated and generalized by Tao [20] to what is now called the slice rank polynomial method. Since the slice rank polynomial gives the bound in Theorem 1.2 for , it is very natural to also try to apply it in the setting of Theorem 1.2 for larger primes . Unfortunately, the proof for breaks down for larger , since the slice rank polynomial method cannot handle the distinctness condition for the vectors in Theorem 1.2. This is because, with this distinctness condition, the relevant tensor is not a diagonal tensor anymore, and so one looses control over its slice rank.
The above-mentioned works of Naslund [16] and the first author [18] obtain a weaker upper bound in the setting of Theorem 1.2 by using certain manipulations (relying on combinatorial arguments) to reduce to the setting of diagonal tensors for applying the slice rank polynomial method (or variants thereof).
Here, we obtain a much stronger upper bound with a new approach that incorporates both (a consequence of) the slice rank polynomial method and a higher-uniformity version of the Balog–Szemerédi–Gowers Theorem. These different tools from additive combinatorics have not previously been combined, and we believe that there may be future potential in pursuing such an approach further.
In particular, our approach manages to break the “multi-colored barrier” for this problem. Indeed, as discussed in [18], for the “multi-colored” version of the setting in Theorem 1.2 the bound due to the first author is essentially tight (there is a lower bound of for even ). Thus, an improvement of the bound beyond needs to use the “single-set” setting in Theorem 1.2 in an essential way (with an argument that is not generalizable to the “multi-colored” setting). So far, most arguments relying on the slice rank polynomial method naturally generalized to the “multi-colored” setting, and so finding new approaches that are specific to the “single-set” setting has been a major challenge. In particular, this challenge also appears for the problem of improving the bounds of Ellenberg–Gijswijt [7] for the cap-set problem and more generally the problem of bounding the size of three-term progression-free subsets of . These bounds are also (essentially) tight in the “multi-colored setting” (see [14]), and so an approach specific to the “single-set” setting would be needed to overcome this “multi-colored barrier”. Our approach of blending the slice rank polynomial method with the Balog–Szemerédi–Gowers Theorem is indeed specific “single-set setting”, and so we believe that it may be helpful in breaking the “multi-colored barrier” in other problems as well.
The particular higher-uniformity version of the Balog–Szemerédi–Gowers Theorem [2, 11] that we are using is due to Borenstein and Croot [3] and is stated in Section 4.2 (there are also other higher-uniformity versions, see in particular [19]). Besides this theorem and a (consequence of) the slice rank polynomial method (see Section 3.1), our proof also uses combinatorial and probabilistic arguments.
The best known lower bounds for for large are of the form for all odd with and are due to Edel [6] (in the case of , the constant is slightly better, namely due to Tyrell [21]). This in particular leads to the lower bound for any which is not a power of and large (if is a power of , then is known exactly). Here, the exponential base is an absolute constant independent of . It is still an open question whether this is the right behaviour, or whether the correct exponential base should depend on :
Question 1.3**.**
Is there an absolute constant , such that for every fixed integer we have for all , where is a constant only depending on ?
Our upper bound in Theorem 1.2 does not give such a constant , but instead it gives a term of the form (where the exponent converges to zero for growing ). In the opposite regime, where the dimension is fixed, the second author [23] showed the bound if all prime factors of are sufficiently large with respect to .
Acknowledgements. The authors would like to thank Cosmin Pohoata for helpful conversations and for pointing out reference [3], as well as Jacob Fox for useful comments on an earlier version of this paper.
Notation. For a subset of an additively written abelian group (for us, the group will usually be ), we write , as usual. The cover number of a finite family of non-empty subsets of some ground set is the size of the smallest subset such that intersects every set .
2 Proof Overview
2.1 Proof Structure
The Balog–Szemerédi–Gowers Theorem gives, for a subset with many solutions to the equation with , a subset such that the sum-set is small. Similarly, the higher uniformity version of the theorem due to Borenstein–Croot [3] gives, under suitable conditions on , a subset such that the -fold sum-set is small for certain . A priori, it is unclear how such a subset is useful for finding a solution to with distinct vectors .
The key for taking advantage of such a statement lies in the inductive setup for our proof, which allows us to incorporate the higher uniformity Balog–Szemerédi–Gowers Theorem in interplay with the slice rank polynomial method. The actual statement that we induct on is as follows.
Theorem 2.1**.**
For every , there is a positive integer and a constant such that for every sufficiently large prime (large enough in terms of ) there is a constant such that the following holds for all . If is a subset of size with , then contains distinct vectors with .
We prove this statement inductively (taking and inducting on ). Assuming Theorem 2.1, it is not difficult to deduce Theorem 1.2:
Proof of Theorem 1.2 assuming Theorem 2.1.
As in Theorem 1.2, let be fixed. Noting that the statement in Theorem 1.2 is trivial for (since we always have ), we may assume that . Now, let , and let and be as in Theorem 2.1. Let us choose large enough such that the statement in Theorem 2.1 holds for all primes .
As in Theorem 1.2, let us now consider a prime . Note that for any prime and any subset we trivially have . Thus, we may assume that , and so there is a constant such that the statement in Theorem 2.1 holds. Let .
Suppose that is a subset of size not containing distinct vectors with . Note that we have and
[TABLE]
Thus, we obtain a contradiction to Theorem 2.1. ∎
It is also not difficult to show that Theorem 1.2 implies our main result about Erdős–Ginzburg–Ziv constants in Theorem 1.1.
Proof of Theorem 1.1 assuming Theorem 1.2.
As in the statement of Theorem 1.1, let and be fixed. We have
[TABLE]
where the sum is over all prime factors of (see, for example, [9, Lemma 11]). On the other hand, for each prime factor of , we can bound using Theorem 1.2 as follows. Consider a sequence of elements of without a subsequence of length summing to zero. Clearly, this sequence can contain at most copies of any particular vector in . On the other hand, the set of all vectors appearing at least once in the sequence does not contain distinct vectors summing to zero, and so by Theorem 1.2 we have . This means that the sequence has length at most and hence . Thus, we obtain
[TABLE]
where the sum is again over all prime factors of . Thus, the desired statement holds when taking the constant in Theorem 1.1 to be the sum of for the constants in Theorem 1.2 over all prime factors of (and taking to be the same constant as in Theorem 1.2). ∎
The main difficulty is of course to prove Theorem 2.1, and the rest of this paper is devoted to this. We start by giving an outline of the main ideas of the proof in the next subsection.
2.2 Outline of proof of Theorem 2.1
Noting that Theorem 2.1 gets strictly stronger as we increase , we may assume that for an integer . We will then prove Theorem 2.1 by induction on .
To prove the theorem, we need to show that there is a solution to with distinct vectors . Our strategy for finding such a solution is to start with a solution to where are not necessarily distinct, and then modify this solution to make distinct. More specifically, to modify the solution we split the vectors into -tuples (and a small remainder of fewer than vectors), and then we replace each -tuple by another -tuple of vectors in with the same sum. Since at every step we replace vectors in with different vectors in with the same sum, the sum does not change throughout this process, and so at every step our vectors form a solution to . The difficulty is, however, to obtain a solution where are distinct.
It turns out that using the slice rank polynomial method and some combinatorial arguments, we can ensure that at the start of our process we have a solution to with that can be split into -tuples (and fewer than remaining vectors) in such a way that each -tuple consists of distinct vectors (and also such that the vectors in the remainder are distinct from each other). Our aim in each step of the process is now to replace one of the -tuples with a different -tuple of distinct vectors in with the same sum, such that the vectors in the new -tuple are distinct from all the other vectors appearing among our solution to at that step. If we can do this step by step for each -tuple, this greedy procedure will lead to a solution to with distinct vectors .
Of course, it may happen that at some step, when we want to replace a certain -tuple with sum , we cannot find an -tuple of distinct vectors in with the same sum which is disjoint from all vectors currently appearing in our solution to . In this case, every subset consisting of distinct vectors with sum must contain one of the vectors . Hence the family of subsets with distinct elements with sum must have a cover of size at most . We call a vector bad if this happens. Furthermore, we call an -tuple of distinct vectors in bad, if the sum of the vectors is bad. Then at every step of our process, if the relevant -tuple is not bad, we will be able to replace it in the desired way.
Thus, if at the start of our process each of the -tuples into which we split our starting solution to is not a bad -tuple, we will be able to run this modification process for each of the -tuples and obtain a solution to with distinct vectors in the end. So it suffices to find a solution to with that can be spit into -tuples in such a way, that each of these -tuples consists of distinct vectors and is not bad (and such that the fewer than remaining vectors are distinct from each other).
If there are only few bad -tuples , then by a probabilistic subset sampling argument there is a relatively large subset such that there exists no bad -tuple with . Then, relying on the slice rank polynomial method and further combinatorial arguments, one can find a solution to with that can be split into -tuples in the desired way (and then automatically none of these -tuples will be bad). Applying our process as discussed above, we can turn this into a solution to with such that are distinct.
For the induction beginning, i.e. the case in Theorem 2.1, this already suffices. In fact, if , we can take in Theorem 2.1 and observe that then by the assumption the number of bad -tuples is at most (indeed, every possible sum can lead to at most bad -tuples with ). Thus, there are only few bad -tuples in , and the above argument applies.
In the induction step, we also need to consider a second case, namely that there are many bad -tuples . For each bad -tuple , the sum is bad, and one of the vectors is in the cover of size at most of the family of all -element subsets of with sum . Thus, upon reordering the vectors, every bad -tuple can be rewritten as where is in the cover of size at most of the family of all -element subsets of with sum . By our assumption in Theorem 2.1, there are not so many possibilities for the sum , and for each of these possibilities there are at most choices for . One can now show that there must be many bad -tuples , for which the sums are concentrated on relatively few possible values. This means that one can apply the higher uniformity Balog–Szemerédi–Gowers Theorem due to Borenstein–Croot [3] (whose precise statement is given in Theorem 4.2). This theorem (with suitably chosen parameters) implies that there is a relatively large subset with for the value such that Theorem 2.1 holds for with instead of (recall that Theorem 2.1 holds for by our induction hypothesis). Now, by the conclusion of Theorem 2.1 for and and , we can find a solution to with distinct vectors .
This finishes the outline of our proof of Theorem 2.1. The actual proof can be found in Section 4, after some preparations for the proof in Section 3.
3 Preparations
3.1 Solutions with not too many repetitions
In this subsection, we show that any large subset must contain a solution to the equation with , such that no vector appears a lot of times among . The precise statement is given in Proposition 3.2 below. This will help us to find a solution to with , such that can be split into -tuples (and fewer than remaining vectors) such that each -tuple consists of distinct vectors.
The proof of our proposition uses the -coloured Sum-Free Theorem, which can be proved with the slice rank polynomial method. As mentioned in the introduction, the slice rank polynomial method was introduced by Tao [20] following work of Croot–Lev–Pach [5] and Ellenberg–Gijswijt [7]. A proof of the -coloured Sum-Free Theorem following this method can be found in [15].
Theorem 3.1** (-coloured Sum-Free Theorem).**
Let be an integer, and let be a prime. For some positive integer , let for be a list of -tuples of vectors in . Suppose that for all , we have
[TABLE]
Then we must have , where
[TABLE]
As an immediate consequence of the -coloured Sum-Free Theorem one can show that every subset of size must contain a solution to the equation such that are not all equal. In other words, we obtain a solution such that every vector in appears at most times among .
The statement of the following proposition is somewhat similar, showing that every large enough subset contains a solution to , such that every vector in appears at most times among (for some fixed ).
Proposition 3.2**.**
For every fixed , there exists a constant such that for every prime and every positive integer the following holds. If is a subset of size , there exist vectors with such that every vector in appears among at most times.
Proof.
We define
[TABLE]
Note that then for all primes we have and
[TABLE]
and furthermore also
[TABLE]
Now, as in the statement of the proposition, let be a subset of size . Let us suppose for contradiction that for any solution to the equation with , there is a vector appearing among at least times.
For every solution to the equation let us consider the number , i.e. the number of distinct vectors appearing among (this number is in ). Let us say that two solutions to this equation are disjoint if no vector appears in both of them.
Claim 3.3**.**
There exists a number and a subset satisfying the following two conditions:
- (i)
There is a collection of more than pairwise disjoint solutions to the equation with .
- (ii)
Every solution to the equation satisfies .
Proof.
Let us define a sequence of subsets of for some with the following recursive process. Throughout this process we will ensure that for every , every solution to the equation satisfies .
We start by defining . Clearly every solution to the equation satisfies .
Suppose that for some index , we have already defined the set with the property that every solution to the equation satisfies .
Let us now consider a maximal collection of pairwise disjoint solutions to the equation with . If this maximal collection has size larger than , then let us terminate the process and define (then we do not need to define another set ). Otherwise, this maximal collection has size at most and so there are at most different vectors appearing in one of the solutions in our collection in . Now, let the set be obtained from by deleting all the vectors appearing in some solution in the collection. Note that then, by the maximality of the chosen collection, no solutions to with remain. Hence in the set every solution to the equation satisfies .
This process defines subsets of for some . Note that at every step of the process we delete at most vectors, meaning that for . This implies that
[TABLE]
so the final set is non-empty.
We claim that . Indeed, if we had , then would be a non-empty subset of such that every solution to the equation satisfies . This is a contradiction, since for any we can form a solution solution to by taking and then we have . Thus, we must have .
This means that the process above terminated with the set , which means that contains a collection of more than pairwise disjoint solutions to the equation with . Thus, taking , condition (i) is satisfied. Furthermore, condition (ii) is satisfied, since throughout the process we maintained the property that every solution to the equation satisfies . ∎
As in Claim 3.3, let us choose and satisfying conditions (i) and (ii). By condition (i), there exists a collection of pairwise disjoint solutions to the equation with . By our assumption, for each of these solutions there is a vector appearing among at least times. So let us define (then ) and let us re-order the vectors in each solution in our collection in such a way that .
Let , and let for be the -tuples in . Then for every we have and as well as . This implies that . Furthermore, for distinct the -tuples and are disjoint, and so we can conclude that
[TABLE]
for .
Suppose we have . Then by our assumption for any solution to the equation with , there is a vector appearing among at least times. In other words, for any solution to the equation with we must have . This implies that there cannot be any solution of the form with where are not all equal (indeed, if , then as and disjoint). Hence the -tuples for satisfy the assumptions of Theorem 3.1 for . On the other hand, we have which is a contradiction to the conclusion of Theorem 3.1.
So we may now assume that , meaning that . For , let us define a -tuple by setting for and
[TABLE]
Note that then we have for . Since , from Theorem 3.1 we can conclude that there must exist with such that are not all equal.
Suppose we have , then let and observe that (since are not all equal). But now we have
[TABLE]
and since in this implies that . But this is a contradiction since the -tuples and are disjoint.
So let us now assume that the indices are not all equal. Then the set
[TABLE]
has size at least (since the -tuples for are pairwise disjoint). Now, we have
[TABLE]
So satisfies and
[TABLE]
where for the second inequality we used (3.1). But this is a contradiction to condition (ii) in our choice of and as in Claim 3.3. This finishes the proof of Proposition 3.2.∎
3.2 Partitioning into rainbow sets
This subsection proves the following combinatorial lemma about partitioning a set with coloured elements into rainbow subsets. This, together with the results from the last subsection, allows us to split our solution of into -tuples, each consisting of distinct vectors, in the desired way.
Lemma 3.4**.**
Let be integers, and consider a colouring of a set of size (assigning each of the elements in a colour). Suppose that each colour occurs at most times. Then there is a partition with for and such that for each all elements of have distinct colours.
Proof.
Let , and . Let us label the elements of the set as in such a way that each colour class forms a block of consecutive elements. It now suffices to show that we can find a partition with for and such that for each we have for any two distinct elements . Indeed, then we can define for . Note that then for any and any distinct , the elements and cannot have the same colour, since otherwise all elements between and in the list would also be of that colour, and so the colour would appear at least times since .
To define the desired partition , let and for . It is not hard to see that for and . Furthermore, for any two distinct , we have , since and are both multiples of . For any and any two distinct , we have , since . ∎
4 Proof of Theorem 2.1
In this section, we finally prove Theorem 2.1. Note that if the theorem holds for some , then it also holds for all smaller values of . Hence it suffices to prove the theorem for for all integers .
We prove Theorem 2.1 for for by induction on . The first subsection of this section contains the induction beginning , and the second subsection contains the induction step.
4.1 Induction beginning
As the starting point of our induction, let us prove Theorem 2.1 for . The following lemma shows that the desired statement holds for and (with as in Proposition 3.2) and for any prime .
Lemma 4.1**.**
Let be as in Proposition 3.2 for . Let be a prime and let be a positive integer. Suppose that is a subset of such that and . Then contains distinct vectors with .
Proof.
Let us say that a pair with is bad if there are at most pairs with satisfying . In other words, a pair is bad if there are at most different ways to write as a sum of two distinct elements of . Note that every element in can occur as the sum of at most bad pairs (since otherwise the pairs with this sum would not be bad). Thus, there can be at most bad pairs .
Let us consider the graph with vertex set where for distinct we draw an edge between and if and only if is a bad pair. Then this graph has at most edges, and hence has average degree at most . Thus, by the well-known Caro–Wei bound [4, 22], the graph has an independent set of size at least
[TABLE]
So let be a subset of size such that there does not exist a bad pair with .
By Proposition 3.2 for , there exist vectors with such that every vector in appears among at most times.
Let us now consider a colouring of the set where the colours correspond to the different vectors appearing among . In other words, two indices receive the same colour if and only if . Then every colour appears at most times on the set , and so by Lemma 3.4, there exists a partition of into sets of size for and one set of size such that all of the sets in this partition are rainbow.
In other words, we can split the list of vectors into pairs with for and one remaining vector . Recalling that does not contain any bad pair, we observe that the pairs for are not bad.
Let . We will now replace each pair by a pair with the same sum in order to construct a solution to the equation with distinct vectors . To do this, consider the indices one by one. For each index , consider the sum . Since is not bad, there are at least pairs with and . These pairs must all be disjoint (since knowing and the sum already determines ), and so there must be at least one pair with and which does not contain or any of the vectors in the already chosen pairs . So let us choose to be such a pair. Doing this step by step for , we obtain a -tuple . By construction, the vectors are distinct and we have
[TABLE]
as desired. ∎
4.2 Induction step
For the induction step in the proof of Theorem 2.1, we will use the following result of Borenstein and Croot [3, Theorem 4], which is a higher-uniformity version of the Balog–Szemerédi–Gowers Theorem [2, 11].
Theorem 4.2** ([3]).**
For every and , there exists such that the following holds for all sufficiently large and all sufficiently large finite subsets of an additively written abelian group. If is a subset satisfying and , then there is a subset of size such that for all positive integers .
In order to perform the induction step for the proof of Theorem 2.1, let us now assume that for an integer and that we have already proved Theorem 2.1 for . Note that , and define .
Let us take a positive integer as in Theorem 2.1 for . Let us now choose small enough (depending on ) such that
[TABLE]
Let us now apply Theorem 4.2 to and . We obtain some and some positive integer such that the statement in Theorem 4.2 holds for all sufficiently large . By decreasing if needed, we may assume that . Let .
Recall that we assume that Theorem 2.1 holds for as our induction hypothesis. So choose as well as for every sufficiently large prime as in Theorem 2.1. Furthermore, let be as in Proposition 3.2 for . Now, let us define
[TABLE]
and for every sufficiently large prime (large enough for Theorem 2.1 for ), let us define
[TABLE]
Now, assuming that is sufficiently large in terms of , any subset of size is large enough for the statement in Theorem 4.2.
As in Theorem 2.1, let us now assume that is a subset of size with . We need to show that contains distinct vectors with .
Let us say that a vector is bad if the family of all subsets with distinct elements satisfying has a cover of size . Let us say that a subset with distinct elements is bad if the sum is bad.
Now, for every bad subset , one of the elements of must be in for (indeed, is a cover of all the size- subsets of summing to ). Thus, by suitably ordering, we can turn every bad subset into an -tuple such that is bad and . Hence the number of bad subsets is at most the number of -tuples such that is bad and .
For every , let us now define to be the number of -tuples with . Note that we have for all .
For every , let us furthermore define to be the number of vectors such that is bad and . Note that we clearly have for all .
Now, the number of -tuples such that is bad and is precisely . Indeed, for every possible value of there are possibilities to choose such that is bad and , and there are furthermore possibilities to choose with .
Thus, the number of bad subsets is at most .
Now, observe that
[TABLE]
Indeed, is the total number of -tuples .
Next, we claim that . Indeed, is the number of pairs such that is bad and . We can choose such pairs by first choosing a bad , then choosing , and finally calculating . Note that there are at most choices for a bad (since by definition every bad is an element of the set ), and for every such choice of there are only choices for . Hence the number of pairs such that is bad and is indeed at most , and we indeed have .
Recalling our assumption , we can now conclude that . Let us now define
[TABLE]
Note that then we have
[TABLE]
Here, we used in the last inequality that (as ).
We will now distinguish two cases depending on the size of the sum .
Case 1: . Recall that and that Theorem 4.2 holds with and our chosen with our values for and . Also recall that is sufficiently large for Theorem 4.2 with these parameters.
Let us now define to be the collection of -tuples such that . Then . We furthermore have
[TABLE]
Therefore, by Theorem 4.2 there exists a subset of size with
[TABLE]
where the second inequality follows from (4.1).
Note that
[TABLE]
This means that all assumptions are satisfied in Theorem 2.1 for (which was our induction hypothesis), recalling our choice of and our assumption that is large enough. Thus, by applying Theorem 2.1 for , we can conclude that contains distinct vectors with . As , this means in particular that contains such vectors.
Case 2: . In this case, we have
[TABLE]
using that . Recalling that for all , this implies
[TABLE]
On the other hand, by the definition of , we have
[TABLE]
where the second inequality follows from (4.2). Thus, the number of bad subsets is at most
[TABLE]
Let
[TABLE]
and note that , since .
Let us now consider a random subset obtained by including every vector in into the subset with probability , independently for all vectors in . Then we have .
Recall that every bad subset consists of distinct vectors in , and that we bounded the number of bad subsets in (4.3). For each such bad subset , the probability of having is . Let be the number of bad subsets with , then
[TABLE]
Hence
[TABLE]
Thus, there exists some outcome of the random subset such that we have . For each of the bad subsets with , let us now delete one of the elements from the set , and let be the set obtained this way. Then is a subset of size and there does not exist any bad subset with .
Applying Proposition 3.2 with to the set , we can find vectors with such that every vector in appears among at most times. Let and be non-negative integers such that and . In other words, this means that and .
Let us now consider a colouring of the set where the colours correspond to the different vectors appearing among . In other words, two indices receive the same colour if and only if . Then every colour appears at most times on the set , and so by Lemma 3.4, there exists a partition of with for and such that for each all elements of have distinct colours. So for each , the vectors with are distinct.
Recalling that and does not contain any bad subset, we can conclude that for each , the set is not bad. Hence, for , the sum is not bad (since the vectors for are distinct and ).
Let us now construct the desired distinct vectors with . We start by defining for (recall that the vectors for are distinct). Note that clearly .
Now, for , let us step by step replace the vectors for with vectors such that . For each index , recall that the sum is not bad. This means that the family of all subsets with distinct elements satisfying does not have a cover of size at most . Since we have chosen at most different vectors throughout our process so far, there must exist a subsets with distinct elements satisfying , such that is disjoint from the set of all our previously chosen vectors for . Let us now assign the vectors for to be (in arbitrary order). Then we have , and the vectors for are distinct and are also distinct from all the vectors for .
Continuing this process step by step for all , in the end we obtain distinct vectors for all such that for . In other words, are distinct vectors, and we have
[TABLE]
This finishes the proof.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Alon and M. Dubiner, A lattice point problem and additive number theory , Combinatorica 15 (1995), 301–309.
- 2[2] A. Balog and E. Szemerédi, A statistical theorem of set addition , Combinatorica 14 (1994), 263–268.
- 3[3] E. Borenstein and E. Croot, On a certain generalization of the Balog-Szemerédi-Gowers theorem , SIAM J. Discrete Math. 25 (2011), 685–694.
- 4[4] Y. Caro, New Results on the Independence Number , Technical Report, Tel-Aviv University, 1979.
- 5[5] E. Croot, V. F. Lev, and P. P. Pach, Progression-free sets in ℤ 4 n superscript subscript ℤ 4 𝑛 \mathbb{Z}_{4}^{n} are exponentially small , Ann. of Math. 185 (2017), 331–337.
- 6[6] Y. Edel, Sequences in abelian groups G 𝐺 G of odd order without zero-sum subsequences of length exp ( G ) exp 𝐺 \operatorname{exp}(G) , Des. Codes Cryptogr. 47 (2008), 125–134.
- 7[7] J. S. Ellenberg and D. Gijswijt, On large subsets of 𝔽 q n superscript subscript 𝔽 𝑞 𝑛 \mathbb{F}_{q}^{n} with no three-term arithmetic progression , Ann. of Math. 185 (2017), 339–343.
- 8[8] P. Erdős, A. Ginzburg, and A. Ziv, Theorem in the additive number theory , Bull. Res. Council Israel 10F (1961), 41–43.
