This paper characterizes the structure of large sum-free subsets of integers, establishes a stability version of Hu's theorem, and confirms a conjecture on the number of partitions into two sum-free sets.
Contribution
It provides a structural characterization of large sum-free sets, a stability version of Hu's theorem, and confirms a conjecture on the count of such partitions.
Findings
01
Structural description of sum-free sets with density ≥ 2/5 - c
02
A stability version of Hu's theorem for maximum union size
03
Number of subsets partitionable into two sum-free sets is Θ(2^{4n/5})
Abstract
A set of integers is called sum-free if it contains no triple (x,y,z) of not necessarily distinct elements with x+y=z. In this paper, we provide a structural characterisation of sum-free subsets of {1,2,…,n} of density at least 2/5−c, where c is an absolute positive constant. As an application, we derive a stability version of Hu's Theorem [Proc. Amer. Math. Soc. 80 (1980), 711-712] about the maximum size of a union of two sum-free sets in {1,2,…,n}. We then use this result to show that the number of subsets of {1,2,…,n} which can be partitioned into two sum-free sets is Θ(24n/5), confirming a conjecture of Hancock, Staden and Treglown [arXiv:1701.04754].
(by \eqrefeq:large-approximate-n2n2m and \eqrefeq:large-approximate-mnm2)≤(13/10+33η)n−13m/2+5≤13ηn
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Full text
On the structure of large sum-free sets of integers
A set of integers is called sum-free if it contains no triple (x,y,z) of not necessarily distinct elements with x+y=z. In this
paper, we provide a structural characterisation of sum-free subsets of {1,2,…,n} of density at least 2/5−c, where c is an absolute positive constant.
As an application, we derive a stability version of Hu’s Theorem [Proc. Amer. Math. Soc. 80 (1980), 711–712] about the maximum size of a union of two sum-free sets in {1,2,…,n}. We then use this result to show that the number of subsets of {1,2,…,n} which can be partitioned into two sum-free sets is Θ(24n/5), confirming a conjecture of Hancock, Staden and Treglown [arXiv:1701.04754].
1. Introduction
A triple (x,y,z) of not necessarily distinct integers is called a Schur triple if x+y=z. Given a positive integer r, we say that a subset A of [n]:={1,2,…,n} is r-wise sum-free if there exists an r-colouring of A which contains no monochromatic Schur triples. When r=1, we simply call such sets sum-free.
Here we derive a structural theorem for large sum-free sets, and apply it to prove a sharp bound, up to a constant factor, on the number of 2-wise sum-free subsets of [n].
In the following subsections we will review what is already known before presenting our results.
1.1. Sum-free sets and their structure
A natural extremal question, which was asked by Abbott and Wang [1] in 1977, is how large an r-wise sum-free subset of [n] can be. We denote the maximum by μ(n,r). It is not difficult to see that μ(n,1)=n−⌊n/2⌋, and this bound is attained by the set of odd numbers in [n] and by the interval {⌊n/2⌋+1,…,n}.
The following definition helps motivate the study of μ(n,r) for r≥2. Let h(r) denote the largest positive integer m for which there exists some way of partitioning [m] into r sets that are sum-free modulo m+1. For example, one has h(2)=4, h(3)=13 and h(4)=44 (see [1, Problem I]).
Abbot and Wang [1] showed
[TABLE]
for every integer r≥2, and conjectured that the equality holds. In 1980, Hu [32] provided a short and elegant proof of this conjecture for r=2, that is, μ(n,2)=n−⌊n/5⌋.
To see why μ(n,2)≥n−⌊n/5⌋, one can consider the set {a∈[n]:a≡1,4(mod5)}∪{b∈[n]:b≡2,3(mod5)}. For r≥3, though there are several interesting general upper bounds for μ(n,r) (see [1, 30]), none of them matches the lower bound given by Abbot and Wang.
Given the extremal result, great efforts has been made to better understand the general structure of large sum-free sets in [n]. The earliest result in this direction was obtained by Freiman [25] who showed that, loosely speaking, a sum-free set of density greater than 5/12 is ‘interval like’ or consists entirely of odd numbers.
Theorem 1.1** (Freiman).**
Every sum-free subset A of
[n] with ∣A∣≥5n/12+2 satisfies one of the following conditions:
(i)
A* consists of odd numbers;*
(ii)
the minimum element of A is at least ∣A∣.
In an unpublished note, Deshouillers, Freiman and Sós proved that the conclusion of Theorem 1.1 continues to hold when 5n/12+2 is replaced by 2n/5+1. The following examples show that the condition ∣A∣≥2n/5+1 cannot be relaxed. Indeed, supposing that n is divisible by 5, we consider the sets A1={a∈[n]:a≡1,4(mod5)}, A2={a∈[n]:a≡2,3(mod5)}, and A3={n/5+1,…,2n/5}∪{4n/5+1,…,n}. We can see that each Ai is a sum-free subset of [n] of size 2n/5, and that they are very far from satisfying property (i) or (ii) from Theorem 1.1.
A few years later, Deshouillers, Freiman, Sós and Temkin [15] succeeded in slightly breaking the 2n/5 barrier (see Theorem 1.2 below). Roughly speaking, they proved that the structure of a sum-free set in [n] of size greater than 2n/5−O(1) is described by Theorem 1.1, or close to one of the sets Ai mentioned previously.111Their result provides no information about sum-free sets in [n] of size less than 2n/5−n.
For every x>0, there exist numbers n0∈N and K>0 such that whenever A is a sum-free set in [n] satisfying n≥n0 and ∣A∣≥2n/5−x, then A has one of the following properties:
(i)
all the elements of A are odd;
(ii)
all the elements of A are congruent to 1 or 4 modulo 5;
(iii)
all the elements of A are congruent to 2 or 3 modulo 5;
(iv)
the minimum element of A is greater than or equal to ∣A∣;
(v)
A* is contained in [5n−K,52n+K]∪[54n−K,n].*
Besides being interesting in their own right, these results have found several applications (see [38, 26, 8, 9]).
We remark that very few structural results are known for large sum-free sets in finite abelian groups, cf. [13, 28, 35, 14, 36, 16, 5].
1.2. Counting sum-free sets
Let SFr(n) denote the collection of r-wise sum-free subsets of [n]. By considering all possible subsets of the set {⌊n/2⌋+1,…,n}, we see that [n] contains at least 2n/2 sum-free sets. Cameron and Erdős [12] in 1990 conjectured that this trivial lower bound is within a constant factor of the truth, that is, ∣SF1(n)∣=O(2n/2). Their conjecture resisted various attempts at proof for over ten years [2, 11, 25], until it was confirmed independently by Green [26] and Sapozhenko [41]. In fact, they proved that there are asymptotically c(n)2n/2 such sets, where c(n) takes two different constant values depending on the parity of n. Recently, a refinement of the Cameron–Erdős conjecture was obtained by Alon, Balogh, Morris and Samotij [4], giving an upper bound on the number of sum-free sets in [n] of size s, for all s∈{1,2,…,⌈n/2⌉}.
For r=2, recall that the set {a∈[n]:a≡1,4(mod5)}∪{b∈[n]:b≡2,3(mod5)} is 2-wise sum-free, and so are all of its subsets, giving ∣SF2(n)∣≥24n/5. Inspired by [41, 26], Hancock, Staden and Treglown [30] considered this counting problem, among other things, and conjectured that this simple bound is in fact the correct estimate on ∣SF2(n)∣. Thus they put forward the following conjecture.
Conjecture 1.3** (Hancock–Staden–Treglown).**
∣SF2(n)∣=O(24n/5).
Note that Hancock et al. applied the container theorems of Balogh, Morris and Samotij [10], and Saxton and Thomason [42], to establish ∣SF2(n)∣=24n/5+o(n). We recommend [30, 31] and the references therein for related results concerning L-free subsets of [n], where L is a homogeneous system of linear equations.
1.3. Our results
Here we go one step beyond Theorem 1.2, and provide a structural characterisation of sum-free sets of size greater than (2/5−c)n, where c is an absolute positive constant.
Theorem 1.4**.**
There exists an absolute positive constant c so that the following holds for every n∈N and every η∈R with 2/n≤η≤c. Let A be a sum-free subset of [n] with ∣A∣≥(2/5−η)n. Then one of the following alternatives occurs:
(i)
all the elements of A are odd;
(ii)
all the elements of A are congruent to 1 or 4 modulo 5;
(iii)
all the elements of A are congruent to 2 or 3 modulo 5;
(iv)
the minimum element of A is greater than or equal to ∣A∣;
(v)
A* is contained in [(51−200η)n,(52+200η)n]∪[(54−200η)n,n].*
Note that there are sum-free subsets of [n] of density 3/8 structurally different from those appeared in the above theorem, such as {a∈[n]:a≡3,4,5(mod8)} and {a∈[n]:a≡4,5,6(mod8)}. As an application of Theorem 1.4, we derive a stability version of Hu’s result (Proposition 3.2), which may be of independent interest.
The proof of Theorem 1.4 draws on a number of ideas from [15]. In particular, as in [15] we make use of an inverse theorem of Lev and Smeliansky [37] for subsets of integers with small difference set. We also develop a number of new ideas in order to deal with the case when the smallest element of A is sublinear in n, thereby making the argument substantially more involved.
The second part of the paper deals with Conjecture 1.3. We show
[TABLE]
settling the conjecture in the affirmative.
Theorem 1.5**.**
The number of 2-wise sum-free subsets of [n] is O(24n/5).
The proof technique is inspired by the methods of [12, 26, 4, 3, 9]. Among other tools we use a container lemma of Hancock et al. [30], an arithmetic removal lemma of Green [27], our stability version of Hu’s theorem, and a recent bound on the number of sets of integers with small sumset due to Green and Morris [29].
1.4. Organisation and notation
The rest is organised as follows. Section 2 is devoted to the study of large sum-free subsets of [n]. In Section 2.1 we provide the main lemmas and use them to obtain Theorem 1.4. We collect together some useful results in Section 2.2 and prove the main lemmas in Sections 2.3, 2.4 and 2.5. Section 3 deals with the enumerating problem. In Section 3.1, we outline the proof of Theorem 1.5. We present the main tools in Section 3.2 and prove Theorem 1.5 in Section 3.3. We close, in Section 4, with some remarks and open problems.
Given two non-empty sets A,B⊂Z, we define
[TABLE]
to be their sumset and difference set, respectively.
For repeated addition we write kA for the k-fold sumset A+…+A, in contrast to k⋅A:={ka:a∈A}. For a finite set A of integers, denote by min(A) and max(A) the minimum and maximum elements of A respectively, and let ℓ(A):=max(A)−min(A)+1. Let A+ stands for the set {a∈A:a>0}. The greatest common divisor of all the elements in A−A will be denoted by d(A).
We denote by E the set of all even and by O the set of all odd numbers in [n]; the value of n will always be clear from the context. Denote
[TABLE]
For real numbers α and β, we employ the interval notation
[TABLE]
and similarly for open intervals. Throughout the paper we omit floor and ceiling signs where the argument is unaffected.
Here we state three main lemmas and explain how to obtain Theorem 1.4 from them. In Lemma 2.1, we deal with the sum-free sets A for which the ratio max(A)min(A) is large. In Lemma 2.2, we deal with the case when the ratio max(A)min(A) is neither too large nor too small. Lemmas 2.1 and 2.2 follow closely the approach from [15], and only minor adaptations are needed in our setting. Finally in Lemma 2.3, which is much more delicate, we study the case that the ratio max(A)min(A) is small. The methods used in [15] do not seem to adapt easily to this case, so we have been forced to devise our own arguments.
Our first main lemma, proven in Section 2.3, says that if the ratio max(A)min(A) is large then A satisfies condition (v) from Theorem 1.4.
Lemma 2.1** (Large range).**
Let 1/n≤η≤1/1602, and let A be a sum-free subset of [n] such that n∈A, d(A)=1, ∣A∣≥(2/5−η)n, and
[TABLE]
Then A is contained in [(51−η)n,(52+32η)n]∪[(54−31η)n,n].
Our second main lemma rules out the possibility that the ratio max(A)min(A) is neither too large nor too small. We provide the proof in Section 2.4.
Lemma 2.2**.**
Let 1/n≤η≤1/1752, and let A be a sum-free subset of [n] such that n∈A, d(A)=1, and
[TABLE]
Then ∣A∣≤(2/5−2η)n.
Our third and final main lemma, proven in Section 2.5, states that if min(A) is small compared to max(A) then A satisfies condition (i), (ii) or (iii) from Theorem 1.4.
Lemma 2.3** (Small range).**
There exists an absolute positive constant c such that the following holds for every n∈N and every η∈R with 1/n≤η≤c. Let A be a sum-free subset of [n] satisfying A∩E=∅, ∣A∣≥(2/5−η)n, and
[TABLE]
Then A is contained in either F1,4 or F2,3.
With these lemmas in hand, we can prove Theorem 1.4.
Set c=min{c\reflem:small,17521}, where c\reflem:small is the absolute positive constant from Lemma 2.3. Denote by m and N the minimum and maximum elements of A respectively. We may assume without restriction of generality that A∩E=∅ and min(A)<∣A∣, that is, A does not satisfy properties (i) and (iv). In order to apply the main lemmas, we must show that d(A)=1 and η≥1/N. Suppose to the contrary that d(A)>1. Then there are two possibilities: either d(A)=2 or d(A)≥3. In the later case, we clearly have ∣A∣≤n/3+1. In the former case, since A∩E=∅, A consists of even numbers. In particular, the set {a/2:a∈A} is a sum-free subset of [n/2], and so ∣A∣≤n/4+1. In either case, we always have ∣A∣≤n/3+1, which contradicts the assumptions that ∣A∣≥(2/5−η)n and 1/n≤η≤1/1752. To verify the inequality η≥1/N, we note that ∣A∣≤N/2+1 as A is a sum-free subset of [N]. Since ∣A∣≥(2/5−η)n, this implies N≥n/2 when 2/n≤η≤1/1752, giving the required bound η≥2/n≥1/N.
The proof now falls naturally into three cases:
[TABLE]
We can easily rule out case (b) using Lemma 2.2. If case (c) occurs then Lemma 2.3 would imply that A is a subset of either F1,4 or F2,3.
Finally we deal with case (a). We may apply Lemma 2.1 to conclude A⊆[(51−η)N,(52+32η)N]∪[(54−31η)N,N]. In particular, we have ∣A∣≤(2/5+65η)N. This upper bound on ∣A∣, in conjunction with the assumption that ∣A∣≥(2/5−η)n, shows N≥(1−163η)n, which in turn implies A⊆[(51−200η)n,(52+200η)n]∪[(54−200η)n,n] when η≤1/1752.
∎
2.2. Inverse theorems
Here we collect together a number of inverse theorems that are essential for proving the main lemmas.
Sets with small sumset are a central object of interest in Arithmetic Combinatorics and have been extensively studied in recent years (see, for example, [43]). One of the main results in this area is Freiman’s inverse theorem [23] which states that if A⊂Z and ∣A+A∣≤K∣A∣ for some fixed K, then A is a dense subset of a generalised arithmetic progression of bounded rank. In fact, the statement still holds in a slightly more general situation, when one considers A+B instead of A+A. This was shown by Ruzsa [40].
For relatively small K, one can obtain more precise information, which plays a crucial role in our study. It is not hard to see that for any finite and non-empty sets A,B⊂Z, one has
[TABLE]
with equality if and only if A and B are arithmetic progressions with the same step. There has been much on generalising this result. For instance, Lev and Smeliansky [37] proved the following theorem.
Lemma 2.4** (Lev–Smeliansky).**
Let A and B be two finite sets of integers such that ∣A+B∣≤∣A∣+∣B∣+min(∣A∣,∣B∣)−4. Then A is contained in an arithmetic progression of length ∣A+B∣−∣B∣+1 and B is contained in an arithmetic progression of length ∣A+B∣−∣A∣+1, where both progressions have the same step.
The special case of the above result for A=B is the famous Freiman’s 3k−4 theorem [23]. For our investigation we shall, however, need a “difference version” of this theorem, which follows readily from Lemma 2.4.
Lemma 2.5**.**
Let A be a finite set in Z such that d(A)=1. Then
[TABLE]
Proof.
Suppose for a contradiction that ∣(A−A)+∣<min{21(∣A∣+ℓ(A)−2),23∣A∣−2}. As ∣(A−A)+∣≥21∣A−A∣−21, it follows that ∣A−A∣≤min{∣A∣+ℓ(A)−2,3∣A∣−4}. By Lemma 2.4, we learn that A is contained in an arithmetic progression of length ∣A−A∣−∣A∣+1≤ℓ(A)−1. This implies d(A)>1, a contradiction.
∎
To our knowledge, the only extension of the 3k−4 Theorem that applies to any set A⊂Z with ∣A+A∣=3∣A∣+o(∣A∣) was accomplished by Jin [34]. His proof is a tour de force of non-standard analysis.
Lemma 2.6** (Jin).**
There exist an absolute positive constant c and a natural number K such that for every finite set A of integers with ∣A∣>K and ∣A+A∣=3∣A∣−3+r for some integer r with 0≤r≤c∣A∣, A satisfies at least one of the following properties:
(i)
A* is a subset of an arithmetic progression of length 2∣A∣−1+2r;*
(ii)
A⊆P1∪P2* for some arithmetic progressions P1,P2 with common step and ∣P1∣+∣P2∣≤∣A∣+r.*
2.3. Large range
Here we give the proof of Lemma 2.1. We shall need a simple but crucial observation from [15, Proposition 2.1]. Its proof can be found in the appendix.
Lemma 2.7**.**
Let A be a sum-free set of positive integers and let m be an arbitrary element of A. Then A satisfies the following conditions:
(i)
∣A∩([u,v]∪[u+m,v+m])∣≤v−u+1* for all u,v∈N with u≤v;*
(ii)
∣A∩[u,u+2m−1]∣≤m* for every u∈N;*
(iii)
∣A∩[u,v]∣≤21(v−u+m+1)* for all u,v∈N with u≤v.*
We emphasise that in the first condition, the two intervals [u,v] and [u+m,v+m] are not necessarily disjoint.
Throughout the proof let m denote the minimum element of A. In the first step, we show that m is not much larger than n/5.
Claim 2.8**.**
m≤(1/5+15η)n.
Proof.
Suppose to the contrary that m>(1/5+15η)n. As m∈A, we may apply Lemma 2.7 (ii) to u=n−2m+1 and obtain
[TABLE]
Since ∣A∣>m by the assumption, this gives m=min(A)≤n−2m, and so m≤n/3. One thus has
[TABLE]
It follows from (2.3) that [m,n] is covered by the intervals [m,21(n−m)], (21(n−m),21n], (n−2m,n] and [m+m,m+21(n−m)]; so also is A.222One may verify this claim for n/5≤m≤n/4, and for n/4≤m≤n/3 separately. For the remainder of the proof we shall use this information to bound ∣A∣.
Applying Lemma 2.7 (i) with u=m and v=21(n−m) we find
[TABLE]
We next bound A∩(21(n−m),21n]. For abbreviation, let B=A∩(21(n−m),21n]. Using (2.1) gives
[TABLE]
To estimate ∣2B∣, we first observe that 2B⊆[n−m+1,n] as B⊆(21(n−m),21n], and (A−A)+⊆[n−m] since A⊆[m,n]. Moreover since A is sum-free and B⊆A, we must have A∩2B=∅ and A∩(A−A)+=∅. Hence 2B,(A−A)+ and A are disjoint subsets of [n], resulting in
[TABLE]
Note that d(A)=1 by the assumption, and n−m≤2∣A∣−3 by (2.3) and the assumption that ∣A∣≥(2/5−η)n and η≥1/n. Lemma 2.5 then implies
[TABLE]
Assembling all the information, we get
[TABLE]
Recalling that A is covered by the intervals [m,21(n−m)], (21(n−m),21n], (n−2m,n] and [m+m,m+21(n−m)], and using estimates (2.2), (2.4) and (2.5), we deduce that ∣A∣≤3n/4−m/4−3∣A∣/4+7/4. Since ∣A∣≥(2/5−η)n and η≥1/n by the assumption, this leads to m≤3n−7∣A∣+7≤(1/5+14η)n, which contradicts our hypothesis that m≥(1/5+15η)n.
∎
In the second step, we establish an approximate version of the lemma.
Claim 2.9**.**
All integers in [(51+η)n,(52−η)n]∪[(54+η)n,n], with at most 14ηn exceptions, belong to A.
Before proving Claim 2.9, we shall use it to finish the proof of Lemma 2.1. Suppose to the contrary that A⊈[(51−η)n,(52+32η)n]∪[(54−31η)n,n]. Then there exists
a∈A∩[(52+32η)n,(54−31η)n]
since min(A)≥(1/5−η)n by the assumption. From this we get
[TABLE]
showing that the intervals a+[(51+η)n,(52−η)n] and [(54+η)n,n] have at least min{29ηn,(1/5−3η)n}=29ηn elements in common. Thus, using pigeonhole principle and Claim 2.9, we find a+b=c for some b,c∈A, which contradicts the assumption that A is sum-free.
∎
Finally we give a proof of Claim 2.9 using Claim 2.8, Lemmas 2.5 and 2.7.
where the last inequality holds since m≤(1/5+15η)n by Claim 2.8, and ∣A∣≥(2/5−η)n and η≥1/n by the assumption. Moreover, as A is a sum-free subset of [m,n], A∩[n−m] and (A−A)+ are disjoint subsets of [n−m]. Hence
[TABLE]
Since A is sum-free, (2⋅A)∩(n−m,n] and A∩(n−m,n] are disjoint, which gives
[TABLE]
We know from (2.6) that A∩([m,21(n−m)]∪(21n,n−2m]) is at least
[TABLE]
as ∣A∣≥(2/5−η)n by the assumption, A∩(21(n−m),21n]≤10ηn by the previous estimate, and ∣A∩(n−2m,n]∣≤m by Lemma 2.7 (ii).
We next apply Lemma 2.7 (i) with u=21n−m and v=n−3m to obtain
[TABLE]
Using (2.6) once again, we may bound A∩[m,21n−m] from below by
[TABLE]
This implies ∣2A∩[2m,n−2m]∣≥−(1/5+22η)n+2m−3, due to (2.1). Moreover, since m≥(1/5−η)n and η≤1/1602 by the assumption, one has (21n,n−2m]⊆[2m,n−2m]. We thus get
(21n,n−2m]∖2A≤∣[2m,n−2m]∖2A∣≤(6/5+22η)n−6m+4. From this and the assumption that 2A∩A=∅, we obtain
[TABLE]
Clearly we can bound [m,21(n−m)]∖A from above by
[TABLE]
assuming m≥(1/5−η)n and 1/n≤η≤1/1602.
From (2.3) and (2.7) we see that all elements of [m,21(n−m)]∪(n−m,n] belong to A, with (13η+10η)n≤14ηn exceptions. As [m,21(n−m)]∪(n−m,n] contains [(51+η)n,(52−η)n]∪[(54+η)n,n] when (1/5−η)n≤m≤(1/5+15η)n and η≤1/1602, the claim follows.
∎
2.4. Middle range
Our goal is to prove Lemma 2.2. For this purpose, we shall require the following variant of a fairly simple result due to Deshouillers et al. [15, Lemma 2.3]. We provide the proof in the appendix for completeness of exposition.
Lemma 2.10**.**
Let k∈N and ϵ≥0, and let A⊆[0,k−1] and
[TABLE]
be two sets of integers such that ∣A∣≥(1−ϵ)k and bi+1−bi≤k for each i∈[ℓ−1]. Then
Throughout the proof let m denote the minimum element of A.
Suppose to the contrary that ∣A∣≥(2/5−2η)n.
Since A is a sum-free subset of [n], we thus have
[TABLE]
To get a contradiction we seek to show ∣(A−A)+∣≥(3/5+3η)n. The following claim serves as an intermediate step.
for m≤n/5 and ∣A∣≥(2/5−2η)n.
Moreover A∩[n−m] and (A−A)+ are disjoint subsets of [n−m] since A is sum-free set in [m,n]. Therefore, we have
[TABLE]
(ii) It follows from (i) that
[TABLE]
In the final step, we shall bound ∣(A−A)+∣ from below.
Claim 2.12**.**
∣(A−A)+∣≥(3/5+3η)n.
Proof.
Let {a1<a2<…<ak} be the set consisting of all elements a∈A∩[n−2m] such that A∩[a−m+1,a−1]=∅. Denote ak+1=n−2m+1, and Ai=A∩[ai,ai+1) for i∈[k].
It is not difficult to see that the following holds:
(∗) For each i∈[k], the gap between any two consecutive elements of Ai is less than m.
Let D=A∩[n−m+1,n], and set ϵ=(5ηn+2)/m. From Claim 2.11 (i) we have ∣D∣≥(1−ϵ)m. Moreover, property (∗) implies that we may apply Lemma 2.10 to A=D−(n−m+1) and B=−Ai, obtaining
[TABLE]
Moreover, using parts (ii) and (iii) of Lemma 2.7 yields
[TABLE]
Furthermore, we can infer from property (∗) that (A−A)+∩[m−1],D−A1,…,D−Ak are disjoint subsets of (A−A)+. So
[TABLE]
Observe that ϵ=(5ηn+2)/m≤min{51η,61} since 35ηn≤m and 1/n≤η by the assumption. Combining this with the assumption that m≤(1/5−η)n and ∣A∣≥(2/5−2η)n, we conclude that the right hand side of (2.4) is greater than
[TABLE]
when η≤1/1752.
Hence ∣(A−A)+∣≥(3/5+3η)n, as promised.
∎
Claim 2.12 obviously contradicts (2.12). This finishes our proof of Lemma 2.2.
∎
2.5. Small range
This section is devoted to the proof of Lemma 2.3. As the proof is quite complicated, we first give a high level overview of our approach. Let A0=A∩[n/2]. The proof naturally splits into four steps
Show that ∣A0∣≥(1/5−o(1))n using Lemma 2.13 (i). This step is performed in Claim 2.14.
2.
Use the estimate from the first step together with inverse theorems (Lemmas 2.4 and 2.6) to show that A0⊆Ia∪Ib, where Ia={a,a+d,…,a+(ℓa−1)d}, Ib={b,b+d,…,b+(ℓb−1)d}, and ℓa+ℓb=(1+o(1))∣A0∣. This is performed in Claim 2.15.
3.
Show that A0 is contained in either F1,4 or F2,3 (Claim 2.16). This step is performed as follows:
3.1
Combining steps 1 and 2 and the property that A0 is sum-free, we obtain a number of inequalities that must be satisfied by the endpoints of Ia and Ib.
3.2
Use the inequalities from the previous step to show that d=5, and either {a,b}≡{1,4}(mod5) or {a,b}≡{2,3}(mod5).
4.
We use a ‘bootstrapping’ argument (Lemma 2.13) to upgrade the ‘50%-structured characterisation’ of A from step 3 to a 100%-structured characterisation.
Our bootstrapping lemma is the following simple result, proven in the appendix, which states that if a set A of integers is dense in some interval I, then the difference set and sumset of A contain long subintervals of I−I and I+I respectively.
Lemma 2.13** (Folklore).**
Every finite set A of integers has the following properties:
(i)
A−A* contains [2∣A∣−ℓ(A)−1];*
(ii)
If A⊆[0,k] for some positive k, then 2A contains [2k−2∣A∣+2,2∣A∣−2].
These properties are only useful when the size of A is at least ℓ(A)/2+1, though it is convenient not to make this a requirement.
Throughout the proof, let A0=A∩[n/2],A1=A∖A0, and me=min(A∩E).
We shall use Lemma 2.13 to show that ∣A0∣ is relatively large.
Claim 2.14**.**
∣A0∣≥(1/5−38η)n.
Proof.
To obtain a contradiction, suppose ∣A0∣<(1/5−38η)n. As ∣A∣≥(2/5−η)n by the assumption, this implies ∣A1∣>(1/5+38η−η)n.
We shall divide the proof into two cases, depending on whether d(A1)=1 or d(A1)>1.
Case 1:d(A1)>1.
We must have d(A1)≤2, since otherwise ∣A1∣≤n/6+1<(1/5+38η−η)n, a contradiction. Thus d(A1)=2, that is, either A1⊆E or A1⊆O. In either case, Lemma 2.13 (i) shows that A1−A1 contains all the even numbers between [math] and 4∣A1∣−n/2≥3n/10, giving me≥3n/10.
We first consider the case A1⊆E. As me≥3n/10, we have ∣A0∩[n/4]∣≤n/8, giving ∣A0∩(n/4,n/2]∣≥∣A0∣−n/8. Thus 2⋅A0∩(n/2,n] contains at least ∣A0∣−n/8 even numbers in (n/2,n]∖A1. (Note that 2⋅A0∩A1=∅ since A is sum-free.) It follows that the number of even integers in (n/2,n] is at least
[TABLE]
for η small, which is impossible.
We are left with the case A1⊆O. Let Me=max(A∩E), and let O′ denote the set of all the odd numbers less than Me in A. We have already shown that me≥3n/10. In addition, since A1⊆O, we have n/2≥Me. As A is sum-free, O′+{Me} is a subset of {Me+1,Me+3,…,2Me−1}∖A, and so A has at most Me/2−∣O′∣ odd elements in [Me,2Me]. Moreover, (2Me,n] contains at most (n−2Me)/2 odd numbers. Thus ∣A∩O∣≤∣O′∣+(Me/2−∣O′∣)+(n−2Me)/2=(n−Me)/2, and hence ∣A∩E∣=∣A∣−∣A∩O∣≥(2/5−η)n−(n−Me)/2≥Me/2−n/8 when η is small enough. However, ∣A∩E∣≤(Me−me)/2+1≤Me/2−3n/20+1 since me≥3n/20. Using these bounds yields 3n/20−1≤n/8, which is impossible for n large.
Case 2:d(A1)=1.
Due to Lemma 2.13 (i), we have A1−A1⊇[2∣A1∣−ℓ(A1)−1]. As A is sum-free, it follows that 2∣A1∣−ℓ(A1)−1<min(A)≤35ηn, giving ℓ(A1)≥2∣A1∣−35ηn−1.
Since d(A1)=1, Lemma 2.5 implies
[TABLE]
for 1/n≤η≤c. Moreover, since A0 and (A1−A1)+ are disjoint subsets of [n/2], we see that n/2≥∣A0∣+∣(A1−A1)+∣. From these estimates we obtain
[TABLE]
So ∣A1∣≤(1/5+2η+36η)n<(1/5+38η−η)n for small η, a contradiction.
∎
In the rest of the proof, we use the κ-notation for constants tending to zero as their parameters do so, that is, κ(η)→0 whenever η→0.
We shall infer from Claim 2.14 that ∣A0+A0∣/∣A0∣ is small, and then rely on the inverse theorems (Lemmas 2.4 and 2.6) to get detailed structural information on A0.
Claim 2.15**.**
The set A0 has the following properties:
(i)
d(A0)=1;
(ii)
A0⊆P1∪P2* for some arithmetic progressions P1 and P2 with the same step and ∣P1∣+∣P2∣≤(1+κ(η))∣A0∣.*
Proof.
(i) Toward a contradiction, suppose d(A0)>1. We must have d(A0)<3 because ∣A0∣≥(1/5−κ(η))n>n/6+1 by Claim 2.14. Hence A0⊆E or A0⊆O. If A0⊆E, then {a/2:a∈A0} is not sum-free since it is a set in [n/4] of size ∣A0∣>n/8+1, contradicting our assumption that A0 is sum-free. Now suppose A0⊆O. Then me>n/2. To bound ∣A∣, we partition A=A′\mathaccent0⋅∪A′′, in which A′=A∩[me−1] and A′′=A∩[me,n]. Since me=min(A∩E) and A is sum-free, A′ and me−A′ are disjoint sets of odd numbers in [me−1], giving ∣A′∣≤me/4. To deal with A′′, we note that A′′−A′′ contains [2∣A′′∣−(n−me)−2] by Lemma 2.13 (i). As A is sum-free, it follows that 2∣A′′∣−(n−me)−2≤min(A)=κ(η)n, resulting in ∣A′′∣≤(n−me)/2+κ(η)n. Therefore, we have ∣A∣=∣A′∣+∣A′′∣≤n/2−me/4+18ηn≤(1/4+κ(η))n, contradicting the assumption that ∣A∣≥(2/5−η)n.
(ii) Since A is sum-free, 2A0 and A are disjoint subsets of [n]. Hence
[TABLE]
as ∣A∣≥(2/5−η)n by the assumption, and ∣A0∣≥(1/5−κ(η))n due to Claim 2.14. By applying Lemma 2.4 when ∣2A0∣≤3∣A0∣−4 and Lemma 2.6 in the case ∣2A0∣≥3∣A0∣−3, we deduce that A0 satisfies one of the following conditions:
(a)
A0 is a subset of an arithmetic progression of length (2+κ(η))∣A0∣;
(b)
A0⊆P1∪P2 for some arithmetic progressions P1 and P2 with the same step and ∣P1∣+∣P2∣≤(1+κ(η))∣A0∣.
To prove property (ii), it thus suffices to show that case (a) is impossible. In this case A0 is located in an interval of length (2+κ(η))∣A0∣, as d(A0)=1 by property (i). Since min(A0)=κ(η)n by the assumption and ∣A0∣≥(1/5−κ(η))n by Claim 2.14, it follows that min(A0)=κ(η)∣A0∣ and A0⊆[(2+κ(η))∣A0∣]. By Theorem 1.1, we thus have A0⊆O, which contradicts property (i).
∎
We shall use the previous claims to obtain the following characterisation of A0.
Claim 2.16**.**
Either A0⊆F1,4 or A0⊆F2,3.
Before we proceed with the proof of Claim 2.16, we show how it implies the lemma. From Claim 2.16 we have A0⊆F1,4 or A0⊆F2,3.
We shall show that if A0⊆F1,4 then A⊆F1,4. Conversely, suppose that there exists a∈A∖F1,4. Since a≡1,4(mod5), we can find i,j∈{1,4} such that a≡i+j(mod5). Now Claim 2.14 tells us that ∣A0∣≥(1/5−κ(η))n. Let X=A0∩(5⋅Z+1) and Y=A0∩(5⋅Z+4). From Lemma 2.13 (ii), we learn that 2X, X+Y and 2Y contain all the elements in [κ(η)n,(1−κ(η))n] of 5⋅Z+2, 5⋅Z and 5⋅Z+3, respectively. As 2A=A and ∣A∣≥(2/5−η)n, this shows that a≥(1−κ(η))n, and that A contains all but at most k(η)n elements of [n]∩({1,4}+5⋅Z). Hence both A and a−A contain all but at most κ(η)n elements of [n]∩(5⋅Z+j), and so A∩(a−A)=∅,
contradicting the assumption that A∩(A−A)=∅.
In much the same way, the condition A0⊆F2,3 would force A⊆F2,3.
∎
We close this section by deducing Claim 2.16 from Claims 2.14 and 2.15.
Finally we come to what is, in some sense, the trickiest part of our proof. Due to Claim 2.15 (ii), there exist two arithmetic progressions Ia={a,a+d,…,a+(ℓa−1)d} and Ib={b,b+d,…,b+(ℓb−1)d} in [n/2] such that A0⊆Ia∪Ib and ∣Ia∣+∣Ib∣≤(1+κ(η))∣A0∣. In particular,
[TABLE]
Clearly ∣A0∣≤∣Ia∣+∣Ib∣≤n/d+2. Combined with the bound ∣A0∣≥(1/5−κ(η))n from Claim 2.14 we get d≤5. We distinguish three cases d=1,d=2 and d∈{3,4,5}.
Case 1:d=1.
In this case both Ia and Ib are intervals. Without loss of generality we can assume that ∣Ia∣≥∣Ib∣. Since A0⊆Ia∪Ib, it follows that ∣Ia∣≥∣A0∣/2, and so from (2.16) and Lemma 2.13 (i) we have A0−A0⊇(A0∩Ia)−(A0∩Ia)⊇[(1/2−κ(η))∣A0∣]. But min(A0)=κ(η)∣A0∣ by the assumption, resulting in A0∩(A0−A0)=∅, a contradiction.
Case 2:d=2.
Since d(A0)=1 by Claim 2.15 (i), we must have A0∩Ia=∅,A0∩Ib=∅ and a=b(mod2). So we can assume that a≡0(mod2) and b≡1(mod2). For u∈{a,b}, denote by mu the smallest element of A∩Iu. We have min{ma,mb}=min(A)=κ(η)n by the assumption. We thus have ma≤n/30, or ma>n/30 and mb=κ(η)n.
We first deal with the case ma≤n/30. We claim that ∣Iu∣≤n/20 for all u∈{a,b}. If this is not true then ∣Iu∣≥n/20 for some u∈{a,b}. From (2.16) and Lemma 2.13 (i), it follows that (A0∩Iu)−(A0∩Iu) contains all the even numbers between 1 and (1−κ(η))n/20. Since ma≤n/30, this leads to ma∈A0−A0, a contradiction. We thus have ∣A0∣≤∣Ia∣+∣Ib∣≤n/10, contradicting Claim 2.14.
We now consider the case ma>n/30 and mb=κ(η)n. For u∈{a,b}, let Mu be the largest element of Iu. As Ia⊆[n/2], we have the constraint C1:Ma≤n/2. We next show that ma and Ma satisfy C2:Ma≤2ma+n/20. Indeed if Ma≥2ma+n/20, then we can deduce from (2.16) that A0∩Ia and ma+(A0∩Ia) would have at least (1−κ(η))n/40 even elements in common, which contradicts the assumption that A is sum-free. We shall need one more constraint C3:ma≥(2−κ(η))Mb. Indeed as A0∩Ia is a sum-free subset of even integers in [n/2], we find ∣A0∩Ia∣≤n/4+1. Together with the estimate ∣A0∣≥(1/5−κ(η))n from Claim 2.14, we see that ∣A0∩Ib∣≥(1/20−κ(η))n. From (2.16) and Lemma 2.13 (ii), it follows that 2(A0∩Ib) contains all the even numbers of [κ(η)n,(2−κ(η))Mb]. As A0 is sum-free and ma>n/30, this implies ma≥(2−κ(η))Mb, as claimed. Under the constraints C1, C2 and C3, one has
[TABLE]
which contradicts the lower bound ∣A0∣≥(1/5−κ(η))n from Claim 2.14.
Case 3:d∈{3,4,5}.
For each u∈{a,b}, let αu and βu be two real numbers such that min(Iu)=αun/2 and max(Iu)=βun/2. Set ϵ=1/1000. We first show that for η>0 sufficiently small, the parameters αa,βa,αb and βb satisfy the following constraints:
(C1)
0≤αu≤βu≤1 for each u∈{a,b},
(C2)
(βa−αa)+(βb−αb)≥2d/5−ϵ,
(C3)
If u,v,w∈{a,b} and u+v≡w(modd), then βu+βv≤αw+ϵ or βw≤αu+αv+ϵ.
Indeed, as Ia and Ib are subsets of [n/2] the first constraint follows. The second holds since ∣Ia∣+∣Ib∣≥∣A0∣≥(1/5−κ(η))n by Claim 2.14. For the third, note first that from (C1) and (C2) one has
[TABLE]
From (2.16) and (2.17) we find min{∣A0∩Ia∣,∣A0∩Ib∣}≥0.19n/(2d)−κ(η)n>0.01n. In particular, A0∩Iu and A0∩Iv are non-empty, and so (2.1) implies
[TABLE]
where the last equality holds because Iu and Iv are two arithmetic progressions with the same step. It follows that (A0∩Iu)+(A0∩Iv) contains all but at most κ(η)n elements of the arithmetic progression {x∣x∈[(αu+αv)2n,(βu+βv)2n],x≡u+v(modd)}. On the other hand, (2.16) tells us that A0∩Iw contains all but at most κ(η)n members of the arithmetic progression {x∣x∈[αwn/2,βwn/2],x≡w≡u+v(modd)}. Therefore we must have βu+βv≤αw+ϵ or βw≤αu+αv+ϵ, as otherwise (A0∩Iu)+(A0∩Iv) and A0∩Iw would have at least min{ϵ⋅2dn−κ(η)n,0.01n}>0 elements in common, which contradicts the assumption that A0 is sum-free.
In what follows we shall exploit the constraints (C1)–(C3) to show that the set {a,b} is sum-free modulo d.
Note that since d(A0)=1, we must have a=b(modd′) for every divisor d′ of d with d′>1. Due to symmetry between a and b, we thus only need to take care of the following three cases.
Case 3.1:d=3,a≡1(mod3) and b≡2(mod3).
Using (C3) with u=v=a and w=b, we deduce that 2βa≤αb+ϵ or βb≤2αa+ϵ. If 2βa≤αb+ϵ, then
[TABLE]
since βb≤1 and αa,βa≥0 by (C1), which contradicts (C2). We thus have βb≤2αa+ϵ. By symmetry we also get βa≤2αb+ϵ. Hence
[TABLE]
since βa,βb≤1 by (C1). But this bound is inconsistent with (C2).
Case 3.2:a≡0(modd). Property (C3) tells us that βa+βb≤αb+ϵ or βb≤αa+αb+ϵ. If the former condition occurs, then from (C1) and (C2) we get
[TABLE]
which is impossible. Hence βb≤αa+αb+ϵ. Combined with the constraint βa≤1 from (C1), we again get a contradiction
[TABLE]
Case 3.3:d∈{4,5}, a,b≡0(modd), and a≡b(modd′) for every divisor d′ of d with d′>1. We begin by reducing to the case that {a,b} is sum-free modulo d. Indeed consider the relation b≡2a(modd). As in the proof of Case 3.1, this would imply βb≤2αa+ϵ. Thus
[TABLE]
since βa,βb≤1 and αb≥0 by (C1). But once again this contradicts (C2). The case a≡2b(modd) follows by symmetry.
We have shown that the set {a,b} is sum-free modulo d. Combined this with the condition that a≡b(modd′) for every divisor d′ of d with d′>1, we conclude that, up to a permutation of a and b, either a≡1(mod5) and b≡4(mod5) or a≡2(mod5) and b≡3(mod5).
∎
Recently the method of containers has emerged as a powerful tool for tackling various problems in combinatorics. Roughly speaking this method states that the independent sets in many ‘natural’ hypergraphs exhibit a certain kind of ‘clustering’, which allows one to count them one cluster at a time. Balogh, Morris and Samotij [10] and Saxton and Thomason [42], proved general container theorems for hypergraphs H whose edges are fairly ‘evenly distributed’ over the vertices of H.
In the proof of Theorem 1.5, we shall apply a special case of a container result of Hancock, Staden and Treglown [30, Theorem 4.7]. We remark that their proof uses the theorems of Balogh, Morris and Samotij [10], and Saxton and Thomason [42].
Lemma 3.1** (Hancock–Staden–Treglown).**
There exists a collection C of subsets of [n]2 with the following three properties:
(i)
If (A1,A2) is a pair of disjoint sum-free subsets of [n], then there exists a pair (C1,C2)∈C such that (A1,A2)⊆(C1,C2);
(ii)
∣C∣=2o(n);
(iii)
For any (C1,C2)∈C, each Ci contains at most o(n2) Schur triples.
We refer to the elements of C from Lemma 3.1 as containers.
A counting strategy. Our general strategy is influenced by the approach used in [9], which in turn dates back to earlier s of Cameron and Erdős [12] and Green [26]. Given A∈SF2(n) and a partition A=A1\mathaccent0⋅∪A2 of A into two sum-free sets, we consider some container (C1,C2)∈C with (A1,A2)⊆(C1,C2). As C is so small, the number of A for which ∣C1∪C2∣≤(4/5−η)n is o(24n/5). If, however, ∣C1∪C2∣≥(4/5−η)n then it is possible to say something about the structure of (C1,C2), and hence about the structure of a typical set A∈SF2(n). We then use a direct argument rather than counting such sets within the containers.
As discussed above, we need to get a handle on the structure of large containers. For this purpose, we first deduce from Theorem 1.4 a structural result on 2-wise sum-free sets of size close to 4n/5, which may be of independent interest.
Proposition 3.2**.**
There exists an absolute positive constant c such that the following holds for every n∈N and every η∈R with 2/n≤η≤c. Let C1 and C2 be two sum-free sets (not necessarily disjoint) in [n] with ∣C1∪C2∣≥(4/5−η)n. Then, up to a permutation of C1 and C2, one of the following situations occurs:
(i)
∣C1∖F1,4∣+∣C2∖F2,3∣≤14ηn;
(ii)
∣C1∖I1∣+∣C2∖I2∣≤2424ηn, where I1=(5n,52n]∪(54n,n] and I2=(52n,54n].
We begin by showing that neither ∣C1∣ nor ∣C2∣ are substantially greater than 2n/5.
Claim 3.3**.**
max{∣C1∣,∣C2∣}≤(2/5+3η)n.
Proof.
Denote C2~=C2∖C1 and R=[n]∖(C1∪C2). As ∣C1∪C2∣≥(4/5−η)n, one has ∣R∣≤(1/5+η)n. Let c1,…,ck be the elements of C1, indexed in increasing order, and let D={c2−c1,…,ck−c1}. Since C1 is sum-free, D∩C1=∅, and consequently D⊆C2~∪R. It follows that
[TABLE]
Let ℓ=∣D∩C2~∣. From the definition of D, there are ℓ distinct numbers i1,…,iℓ in {2,3,…,k}, indexed in increasing order, so that D∩C2~={ci1−c1,…,ciℓ−c1}.
Since C2~ is sum-free, one has cit−ci1=(cit−c1)−(ci1−c1)∈/C2~ for all t with 2≤t≤ℓ. Also cit−ci1∈/C1, as C1 is sum-free. Hence cit−ci1∈R for each t∈{2,…,ℓ}, and so ∣R∣≥ℓ−1. Thus
[TABLE]
Using these bounds on ∣D∩C2~∣ gives ∣C1∣=∣D∣+1≤2∣R∣+2≤(2/5+3η)n when η≥1/n. In the same manner we can show ∣C2∣≤(2/5+3η)n.
∎
We consider the sets C1~=C1∖C2 and C2~=C2∖C1. Clearly one has
C1~∩C2~=∅. Since max{∣C1∣,∣C2∣}≤(2/5+3η)n by Claim 3.3 and ∣C1∪C2∣≥(4/5−η)n by the assumption, we find ∣C1∩C2∣≤7ηn
and min{∣C1~∣,∣C2~∣}≥(2/5−4η)n.
We shall derive the lemma from this information and Theorem 1.4.
Applying Theorem 1.4 to C1~ and C2~, and noting that min{∣C1~∣,∣C2~∣}≥(2/5−4η)n and C1~∩C2~=∅, we conclude that, up to a permutation of C1~ and C2~, one of the following conditions must be true:
(i’)
C1~⊆F1,4 and C2~⊆F2,3;
(ii’)
C1~⊂I1~ and min(C2~)≥(2/5−4η)n, where
[TABLE]
If condition (i’) holds, then ∣C1∖F1,4∣+∣C2∖F2,3∣≤2∣C1∩C2∣≤14ηn. Suppose, then, that condition (ii’) is true. In particular, one has ∣C1~∖I1∣≤1200ηn+3. Hence
[TABLE]
as ∣C1∩C2∣≤7ηn.
It remains to bound ∣C2∖I2∣. From condition (ii’) and the fact that C1~∩C2~=∅, we learn that C2~∖I2⊆C2~∩I1~⊆I1~∖C1~. Thus C2∖I2⊆(C2~∖I2)∪(C1∩C2)⊆(I1~∖C1~)∪(C1∩C2), leading to
[TABLE]
where the second inequality follows from condition (ii’), and in the last we evaluated ∣I1~∣≤(2/5+1200η)+3, ∣C1~∣≥(2/5−4η)n and ∣C1∩C2∣≤7ηn. From these upper bounds on ∣C1∖I1∣ and ∣C2∖I2∣, we find
[TABLE]
We also need a removal lemma of Green [27, Corollary 1.6] for sum-free sets.
Lemma 3.4** (Green).**
Suppose that C⊆[n] is a set containing o(n2) Schur
triples. Then there exists a sum-free subset C~ of C such that ∣C∖C~∣=o(n).
From Lemma 3.1, Proposition 3.2 and Lemma 3.4 we obtain the following description of almost all A∈SF2(n). Note that we shall identify each set A∈SF2(n) with a pair (A1,A2) of disjoint sum-free sets so that A=A1\mathaccent0⋅∪A2.
Corollary 3.5**.**
Given δ>0, every set A∈SF2(n), with at most o(24n/5) exceptions, has one of the following structures (up to a permutation of A1 and A2):
(a)
∣A1∖F1,4∣+∣A2∖F2,3∣≤δn;
(b)
∣A1∖I1∣+∣A2∖I2∣≤δn, in which I1=(5n,52n]∪(54n,n] and I2=(52n,54n].
In the remainder of the paper we refer to sets that satisfy condition (a) and condition (b) from Corollary 3.5 as type (a) and type (b) respectively. Note that Corollary 3.5 implies that, in order to prove Theorem 1.5, it suffices to show that there are at most O(24n/5) sets A∈SF2(n) of type (a) and type (b).
Let η=min{29δ,(3430δ)2,21c\refprop:stability}, where c\refprop:stability is the absolute positive constant from Proposition 3.2. For each set A∈SF2(n), we fix a pair (A1,A2) of disjoint sum-free sets in [n], and a container (C1,C2)∈C such that A=A1∪A2 and (A1,A2)⊂(C1,C2). According to Lemma 3.1 (ii), the number of set A∈SF2(n) for which ∣C1∪C2∣≤(4/5−η)n is certainly at most 2(4/5−η)n⋅2o(n)=o(24n/5), so suppose ∣C1∪C2∣≥(4/5−η)n. By Lemma 3.4, there exists a pair (C1~,C2~) of sum-free sets such that (C1~,C2~)⊆(C1,C2) and ∣C1∖C1~∣+∣C2∖C2~∣=o(n). Observe that ∣C1~∪C2~∣=∣C1∪C2∣−o(n)≥(4/5−2η)n. Since 2η≤c\refprop:stability by the choice of η, we may appeal to Proposition 3.2 to conclude that:
(a′)
∣C1~∖F1,4∣+∣C2~∖F2,3∣≤28ηn, or
(b′)
∣C1~∖I1∣+∣C2~∖I2∣≤3429ηn.
If case (a′) is true, then ∣A1∖F1,4∣+∣A2∖F2,3∣≤28ηn+o(n)<δn for η≤δ/29. If, however, case (b′) occurs then ∣A1∖I1∣+∣A2∖I2∣≤3429ηn+o(n)≤δn since η≤(3430δ)2, completing the proof.
∎
3.2. Restricted partitions and sumsets
In this section, we introduce some tools that are useful for counting
sets A∈SF2(n) of type (a) and type (b).
A handy tool for the study of sumsets is Plünnecke’s inequality [39].
Lemma 3.6** (Plünnecke Inequality).**
If S is a set of integers and ∣S+S∣≤R∣S∣, then
[TABLE]
for any positive integer k.
We shall need the following bound on the number of s-subsets S of {1,2,…,D} with ∣S+S∣≤R∣S∣, due to Green and Morris [29, Theorem 1.1].
Lemma 3.7** (Green–Morris).**
Fix δ>0 and R>0. Then the following holds for all integers s with s≥s0(δ,R). For any D∈N there are at most
[TABLE]
sets S⊆[D] with ∣S∣=s and ∣S+S∣≤R∣S∣.
Lemma 3.7 will be used in conjunction with some estimates on binomial coefficients, which we list here for future reference. It is well-known that for every integers n and k with 0≤k≤n and every real α with 0≤α≤1/2, we have
[TABLE]
where H(x)=−xlog2(x)−(1−x)log2(1−x) is the binary entropy function.
Another component in our argument is a crude bound on the number of restricted integer partitions (see [4, Lemma 5.1]).
Lemma 3.8**.**
Given k,ℓ∈N, let pℓ∗(k) denote the number of integer partitions of k into ℓ distinct parts. Then
[TABLE]
To handle sets with large sumset, we shall apply the following lower tail estimate, which is a special case of Janson’s inequality (see [33, Theorem 2.14]).
Lemma 3.9** (Janson Inequality).**
Suppose that {Ui}i∈I is a collection of subsets of a finite set Γ. Let
[TABLE]
where the second sum is over ordered pairs (i,j) such that i=j and Ui∩Uj=∅.
Then the number of subsets of Γ that contain at most μ/2 sets Ui is at most
[TABLE]
3.3. Counting sets of type (a) and type (b)
Throughout we identify each set A∈SF2(n) with a pair (A1,A2) of disjoint sum-free sets so that A=A1\mathaccent0⋅∪A2.
The following lemma deals with sets of type (a).
Lemma 3.10**.**
There are (1+o(1))2⌈4n/5⌉ sets A∈SF2(n) of type (a), provided that δ>0 is sufficiently small.
Proof.
There are 2⌈4n/5⌉ sets A∈SF2(n) with A1⊆F1,4 and A2⊆F2,3. So, to prove the lemma, it suffices to show that the number of type (a) sets A with 0<∣A1∖F1,4∣+∣A2∖F2,3∣≤δn is o(24n/5). By symmetry we only need to deal with the case that A1∖F1,4 contains at least one element, t say. If t<n/2, then we may select n/20 disjoint pairs (x,x+t) in F1,4, and A1 can not contain both of the elements of any of them since it is sum-free. The number of choices for the pair (A1∩F1,4,A2∩F2,3) is thus no more than 23n/103n/20⋅22n/5=27n/103n/20. Furthermore, since ∣A1∖F1,4∣+∣A2∖F2,3∣≤δn, the number of pairs (A1∖F1,4,A2∖F2,3) is at most (∑i≤δn(in))2≤22H(δ)n, due to (3.1). We deduce that there are at most 27n/103n/20⋅22H(δ)n=o(24n/5)
ways to choose (A1,A2). If t≥n/2 then a very similar argument applies with pairs (x,x−t).
∎
We now turn our attention to sets of type (b). Note that Corollary 3.5, Lemmas 3.10 and 3.11 together imply Theorem 1.5.
Lemma 3.11**.**
If δ>0 is sufficiently small, then there are O(24n/5) sets A∈SF2(n) of type (b).
The proof of Lemma 3.11 is fairly long and technical so, in order to aid the reader, we shall start by giving a brief sketch. The argument is split into four claims; the first three being relatively straightforward, and the last being somewhat more involved.
We begin, in Claim 3.12, by using a direct argument to give a description of almost all sets A∈SF2(n) of type (b). In Claims 3.13, 3.14 and 3.15, we use this description to bound the number of sets A∈SF2(n) with S=A∩[n/5] fixed. Specifically, writing ℓ=∣S∣ and k=∑a∈S(n/5−a), in Claim 3.13 we use Claim 3.12, Lemmas 3.8 and 3.9 to deal with the case
k≫ℓ2. Then, in Claim 3.14, we use Claim 3.12 and Lemma 3.8 to handle the case k=O(ℓ2) and ∣S+S∣≫∣S∣. Finally, in Claim 3.15, we treat the remaining (hard) case; however, since we now have ∣S+S∣=O(∣S∣), we may apply Lemma 3.7 in place of Lemma 3.8.
Fix δ>0 sufficiently small, and let n∈N. We shall show that there are at most O(24n/5) sets A∈SF2(n) of type (b). Since for us the residue of n modulo 5 will not matter, we assume for simplicity throughout the proof that n is divisible by 5. We begin by proving that a typical set A∈SF2(n) of type (b) has the following property:
[TABLE]
Claim 3.12**.**
With o(24n/5) exceptions, all sets A∈SF2(n) of type (b) satisfy (α).
Proof.
Let A∈SF2(n) be a set of type (b) that does not posses property (α). If A1 contains an element t∈[(51−1001)n]∪[(52+1001)n,(54−1001)n], then we can pick at least n/400 disjoint pairs (x,x+t) in I1. Thus the number of ways to choose (A1∩I1,A2∩I2) is at most 279n/2003n/400⋅22n/5=2159n/2003n/400. In addition, since ∣A1∖I1∣+∣A2∖I2∣≤δn, there are at most 22H(δ)n choices for (A1∖I1,A2∖I2). From these estimates it follows that there are at most 2159n/2003n/400⋅22H(δ)n=o(24n/5) possible assignments for (A1,A2). The same conclusion can be drawn for the case that A2 has at least one element in [(52−1001)n]∪[(54+1001)n,n].
∎
From now on we may restrict our attention to those A∈SF2(n) satisfying (α). Let
[TABLE]
denote the collection of elements of A which are at most n/5. We shall count the number of sets A∈SF2(n) with S(A) fixed. The following simple but crucial observation will be exploited several times to bound the number of ways to choose A∩{n/5+1,…,n}.
Observation**.**
Every set A∈SF2(n) with property (α) satisfies the following:
(i)
S(A)=A1∩[(51−1001)n,51n], and A∩(S(A)+S(A))⊆A2∩[(52−501)n,52n];
(ii)
If X⊆A2∩[(52−501)n,52n], then A∩{4n/5+1,…,n} and S+(A∩2X) are disjoint subsets of {4n/5+1,…,n}.
Proof.
To ease notation we shall write S for S(A).
(i) The first statement holds since A2∩[n/5]=∅ and min(A1)≥(51−1001)n by property (α). Since S=A1∩[(51−1001)n,51n], we have 2S⊆2A1∩[(52−501)n,52n]. As A1∩2A1=∅ and A=A1∪A2, this forces
[TABLE]
(ii) As X⊆A2∩[(52−501)n,52n], we have 2X⊆2A2∩[(54−251)n,54n]. Since A2∩2A2=∅ and A=A1∪A2, it follows that
[TABLE]
As S=A1∩[(51−1001)n,51n] due to (i), this implies S+(A∩2X)⊆2A1∩[(1−201)n,n]. In particular, one has S+(A∩2X)⊆{4n/5+1,…,n}. Furthermore, the intersection of A and S+(A∩2X) is contained in
(A∩2A1)∩[(1−201)n,n]⊆A2∩[(1−201)n,n]=(α)∅.
These properties imply the statement.
∎
The remainder of the proof involves some careful counting using the observation as well as Lemmas 3.7, 3.8 and 3.9.
We shall break up the calculation into three claims. In the first two, we count the sets A for which ∑a∈S(A)(n/5−a) is large (Claim 3.13), or ∑a∈S(A)(n/5−a) is small and ∣S(A)+S(A)∣ is large (Claim 3.14). Finally we count the remaining sets in Claim 3.15.
Let S(k,ℓ) denote the collection of sets S⊆[n/5] with ∣S∣=ℓ and
[TABLE]
Claim 3.13**.**
For a given fixed ℓ∈N, there are at most e−ℓ24n/5 sets A∈SF2(n) of type (b) which satisfy (α) and with S(A)∈S(k,ℓ) for some k≥ℓ2/δ2.
Proof.
For k≥ℓ2/δ2 and S∈S(k,ℓ), let I(S) denote the family of all sets A∈SF2(n) of type (b) that satisfy (α) and with S(A)=S. We shall first bound I(S) and then sum over choices of S. Define the graph G of ‘forbidden monochromatic pairs’ by setting
[TABLE]
We partition I(S)=I1(S)\mathaccent0⋅∪I2(S), in which I1(S) consists of all those sets A∈I(S) having the property that A∩V(G) contains at most k/8 edges of G.
We shall use Janson Inequality to estimate ∣I1(S)∣.
Observe that G has k edges and maximum degree at most 2ℓ, since S(A)=S∈S(k,ℓ). Let μ and Δ be the quantities defined in the statement of Lemma 3.9 and note that we are applying the lemma with ∣Γ∣=n/5. We have
[TABLE]
Accordingly μ2/(8μ+8Δ)≥k/(96ℓ), and so the number of choices for A∩V(G) is at most e−k/96ℓ2n/5. On the other hand, we can pick A∩{2n/5+1,…,n} in at most 23n/5 ways. We thus have
[TABLE]
We proceed to bound ∣I2(S)∣. For each subset T⊆V(G) so that T contains at least k/8 edges of G, we define I2(S,T) to be the collection of sets A∈I2(S) with A∩V(G)=T. We see immediately that I2(S)=⋃TI2(S,T), and so the task is now to estimate ∣I2(S,T)∣. Observe that a set A∈I2(S,T) is uniquely determined by the intersection of A and {2n/5+1,…,n}. For this reason we fix S and T, and bound the number of ways to choose A∩{2n/5+1,…,n}. Since G has maximum degree at most 2ℓ, we may select k/16ℓ disjoint edges in T=A∩V(G), say {xi,xi+si} with 1≤i≤k/16ℓ. Let B={xi+si:1≤i≤k/16ℓ}. Then, si∈A1∩[(51−1001)n,51n] by Observation (i), and so xi∈A1∩[51n,(51+1001)n] due to property (α) and the fact that xi+si≤2n/5. Hence B⊆2A1∩[(52−1001)n,52n].
Since A1 is sum-free and B⊆A, this forces
[TABLE]
We thus have
[TABLE]
This suggests us splitting I2(S,T)=I2′(S,T)\mathaccent0⋅∪I2′′(S,T), where I2′(S,T) contains every set A∈I2(S,T) with ∣A∩2B∣≤∣2B∣/4.
We consider a set A∈I2′(S,T). Since ∣A∩2B∣≤∣2B∣/4 by the definition of I2′(S,T), we may pick A∩2B from the family of all subsets of 2B in at most 2H(1/4)∣2B∣ ways. Thus, noting that 2B⊆{2n/5+1,…,n} by (3.4) and that A is determined by A∩{2n/5+1,…,n}, we have
[TABLE]
Suppose now that A∈I2′′(S,T). Evidently there are at most 22n/5 ways to choose A∩{2n/5+1,…,4n/5}. We shall fix this set and bound the number of possibilities for A∩{4n/5+1,…,n}. As 2B⊆{2n/5+1,…,4n/5} by (3.4), S+(A∩2B) is already determined. Moreover, it follows from property (3.3) and Observation (ii) that A∩{4n/5+1,…,n} and S+(A∩2B) are two disjoint subsets of {4n/5+1,…,n}. Hence there are at most 2n/5−∣S+(A∩2B)∣≤2n/5−∣A∩2B∣ possible outcomes for the set A∩{4n/5+1,…,n}. Therefore, we get the estimate
[TABLE]
in which the second inequality follows from the definition of I2′′(S,T).
Finally there are at most (ℓ2e2k)ℓ choices for S∈S(k,ℓ) by Lemma 3.8, and hence, using (3.2) and (3.7), we can bound the number of sets A from above by
[TABLE]
where the second inequality holds since g(x)=xae−bx is decreasing on [a/b,∞) and g(x+1/b)<g(x)/2 for x≥4a/b. (Note that we have ℓ2/δ2≥4ℓ⋅130ℓ since δ>0 is sufficiently small.)
∎
Claim 3.14**.**
For a given fixed ℓ∈N, there are at most e−ℓ24n/5 sets A∈SF2(n) of type (b) that satisfy (α) and with
(β1)
S(A)∈S(k,ℓ)* for some k≤ℓ2/δ2;*
(β2)
∣S(A)+S(A)∣≥∣S(A)∣/δ2.
Proof.
The proof is similar in spirit to that of Claim 3.13. Fixing an integer k with k≤ℓ2/δ2 and a set S∈S(k,ℓ) with ∣2S∣≥ℓ/δ2, we denote by I(S) the collection of all sets A∈SF2(n) of type (b) that satisfy (α) and with S(A)=S. Further partition I(S)=I1(S)\mathaccent0⋅∪I2(S), where I1(S) consists of all sets A∈I(S) with ∣A∩2S∣≤∣2S∣/4.
We first count I1(S). Notice that 2S⊆{n/5+1,…,2n/5} due to Observation (i), and ∣A∩2S∣≤∣2S∣/4 by the definition of I1(S). From this we deduce that there are no more than 2n/5−∣2S∣2H(1/4)∣2S∣ choices for A∩{n/5+1,…,2n/5}. Since we can take A∩{2n/5+1,…,n} in at most 23n/5 possible ways, it follows that
[TABLE]
for ∣2S∣≥ℓ/δ2.
We next deal with I2(S). For each subset T⊆2S with ∣T∣≥∣2S∣/4, we define I2(S,T) to be the collection of sets A∈I2(S) with A∩2S=T. We shall fix such a set T and further partition I2(S,T)=I2′(S,T)\mathaccent0⋅∪I2′′(S,T), in which I2′(S,T) consists of sets A∈I2(S,T) with ∣A∩2T∣≤∣2T∣/4.
Note that
[TABLE]
Suppose first that A∈I2′(S,T). Then ∣A∩2T∣≤∣2T∣/4 by the definition of I2′(S,T), and so we can choose A∩2T in at most 2H(1/4)∣2T∣ ways. Moreover, by Observation (i) we have
2S⊆{n/5+1,…,2n/5} and 2T⊆4S⊆{2n/5+1,…,4n/5}. So there are at most 24n/5−∣2S∣−∣2T∣ possibilities for A∖(S∪2S∪2T). (Recall that S=A∩[n/5].) We therefore obtain
[TABLE]
Suppose now that A∈I2′′(S,T). Since 2S⊆{n/5+1,…,2n/5} by Observation (i), and the sets S=A∩[n/5] and T=A∩2S have been chosen, we see that A is uniquely determined by A∩({n/5+1,…,4n/5}∖2S) and A∩{4n/5+1,…,n}. We can trivially bound the number of choices for A∩({n/5+1,…,4n/5}∖2S) by 23n/5−∣2S∣. We shall fix this set and bound the number of ways to choose A∩{4n/5+1,…,n}. Note that fixing A∩[4n/5] determines S+(A∩2T). Furthermore, we know from Observation (i) that T=A∩2S is contained in A2∩[(52−501)n,52n], and consequently A∩{4n/5+1,…,n} and S+(A∩2T) are disjoint subsets of {4n/5+1,…,n} due to Observation (ii).
Hence we can assign A∩{4n/5+1,…,n} in at most 2n/5−∣S+(A∩2T)∣≤2n/5−∣A∩2T∣ possible ways, as S=∅. Putting everything together we get
[TABLE]
where the second inequality holds since ∣A∩2T∣≥∣2T∣/4 by the definition of I2′′(S,T).
Finally adding inequalities (3.8) and (3.12), and summing over all S, we get the following bound on the number of sets A:
[TABLE]
where the first inequality holds since ∣S(k,ℓ)∣≤(ℓ2e2k)ℓ due to Lemma 3.8.
∎
The following claim now completes the proof of Lemma 3.11.
Claim 3.15**.**
There exists an absolute constant ℓ0 so that for every integer ℓ≥ℓ0 there are at most e−ℓ/524n/5 sets A∈SF2(n) of type (b) which satisfy (α) and with
(γ1)
S(A)∈S(k,ℓ)* for some k≤ℓ2/δ2;*
(γ2)
∣S(A)+S(A)∣≤∣S(A)∣/δ2.
Proof.
This is the most difficult case, and we shall have to count more carefully, using Lemma 3.7.
For each k∈N and λ>0, let S(λ)(k,ℓ) denote the collection of sets S∈S(k,ℓ) such that
[TABLE]
Given S∈S(λ)(k,ℓ), we denote by I(S) the collection of all sets A∈SF2(n) of type (b) that satisfy (α) and with S(A)=S. It is not hard to see that the number of sets A that satisfies the hypothesis of Claim 3.15 is bounded from above by ∑∣I(S)∣, where the sum is taken over all triples (k,λ,S) with k≤ℓ2/δ2, λ=(2−δ)(1+δ)i for some integer i with 0≤i≤δ3lnδ1, and S∈S(λ)(k,ℓ). To count I(S), we partition I(S)=J(S)\mathaccent0⋅∪K(S), in which J(S) consists of all sets A∈I(S) with ∣A∩2S∣≤∣2S∣/20.
We shall use Lemma 3.7 to count the number of triples (k,λ,S). As noted above, there are only Oδ(ℓ2) choices for k and λ; this will be absorbed by the error term 2O(δℓ). We may apply Lemma 3.7 to R=(1+δ)λ, s=ℓ and D=k, and conclude that there are at most 2O(δℓ)(ℓ(1+δ)λℓ/2) choices for S∈S(λ)(k,ℓ). (Note that (1+δ)λ=Oδ(1), k=Oδ(ℓ2) and ℓ is sufficiently large.)
We are now ready to estimate the sum ∑(k,λ,S)∣J(S)∣. Analysis similar to that in the proof of Claim 3.14 shows
[TABLE]
since ∣2S∣≥λℓ for all S∈S(λ)(k,ℓ). Summing over all choices of (k,λ,S), and recalling that λ=(2−δ)(1+δ)i≥2−δ, we thus get
[TABLE]
We proceed to bound the sum ∑(k,λ,S)∣K(S)∣.
For each p∈N and μ>0, let T(μ)(S,p) be the collection of sets T⊆2S with
[TABLE]
For any set T∈T(μ)(S,p) and any integer q with 0≤q≤∣2T∣, let K(T,q) stand for the collection of those sets A∈K(S) with A∩2S=T and ∣A∩2T∣=q. From the definition of K(S), we know that ∣2S∣/20≤∣T∣≤∣2S∣ (otherwise K(T,q)=∅), and so λℓ/20≤p≤(1+δ)λℓ. Moreover, as ∣2S∣≤∣S∣/δ4 by our choice of λ, Lemma 3.6 implies ∣4S∣≤∣S∣/δ16, giving ∣2T∣≤∣4S∣≤∣2S∣/δ16≤20∣T∣/δ16. Accordingly we only need to care about those μ so that μ=(2−δ)(1+δ)j for some integer j with 0≤j≤δ17lnδ1.
Summarizing, we have
[TABLE]
where the sum is over all quadruples (p,μ,T,q) such that λℓ/20≤p≤(1+δ)λℓ, μ=(2−δ)(1+δ)j for some integer j with 0≤j≤δ17lnδ1, T∈T(μ)(S,p), and q≤∣2T∣.
From the previous discussion, we deduce that there are only Oδ(p) choices for p and μ; this will be absorbed by the error term 2O(δp). Using Lemma 3.7 with R=(1+δ)μ, s=p and D=2k, we find that there are at most 2O(δp)(p(1+δ)μp/2) choices for T∈T(μ)(S,p).
Since q≤∣2T∣≤(1+δ)μp, we have only Oδ(p) possibilities for q, and this will also be absorbed by the error term 2O(δp).
We are reduced to enumerating K(T,q) for fixed T∈T(μ)(S,p) and q≤(1+δ)μp. Since ∣A∩2T∣=q, there are at most (q∣2T∣) choices for A∩2T. In addition, since 2S and 2T are disjoint subsets of {2n/5+1,…,4n/5} due to Observation (i), we can allocate A∩({2n/5+1,…,n}∖(2S∪2T)) in at most 23n/5−∣2S∣−∣2T∣ possible ways. Furthermore, specifying A∩[4n/5] determines S+(A∩2T). As A∩{4n/5+1,…,n} and S+(A∩2T) are disjoint subsets of {4n/5+1,…,n} by Observation (ii), this implies that there are at most 2n/5−∣S+(A∩2T)∣≤2n/5−q possibilities for A∩{4n/5+1,…,n}.
Here we evaluate ∣S+(A∩2T)∣≥∣A∩2T∣=q for S=∅. Therefore, recalling that ∣2S∣≥λℓ and μp≤∣2T∣≤(1+δ)μp, we get
[TABLE]
From what has already been proved we may bound ∑(k,λ,S)∣K(S)∣ from above by
[TABLE]
where in the last inequality we used the fact that the term 2O(δℓ) is absorbed by the error term 2O(δp). We shall deploy the entropy estimate (3.1) to control the last expression. For abbreviation, set x=(1+δ)λ/2, y=(1+δ)μ/2, z=q/(1+δ)μp, and D={(x,p,y,z)∈R4:x≥1,0≤p≤2xℓ,y≥1,0≤z≤1}.
Recalling that λ,μ≥2−δ, 0≤p≤(1+δ)λℓ and 0≤q≤(1+δ)μp, we then have (x,p,y,z)∈D. Now using inequality (3.1) and simplifying yields
[TABLE]
where h(x,p,y,z):=(xH(x1)−1+δ2x)ℓ+(yH(y1)−1+δ2y+O(δ))p+2yp⋅(H(z)−z).
A straightforward but slightly tedious calculation shows that the maximum value of h(x,p,y,z) on D is (log2(11581)+O(δ))ℓ≈−0.505ℓ, attained at z=31, y=716+O(δ), p=2xℓ and x=115196+O(δ).444We can solve this optimisation problem backwardly using the following simple facts. Firstly, the function f(z)=H(z)−z achieves its maximum at z=1/3.
Secondly, given ρ>0, the function g(t)=tH(t1)−ρt is maximised at t=2ρ/(2ρ−1). Hence
[TABLE]
Finally adding inequalities (3.3) and (3.14), and summing over all triples (k,λ,S), we conclude that the number of sets A is at most
[TABLE]
The proof of Lemma 3.11 is at long last complete.
∎
4. Concluding remarks
In this paper we studied the general structure of large sum-free sets of integers. From this we obtained a good bound on the total number of 2-wise sum-free subsets of [n]. It is likely that our methods extend to give an asymptotic formula for this number, but we do not pursue this here. We close with some remarks
and possible directions for further research.
Sets with small difference constant
The main open problem is to determine the critical density
threshold at which Theorem 1.4 ceases to hold. Note that in the theorem, the value for c given by our argument is something like 10−6c\refthm:Jin2, where c\refthm:Jin is the absolute positive constant from Jin’s inverse theorem (Lemma 2.6). Note that Jin obtained his result via non-standard analysis, and thus no explicit value of c\refthm:Jin can be extracted from his proof.
Using the following conjecture instead of Lemma 2.6, we would certainly get a reasonable value for c.
Conjecture 4.1**.**
There exists a natural number K such that for any finite set of integers A so that ∣A∣≥K and ∣A−A∣=3∣A∣−3+r for some integer r with 0<r<31∣A∣−2, one of the following properties holds:
(i)
A* is a subset of an arithmetic progression of length 2∣A∣−1+2r;*
(ii)
A⊆P1∪P2* for some arithmetic progressions P1,P2 with common step and ∣P1∣+∣P2∣≤∣A∣+r.*
We remark that the sumset version of Conjecture 4.1 was proposed by Freiman [24]. The following example shows that the condition r<31∣A∣−2 is necessary.
Example 4.2**.**
Let y≥4x, and consider the set A={0,y,2y}+[0,x−1]. We have A−A={0,±y,±2y}+[−x+1,x−1], and so ∣A−A∣=10x−5=(3∣A∣−3)+(31∣A∣−2).
But A is neither a subset of an arithmetic progression of length (2∣A∣−1)+2⋅(31∣A∣−2) nor a subset of an union of two arithmetic progressions of total length ∣A∣+(31∣A∣−2).
It is worth mentioning that Eberhard, Green and Manners [17] provided a rough structure theorem for sets of integers of difference constant less than 4. Specifically, they proved that if A is a subset of Z with ∣A−A∣≤(4−ϵ)∣A∣ then A has density at least 21+2−1000ϵ on some arithmetic progression of length ≫ϵ∣A∣. They then used this result to show the existence of a set of n positive integers with no sum-free subset of size greater than 31n+o(n), answering a famous question of Erdős [19] from 1965.
Union of intersecting families
One can pursue the following general questions for any monotone propertyP:
(i)
What is the maximum size of a union of r objects with property P?
(ii)
How many objects which can be partitioned into r subobjects having property P are there?
In this paper, we addressed the second question for the sum-free property. In what follows, we shall single out another monotone property for further research.
A family of sets is called intersecting if it does not contain two disjoint sets. Given a positive integer r, a family F is said to be r-wise intersecting if there exists a partition of F into r intersecting families.
Let Ir(n,k) denote the collection of all r-wise intersecting families F⊆(k[n]).
The celebrated Erdős-Ko-Rado theorem from 1961 states that for n≥2k the largest member of I1(n,k) has size (k−1n−1). Recently Ellis and Lifshitz [18] considered the problem, first raised by Erdős [20], of determining the maximum possible size of a family in Ir(n,k) when r≥2. Specifically, they showed ∣F∣≤(kn)−(kn−r) for any F∈Ir(n,k) provided that r≥2 and n≥2k+C(r)k2/3, with equality holds if and only if F={F∈(k[n]):F∩R=∅} for some R∈(r[n]). In the case r=2, this significantly improves a previous result due to Frankl and Füredi [21]. It would be interesting to determine whether C(r)k2/3 is the best possible error term. Note that an example given by Frankl and Füredi [21] shows that this term cannot be reduced to below k.
The problem of enumerating I1(n,k) was first investigated by Balogh, Das, Delcourt, Liu and Sharifzadeh [6]. Building on the of Balogh et al., Frankl and Kupavskii
[22] and, independently, Balogh, Das, Liu, Sharifzadeh and Tran [7] established the asymptotic formula ∣I1(n,k)∣=(n+o(1))2(k−1n−1) for n≥2k+3klnk. Motivated by this result and the theorem of Ellis and Lifshitz, we make the following conjecture.
Conjecture 4.3**.**
∣Ir(n,k)∣=((rn)+o(1))2(kn)−(kn−r)* for r≥2 and n≥2k+C(r)k0.9, where the term o(1) tends to [math] as n→∞.*
Acknowledgement
The author was supported by the Alexander Humboldt Foundation, and by the GACR grant GJ16-07822Y, with institutional support RVO:67985807. He would like to thank Jan Hladky and Phuong Dao for fruitful discussions. He is also grateful to an anonymous referee whose suggestions helped improve and clarify the manuscript.
We begin by showing that the set [x,x+k−1]∖(A+B) has at most 2ϵk elements for each integer x∈[b1,bℓ+1]. Indeed let i∈[ℓ] be the largest integer such that bi≤x. For convenience, set bℓ+1=bℓ+1. From the definition of i and the fact that bi+1−bi≤k, we find [x,x+k−1]⊆[bi,bi+1+k−1]={bi,bi+1}+[0,k−1]. Moreover, since ∣A∣≥(1−ϵ)k, there are at most 2kϵ elements in {bi,bi+1}+[0,k−1] which do not belong to {bi,bi+1}+A. Hence [x,x+k−1] contains only elements of A+B, with at most 2kϵ exceptions, as claimed.
Finally, because [b1,bℓ+k] can be covered by at most (k+bℓ−b1+1)/k+1 intervals of the form [x,x+k−1] with x∈[b1,bℓ+1], we find
(i) A proof of this result can be found in [15, Lemma 2.2].
(ii) Denote s=∣A∣. We wish to show that x∈2A for each x∈[2k−2s+2,2s−2]. Since [2k−2s+2,2s−2]=[k−s+1,s−1]+[k−s+1,s−1], one has x=y+z for some integers y,z∈[k−s+1,s−1]. Note that y+i,z−i∈[0,k] for every integer i∈[−k+s−1,k−s+1], and ∣[0,k]∖A∣≤k−s+1. Thus by the pigeonhole principle, there exists j∈[−k+s−1,k−s+1] so that y+j,z−j∈A. We then have x=(y+j)+(z−j)∈2A.
∎
Bibliography43
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] H. L. Abbott and E. T. H. Wang, Sum-free sets of integers , Proceedings of the American Mathematical Society 67 (1977), 11–16.
2[2] N. Alon, Independent sets in regular graphs and sum-free subsets of abelian groups , Israel Journal of Mathematics 78 (1991), 247–256.
3[3] N. Alon, J. Balogh, R. Morris and W. Samotij, Counting sum-free sets in abelian groups , Israel Journal of Mathematics 199 (2014), 309–344.
4[4] N. Alon, J. Balogh, R. Morris and W. Samotij, A refinement of the Cameron–Erdős conjecture , Proceedings of the London Mathematical Society 108 (2014), 44–72.
5[5] R. Balasubramanian, G. Prakash and D. S. Ramana, Sum-free subsets of finite abelian groups of type III , European Journal of Combinatorics 58 (2016), 181–202.
6[6] J. Balogh, S. Das, M. Delcourt, H. Liu and M. Sharifzadeh, Intersecting families of dicrete structures are typically trivial , Journal of Combinatoral Theory. Series A 32 (2015), 224–245.
7[7] J. Balogh, S. Das, H. Liu, M. Sharifzadeh and T. Tran, Structure and supersaturation for intersecting families , ar Xiv:1802.08018.
8[8] J. Balogh, H. Liu, M. Sharifzadeh and A. Treglown, The number of maximal sum-free subsets of integers , Proceedings of the American Mathematical Society 143 (2015), 4713–4721.