Small Sets with Large Difference Sets
Luka Milicevic

TL;DR
This paper constructs sets in modular integers with full difference sets but arbitrarily small polynomial or mixed polynomial sumsets, partially answering a question by Nathanson about such sets.
Contribution
It provides new constructions of sets with full difference sets and small polynomial sumsets, extending previous results to more complex polynomial combinations.
Findings
Sets with full difference sets and small polynomial sumsets are constructed.
Results extend to complex polynomial combinations like quadratic and mixed sums.
Partial answers to Nathanson's problem on polynomial sumsets in modular integers.
Abstract
For every and , Haight constructed a set ( stands for the integers modulo ) for a suitable , such that and . Recently, Nathanson posed the problem of constructing sets for given polynomials and , such that and , where is the set , when has variables. In this paper, we give a partial answer to Nathanson's question. For every and , we find a set for suitable , such that , but , where . We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLimits and Structures in Graph Theory · graph theory and CDMA systems · Graph Labeling and Dimension Problems
Small Sets with Large Difference Sets
Luka Milićević E-mail address: [email protected] Department of Pure Mathematics and Mathematical Statistics
Wilberforce Road
Cambridge CB3 0WB
UK
Abstract
For every and , Haight constructed a set ( stands for the integers modulo ) for a suitable , such that and . Recently, Nathanson posed the problem of constructing sets for given polynomials and , such that and , where is the set , when has variables. In this paper, we give a partial answer to Nathanson’s question. For every and , we find a set for suitable , such that , but , where . We also extend this result to construct, for every and , a set for suitable , such that , but , where .
00footnotetext: 2010 Mathematics Subject Classification: 11B13; 11P99
1 Introduction
The problem of comparing different expressions involving the same subset of an abelian group (e.g. and ) is one of the central topics in additive combinatorics. For example, one of the starting points in the study of this field is the Plünnecke-Ruzsa inequality that bounds in terms of and .
Theorem 1.1**.**
(Plünnecke-Ruzsa inequality, [10], [12]) Let be a subset of an abelian group. Then, for any we have
[TABLE]
To illustrate the difficulties in determining the right bounds for such inequalities, we note that even for the comparison of and the right exponents are not known. In fact, the best known lower bounds for in terms of have not changed for more than 40 years.
Theorem 1.2** (Freiman, Pigaev; Ruzsa, [3], [11]).**
Let be a subset of an abelian group. Then .
In the opposite direction, the best known lower bound is given by the following result.
Theorem 1.3** (Hennecart, Robert, Yudin, [7]).**
There exist arbitrarily large sets such that , where .
In 1973, Haight [6] found for each and , an integer and a set such that and . Recently, Ruzsa [13] gave a similar construction, and observed that Haight’s work even gives a constant for each with the property that there are arbitrarily large with sets such that and . The ideas in both constructions are relatively similar, but Ruzsa’s argument is cosiderably more concise.
In [9], Nathanson applied Ruzsa’s method to construct sets with , but small, for rings that are more general than . In the same paper, he posed the following more general question. Given a polynomial with coefficients in , and a set , write . His question can be stated as: given two polynomials over and , does there exist arbitrarily large and a set such that , but ?111Actually, Nathanson poses this question for more general rings , but for , the formulation we give here is a natural one.
Let us now state the main result of this paper, which answers the first interesting cases of Nathanson’s question. Once again we recall the notation
[TABLE]
and more generally,
[TABLE]
Theorem 1.4**.**
Given and any , there is a natural number and a set such that
[TABLE]
In fact we prove rather more.
Theorem 1.5**.**
For , any and any , there is a natural number and a set such that
[TABLE]
Moreover, we can take to be a product of distinct primes, and we can take the smallest prime dividing to be arbitrarily large.
We shall discuss each of the cases separately. Note also an interesting phenomenon in the opposite direction. Namely, if we are not allowed freedom in the choice of the modulus, a statement like the theorem above cannot hold. The reason is that, by the result of Glibichuk and Rudnev (Lemma 1 in [4]) whenever for a prime , is a set of size at least , then (and certainly implies ). Hence, unlike the linear case, already for quadratic expressions we have strong obstructions.
In fact, this problem is comparable in spirit to sum-product phenomenon, which can be stated as the following theorem.
Theorem 1.6**.**
(Bourgain, Katz, Tao [2], Sum-product estimate.) Let be given. Then there is such that whenever for a prime satisfies
[TABLE]
then one has
[TABLE]
This was further generalized to arbitrary modulus .
Theorem 1.7**.**
*(Bourgain [1], Sum-product estimate for composite moduli.) Given such that , write for the natural projection from .
Let be given. We then have such that the following holds. Whenever satisfies*
[TABLE]
and,
[TABLE]
then
[TABLE]
Hence, the sum-product phenomenon still holds in general , even when N is composite, and given the similarity of our problem, it could well be that the result of Glibichuk and Rudnev stated above holds in the more general setting as well. (Note that if , then it satisfies the technical condition in Theorem 1.7.)
Conjecture 1.8**.**
There is such that whenever and , then we have .
1.1 Acknowledgements
I would like to thank Trinity College Cambridge and the Department of Pure Mathematics and Mathematical Statistics of Cambridge University for their generous support, and Imre Leader for encouragement and helpful discussions concerning this paper.
2 Overview of the Construction
We begin the paper by reviewing Ruzsa’s construction and generalizing its main ideas slightly to the context of polynomial expressions in . As it turns out, to be able to construct a set such that , but , it will suffice to consider expressions which are sums of terms of the form , and , with and then choose the maps so that the number of values attained by each expression is small. For example, one of the expressions that we have to consider already for the case is . This discussion takes place in Section 3 and the rest of the paper is devoted to constructions of maps for various expressions.
In Section 4, we construct sets such that but is small. In this construction, we come to a basic version of one of the main ideas, which we call the identification of coordinates. Very roughly, if is a product of distinct prime , using approximate homomorphisms between and , we can essentially treat as a vector space of dimension . Then, altough we might not ensure that each coordinate attains few values, we can ensure that their sum attains few values.
In Section 5, we construct sets such that but is small. There, we improve our results for the expression that involve a single variable using a variant of Weyl’s equidistribution theorem for polynomials. Using this result, the identification of coordinates is developped further and we conclude this section with the strongest form of identification of coordinates.
The final part of the construction, finding sets with small, is carried out in Section 6. There, we also touch upon some limitations of the usual approach and therefore develop different ideas to treat some of the remaining expressions. Namely, for certain choices of coefficients, in the expression
[TABLE]
the identification of coordinates cannot work. For this expression, we give a different, probabilistic argument.
The final section is devoted to some open problems and questions that naturally arise, including the motivation for some of these. We have tried to organize the paper so that methods used naturally develop from the case to the case , highlighting the new difficulties that arise and why the earlier arguments are not powerful enough for the later expressions.
3 Overview of Ruzsa’s argument and Initial Steps
We now briefly discuss Ruzsa’s construction of sets such that , but . His ideas will be important for the later constructions given in this paper.
Let us first analyse the requirement that . Given any , we thus have such that . If we write for such a , this yields a map with the property that all and are contained in . Removing all other elements from does not change the equality , and it can only make smaller, so Ruzsa’s starting point is to consider a set of the form
[TABLE]
where is map from to itself. We shall do the same in this paper as well, and throughout the paper we will devote ourselves to finding suitable modulus and maps on .
Thus, we have to understand how to find a suitable and a map which then give rise to the desired set . Let us now examine the elements of . These are sums , where . But each element of is either or for some . Hence, elements of are of the form
[TABLE]
for a subset and . Immediately we see that the number of different expressions here is bounded in terms of (in fact, it equals ). Further, we consider which of the are equal, grouping the corresponding terms and together, and renaming the variables along the path to . Hence, every element of is of the form
[TABLE]
where , and all are different. Once again, treating as formal variables, the number of expressions we wrote is bounded in terms of . The plan now is to make sure that each such expression attains a small number of values, so that in total only at most values attained.
Ruzsa’s main idea in the costruction is the separation of functions, which we now discuss. In all these expressions we have the same map occuring. However, we can turn the problem of constructing a single function that works for all expressions into a much easier problem of constructing a function for each expression separately. We first list all the expressions of the form (1), sorted in the asscending order by the number of variables appearing. Thus, our list start from expressions of the form . Next, we split as a product of coprime numbers , with one for each expression so that by Chinese Remainder Theorem we have .
We promise that however we choose an expression and values of , we get at least one zero coordinate (which need not depend on the expression) and we call this ZCP (Zero Coordinate Promise). If -th expression has only one variable appearing, thus it is of the form , we can easily ensure ZCP by setting the -th component of the function as . Now, take any expression
[TABLE]
and assume that for every such expression with fewer than variables ZCP holds. Let be the product of for the expressions with fewer than variables. Note that, if we are given , and if any two among them have the same value in , by induction hypothesis, ZCP already holds. Hence, we may assume not only that are different, but that they are different modulo . Write for the residue of mod . Then, looking at coordinate, we have to define such that
[TABLE]
equals zero for all choices of such that are different. But, we can rewriting as already tells us that we are actually looking for a new function for each variable! Hence, our goal is to find functions such that the expression is once again zero. But linear maps once again work.
We start our own work in this paper by slightly generalizing Ruzsa’s idea to polynomial setting. In what follows, by an -degree term we think of a product of terms of the from or , the only rule being that indices of the map and variable to which it is applied (and which is possibly added) coincide. For example, and are both -degree terms, but is not, since the indices are not valid.
Proposition 3.1**.**
Let be given, and let . Suppose that for every and every formal expression in functions and variables of the form
[TABLE]
we can find a modulus , which is a product of arbitrarily large distinct primes, and functions , so that the takes at most values in , when the functions are substituted in . Then, for every , there is a modulus , product of arbitrarily large distinct primes, and a set such that and
[TABLE]
Proof.
We proceed as in the Ruzsa’s construction (except that we do not insist of only having zero value in a coordinate, small number of values suffices). As before, we sort the expressions by the number of variables appearing, and process them in groups of those having the same number of a variables. We now turn to details.
Let . Let be all the expressions in variables of the following form. Each expression is a sum of terms, each being a product of short terms or , followed by terms which are products of short terms, etc. with a final contribution of terms, each being or . As in the discussion before, these are all expressions that naturally arise from , when is defined as . Comparing these expressions with the expressions in the assumptions of this proposition, we have that here only a single formal function appears, while in the other expressions we have a separate function for each variable. Let be indices such that if , then the number of different variables among appearing in the expression is exactly .
Fix an increasing sequence . We inductively construct moduli and functions such that for every we have that union of all images of expressions (that is all expressions futuring at most variables) takes at most values (when is substituted in the expressions).
Base case: . By the assumption, for every expression that has only one variable, we have moduli with arbitrarily large distinct prime factors, and a map , such that takes only at most values. Thus, w.l.o.g. are all coprime, with distinct arbitrarily large prime factors. We set and identify with , and we define coordinate-wise as , where is -th coordinate of . Note that union of all values attained by these expressions with this definition of and has size bounded by
[TABLE]
as desired. (Here we write for the resulting image of the expression , and we have a trivial bound for it – the expression may only take at most values on the coordinate.)
Inductive step. Suppose now that we have found such that in total all expressions with at most variables have a small image , i.e. only at most values are attained. We shall construct as a product , where is an auxiliary modulus for the expression , with the property that either takes one of the small number of values on or a value in another small set in . Here we use Ruzsa’s separation of functions idea.
Fix an expression with exactly variables. If we take values of these variables restricted to , and it happens so that at least two such values coincide, then using the map the value of the expression (also restricted to ) is actually a value of one of the expressions we already considered, with at most variables, so it lies in the small set . Hence, we only need to consider the choices of (w.l.o.g. these are the variables that appear) which differ in . We split the expression further into cases on mod , thus into further cases. Pick an arbitrary choice of distinct values in . Look back at and change every appearance of by . By assumptions, we have a choice of an integer with arbitrarily large distinct prime factors and maps such that the modified takes only at most values in . Finally, define as the product of all these , and as follows: for every , take at the coordinate corresponding to to be zero if modulo is not in , otherwise, if it is the -th residue, set , where is the coordinate of corresponding to . It remains to check the size of images.
For every expression and every choice of values of , we either end up in , which has size at most , or one of the coordinates is in a fixed subset of of size at most . Summing everything together, the image has at most values as desired.∎
The rest of the paper is therefore devoted to finding moduli and maps under which the expressions like do not take too many values. Along the way, we also discuss related problems and questions.
Notation. Throughout the paper, greek letters and will be used for the maps appearing in the expressions. The following functions will be frequently used in our construction. For a prime , we use the standard projection homomorphism , which sends integer to . Next, we define by sending to the integer such that . For two primes and , we also define the map given by . Finally, in any abelian group , and functions , from a set to , we write to mean that is a set of size at most . In particular, means that has a bounded size as grows.
4 Sets with small
The main result of this section is the case of the Theorem 1.5.
Theorem 4.1**.**
For any and any , there is a natural number , which is a product of distinct, arbitrarily large primes, and a set such that , while .
Proof.
We start from the Proposition 3.1. To be able to construct with full difference set, but small , we need to handle the expressions that are sums of the quadratic part which is a product of two terms of the form or , and a linear part which is itself a sum of summands, each being of the form or . Note that for the terms in the linear part whose variables do not appear in the quadratic part, we can define the corresponding maps to be affine so that the variables involved cancel out. Therefore, w.l.o.g. we only consider expressions whose variables appear already in the quadratic part. Note also that for the quadratic part we have two cases: either only one variable, w.l.o.g. , appears, or exactly two variables, w.l.o.g. and , appear. We treat these cases separately.
Case 1: only one variable in the quadratic part. Thus, our goal now is to show that if we are given a quadratic expression featuring only one variable, we can find a modulus and function, so that the expression takes a small number of values. In fact, here we do more and prove the claim for expressions of arbitrary degree.
Lemma 4.2**.**
Let be given, and let be a prime. Then, given any maps and any set of size less than , we can find another map such that the expression
[TABLE]
does not take a value in for any that has at least one of non-zero.
Proof.
Suppose that for some , we have that for every choice of we have . By the pigeonhole principle, some value is hit at least times. Thus, the polynomial
[TABLE]
has at least zeros, making it a zero polynomial. Hence are simultaneously zero, proving the lemma.∎
Corollary 4.3**.**
Let be an arbitrary -linear combination of terms of the form , where at least one of such terms with appears. Given any , we can find a modulus , which is a product of distinct arbirtrarily large primes, and a map such that under the expression takes at most values in .
Proof.
Rewrite by grouping together a -linear combination of that appear next to each . Thus, we can write as , where each is a polynomial in over , and at least one of is not a zero polynomial. Let . Pick distinct arbitrarily large primes , all w.l.o.g. larger than and absolute values of coefficients of (so that non-zero polynomials do not become zero modulo ). By the Lemma 4.2, we may find a map for each such that the image of has size at most , when the variable ranges over values such that polynomials are not simlutaneously zero. But there are at most values of such that , so we conclude that modulo each , the expression may take at most values. Finally, set and take to be , where we as usual identify with . Hence, modulo , the expression takes at most values. Taking large enough so that proves the corollary.∎
The case 1 now follows by applying Corollary 4.3.
Case 2: the quadratic part has two variables. The quadratic part must look like a product of two terms, each being either or . By suitably renaming the variables, and adding to if necessary, w.l.o.g. we only need to consider the case when the quadratic part is , and the whole expression is
[TABLE]
where each is a -linear combination of and . Note also that if is nonzero, then appears with a nonzero coefficient.
We have now come to an important point in this paper, and one of the key ideas, which we shall now explain. We have to construct and maps such that takes values. Suppose for a moment that the linear terms are both zero. Then, we have an easy way to make constant, by setting one of the to be zero. However, such an approach cannot work in the case when are not zero, as it would force one of the to be an affine map, which is surjective. As a way to overcome this, we can use both and to get additional freedom. Thus, we set , where are coprime products of distinct primes, identify with , and set to be zero on the first coordinate, and to be zero on the second coordinate. Hence if and , then the expression becomes
[TABLE]
We now want to find and so that the expression (2) does not take too many values in . Suppose for a moment that instead of coprime and we actually had . Then, we could have simply taken
[TABLE]
and
[TABLE]
which ensures that every value taken by the expression is of the form and hence it is in small subset of . It turns out that we can use the same approach even if . We shall refer to this idea as the identification of coordinates, which will appear at other places in this paper as well. The following proposition and its proof formalize this discussion. We slightly change the notation to make the reading easier.
Proposition 4.4**.**
(Basic identification of coordinates.) Let be given and let be primes greater than . Suppose that if then and if then . Then we have such that
[TABLE]
takes at most values, when range over all pairs of values in .
Recall the definition of map as the natural embedding of into , the natural projection , and finally, the composition , given by . Before proceeding with the proof, it is useful to note some easy properties of the maps and .
Lemma 4.5**.**
Let be primes. Then
- (1) Given , we have . Also, , when .
- (2) Given , we have
- (3) Given , we have
[TABLE]
- (4) Provided that , we have
[TABLE]
Proof.
(1) Applying , we have , thus . If , then , so the claim follows.
(2) Let . Note that and . From definition, and . Hence, if we set , we have and , so .
(3) The statement follows by applying to , noting that is an additive homomorphism and recalling that .
(4) From the definition, we have
[TABLE]
Write . Using the previous work, we know that , and , since . So , and the claim follows after applying .∎
Proof of Proposition 4.4.
Observe immediately that if , we can ensure that and , proving the claim. Therefore, we may assume , w.l.o.g. . If holds, then the function becomes , which can be made zero, by choosing zero maps for and . If exactly one of vanishes, say, then we can pick to ensure that , and set to get . From now on, assume that .
Set and . This makes for all choices of . It remains to pick so that takes a small number of values.
Set and . Hence becomes
[TABLE]
Let be given by , noting that . Then,
[TABLE]
Fixing the set , from Lemma 4.5 we have
[TABLE]
or, under our notation introduced earlier,
[TABLE]
Lemma 4.5 also implies that and , when , from which we conclude that
[TABLE]
so the image of the function is a subset of a preimage of of a set of size . Fibers of are of size at most , so the claim follows.∎
Applying the Proposition 4.4 finishes the proof of the Theorem 4.1.∎
4.1 Using affine maps in the case of two variables
In this subsection, we further discuss some quadratic expressions involving two variables. A natural map we can try is an affine map , for constants . However, if we look at expression , which was among the ones necessary to discuss in the proof of Theorem 4.1, it is easy to see that choosing affine maps from to for and yields full image, for every . Here we ask ourselves the question when we can use such maps to get a small image of the function defined by the expression.
As we shall see later in the paper, in the construction of with small , one of the expressions we shall consider has quadratic part of the form , with . It turns out that in this case the affine maps can be used as desired maps. We discuss these maps before the construction of with small , so that we can focus better on the new ideas needed for that case.
Lemma 4.6**.**
(Affine maps solution.) Let and be integers. Then, for any prime greater than absolute values of all the given integers, we can find affine maps such that
[TABLE]
is constant.
Proof.
Let and , with to be determined. With this choice of maps, the expression above becomes
[TABLE]
Hence, we need to make sure that
[TABLE]
[TABLE]
and
[TABLE]
This is equivalent to
[TABLE]
[TABLE]
and
[TABLE]
Hence, we can pick so that affine maps make our expression equal to constant iff are non-zero. ∎
5 Sets with small
This section is devoted to the proof of the case of the Theorem 1.5.
Theorem 5.1**.**
For any and any , there is a natural number , which is a product of distinct, arbitrarily large primes, and a set such that , while .
Proof.
The approach here is similar to the one in the proof of the Theorem 4.1, however the expressions that arise in this case are more complicated and require new ideas. Once again, the proof is based on the Proposition 3.1. As before, we split all expressions in their quadratic and linear parts, and we may assume that if a variable appears at all in an expression, it must appear in the quadratic part. Next, we consider all the possible cases for the quadratic part, and explain how to make the image of the expression small in each case separately. They are listed sorted by the support size and then by structure. We also have the freedom of renaming the variables. Again, we change the notation slightly; instead of and we use and respectively. The possible cases, w.l.o.g. are (all the are in )
Support of size 1.
- (a)
The non-linear part must look like . 2. 2.
Support of size 2. We have a few possibilities here.
- (a)
2. (b)
3. (c)
3. 3.
Support of size 3. We have a few possibilities here.
- (a)
2. (b)
4. 4.
Support of size 4.
- (a)
The non-linear part must look like .
We discuss each of these case separately. However, we use a different order than stated above and deal with easier cases first.
Case 1(a). This is immediate from Corollary 4.3.
Case 2(b). If or , modifying by adding a suitable multiple to it, and modyfing accordingly, we may assume that the quadratic expression is exactly , which we have already done in Proposition 4.4 (notice that the condition on coefficients in that proposition is satisfied). Hence, w.l.o.g. and . Then, (after a suitable modification of by affine maps to make , ), we can apply the Lemma 4.6, to finish the proof in this case.
Case 2(c). The whole expression in this case is of the form , where is a polynomial of degree at most 2 in and and is a polynomial of degree at most 2 in and . Note that we cannot use our arguments about single variable expressions here, as we would only get two sets of size such that always takes values in , so we would only know that the whole expression takes values in which could easily be the whole set of residues. Instead, we recall that the polynomials always attain a small value. This is the content of the next lemma, which is a well-known consequence of Weyl’s inequality on exponential sums. Similar results appear in [5], we include a proof for completness.
Lemma 5.2**.**
Let be fixed. Then there is an absolute constant such that the following holds. Let be a prime, and let be given, with non-zero. Then the polynomial attains a value in .
Write for the function . The proof uses discrete Fourier transforms of functions , which we define as with . We refer readers to [5] for more details.
Proof.
Write for the polynomial . We begin by stating (a special case of) Weyl’s inequality.
Theorem 5.3**.**
(Weyl’s inequality. [14]) For every , and , there is a constant such that for all primes
[TABLE]
holds for every polynomial of degree .
Write for the number of times the polynomial attains the value . Hence, by Weyl’s inequality, there is a constant , independent of such that for , and . Let be interval . Suppose that attains no value in . We have
[TABLE]
Applying Parseval’s formula and noting that , we get that
[TABLE]
Thus,
[TABLE]
From this we conclude that , as desired. ∎
Write for . Now, consider as a polynomial in for every fixed . The lemma guarantees that we can define so that . Similarly, for every , we can pick so that , hence we always have , as desired.
Case 3(a). We shall take of the form , where are coprime, and each is a product of distinct arbitrarily large primes. As always, we identify , and we aim to use the identification of coordinates idea. Thus, we set , so that has second and third coordinates equal to zero. We also set and . Note that we still have freedom of choice for . Let the linear part of the expression be , where the coefficients have the property that implies (since the linear part comes from -linear combination of and , etc.). The expression becomes
[TABLE]
We combine the identification of coordinates idea with the fact that polynomials have relatively dense sets of values in the next proposition.
Proposition 5.4**.**
(Strong version of the identification of coordinates) Fix . Then there are constants such that the following holds. Let all be at most . Let be primes. Write . Next, let be arbitrary maps for every . Let for every , . Finally, let be also arbitrary functions for every . Then, we can find maps such that the expression
[TABLE]
takes at most values as and range over all values in .
Throughout the paper, we will use the prime number theorem ([8]) without explicitly mentioning it.
Proof.
Write for (in fact any prime close to would work). The main idea is to pick so that every value attained by the expression satisfies , for a small subset . Partitioning into cosets of , we see the set of values of the expression can take only at most values on each coset, and thus a small number of values in total.
We use the Lemma 5.2 in order to define . Recall that the lemma gives such that every non-constant polynomial of degree at most in for any , takes a value in (modify the constant coefficient if necessary). For every , we define as follows. We apply the lemma for every fixed to the polynomial
[TABLE]
Hence, we can pick , such that this expression takes value in . We set . Therefore, we have defined , so that
[TABLE]
where . To finish the proof, we apply the Lemma 4.5.
Note that we have
[TABLE]
We conclude that values attained by the expression with the maps defined as above satisfy
[TABLE]
for a set of size at most . Since , the expression takes at most values, as desired.∎
The case 3(a) now follows from a straightforward application of the Proposition 5.4.
We deal with the remaining cases in a similar fashion.
Case 2(a). Let the linear part of the expression be . We shall take , for coprime and , with . We set and . It remains to choose and so that the expression
[TABLE]
takes small number of values. But, recalling that implies , this follows directly from the Proposition 5.4, and we may take to be prime.
Case 3(b). Let the linear part of the expression be . We shall take , for coprime and , with . We set and . It remains to choose and so that the expression
[TABLE]
takes small number of values. Once again, recalling that implies , this follows directly from the Proposition 5.4, and we may take and to be prime.
Case 4(a). Let the linear part of the expression be . We shall take , for coprime and , with . We set
[TABLE]
We use the Proposition 5.4 to find so that the expression
[TABLE]
takes small number of values. This completes the proof of the Theorem 5.1.∎
5.1 Further discussion of the identification of coordinates idea
As we have seen in the proof of the Theorem 5.1, the Proposition 5.4 was used in a very similar fashion for several cases of expressions. The goal of this short subsection is to take this approach further and see what expressions can be handled using this idea.
We temporarily return to the notation of for the variables and for the maps. The value of at coordinate is denoted by . Observe that when we use Proposition 5.4, we have to pick some of the maps to cancel out the mixed quadratic terms like . In the proof of the Theorem 5.1 in the last few cases, given an expression, we used a different coordinate for every variable , and we picked for , so that the mixed quadratic terms dissappear. Our goal now is to put all these ideas together in a single proposition. First, we need to set up some useful definitions.
Fix an expression in variables . Define a graph on vertices by adding an edge for every term of the form with , with multiple edges allowed (so appears the same number of times the relevant terms occur in ).
Proposition 5.5**.**
(Acyclic version of the identification of the coordinates.) Let be a quadratic expression such that has no cycles (in particular, no repeated edges). Then there is an absolute constant such that the following holds. We can find , a product of distinct, arbitrarily large primes, and maps such that takes at most values.
Proof.
As promised, we will take , with coprime products of distinct primes, suitably chosen. As always, view as the direct sum . Let be an arbitrary coordinate. We start from and traverse the graph . (If is disconnected, pick arbitrary vertices in all other components to start the traversal from. For each such starting vertex , , set .) Since the graph is acyclic, we reach every variable at most once, and we visit every edge. When we move along the edge , from to , that means that there is a term in the expression, and we set , to make the term vanish. Since this is the first time we reach , there are no issues with defining .
After this procedure, we have defined for , so that for every coordinate , the expression no longer has mixed quadratic terms. We still have the freedom of choosing , so we now may apply the Proposition 5.4 to finish the proof.∎
As we shall see later, depending on the structure of the graph , it is not always possible to choose some of the maps so that the mixed quadratic terms vanish, so there is no obvious way to make the Proposition 5.5 more general.
6 Sets with small
In this section we prove the final case of the main theorem.
Theorem 6.1**.**
For any and any , there is a natural number , which is a product of distinct, arbitrarily large primes, and a set such that , while .
Proof.
We proceed like in the proofs of Theorems 4.1 and 5.1, except that the details become once again more complicated and the ideas we developed so far, culminating in Proposition 5.5, do not suffice. As usual, the proof is based on Proposition 3.1. We split all expressions in their quadratic and linear parts, and we may assume that if a variable appears at all in an expression, it must appear in the quadratic part. In the first part of the discussion of the possible expressions, we use the notation for variables and for maps, as there can be upto 6 variables involved. Later, we again switch to and notation.
Firstly, by Corollary 4.3, we only need to consider expressions with at least two varaibles. Next, we use the Proposition 5.5 to treat the expressions with at least 4 variables. We look at the graph . Note that if we have an isolated vertex in , since appears in the quadratic part, we must have term of the form in . Hence, the number of isolated vertices plus the number of edges is at most 3, which is the number of quadratic terms in .
Expression with exactly 6 variables. We look at . It is a graph on 6 vertices, with . Hence, it is a perfect matching, which is acyclic, so the Proposition 5.5 applies.
Expression with exactly 5 variables. Looking at , which is a graph on 5 vertices with , we see that at most one vertex can have degree greater than 1. The graph is acyclic, so the Proposition 5.5 applies.
Expression with exactly 4 variables. Once again, we analyse . It is a graph on 4 vertices with . The only way to get a cycle is if the graph has a double edge and an edge (after a suitable renaming of variables). Thus, the quadratic part of is of the form
[TABLE]
where . If or , we can rewrite the quadratic part as a linear combination of only two quadratic terms, so that the graph becomes a matching, and therefore acyclic. Thus, assume that and . But, using the affine maps solution from the Lemma 4.6 we can cancel all the terms in that involve and . Then, w.l.o.g. becomes an expression with quadratic term
[TABLE]
which we have already done using the basic version of the identification of coordinates idea in Lemma 4.4.
Hence, we may assume that the expression has either two or three variables. We treat these cases separately. From now on, we use the notation for the variables and for maps.
6.1 has two variables and
Observe that if there is at most one mixed quadratic term in the quadratic part, then once again Proposition 5.5 applies. Hence, we may assume that there are at least two such terms in . Suppose now that there all three quadratic terms are of this form, hence the quadratic part is
[TABLE]
where . This constraint on coefficients is crucial. By pigeonhole principle, there are at least two equal coefficients among , w.l.o.g. . The quadratic part of may be written as
[TABLE]
which we treat using Lemma 4.4 if this factorizes further, or using Lemma 4.6 otherwise.
It remains to treat the case when there are exactly two mixed terms, so the quadratic part is w.l.o.g.
[TABLE]
However, we can no longer use the affine maps to cancel out quadratic terms to modify the expression and then apply the Proposition 5.5. Instead, we have to use a different argument, which unfortunately gives significantly worse bounds.
Lemma 6.2**.**
Let be a quadratic expression with quadratic part of the form
[TABLE]
with and . Then, for every sufficiently large prime , we can find such that the expression does not attain every value in .
Immediately, we have the following corollary.
Corollary 6.3**.**
Let be a quadratic expression with quadratic part of the form
[TABLE]
with and . Let . Then, there is , product of distinct, arbitrarily large primes, and maps such that the expression attains at most values.
Proof.
Let be the bound in Lemma 6.2 such that for all primes we have such that the expression evades one value, i.e. all values are confined to a set of size . If we now take , a product of distinct primes greater than , then, once again identifying , and defining coordinatewise using , we have that the expression in attains values in . Hence, it takes at most values. A standard calculation reveals that for sufficiently large, the number of values becomes . (The that appears in the sums and products below ranges over primes only.) Indeed,
[TABLE]
as , since .∎
Proof of Lemma 6.2..
Let be the linear part of the expression. We will define essentially by setting each uniformly independently at random (for technical reasons, for every we will forbid one value in ). Our aim is to define accordingly so that the expression evades zero value. Hence, for every , we want to find such that there is no with
[TABLE]
In other words, provided always, we want a value of such that
[TABLE]
for all . Hence, this becomes the requirement that for every fixed , the set
[TABLE]
is not the whole set . We now define by setting each independently to be a uniform random variable on (which is fine, as ).
Let be the event that the set is the whole , i.e. for every there is such that
[TABLE]
Suppose that occurs. We cannot use the same for two values of , so by counting, for every , we have exactly one such that (6) holds. Suppose that we already know this permutation . The equation is further equivalent to
[TABLE]
Hence, for every , we know that must take one of the two values depending only on , since . So, given , there are at most choices for . Hence, the probability of is . By Stirling’s formula,
[TABLE]
By the union bound, the probability , so there is a choice of such that for all we have . For such , we can define so that the expression does not attain every value, proving the lemma. ∎
Returning to our main argument, the case when the quadratic part is of the form
[TABLE]
follows directly from Corollary 6.3, since .
6.2 has three variables
Finally, we address the case when the quadratic part of has exactly three variables. Once again, we only need to consider the situation when has a cycle. We know that is a graph on three vertices, with . The only there such graphs that have cycles are (a repeated edge and an isolated vertex), (a repeated edge and an additional edge) and (a cycle of length 3).
** is a repeated edge.**In this case, the quadratic part of the expression is w.l.o.g.
[TABLE]
If or , we can further factorize the expression and apply the Proposition 5.5, to finish the proof. Thus assume that and .
Let the linear part of the expression be . Fix a prime , and apply Lemma 4.6 to the expression
[TABLE]
to make it constant. Hence, it remains to pick so that the expression
[TABLE]
attains a small number of values, which we can ensure if we apply Lemma 5.2 for each to the polynomial . Provided is large enough, can be chosen so that the value of the polynomial is small. This completes the proof in this case.
** is a 3-cycle.** In this case, the quadratic part of has three mixed terms, one for each pair of variables among . More precisely, it is
[TABLE]
where . Let the linear part be
[TABLE]
First, assume that no further factorization is possible, i.e. and . We set , so that the expression becomes
[TABLE]
Rearranging further,
[TABLE]
Setting , and , the expression becomes constant.
Now, suppose that w.l.o.g. . Assume for now that , we will address the case when this product does not vanish later. The expression becomes
[TABLE]
We use the identification of coordinates approach. We will take , where are arbitrarily large primes. Identify . Our first step is to set
[TABLE]
This way, the quadratic terms vanish in the first two coordinates, and we still have freedom of choosing to cancel the linear terms in . We want to do the same for , so we set . However, with such a choice, the third coordinate of the expression is
[TABLE]
Since , the expression becomes
[TABLE]
We may now apply the identification of coordfinates idea, using Proposition 5.4, to finish the proof in this case.
Now assume that . We shall take and use the additional fourth and fifth coordinates to cancel out the term. Also, using the prime number theorem, we can find arbitrarily large primes such that . In the work below it will be essential that all the primes are close in value (although it will not be important to have them this close). Writing also for the resulting map defined by and the expression, our aim is to show that
[TABLE]
takes few values in .
We use the same choices of as in the case when . Next, we set . Observe that
[TABLE]
Let and . Hence are integers such that and hold. We also have . But the is an integer such that , thus , for . Therefore, with this choice of we have
[TABLE]
Proceeding further, we use the fifth coordinate to approximate . To this end, write , , where , . Observe that is a good approximation to
[TABLE]
for some absolute constant , since . Therefore, we set . Note that are well defined, as depends on only, and depends on only. With and so defined we have
[TABLE]
We also have that is an integer such that , thus , where , for an absolute constant . Therefore,
[TABLE]
Summing up the work done so far we conclude that
[TABLE]
where is the set defined by . In particular . Finally, we put everything together, using the Lemma 4.5. Recall the definitions (the maps and below are slightly modified to cancel the term instead of just )
[TABLE]
Thus,
[TABLE]
Finally, we set to cancel the linear terms respectively:
[TABLE]
With this choice of we have
[TABLE]
which takes small number of values.
** is has a repeated edge and another single edge.** In this case, the quadratic part of the expression is w.l.o.g.
[TABLE]
If or , we can further factorize the expression and apply the Proposition 5.5, to finish the proof. Thus assume that and . Since all , we must have , so w.l.o.g. .
We now discuss a limitation of the usual approach based on the identification of coordinates idea. Basically, we always try to cancel out the quadratic terms by taking some of the to be affine, while we use the rest to cancel out the linear terms in . Let us try the same strategy here. Temporarily we work in to ignore the difficulties that arise from moving from one modulus to another one. For technical reasons, we use a slightly unusual indexing of coordinates by . Start by using the coordinate -1 to get a free which is later used to cancel the linear terms involving . Thus, we set and . Similarly, try to use the coordinate 0 to get a free map. Rewriting the expression as
[TABLE]
we see that we need to set , for a constant and . The issue is that we get a term with a non-zero coefficient. The natural thing to do now is to try to cancel somehow this term. During this digression, we forget about the linear terms (in any case, we can cancel them by remaining free ).
The most natural thing is to set for (as further mixed quadratic terms involving seem even harder to cancel). Hence, the question is whether we can find linear maps , each a linear combination of or such that (w.l.o.g. and )
[TABLE]
Write and . Let equal 1 if and zero otherwise. Expanding the (7) we obtain
[TABLE]
Hence, we require that for every , which are not both zero, we have , while for this expression is non-zero (to cancel the initial term). We now define two matrices , with entries indexed by , by setting when and , and if and . Let be the matrix of all zeros except for , and let be the matrix consisting of zeros only, except . We rewrite (8) as a matrix equation
[TABLE]
for some non-zero . However, this is the same as
[TABLE]
But comparing ranks we have
[TABLE]
which is a contradiction. Hence, this case requires a different approach.
Finally, we construct the desired maps for this expression. By adding linear terms to , we may assume that the expression is
[TABLE]
for some coefficients . Let us begin by observing that in most cases there is a rather simple solution, which strangely we could not generalize to work for all choices of coefficients. Try setting , for some constants and suppose we work in , where is a product of distinct, arbitrarily large primes (so that all the coefficients and related expressions are coprime with ). With these choices, the expression (9) becomes
[TABLE]
Further, set , (recall that so ) so that the coefficient of above vanishes. We try to pick such that coefficient of also becomes zero, setting . If , then we can pick to cancel the term, and the expression actually becomes constant. Otherwise, assume that . In this case, we prove the following proposition, and the full result is then a consequence of a simple number-theoretic calculation.
Proposition 6.4**.**
Let be some fixed coefficients, such that and . Then, for all sufficiently large primes , obeying , we may find maps such that the expression (9) misses at least values.
Proof.
As always, is viewed as . In the first coordinate, we set , with to be chosen and a constant . After a suitable choice of , the first coordinate of the expression becomes .
On the other hand, we shall use the second coordinate to evade some of the values. To this end, we generalize the Lemma 6.2, with a similar proof.
Lemma 6.5**.**
Let be a set, and a prime. Let be any map, and let be any coefficients. Then, provided we may pick for all , such that
[TABLE]
never takes value zero.
Proof of Lemma 6.5..
We proceed similarly as in the proof of Lemma 6.2, starting by defining each independently, uniformly at random in , with this single value omitted for technical reasons.
For each and , we want to pick , so that (10) does not vanish for any . Let be the event that we cannot do this, i.e. that, having fixed for every value , we can find such that
[TABLE]
If occurs, observe that (11) cannot hold for distinct with the same choice of , since this equation can be rewritten as
[TABLE]
and by the choice of , the coefficient of is never zero. Hence, if is the map that sends each to the corresponding value of for which the (11) vanishes, we must have injective, which is thus a bijection.
Suppose furthermore that we know as well. Note that in this case we can almost determine . Indeed, for all we have
[TABLE]
Substituting , we obtain
[TABLE]
for all , so is uniquely determined for all such that , i.e. for values. So there are at most ways to pick , and in conclusion, the probability of is . Finally, we have
[TABLE]
so it is possible to choose for which all other maps can be defined so that (10) never vanishes. ∎
Set . Let . We define , so the first coordinate becomes . We set , by . Apply Lemma 6.5 to the , , and the expression
[TABLE]
to define to make it non-zero always. Note that we may apply the lemma since , whenever , for sufficiently large . We define as . Finally, we show that values are not attained for .
Suppose that and suppose that the expression takes value . Thus, the first coordinate gives , so divides , so either , , or . But, and , so we must have .
Next, let stand for the value of
[TABLE]
By the definition of , we always have . If the second coordinate equals , then we have , which is impossible. ∎
Corollary 6.6**.**
Let be some fixed coefficients, such that and . Let be any small real. Then, we can find , a product of arbitrarily large distinct primes and maps such that the expression (9) takes at most values in .
Proof.
We proceed as follows. Look at all the primes and . For sufficiently large, by the prime number theorem, . For sufficiently large, pairs of primes satisfy the conditions of Proposition 6.4, which we apply to obtain so that the expression (9) misses at least values in . In other words, the expression (9) takes at most values in . Let , and let be the product of all . Viewing as a direct sum of , we can therefore define coordinatewise using , so that the expression (9) attains at most values in , for some positive constant .
Finally, taking , and using the maps on each separately, makes the expression (9) take at most proportion of values in , which goes to zero as goes to infinity, as desired.∎
This finishes the proof of the Theorem 6.1.∎
7 Concluding remarks
We conclude the paper with some problems and several questions related to the intgredients used in our construction. Firstly, the main question here is still the following.
Question 7.1**.**
Suppose that has and let . How small can be? What is the answer when is square-free/product of primes/prime? When can we get a power saving, i.e. ?
The next natural question is about the number of values attained by expressions.
Question 7.2**.**
Let be given. We consider expressions in variables and maps . Let be any -linear combination of products of terms of the form or . Is there a choice of a and maps such that attains only values in ? Is there a choice for which we have a power-saving, i.e. attains only values? What if is square-free/product of primes/prime?
We remark that in our construction, there was a power-saving choice for most of the expressions. In fact, the only ones for which our arguments do not lead to a power-saving are
[TABLE]
and
[TABLE]
(for a specific choice of ).
Returning once again to the identification of coordinates idea, it turns out that Proposition 4.4 is nearly optimal for some expressions, provided and are close. Namely, consider expression . Putting , the expression becomes .
Observation 7.3**.**
Let and be distinct primes. Given any maps , the expression attains at least values in .
Proof.
We begin by observing that if is not invertible for some choice of , viewing as , for some coordinate , we have . Letting vary, we obtain at least values.
Therefore, assume that all are invertible in . Fix some . Consider all values of , (where is evaluation of the expression for the given choice of ), as ranges over . We may assume , otherwise we are done. Hence, we obtain a partition , where if . Call a pair invertible if is invertible in . Observe that in each set , there are at least invertible pairs. However, if for an invertible pair , then , so is invertible, and . Thus, for every invertible pair there is a value such that implies .
For a fixed , take such that , and consider the partition as above. Firstly, take to be the set of indiced such that . Thus, . Hence, . Therefore, we obtain that the number of invertible pairs that have value is at least
[TABLE]
If attains at most values, we simply consider for fixed . The expression then attains at least values, thus the claim follows, so we may assume that attains more than values. But then, for every value of , we have at least invertible pairs with , so the total number of invertible pairs is at least , which is a contradiction.∎
It could be interesting to better understand the minimum image size for this expression. Furthermore, recall that in the case of prime modulus, we only achieved that is not surjective.
Question 7.4**.**
Let be maps and prime. What is the smallest number of values that the expression must attain?
Finally, we pose the question of improving the bounds in Lemma 4.2.
Question 7.5**.**
Suppose that are never simultaneously zero. How large a set can we take?
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Bourgain, The sum-product theorem in ℤ q subscript ℤ 𝑞 \mathbb{Z}_{q} with q 𝑞 q arbitrary, Journal d’Analyse Mathématique 106.1 (2008): 1–93.
- 2[2] J. Bourgain, N. Katz, T. Tao, A sum-product estimate in finite fields, and applications, Geometric and Functional Analysis 14.1 (2004): 27–57.
- 3[3] G.A. Freiman, V.P. Pigaev, The relation between the invariants R and T (in Russian), Kalinin. Gos. Univ. Moscow, (1973), 172-174
- 4[4] A.A. Glibichuk, Sums of powers of subsets of an arbitrary finite field, Izv. Math. 75 253, 2011
- 5[5] W.T. Gowers, A new proof of Szemeredi’s theorem, GAFA 11 (2001), 465–588
- 6[6] J.A. Haight, Difference covers which have small k-sums for any k, Mathematika 20 (1973), 109-118
- 7[7] F. Hennecart, G. Robert, A. Yudin, On the number of sums and differences, Astérisque 258 , 1999, p. 173–178
- 8[8] A. Ivić, The Riemann Zeta-Function. Theory and Applications. Dover Publications , New York, 2003, reprint of the 1985 original
