Product-free sets in the free semigroup
Imre Leader, Shoham Letzter, Bhargav Narayanan, Mark Walters

TL;DR
This paper investigates the maximum density of product-free subsets within the free semigroup over a finite alphabet, establishing that the maximum possible density is exactly 1/2 under a natural measure.
Contribution
It proves that the maximum density of product-free subsets in the free semigroup over a finite alphabet is exactly 1/2, providing a precise measure of their size.
Findings
Maximum density of product-free subsets is 1/2.
The natural measure assigns weight |A|^{-n} to words of length n.
The result characterizes the largest possible product-free sets.
Abstract
In this paper, we study product-free subsets of the free semigroup over a finite alphabet . We prove that the maximum density of a product-free subset of the free semigroup over , with respect to the natural measure that assigns a weight of to each word of length , is precisely .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Product-free sets in the free semigroup
Imre Leader
Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Wilberforce Road, Cambridge CB3 0WB, UK
,
Shoham Letzter
ETH Institute for Theoretical Studies, 8092 Zurich, Switzerland
,
Bhargav Narayanan
Department of Mathematics, Rutgers University, Piscataway NJ 08854, USA
and
Mark Walters
School of Mathematical Sciences, Queen Mary, University of London, London E1 4NS, UK
(Date: 6 December 2018)
Abstract.
In this paper, we study product-free subsets of the free semigroup over a finite alphabet . We prove that the maximum density of a product-free subset of the free semigroup over , with respect to the natural measure that assigns a weight of to each word of length , is precisely .
2010 Mathematics Subject Classification:
Primary 20M05; Secondary 05D05
1. Introduction
A subset of a semigroup is said to be product-free if there do not exist (not necessarily distinct) such that x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}y=z; it is customary to call sum-free when the underlying semigroup is abelian.
It is a well known fact (and an easy exercise) that any sum-free subset of the integers has upper density at most . Sum-free subsets of the integers, and of abelian groups in general, have been studied by very many researchers over the last fifty years. For example, from the work of Green and Ruzsa [4], there is now a complete picture of how large a sum-free set we can find in any finite abelian group. We refer the reader to the surveys of Tao and Vu [7] and Kedlaya [5] for more information on these questions.
Product-free subsets of finite non-abelian groups were first investigated by Babai and Sós [1]. Following foundational work by Gowers [3] demonstrating so-called ‘product-mixing’ phenomena in groups with no low-dimensional representations, there has been a great deal of recent work in the non-abelian setting; for instance, in a recent breakthrough, Eberhard [2] determined how large a product-free subset of the alternating group can be.
In light of these developments, it is natural to ask what one can say about product-free sets in infinite non-abelian structures, a setting in which our knowledge is a bit more limited. Perhaps the first natural place to look among infinite non-abelian structures is among those that are free, so here, we shall investigate how large product-free subsets of the free semigroup can be.
2. Our results
Let be a finite set. We write for the free semigroup over ; in other words, is the set of all finite words over the alphabet equipped with the associative operation of concatenation. While we state and prove our results for finite alphabets of all possible sizes for the sake of completeness, the reader will lose nothing by supposing that is a two-element set in what follows; indeed, this case captures all the difficulties inherent in the questions we study.
Recall that a set is product-free if, writing \mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}} for the operation of concatenation, there do not exist words (not necessarily distinct) such that x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}y=z. There is an obvious example of a ‘large’ subset of that is product-free: when for instance, the set of words which contain an odd number of occurrences of the symbol (or , for that matter) is easily seen to be a product-free set that contains, roughly, half the words from . Our aim in this paper is to prove that these sets are, in a precise sense, the largest product-free subsets of . We remark in passing that there are several other product-free sets that are ‘equally large’: for any nonempty subset , the odd-occurrence set generated by , namely the set of words in which the total number of occurrences of symbols from is odd, is easily seen to be a product-free set; in the case where , our earlier example corresponds to taking , and taking gives us the set of all words of odd length, for example.
To formally state our results, we need a way to measure the size of a set . For an integer , the layer is the set of words of length , and the ball is the set of words of length at most . As a first attempt, one might define the density of a set via its densities in balls, namely as the quantity
[TABLE]
However, a little thought should convince the reader that the counting measure is somewhat ill-suited for our purposes. Indeed, when , almost all the words in are long since . Consequently, we may find product-free sets that are intuitively small, and yet have density arbitrarily close to in the above sense; for example, for any sufficiently large , the set
[TABLE]
is product-free and has density at least in the above sense, provided .
A more natural approach is to assign a weight of to each word of , thereby ensuring that the layers have the same total weight for all . To this end, for a subset and an integer , we define the density of in the layer by . With this definition in place, most standard notions of density may now be carried over: we define the upper asymptotic density of by
[TABLE]
and the upper Banach density of by
[TABLE]
Of course, the latter is a weaker notion of density than the former; indeed, it is clear that for any .
It is easy to see that any odd-occurrence set has both an upper asymptotic density and an upper Banach density of . Our aim in this note is to show that product-free sets cannot be any larger; our main result is as follows.
Theorem 1**.**
Let be a finite set. If is product-free, then .
Let us mention that product-free sets in cancellative semigroups have been studied by Łuczak and Schoen [6]; while their results are sharp for such semigroups in general, these results do not give us any effective bounds on the size of a product-free subset of .
Before we turn to the proof of Theorem 1, it is worth pointing out that there is a simple argument that allows us to bound the upper asymptotic density of a product-free subset of away from . Indeed, suppose that is product-free. We then have
[TABLE]
for any since the sets and (S\cap\mathcal{F}(m))\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}(S\cap\mathcal{F}(n)) must be disjoint. Now, consider the set of integers for which , where is the unique positive solution to the equation . It follows from the inequality above that this set of integers must be sum-free. It is now easy to see that .
We shall have to work somewhat harder to prove Theorem 1, which improves this bound of for the upper asymptotic density to the optimal bound of for the upper Banach density. The proof of Theorem 1 is given in Section 3. We conclude this note with a discussion of some open problems in Section 4.
3. Proof of the main result
We begin by fixing our finite alphabet . In the sequel, will always mean , the free semigroup over this fixed alphabet .
It will be helpful to establish some notation. For a pair of words , we say that is a prefix of if w=x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}y for some , and that is a suffix of if w=y\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}x for some . For a pair of sets , we write S_{1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}S_{2} for their (Minkowski) product; in other words,
[TABLE]
For a set and an integer , we set . One of the key ideas in the proof of Theorem 1 is the following definition. For any sequence of positive integers , we define
[TABLE]
in other words,
[TABLE]
Let us note, for any , that the sets and S(m)\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}\mathcal{F}(n-m) are disjoint for any pair of positive integers . Recall that ; we analogously define
[TABLE]
When the set in question is clear, we write and for and , respectively. Recall that for any product-free set and any , we have
[TABLE]
We start by proving a generalisation of this fact.
Proposition 2**.**
If is product-free, then for any sequence of positive integers , we have
[TABLE]
Proof.
First, consider the products
[TABLE]
These subsets of are by definition disjoint. Let be the union of these sets. Since is product-free, and are disjoint as well. Let ; clearly, the density of in is
[TABLE]
Next, consider the Minkowski products
[TABLE]
These subsets of are again disjoint by definition; let denote their union. Note that and are disjoint. Let ; it is easy to see that the density of in is
[TABLE]
and that this quantity is therefore at most .
To finish the proof, it suffices to show that
[TABLE]
It is easy to see that . Therefore, it is sufficient to show that is a subset of . To see this, note that any word from which has a prefix in is also contained in . In other words, ; the result follows. ∎
With the above observation in hand, we are now ready to prove Theorem 1.
Proof of Theorem 1.
We prove by contradiction that the upper Banach density of a product-free set is at most .
Suppose that is product-free and that for some . We then claim that we may find an increasing sequence of positive integers such that
[TABLE]
for each .
We construct this sequence inductively. Since , it is clear that we may find such that . Having found as required, we choose as follows. Since , there exist arbitrarily long intervals that satisfy
[TABLE]
Choose such an interval whose length is sufficiently larger than ; we may assume, by passing to a sub-interval if necessary, that . We claim that it is possible to choose from ; in other words, we claim that there exists an such that
[TABLE]
We prove this claim by contradiction. Suppose that there is no such . Then, by Proposition 2, we have
[TABLE]
for each . By summing the above inequality over all , we get
[TABLE]
where is the set of with . This implies, by the inductive hypothesis, that
[TABLE]
or equivalently, . Therefore, we have
[TABLE]
which contradicts the fact that , provided .
We now finish the proof of the proposition by showing that the existence of this sequence contradicts our initial assumption that . Fix a large enough to ensure that
[TABLE]
and consider any interval with . We know from Proposition 2 that
[TABLE]
for each with ; summing this inequality over such , we get
[TABLE]
where is the set of with and . Therefore,
[TABLE]
which is a contradiction; this proves the claimed upper bound in Theorem 1. ∎
4. Conclusion
A common line of enquiry in the study of product-free sets is to ask for ‘asymmetric’ versions of results bounding the upper density of product-free sets. In this spirit, it is natural to ask whether an analogue of Theorem 1 continues to hold when one wishes to solve the equation x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}y=z with , and in specified subsets of . More precisely, if are such that there are no solutions to x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}y=z with , and , one might ask if one of , or has an upper asymptotic density of at most . However, it is not hard to construct for any , three sets , each of upper asymptotic density at least , where , such that there are no solutions to x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}y=z with , and . Indeed, pick a suitably large and choose any set such that . Now take to be the set of all words with a prefix in , to be the set of all words with a suffix in , and to be the set \mathcal{F}\setminus(X\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}Y). Clearly, there are no solutions to x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{\scriptscriptstyle\bullet}}}}}y=z with , and ; it is also not hard to check that each of , and has an upper asymptotic density at least .
Next, it would be interesting to understand what product-free sets of maximal density look like. As we saw earlier, several non-isomorphic extremal constructions are furnished by the family of odd-occurrence sets. We suspect that these might be the only constructions of maximal density, and make the following conjecture.
Conjecture 3**.**
Let be a finite set. If is product-free and , then for some nonempty subset .
Finally, another natural direction is to study product-free subsets of the free group over a finite alphabet . Similarly to the situation in this paper, the most natural measure to consider in the case of the free group would be the one that assigns a weight of to each irreducible word of length . The different notions of density defined here for the free semigroup then have analogous definitions in the free group, and we believe that an analogue of Theorem 1 should hold in the free group as well; concretely, we conjecture the following.
Conjecture 4**.**
For any finite alphabet , no product-free subset of the free group has upper Banach density exceeding .
Note that, in the proof of Theorem 1, we rely crucially on the fact that there is exactly one way to write a word of length as the concatenation of a word of length with a word of length ; of course, we lose this property when working with free groups, so we believe that some new ideas will be required to understand product-free sets in free groups.
Acknowledgements
The second author would like to acknowledge the support of Dr. Max Rössler, the Walter Haefner Foundation, and the ETH Zurich Foundation. The third author wishes to acknowledge support from NSF grant DMS-1800521.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Babai and V. T. Sós, Sidon sets in groups and induced subgraphs of Cayley graphs , European J. Combin. 6 (1985), 101–114.
- 2[2] S. Eberhard, Product mixing in the alternating group , Discrete Analysis (2016:2), 19 pp.
- 3[3] W. T. Gowers, Quasirandom groups , Combin. Probab. Comput. 17 (2008), 363–387.
- 4[4] B. Green and I. Z. Ruzsa, Sum-free sets in abelian groups , Israel J. Math. 147 (2005), 157–188.
- 5[5] K. S. Kedlaya, Product-free subsets of groups, then and now , Communicating mathematics, Contemp. Math., vol. 479, Amer. Math. Soc., Providence, RI, 2009, pp. 169–177.
- 6[6] T. Łuczak and T. Schoen, Sum-free subsets of right cancellative semigroups , European J. Combin. 22 (2001), 999–1002.
- 7[7] T. Tao and V. Vu, Sumfree sets in groups: a survey , J. Comb. 8 (2017), 541–552.
