Product-free sets in the free semigroup

Imre Leader; Shoham Letzter; Bhargav Narayanan; Mark Walters

arXiv:1812.04749·math.CO·December 13, 2018·Eur. J. Comb.

Product-free sets in the free semigroup

Imre Leader, Shoham Letzter, Bhargav Narayanan, Mark Walters

PDF

TL;DR

This paper investigates the maximum density of product-free subsets within the free semigroup over a finite alphabet, establishing that the maximum possible density is exactly 1/2 under a natural measure.

Contribution

It proves that the maximum density of product-free subsets in the free semigroup over a finite alphabet is exactly 1/2, providing a precise measure of their size.

Findings

01

Maximum density of product-free subsets is 1/2.

02

The natural measure assigns weight |A|^{-n} to words of length n.

03

The result characterizes the largest possible product-free sets.

Abstract

In this paper, we study product-free subsets of the free semigroup over a finite alphabet $A$ . We prove that the maximum density of a product-free subset of the free semigroup over $A$ , with respect to the natural measure that assigns a weight of $∣ A ∣^{- n}$ to each word of length $n$ , is precisely $1/2$ .

Equations56

n \to \infty lim sup \frac{∣ S \cap F _{\leq} ( n ) ∣}{∣ F _{\leq} ( n ) ∣} .

n \to \infty lim sup \frac{∣ S \cap F _{\leq} ( n ) ∣}{∣ F _{\leq} ( n ) ∣} .

n \geq c ⋃ (F_{\leq 2^{n} + c} ∖ F_{\leq 2^{n}})

n \geq c ⋃ (F_{\leq 2^{n} + c} ∖ F_{\leq 2^{n}})

\overset{ˉ}{d} (S) = n \to \infty lim sup \frac{\sum _{i = 1}^{n} d _{S} ( i )}{n},

\overset{ˉ}{d} (S) = n \to \infty lim sup \frac{\sum _{i = 1}^{n} d _{S} ( i )}{n},

d^{*} (S) = n - m \to \infty lim sup \frac{\sum _{i = m}^{n} d _{S} ( i )}{n - m + 1} .

d^{*} (S) = n - m \to \infty lim sup \frac{\sum _{i = m}^{n} d _{S} ( i )}{n - m + 1} .

d_{S} (m) d_{S} (n) + d_{S} (m + n) \leq 1

d_{S} (m) d_{S} (n) + d_{S} (m + n) \leq 1

S_{1} \vbox \scalebox 0.6 ∙ S_{2} = {w_{1} \vbox \scalebox 0.6 ∙ w_{2} : w_{1} \in S_{1}, w_{2} \in S_{2}} .

S_{1} \vbox \scalebox 0.6 ∙ S_{2} = {w_{1} \vbox \scalebox 0.6 ∙ w_{2} : w_{1} \in S_{1}, w_{2} \in S_{2}} .

S(n;\ell_{1},\ell_{2},\dots,\ell_{k})=\mathopen{}\mathclose{{}\left\{w\in S(n):w\text{ has no prefix in }S(\ell_{1})\cup S(\ell_{2})\cup\dots\cup S(\ell_{k})}\right\};

S(n;\ell_{1},\ell_{2},\dots,\ell_{k})=\mathopen{}\mathclose{{}\left\{w\in S(n):w\text{ has no prefix in }S(\ell_{1})\cup S(\ell_{2})\cup\dots\cup S(\ell_{k})}\right\};

S(n;\ell_{1},\ell_{2},\dots,\ell_{k})=S(n)\setminus\mathopen{}\mathclose{{}\left(\bigcup_{i=1}^{k}S(\ell_{i})\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$\displaystyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$\textstyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$\scriptstyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$\scriptscriptstyle\bullet$}}}}}\mathcal{F}(n-\ell_{i})}\right).

S(n;\ell_{1},\ell_{2},\dots,\ell_{k})=S(n)\setminus\mathopen{}\mathclose{{}\left(\bigcup_{i=1}^{k}S(\ell_{i})\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$\displaystyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$\textstyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$\scriptstyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$\scriptscriptstyle\bullet$}}}}}\mathcal{F}(n-\ell_{i})}\right).

d_{S} (n; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k}) = \frac{∣ S ( n ; ℓ _{1} , ℓ _{2} , \dots , ℓ _{k} ) ∣}{∣ F ( n ) ∣} .

d_{S} (n; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k}) = \frac{∣ S ( n ; ℓ _{1} , ℓ _{2} , \dots , ℓ _{k} ) ∣}{∣ F ( n ) ∣} .

d (m) d (n) + d (m + n) \leq 1.

d (m) d (n) + d (m + n) \leq 1.

d (ℓ_{1}) d (n - ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) d (n - ℓ_{2}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) d (n - ℓ_{k}) + d (n)

d (ℓ_{1}) d (n - ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) d (n - ℓ_{2}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) d (n - ℓ_{k}) + d (n)

\leq

S (ℓ_{1}) \vbox \scalebox 0.6 ∙ S (n - ℓ_{1}), S (ℓ_{2}; ℓ_{1}) \vbox \scalebox 0.6 ∙ S (n - ℓ_{2}), \dots, S (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) \vbox \scalebox 0.6 ∙ S (n - ℓ_{k}) .

S (ℓ_{1}) \vbox \scalebox 0.6 ∙ S (n - ℓ_{1}), S (ℓ_{2}; ℓ_{1}) \vbox \scalebox 0.6 ∙ S (n - ℓ_{2}), \dots, S (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) \vbox \scalebox 0.6 ∙ S (n - ℓ_{k}) .

d (ℓ_{1}) d (n - ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) d (n - ℓ_{2}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) d (n - ℓ_{k}) + d (n) .

d (ℓ_{1}) d (n - ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) d (n - ℓ_{2}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) d (n - ℓ_{k}) + d (n) .

S (ℓ_{1}) \vbox \scalebox 0.6 ∙ F (n - ℓ_{1}), S (ℓ_{2}; ℓ_{1}) \vbox \scalebox 0.6 ∙ F (n - ℓ_{2}), \dots, S (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) \vbox \scalebox 0.6 ∙ F (n - ℓ_{k}) .

S (ℓ_{1}) \vbox \scalebox 0.6 ∙ F (n - ℓ_{1}), S (ℓ_{2}; ℓ_{1}) \vbox \scalebox 0.6 ∙ F (n - ℓ_{2}), \dots, S (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) \vbox \scalebox 0.6 ∙ F (n - ℓ_{k}) .

d (ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) + d (n; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k})

d (ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) + d (n; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k})

L^{'} \cup S (n) = L \subset R = R^{'} \cup S (n; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k}) .

L^{'} \cup S (n) = L \subset R = R^{'} \cup S (n; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k}) .

d (ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) \geq \frac{1}{2} + \frac{1}{4} + \dots + \frac{1}{2 ^{k}} = 1 - \frac{1}{2 ^{k}}

d (ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) \geq \frac{1}{2} + \frac{1}{4} + \dots + \frac{1}{2 ^{k}} = 1 - \frac{1}{2 ^{k}}

\frac{\sum _{n \in I} d ( n )}{∣ I ∣} > \frac{1}{2} + ε .

\frac{\sum _{n \in I} d ( n )}{∣ I ∣} > \frac{1}{2} + ε .

d (ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) + d (n; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k}) \geq 1 - \frac{1}{2 ^{k + 1}} .

d (ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) + d (n; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k}) \geq 1 - \frac{1}{2 ^{k + 1}} .

d (ℓ_{1}) d (n - ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) d (n - ℓ_{2}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) d (n - ℓ_{k}) + d (n)

d (ℓ_{1}) d (n - ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) d (n - ℓ_{2}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) d (n - ℓ_{k}) + d (n)

\leq

\sum_{n\in I^{\prime}}d(n)\mathopen{}\mathclose{{}\left(1+d(\ell_{1})+d(\ell_{2};\ell_{1})+\dots+d(\ell_{k};\ell_{1},\ell_{2},\dots,\ell_{k-1})}\right)<|I|\mathopen{}\mathclose{{}\left(1-\frac{1}{2^{k+1}}}\right),

\sum_{n\in I^{\prime}}d(n)\mathopen{}\mathclose{{}\left(1+d(\ell_{1})+d(\ell_{2};\ell_{1})+\dots+d(\ell_{k};\ell_{1},\ell_{2},\dots,\ell_{k-1})}\right)<|I|\mathopen{}\mathclose{{}\left(1-\frac{1}{2^{k+1}}}\right),

\sum_{n\in I^{\prime}}d(n)\mathopen{}\mathclose{{}\left(2-\frac{1}{2^{k}}}\right)<|I|\mathopen{}\mathclose{{}\left(1-\frac{1}{2^{k+1}}}\right),

\sum_{n\in I^{\prime}}d(n)\mathopen{}\mathclose{{}\left(2-\frac{1}{2^{k}}}\right)<|I|\mathopen{}\mathclose{{}\left(1-\frac{1}{2^{k+1}}}\right),

n \in I \sum d (n) \leq n \in I^{'} \sum d (n) + ℓ_{k} + 1 < \frac{∣ I ∣}{2} + ℓ_{k} + 1,

n \in I \sum d (n) \leq n \in I^{'} \sum d (n) + ℓ_{k} + 1 < \frac{∣ I ∣}{2} + ℓ_{k} + 1,

\frac{2 ^{k}}{2 ^{k + 1} - 1} < \frac{1 + ε}{2}

\frac{2 ^{k}}{2 ^{k + 1} - 1} < \frac{1 + ε}{2}

d (ℓ_{1}) d (n - ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) d (n - ℓ_{2}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) d (n - ℓ_{k}) + d (n) \leq 1

d (ℓ_{1}) d (n - ℓ_{1}) + d (ℓ_{2}; ℓ_{1}) d (n - ℓ_{2}) + \dots + d (ℓ_{k}; ℓ_{1}, ℓ_{2}, \dots, ℓ_{k - 1}) d (n - ℓ_{k}) + d (n) \leq 1

\sum_{n\in I^{\prime}}d(n)\mathopen{}\mathclose{{}\left(2-\frac{1}{2^{k}}}\right)\leq|I|,

\sum_{n\in I^{\prime}}d(n)\mathopen{}\mathclose{{}\left(2-\frac{1}{2^{k}}}\right)\leq|I|,

\frac{\sum _{n \in I} d ( n )}{∣ I ∣} \leq \frac{2 ^{k}}{( 2 ^{k + 1} - 1 )} + \frac{2 ( ℓ _{k} + 1 )}{∣ I ∣} < \frac{1}{2} + ε,

\frac{\sum _{n \in I} d ( n )}{∣ I ∣} \leq \frac{2 ^{k}}{( 2 ^{k + 1} - 1 )} + \frac{2 ( ℓ _{k} + 1 )}{∣ I ∣} < \frac{1}{2} + ε,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Product-free sets in the free semigroup

Imre Leader

Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Wilberforce Road, Cambridge CB3 0WB, UK

[email protected]

,

Shoham Letzter

ETH Institute for Theoretical Studies, 8092 Zurich, Switzerland

[email protected]

,

Bhargav Narayanan

Department of Mathematics, Rutgers University, Piscataway NJ 08854, USA

[email protected]

and

Mark Walters

School of Mathematical Sciences, Queen Mary, University of London, London E1 4NS, UK

[email protected]

(Date: 6 December 2018)

Abstract.

In this paper, we study product-free subsets of the free semigroup over a finite alphabet $\mathscr{A}$ . We prove that the maximum density of a product-free subset of the free semigroup over $\mathscr{A}$ , with respect to the natural measure that assigns a weight of $|\mathscr{A}|^{-n}$ to each word of length $n$ , is precisely $1/2$ .

2010 Mathematics Subject Classification:

Primary 20M05; Secondary 05D05

1. Introduction

A subset $S$ of a semigroup is said to be product-free if there do not exist $x,y,z\in S$ (not necessarily distinct) such that $x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}y=z$ ; it is customary to call $S$ sum-free when the underlying semigroup is abelian.

It is a well known fact (and an easy exercise) that any sum-free subset of the integers has upper density at most $1/2$ . Sum-free subsets of the integers, and of abelian groups in general, have been studied by very many researchers over the last fifty years. For example, from the work of Green and Ruzsa [4], there is now a complete picture of how large a sum-free set we can find in any finite abelian group. We refer the reader to the surveys of Tao and Vu [7] and Kedlaya [5] for more information on these questions.

Product-free subsets of finite non-abelian groups were first investigated by Babai and Sós [1]. Following foundational work by Gowers [3] demonstrating so-called ‘product-mixing’ phenomena in groups with no low-dimensional representations, there has been a great deal of recent work in the non-abelian setting; for instance, in a recent breakthrough, Eberhard [2] determined how large a product-free subset of the alternating group can be.

In light of these developments, it is natural to ask what one can say about product-free sets in infinite non-abelian structures, a setting in which our knowledge is a bit more limited. Perhaps the first natural place to look among infinite non-abelian structures is among those that are free, so here, we shall investigate how large product-free subsets of the free semigroup can be.

2. Our results

Let $\mathscr{A}$ be a finite set. We write $\mathcal{F}=\mathcal{F}_{\mathscr{A}}$ for the free semigroup over $\mathscr{A}$ ; in other words, $\mathcal{F}$ is the set of all finite words over the alphabet $\mathscr{A}$ equipped with the associative operation of concatenation. While we state and prove our results for finite alphabets of all possible sizes for the sake of completeness, the reader will lose nothing by supposing that $\mathscr{A}$ is a two-element set in what follows; indeed, this case captures all the difficulties inherent in the questions we study.

Recall that a set $S\subset\mathcal{F}$ is product-free if, writing $\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}$ for the operation of concatenation, there do not exist words $x,y,z\in S$ (not necessarily distinct) such that $x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}y=z$ . There is an obvious example of a ‘large’ subset of $\mathcal{F}$ that is product-free: when $\mathscr{A}=\{a,b\}$ for instance, the set of words which contain an odd number of occurrences of the symbol $a$ (or $b$ , for that matter) is easily seen to be a product-free set that contains, roughly, half the words from $\mathcal{F}$ . Our aim in this paper is to prove that these sets are, in a precise sense, the largest product-free subsets of $\mathcal{F}$ . We remark in passing that there are several other product-free sets that are ‘equally large’: for any nonempty subset $\Gamma\subset\mathscr{A}$ , the odd-occurrence set $\mathcal{O}_{\Gamma}\subset\mathcal{F}$ generated by $\Gamma$ , namely the set of words in which the total number of occurrences of symbols from $\Gamma$ is odd, is easily seen to be a product-free set; in the case where $\mathscr{A}=\{a,b\}$ , our earlier example corresponds to taking $\Gamma=\{a\}$ , and taking $\Gamma=\{a,b\}$ gives us the set of all words of odd length, for example.

To formally state our results, we need a way to measure the size of a set $S\subset\mathcal{F}$ . For an integer $n\in\mathbb{N}$ , the layer $\mathcal{F}(n)\subset\mathcal{F}$ is the set of words of length $n$ , and the ball $\mathcal{F}_{\leq}(n)\subset\mathcal{F}$ is the set of words of length at most $n$ . As a first attempt, one might define the density of a set $S\subset\mathcal{F}$ via its densities in balls, namely as the quantity

[TABLE]

However, a little thought should convince the reader that the counting measure is somewhat ill-suited for our purposes. Indeed, when $|\mathscr{A}|>1$ , almost all the words in $\mathcal{F}_{\leq}(n)$ are long since $|\mathcal{F}(n)|\geq|\mathcal{F}_{\leq}(n)|/2$ . Consequently, we may find product-free sets that are intuitively small, and yet have density arbitrarily close to $1$ in the above sense; for example, for any sufficiently large $c\in\mathbb{N}$ , the set

[TABLE]

is product-free and has density at least $1-1/c$ in the above sense, provided $|\mathscr{A}|>1$ .

A more natural approach is to assign a weight of $|\mathscr{A}|^{-n}$ to each word of $\mathcal{F}(n)$ , thereby ensuring that the layers $\mathcal{F}(n)$ have the same total weight for all $n\in\mathbb{N}$ . To this end, for a subset $S\subset\mathcal{F}$ and an integer $n\in\mathbb{N}$ , we define the density of $S$ in the layer $\mathcal{F}(n)$ by $d_{S}(n)=|S\cap\mathcal{F}(n)|/|\mathcal{F}(n)|$ . With this definition in place, most standard notions of density may now be carried over: we define the upper asymptotic density of $S$ by

[TABLE]

and the upper Banach density of $S$ by

[TABLE]

Of course, the latter is a weaker notion of density than the former; indeed, it is clear that $\bar{d}(S)\leq d^{*}(S)$ for any $S\subset\mathcal{F}$ .

It is easy to see that any odd-occurrence set has both an upper asymptotic density and an upper Banach density of $1/2$ . Our aim in this note is to show that product-free sets cannot be any larger; our main result is as follows.

Theorem 1.

Let $\mathscr{A}$ be a finite set. If $S\subset\mathcal{F}_{\mathscr{A}}$ is product-free, then $d^{*}(S)\leq 1/2$ .

Let us mention that product-free sets in cancellative semigroups have been studied by Łuczak and Schoen [6]; while their results are sharp for such semigroups in general, these results do not give us any effective bounds on the size of a product-free subset of $\mathcal{F}$ .

Before we turn to the proof of Theorem 1, it is worth pointing out that there is a simple argument that allows us to bound the upper asymptotic density of a product-free subset of $\mathcal{F}$ away from $1$ . Indeed, suppose that $S\subset\mathcal{F}$ is product-free. We then have

[TABLE]

for any $m,n\in\mathbb{N}$ since the sets $S\cap\mathcal{F}(m+n)$ and $(S\cap\mathcal{F}(m))\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}(S\cap\mathcal{F}(n))$ must be disjoint. Now, consider the set of integers $n\in\mathbb{N}$ for which $d_{S}(n)>\phi$ , where $\phi=(\sqrt{5}-1)/2\approx 0.618$ is the unique positive solution to the equation $x^{2}+x=1$ . It follows from the inequality above that this set of integers must be sum-free. It is now easy to see that $\bar{d}(S)\leq(1+\phi)/2\approx 0.809$ .

We shall have to work somewhat harder to prove Theorem 1, which improves this bound of $(1+\phi)/2$ for the upper asymptotic density to the optimal bound of $1/2$ for the upper Banach density. The proof of Theorem 1 is given in Section 3. We conclude this note with a discussion of some open problems in Section 4.

3. Proof of the main result

We begin by fixing our finite alphabet $\mathscr{A}$ . In the sequel, $\mathcal{F}$ will always mean $\mathcal{F}_{\mathscr{A}}$ , the free semigroup over this fixed alphabet $\mathscr{A}$ .

It will be helpful to establish some notation. For a pair of words $x,w\in\mathcal{F}$ , we say that $x$ is a prefix of $w$ if $w=x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}y$ for some $y\in\mathcal{F}$ , and that $x$ is a suffix of $w$ if $w=y\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}x$ for some $y\in\mathcal{F}$ . For a pair of sets $S_{1},S_{2}\subset\mathcal{F}$ , we write $S_{1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}S_{2}$ for their (Minkowski) product; in other words,

[TABLE]

For a set $S\subset\mathcal{F}$ and an integer $n\in\mathbb{N}$ , we set $S(n)=S\cap\mathcal{F}(n)$ . One of the key ideas in the proof of Theorem 1 is the following definition. For any sequence of positive integers $\ell_{1}<\ell_{2}<\dots<\ell_{k}<n$ , we define

[TABLE]

in other words,

[TABLE]

Let us note, for any $S\subset\mathcal{F}$ , that the sets $S(n;m)$ and $S(m)\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}\mathcal{F}(n-m)$ are disjoint for any pair of positive integers $m<n$ . Recall that $d_{S}(n)=|S(n)||\mathcal{F}(n)|^{-1}$ ; we analogously define

[TABLE]

When the set $S$ in question is clear, we write $d(n)$ and $d(n;\ell_{1},\ell_{2},\dots,\ell_{k})$ for $d_{S}(n)$ and $d_{S}(n;\ell_{1},\ell_{2},\dots,\ell_{k})$ , respectively. Recall that for any product-free set $S\subset\mathcal{F}$ and any $m,n\in\mathbb{N}$ , we have

[TABLE]

We start by proving a generalisation of this fact.

Proposition 2.

If $S\subset\mathcal{F}$ is product-free, then for any sequence of positive integers $\ell_{1}<\ell_{2}<\dots<\ell_{k}<n$ , we have

[TABLE]

Proof.

First, consider the products

[TABLE]

These subsets of $\mathcal{F}(n)$ are by definition disjoint. Let $L^{\prime}$ be the union of these $k$ sets. Since $S$ is product-free, $L^{\prime}$ and $S(n)$ are disjoint as well. Let $L=L^{\prime}\cup S(n)$ ; clearly, the density of $L$ in $\mathcal{F}(n)$ is

[TABLE]

Next, consider the Minkowski products

[TABLE]

These subsets of $\mathcal{F}(n)$ are again disjoint by definition; let $R^{\prime}$ denote their union. Note that $R^{\prime}$ and $S(n;\ell_{1},\ell_{2},\dots,\ell_{k})$ are disjoint. Let $R=R^{\prime}\cup S(n;\ell_{1},\ell_{2},\dots,\ell_{k})$ ; it is easy to see that the density of $R$ in $\mathcal{F}(n)$ is

[TABLE]

and that this quantity is therefore at most $1$ .

To finish the proof, it suffices to show that

[TABLE]

It is easy to see that $L^{\prime}\subset R^{\prime}$ . Therefore, it is sufficient to show that $S(n)$ is a subset of $R^{\prime}\cup S(n;\ell_{1},\ell_{2},\dots,\ell_{k})$ . To see this, note that any word from $S(n)$ which has a prefix in $S(\ell_{1})\cup S(\ell_{2})\cup\dots\cup S(\ell_{k})$ is also contained in $R^{\prime}$ . In other words, $S(n)\setminus S(n;\ell_{1},\ell_{2},\dots,\ell_{k})\subset R^{\prime}$ ; the result follows. ∎

With the above observation in hand, we are now ready to prove Theorem 1.

Proof of Theorem 1.

We prove by contradiction that the upper Banach density of a product-free set is at most $1/2$ .

Suppose that $S\subset\mathcal{F}$ is product-free and that $d^{*}(S)>1/2+\varepsilon$ for some $\varepsilon>0$ . We then claim that we may find an increasing sequence of positive integers $(\ell_{k})_{k\in\mathbb{N}}$ such that

[TABLE]

for each $k\in\mathbb{N}$ .

We construct this sequence inductively. Since $d^{*}(S)>1/2$ , it is clear that we may find $\ell_{1}\in\mathbb{N}$ such that $d(\ell_{1})\geq 1/2$ . Having found $\ell_{1}<\ell_{2}<\dots<\ell_{k}$ as required, we choose $\ell_{k+1}$ as follows. Since $d^{*}(S)>1/2+\varepsilon$ , there exist arbitrarily long intervals $I\subset\mathbb{N}$ that satisfy

[TABLE]

Choose such an interval $I$ whose length is sufficiently larger than $\ell_{k}$ ; we may assume, by passing to a sub-interval if necessary, that $\min I>\ell_{k}$ . We claim that it is possible to choose $\ell_{k+1}$ from $I$ ; in other words, we claim that there exists an $n\in I$ such that

[TABLE]

We prove this claim by contradiction. Suppose that there is no such $n\in I$ . Then, by Proposition 2, we have

[TABLE]

for each $n\in I$ . By summing the above inequality over all $n\in I$ , we get

[TABLE]

where $I^{\prime}\subset I$ is the set of $n\in I$ with $n+\ell_{k}<\max I$ . This implies, by the inductive hypothesis, that

[TABLE]

or equivalently, $\sum_{n\in I^{\prime}}d(n)<|I|/2$ . Therefore, we have

[TABLE]

which contradicts the fact that $\sum_{n\in I}d(n)>|I|/2+\varepsilon|I|$ , provided $|I|>(\ell_{k}+1)/\varepsilon$ .

We now finish the proof of the proposition by showing that the existence of this sequence $(\ell_{k})_{k\in\mathbb{N}}$ contradicts our initial assumption that $d^{*}(S)>1/2+\varepsilon$ . Fix a $k\in\mathbb{N}$ large enough to ensure that

[TABLE]

and consider any interval $I\subset\mathbb{N}$ with $|I|>4(\ell_{k}+1)/\varepsilon$ . We know from Proposition 2 that

[TABLE]

for each $n\in\mathbb{N}$ with $n>\ell_{k}$ ; summing this inequality over such $n\in I$ , we get

[TABLE]

where $I^{\prime}$ is the set of $n\in I$ with $n>\ell_{k}$ and $n+\ell_{k}<\max I$ . Therefore,

[TABLE]

which is a contradiction; this proves the claimed upper bound in Theorem 1. ∎

4. Conclusion

A common line of enquiry in the study of product-free sets is to ask for ‘asymmetric’ versions of results bounding the upper density of product-free sets. In this spirit, it is natural to ask whether an analogue of Theorem 1 continues to hold when one wishes to solve the equation $x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}y=z$ with $x$ , $y$ and $z$ in specified subsets of $\mathcal{F}$ . More precisely, if $X,Y,Z\subset\mathcal{F}$ are such that there are no solutions to $x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}y=z$ with $x\in X$ , $y\in Y$ and $z\in Z$ , one might ask if one of $X$ , $Y$ or $Z$ has an upper asymptotic density of at most $1/2$ . However, it is not hard to construct for any $\varepsilon>0$ , three sets $X,Y,Z\subset\mathcal{F}$ , each of upper asymptotic density at least $\phi-\varepsilon$ , where $\phi=(\sqrt{5}-1)/2$ , such that there are no solutions to $x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}y=z$ with $x\in X$ , $y\in Y$ and $z\in Z$ . Indeed, pick a suitably large $n\in\mathbb{N}$ and choose any set $W\subset\mathcal{F}(n)$ such that $||W|/|\mathcal{F}(n)|-\phi|<\varepsilon/3$ . Now take $X$ to be the set of all words with a prefix in $W$ , $Y$ to be the set of all words with a suffix in $W$ , and $Z$ to be the set $\mathcal{F}\setminus(X\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}Y)$ . Clearly, there are no solutions to $x\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.6}{$ \scriptscriptstyle\bullet $}}}}}y=z$ with $x\in X$ , $y\in Y$ and $z\in Z$ ; it is also not hard to check that each of $X$ , $Y$ and $Z$ has an upper asymptotic density at least $\phi-\varepsilon$ .

Next, it would be interesting to understand what product-free sets of maximal density look like. As we saw earlier, several non-isomorphic extremal constructions are furnished by the family of odd-occurrence sets. We suspect that these might be the only constructions of maximal density, and make the following conjecture.

Conjecture 3.

Let $\mathscr{A}$ be a finite set. If $S\subset\mathcal{F}_{\mathscr{A}}$ is product-free and $d^{*}(S)=1/2$ , then $S\subset\mathcal{O}_{\Gamma}$ for some nonempty subset $\Gamma\subset\mathscr{A}$ .

Finally, another natural direction is to study product-free subsets of the free group $\mathbf{F}_{\mathscr{A}}$ over a finite alphabet $\mathscr{A}$ . Similarly to the situation in this paper, the most natural measure to consider in the case of the free group $\mathbf{F}_{\mathscr{A}}$ would be the one that assigns a weight of $|\mathscr{A}|(|\mathscr{A}|-1)^{-(n-1)}$ to each irreducible word of length $n$ . The different notions of density defined here for the free semigroup then have analogous definitions in the free group, and we believe that an analogue of Theorem 1 should hold in the free group as well; concretely, we conjecture the following.

Conjecture 4.

For any finite alphabet $\mathscr{A}$ , no product-free subset of the free group $\mathbf{F}_{\mathscr{A}}$ has upper Banach density exceeding $1/2$ .

Note that, in the proof of Theorem 1, we rely crucially on the fact that there is exactly one way to write a word of length $m+n$ as the concatenation of a word of length $m$ with a word of length $n$ ; of course, we lose this property when working with free groups, so we believe that some new ideas will be required to understand product-free sets in free groups.

Acknowledgements

The second author would like to acknowledge the support of Dr. Max Rössler, the Walter Haefner Foundation, and the ETH Zurich Foundation. The third author wishes to acknowledge support from NSF grant DMS-1800521.

Bibliography7

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] L. Babai and V. T. Sós, Sidon sets in groups and induced subgraphs of Cayley graphs , European J. Combin. 6 (1985), 101–114.
2[2] S. Eberhard, Product mixing in the alternating group , Discrete Analysis (2016:2), 19 pp.
3[3] W. T. Gowers, Quasirandom groups , Combin. Probab. Comput. 17 (2008), 363–387.
4[4] B. Green and I. Z. Ruzsa, Sum-free sets in abelian groups , Israel J. Math. 147 (2005), 157–188.
5[5] K. S. Kedlaya, Product-free subsets of groups, then and now , Communicating mathematics, Contemp. Math., vol. 479, Amer. Math. Soc., Providence, RI, 2009, pp. 169–177.
6[6] T. Łuczak and T. Schoen, Sum-free subsets of right cancellative semigroups , European J. Combin. 22 (2001), 999–1002.
7[7] T. Tao and V. Vu, Sumfree sets in groups: a survey , J. Comb. 8 (2017), 541–552.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Product-free sets in the free semigroup

Abstract.

2010 Mathematics Subject Classification:

1. Introduction

2. Our results

Theorem 1**.**

3. Proof of the main result

Proposition 2**.**

Proof.

Proof of Theorem 1.

4. Conclusion

Conjecture 3**.**

Conjecture 4**.**

Acknowledgements

Theorem 1.

Proposition 2.

Conjecture 3.

Conjecture 4.