Circular automata synchronize with high probability

Christoph Aistleitner; Daniele D'Angeli; Abraham Gutierrez; Emanuele; Rodaro; Amnon Rosenmann

arXiv:1906.02602·math.CO·July 9, 2020·J. Comb. Theory A

Circular automata synchronize with high probability

Christoph Aistleitner, Daniele D'Angeli, Abraham Gutierrez, Emanuele, Rodaro, Amnon Rosenmann

PDF

TL;DR

This paper proves that a random circular automaton of size n synchronizes with high probability, using probabilistic methods and properties of associated random matrices, and relates synchronization probability to chromatic polynomials of circulant graphs.

Contribution

It establishes that random circular automata synchronize with high probability and introduces a novel approach using random matrix properties and graph chromatic polynomials.

Findings

01

Synchronization probability approaches 1 as n increases

02

Provides bounds on synchronization probability using chromatic polynomials

03

Connects automaton synchronization to properties of circulant graphs

Abstract

In this paper we prove that a uniformly distributed random circular automaton $A_{n}$ of order $n$ synchronizes with high probability (whp). More precisely, we prove that $P [A_{n} synchronizes] = 1 - O (\frac{1}{n}) .$ The main idea of the proof is to translate the synchronization problem into properties of a random matrix; these properties are then handled with tools of the probabilistic method. Additionally, we provide an upper bound for the probability of synchronization of circular automata in terms of chromatic polynomials of circulant graphs.

Equations245

P [A_{n} synchronizes] = 1 - O (\frac{1}{n}) .

P [A_{n} synchronizes] = 1 - O (\frac{1}{n}) .

γ (q, w) := ((q, q_{i_{1}})_{a_{i_{1}}}, (q_{i_{1}}, q_{i_{2}})_{a_{i_{2}}}, \dots, (q_{i_{k - 1}}, q^{'})_{a_{i_{k}}})

γ (q, w) := ((q, q_{i_{1}})_{a_{i_{1}}}, (q_{i_{1}}, q_{i_{2}})_{a_{i_{2}}}, \dots, (q_{i_{k - 1}}, q^{'})_{a_{i_{k}}})

P [{b \in M_{p} : \leavevmode A_{p} (b) \mbox sy n c h r o ni z es}] = 1 - \frac{p !}{p ^{p}} = 1 - Θ (\frac{p}{e ^{p}}) .

P [{b \in M_{p} : \leavevmode A_{p} (b) \mbox sy n c h r o ni z es}] = 1 - \frac{p !}{p ^{p}} = 1 - Θ (\frac{p}{e ^{p}}) .

P [{b \in M_{n} : \leavevmode A_{n} (b) \mbox sy n c h r o ni z es}] = 1 - O (\frac{1}{n})

P [{b \in M_{n} : \leavevmode A_{n} (b) \mbox sy n c h r o ni z es}] = 1 - O (\frac{1}{n})

\big{|}r\big{|}_{n}:=\min\left\{(r)_{n},(-r)_{n})\right\}\in\left\{0,1,\ldots,\left\lfloor\frac{n}{2}\right\rfloor\right\}.

\big{|}r\big{|}_{n}:=\min\left\{(r)_{n},(-r)_{n})\right\}\in\left\{0,1,\ldots,\left\lfloor\frac{n}{2}\right\rfloor\right\}.

T_{\mathbf{b}}:=\begin{bmatrix}\big{|}b_{0}-b_{1}\big{|}_{n}&\big{|}b_{1}-b_{2}\big{|}_{n}&\ldots&\big{|}b_{k}-b_{k+1}\big{|}_{n}&\ldots&\big{|}b_{n-1}-b_{0}\big{|}_{n}\\ \big{|}b_{0}-b_{2}\big{|}_{n}&\big{|}b_{1}-b_{3}\big{|}_{n}&\ldots&\big{|}b_{k}-b_{(k+2)_{n}}\big{|}_{n}&\ldots&\big{|}b_{n-1}-b_{1}\big{|}_{n}\\ \vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \big{|}b_{0}-b_{i}\big{|}_{n}&\big{|}b_{1}-b_{1+i}\big{|}_{n}&\ldots&\big{|}b_{k}-b_{(k+i)_{n}}\big{|}_{n}&\ldots&\big{|}b_{n-1}-b_{i-1}\big{|}_{n}\\ \vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \big{|}b_{0}-b_{\left\lfloor\frac{n}{2}\right\rfloor}\big{|}_{n}&\big{|}b_{1}-b_{1+\left\lfloor\frac{n}{2}\right\rfloor}\big{|}_{n}&\ldots&\big{|}b_{k}-b_{(k+\left\lfloor\frac{n}{2}\right\rfloor)_{n}}\big{|}_{n}&\ldots&\big{|}b_{n-1}-b_{\left\lfloor\frac{n}{2}\right\rfloor-1}\big{|}_{n}\end{bmatrix},

T_{\mathbf{b}}:=\begin{bmatrix}\big{|}b_{0}-b_{1}\big{|}_{n}&\big{|}b_{1}-b_{2}\big{|}_{n}&\ldots&\big{|}b_{k}-b_{k+1}\big{|}_{n}&\ldots&\big{|}b_{n-1}-b_{0}\big{|}_{n}\\ \big{|}b_{0}-b_{2}\big{|}_{n}&\big{|}b_{1}-b_{3}\big{|}_{n}&\ldots&\big{|}b_{k}-b_{(k+2)_{n}}\big{|}_{n}&\ldots&\big{|}b_{n-1}-b_{1}\big{|}_{n}\\ \vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \big{|}b_{0}-b_{i}\big{|}_{n}&\big{|}b_{1}-b_{1+i}\big{|}_{n}&\ldots&\big{|}b_{k}-b_{(k+i)_{n}}\big{|}_{n}&\ldots&\big{|}b_{n-1}-b_{i-1}\big{|}_{n}\\ \vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \big{|}b_{0}-b_{\left\lfloor\frac{n}{2}\right\rfloor}\big{|}_{n}&\big{|}b_{1}-b_{1+\left\lfloor\frac{n}{2}\right\rfloor}\big{|}_{n}&\ldots&\big{|}b_{k}-b_{(k+\left\lfloor\frac{n}{2}\right\rfloor)_{n}}\big{|}_{n}&\ldots&\big{|}b_{n-1}-b_{\left\lfloor\frac{n}{2}\right\rfloor-1}\big{|}_{n}\end{bmatrix},

T_{\mathbf{b}}(i,j)=\big{|}b_{j}-b_{(j+i)_{n}}\big{|}_{n}\mbox{ for }1\leq i\leq\left\lfloor\frac{n}{2}\right\rfloor\mbox{ and }0\leq j\leq n-1.

T_{\mathbf{b}}(i,j)=\big{|}b_{j}-b_{(j+i)_{n}}\big{|}_{n}\mbox{ for }1\leq i\leq\left\lfloor\frac{n}{2}\right\rfloor\mbox{ and }0\leq j\leq n-1.

R_{i}(\mathbf{b}):=\#\left\{\big{|}b_{0}-b_{(0+i)_{n}}\big{|}_{n},\big{|}b_{1}-b_{(1+i)_{n}}\big{|}_{n},\ldots,\big{|}b_{n-1}-b_{i-1}\big{|}_{n}\right\}.

R_{i}(\mathbf{b}):=\#\left\{\big{|}b_{0}-b_{(0+i)_{n}}\big{|}_{n},\big{|}b_{1}-b_{(1+i)_{n}}\big{|}_{n},\ldots,\big{|}b_{n-1}-b_{i-1}\big{|}_{n}\right\}.

E_{row} (α) := i = 1 ⋂ ⌊ \frac{n}{2} ⌋ {b \in M_{n} : \leavevmode R_{i} (b) \geq α ⌊ \frac{n}{2} ⌋},

E_{row} (α) := i = 1 ⋂ ⌊ \frac{n}{2} ⌋ {b \in M_{n} : \leavevmode R_{i} (b) \geq α ⌊ \frac{n}{2} ⌋},

E_{row}^{c} (α) := i = 1 ⋃ ⌊ \frac{n}{2} ⌋ {b \in M_{n} : \leavevmode R_{i} (b) < α ⌊ \frac{n}{2} ⌋} .

E_{row}^{c} (α) := i = 1 ⋃ ⌊ \frac{n}{2} ⌋ {b \in M_{n} : \leavevmode R_{i} (b) < α ⌊ \frac{n}{2} ⌋} .

E_{zero} (β) := {b \in M_{n} : \leavevmode D (b) \geq β ⌊ \frac{n}{2} ⌋},

E_{zero} (β) := {b \in M_{n} : \leavevmode D (b) \geq β ⌊ \frac{n}{2} ⌋},

E_{zero}^{c} (β) := {b \in M_{n} : \leavevmode D (b) < β ⌊ \frac{n}{2} ⌋},

E_{zero}^{c} (β) := {b \in M_{n} : \leavevmode D (b) < β ⌊ \frac{n}{2} ⌋},

D_{i}(\mathbf{b}):=\begin{cases}1,&\mbox{ if there exist }\,k,l\in\mathbb{Z}_{n}\mbox{ such that }\big{|}k-l\big{|}_{n}=i\mbox{ and }\big{|}b_{k}-b_{l}\big{|}_{n}=0;\\ 0,&\mbox{ otherwise, }\end{cases}

D_{i}(\mathbf{b}):=\begin{cases}1,&\mbox{ if there exist }\,k,l\in\mathbb{Z}_{n}\mbox{ such that }\big{|}k-l\big{|}_{n}=i\mbox{ and }\big{|}b_{k}-b_{l}\big{|}_{n}=0;\\ 0,&\mbox{ otherwise, }\end{cases}

D (b) := i = 1 \sum ⌊ \frac{n}{2} ⌋ D_{i} (b) .

D (b) := i = 1 \sum ⌊ \frac{n}{2} ⌋ D_{i} (b) .

P [E_{row}^{c} (α)] = O (\frac{1}{n})

P [E_{row}^{c} (α)] = O (\frac{1}{n})

P [E_{zero}^{c} (β)] = O (\frac{1}{n}) .

P [E_{zero}^{c} (β)] = O (\frac{1}{n}) .

P [{b \in M_{n} : \leavevmode A_{n} (b) \mbox sy n c h r o ni z es}] \geq P [E_{row} (α) \cap E_{zero} (β)] = 1 - P [E_{row}^{c} (α) \cup E_{zero}^{c} (β)] \geq 1 - P [E_{row}^{c} (α)] - P [E_{zero}^{c} (β)] .

P [{b \in M_{n} : \leavevmode A_{n} (b) \mbox sy n c h r o ni z es}] \geq P [E_{row} (α) \cap E_{zero} (β)] = 1 - P [E_{row}^{c} (α) \cup E_{zero}^{c} (β)] \geq 1 - P [E_{row}^{c} (α)] - P [E_{zero}^{c} (β)] .

P [{b \in M_{n} : \leavevmode A_{n} (b) \mbox sy n c h r o ni z es}] \geq 1 - = O (\frac{1}{n}) P [E_{row}^{c} (α^{⋆})] - = O (\frac{1}{n}) P [E_{zero}^{c} (β^{⋆})] = 1 - O (\frac{1}{n})

P [{b \in M_{n} : \leavevmode A_{n} (b) \mbox sy n c h r o ni z es}] \geq 1 - = O (\frac{1}{n}) P [E_{row}^{c} (α^{⋆})] - = O (\frac{1}{n}) P [E_{zero}^{c} (β^{⋆})] = 1 - O (\frac{1}{n})

T_{\mathbf{b}}(1,0)=\big{|}b_{0}-b_{1}\big{|}_{n},\quad T_{\mathbf{b}}(1,1)=\big{|}b_{1}-b_{2}\big{|}_{n},\quad T_{\mathbf{b}}(2,0)=\big{|}b_{0}-b_{2}\big{|}_{n}

T_{\mathbf{b}}(1,0)=\big{|}b_{0}-b_{1}\big{|}_{n},\quad T_{\mathbf{b}}(1,1)=\big{|}b_{1}-b_{2}\big{|}_{n},\quad T_{\mathbf{b}}(2,0)=\big{|}b_{0}-b_{2}\big{|}_{n}

S = {(i_{1}, j_{1}), (i_{2}, j_{2}), \dots, (i_{k}, j_{k})}

S = {(i_{1}, j_{1}), (i_{2}, j_{2}), \dots, (i_{k}, j_{k})}

\Big{\{}\{j_{1},(j_{1}+i_{1})_{n}\},\,\{j_{2},(j_{2}+i_{2})_{n}\},\ldots,\,\{j_{k},(j_{k}+i_{k})_{n}\}\Big{\}}.

\Big{\{}\{j_{1},(j_{1}+i_{1})_{n}\},\,\{j_{2},(j_{2}+i_{2})_{n}\},\ldots,\,\{j_{k},(j_{k}+i_{k})_{n}\}\Big{\}}.

P [w = 1 ⋂ k {b \in M_{n} : \leavevmode T_{b} (i_{w}, j_{w}) = s_{w}}] = \frac{\prod _{w = 1}^{k} m _{s_{w}}}{n ^{k}}, \forall k \geq 1,

P [w = 1 ⋂ k {b \in M_{n} : \leavevmode T_{b} (i_{w}, j_{w}) = s_{w}}] = \frac{\prod _{w = 1}^{k} m _{s_{w}}}{n ^{k}}, \forall k \geq 1,

m_{s}=\#\{d\in\mathbb{Z}_{n}:\big{|}d\big{|}_{n}=s\}=\begin{cases}&2,\quad\mbox{ if }0<s<\frac{n}{2};\\ &1,\quad\mbox{ if }s=0;\\ &1,\quad\mbox{ if }s=\frac{n}{2}\mbox{ and }\frac{n}{2}\in\mathbb{N};\\ &0,\quad\mbox{ otherwise}.\end{cases}

m_{s}=\#\{d\in\mathbb{Z}_{n}:\big{|}d\big{|}_{n}=s\}=\begin{cases}&2,\quad\mbox{ if }0<s<\frac{n}{2};\\ &1,\quad\mbox{ if }s=0;\\ &1,\quad\mbox{ if }s=\frac{n}{2}\mbox{ and }\frac{n}{2}\in\mathbb{N};\\ &0,\quad\mbox{ otherwise}.\end{cases}

\mathbb{P}\left[\left\{\mathbf{b}\in\mathcal{M}_{n}:\leavevmode\nobreak\ \big{|}b_{p}-b_{p+q}\big{|}_{n}=s\right\}\right]=\frac{n\cdot m_{s}}{n^{2}}=\frac{m_{s}}{n},

\mathbb{P}\left[\left\{\mathbf{b}\in\mathcal{M}_{n}:\leavevmode\nobreak\ \big{|}b_{p}-b_{p+q}\big{|}_{n}=s\right\}\right]=\frac{n\cdot m_{s}}{n^{2}}=\frac{m_{s}}{n},

j_{1} \to (j_{1} + i_{1})_{n} = j_{2} \to (j_{2} + i_{2})_{n} = j_{3} \to \dots \to (j_{l - 1} + i_{l - 1}) = j_{l} \to (j_{l} + i_{l})_{n} = j_{1} .

j_{1} \to (j_{1} + i_{1})_{n} = j_{2} \to (j_{2} + i_{2})_{n} = j_{3} \to \dots \to (j_{l - 1} + i_{l - 1}) = j_{l} \to (j_{l} + i_{l})_{n} = j_{1} .

T_{b} (i_{1}, j_{1}) = T_{b} (i_{2}, j_{2}) = \dots = T_{b} (i_{l - 1}, j_{l - 1}) = 0,

T_{b} (i_{1}, j_{1}) = T_{b} (i_{2}, j_{2}) = \dots = T_{b} (i_{l - 1}, j_{l - 1}) = 0,

P [{b \in M_{n} : \leavevmode T_{b} (i_{k}, j_{k}) = s_{k}}] = P [{b \in M_{n} : \leavevmode T_{b} (i_{k}, j_{k}) = s_{k} and b_{j_{k}} = r}]

P [{b \in M_{n} : \leavevmode T_{b} (i_{k}, j_{k}) = s_{k}}] = P [{b \in M_{n} : \leavevmode T_{b} (i_{k}, j_{k}) = s_{k} and b_{j_{k}} = r}]

P [w = 1 ⋂ k {b : \leavevmode T_{b} (i_{w}, j_{w}) = s_{w}}]

P [w = 1 ⋂ k {b : \leavevmode T_{b} (i_{w}, j_{w}) = s_{w}}]

E_{i}(\mathbf{b}):=\{\big{|}b_{0}-b_{i}\big{|}_{n},\ldots,\big{|}b_{k}-b_{(k+i)_{n}}\big{|}_{n},\ldots,\big{|}b_{n-1}-b_{i-1}\big{|}_{n}\}.

E_{i}(\mathbf{b}):=\{\big{|}b_{0}-b_{i}\big{|}_{n},\ldots,\big{|}b_{k}-b_{(k+i)_{n}}\big{|}_{n},\ldots,\big{|}b_{n-1}-b_{i-1}\big{|}_{n}\}.

E_{\frac{n}{2}}(\mathbf{b})=\{\big{|}b_{0}-b_{\frac{n}{2}}\big{|}_{n},\ldots,\big{|}b_{k}-b_{(k+\frac{n}{2})_{n}}\big{|}_{n},\ldots,\big{|}b_{\frac{n}{2}-1}-b_{n-1}\big{|}_{n}\}

E_{\frac{n}{2}}(\mathbf{b})=\{\big{|}b_{0}-b_{\frac{n}{2}}\big{|}_{n},\ldots,\big{|}b_{k}-b_{(k+\frac{n}{2})_{n}}\big{|}_{n},\ldots,\big{|}b_{\frac{n}{2}-1}-b_{n-1}\big{|}_{n}\}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Circular automata synchronize with high probability

Christoph Aistleitner Institute of Analysis and Number Theory, TUGraz, Austria. [email protected]

Daniele D’Angeli Università Niccolò Cusano, Via don Gnocchi Roma, Italia. [email protected]

Abraham Gutierrez Institute of Discrete Mathematics, TUGraz, Austria. {a.gutierrez, rosenmann}@math.tugraz.at

Emanuele Rodaro Department of Mathematics, Politecnico di Milano, Italia. [email protected]

Amnon Rosenmann*‡*

Abstract

In this paper we prove that a uniformly distributed random circular automaton $\mathcal{A}_{n}$ of order $n$ synchronizes with high probability (w.h.p.). More precisely, we prove that

[TABLE]

The main idea of the proof is to translate the synchronization problem into a problem concerning properties of a random matrix; these properties are then established with high probability by a careful analysis of the stochastic dependence structure among the random entries of the matrix. Additionally, we provide an upper bound for the probability of synchronization of circular automata in terms of chromatic polynomials of circulant graphs.

Keywords: Automata; Synchronization; Random Matrices; Circulant Graphs; Chromatic Polynomials.

1 Introduction

A complete deterministic finite automaton (DFA) is a tuple $\mathcal{A}=(Q,L)$ , where $Q:=\{q_{1},q_{2},\ldots,q_{n}\}$ is a finite set of states and $L:=\{\mathbf{a_{1}},\mathbf{a_{2}},\ldots,\mathbf{a_{k}}\}$ is a finite set of mappings $\mathbf{a_{i}}:Q\rightarrow Q$ , where $\mathbf{a}(q)=q^{\prime}$ is also written as $q\mathbf{a}=q^{\prime}$ , $q,q^{\prime}\in Q$ , $\mathbf{a}\in L$ . The number of states $n$ is the order of $\mathcal{A}$ . Each $\mathbf{a_{i}}$ is called a letter and a sequence $\mathbf{w}=\mathbf{a_{i_{1}}a_{i_{2}}}\ldots\mathbf{a_{i_{r}}}\in L^{*}$ is a word of length $r$ . The action of $L$ on $Q$ naturally extends to an action of $L^{*}$ on $Q$ , defined recursively by $q(\mathbf{aw})=(q\mathbf{a})\mathbf{w}$ , $q\in Q$ , $\mathbf{a}\in L$ , $\mathbf{w}\in L^{*}$ . This action further extends to an action of $L^{*}$ on subsets of $Q$ by $\{q_{i_{1}},q_{i_{2}},\ldots,q_{i_{k}}\}\mathbf{w}=\{q_{i_{1}}\mathbf{w},q_{i_{2}}\mathbf{w},\ldots,q_{i_{k}}\mathbf{w}\}$ . We say that the subset $S=\{q_{i_{1}},q_{i_{2}},\ldots,q_{i_{k}}\}\subseteq Q$ synchronizes if there exists a word $\mathbf{w}\in L^{*}$ such that $q_{i_{1}}\mathbf{w}=q_{i_{2}}\mathbf{w}=\ldots=q_{i_{k}}\mathbf{w}$ (equivalently, we say that $\mathbf{w}$ synchronizes $S$ ). If the set $Q$ synchronizes then we say that $\mathcal{A}(Q,L)$ synchronizes (or that it is a synchronizing automaton). A word $\mathbf{w}\in L^{*}$ that synchronizes $Q$ is called a synchronizing (or reset) word of $\mathcal{A}$ .

The following simple criterion for synchronization is well known and plays a crucial role throughout the paper:

Claim 1.

$\mathcal{A}=(Q,L)$ * synchronizes $\iff$ every pair of states $q,q^{\prime}\in Q$ synchronizes.*

Proof.

It is clear that if $Q$ synchronizes by a reset word $\mathbf{w}$ then $\mathbf{w}$ synchronizes every pair of states of $Q$ . Conversely, a reset word for $Q$ can be formed by concatenating words $w_{i}$ that synchronize pairs of states until we end up with a single state. ∎

The synchronization property may be described in terms of the graph representation of $\mathcal{A}$ . The set $Q$ of states comprises the vertices of the graph and for each pair of states $q,q^{\prime}$ and a letter $\mathbf{a}\in L$ such that $q\mathbf{a}=q^{\prime}$ there is an arrow $(q,q^{\prime})_{\mathbf{a}}$ labeled with $\mathbf{a}\in L$ and connecting $q$ to $q^{\prime}$ . Each $q\in Q$ and $w=\mathbf{a_{i_{1}}a_{i_{2}}}\ldots\mathbf{a_{i_{k}}}\in L^{*}$ defines a directed path

[TABLE]

that begins in $q$ and ends in $q^{\prime}=q\mathbf{w}$ . $\mathcal{A}$ then synchronizes if and only if there is a word $\mathbf{w}$ , such that the paths $\{\gamma(q,\mathbf{w}):q\in Q\}$ have a common endpoint $q^{\prime}$ , that is, the word $\mathbf{w}$ acts on $Q$ as the constant mapping.

Synchronizing automata have been intensely studied by theoretical computer scientists as well as pure mathematicians since the 1960’s; see [Volkov, 2008] for a detailed introduction on synchronization of automata. A driving force in this research field is the Černỳ conjecture.

*Conjecture 2** (The Černỳ conjecture).*

A synchronizing automaton $\mathcal{A}$ of order $n$ has a shortest synchronizing word of length at most $(n-1)^{2}$ .

The bound in the Černỳ conjecture is tight: in [Cerny, 1964] Černỳ provided a series of synchronizing circular automata $C_{2},C_{3},\ldots$ , such that $C_{n}$ has order $n$ and its shortest synchronizing word is of size exactly $(n-1)^{2}$ (see Fig. 1). Furthermore, the Černỳ series of circular automata $C_{2},C_{3},\ldots$ is the only known infinite series of automata whose shortest synchronizing words are of length $(n-1)^{2}$ [Ananichev et al., 2010].

The best known general upper bounds for the size of shortest synchronizing words of an automaton with $n$ states are of order $O(n^{3})$ [Pin, 1983][Szykuła, 2017][Shitov, 2019]. Nevertheless, there are many classes of automata for which the Černỳ conjecture has been established (see [Volkov, 2008] for examples).

In last decade probabilistic approaches to the synchronization problem have been developed. Typical questions in this setting are: let $\mathcal{A}(\{0,1,\ldots,n-1\},L)$ be a uniformly chosen DFA with $k$ letters on a certain probability space, is it true that with high probability the automaton $\mathcal{A}(\{0,1,\ldots,n-1\},L)$ is synchronizing? Does the Černỳ conjecture hold with high probability? Here we give a (non-comprehensive) list of recent achievements in this probabilistic setting:

$\bullet$

In [Skvortsov and Zaks, 2010] the authors study random automata $\mathcal{A}$ where the number of letters $k$ grow together with $n$ . In particular, they prove that $\mathcal{A}$ synchronizes w.h.p. when $k(n)$ grows fast enough; 2. $\bullet$

In [Berlinkov, 2016] the author proves that $\mathbb{P}\left[\mathcal{A}\text{ synchronizes}\right]=1-O(n^{-k/2})$ , for arbitrary $k\geq 2$ , and $\mathbb{P}\left[\mathcal{A}\text{ synchronizes}\right]=1-\Theta(1/n)$ for $k=2$ ; 3. $\bullet$

In [Nicaud, 2019] the author proves that $\mathcal{A}$ admits w.h.p. a synchronizing word of length $O(n\log^{3}n)$ for arbitrary $k\geq 2;$ 4. $\bullet$

In [Berlinkov and Nicaud, 2018] the authors prove that if $\mathcal{A}$ is uniformly chosen among the strongly-connected almost-group automata then $\mathcal{A}$ synchronizes with probability $1-\Theta((2^{k-1}-1)n^{-2(k-1)})$ for arbitrary $k\geq 2$ .

Since the sequence of circular automata $C_{n}$ depicted in Fig. 1 is the only known infinite series of synchronizing automata reaching Černỳ ’s bound $(n-1)^{2}$ , one might suspect that the class of circular automata is somehow difficult to synchronize. However, as we show in the present paper, it turns out that a random circular automaton is synchronizing with high probability.

The rest of the paper is organized as follows: in Section 2 we present the main result together with its proof and the statement of the two key lemmas for the proof. In Section 3 we study the dependence structure among the entries of the random matrix used in the proof of the main result; the result obtained in this section is crucial for the proof of the key lemmas. In Section 4 we prove the first lemma while in Section 5 we prove the second one. In Section 6 we present some interesting connections between synchronization of circular automata and chromatic polynomials of circulant graphs. Finally, in Section 7 we present some possible directions towards generalizing and improving the results presented in this paper.

2 Main result

Let $n$ be a positive integer. An automaton $\mathcal{A}(\mathbb{Z}_{n},L)$ , where $\mathbb{Z}_{n}:=\{0,1,\ldots,n-1\}$ is the set of states, is called a circular automaton if $L$ contains a permutation that decomposes in exactly one cycle. Let $(i)_{n}:=i\mod n$ . Let $\mathcal{M}_{n}$ denote the set of all mappings from $\mathbb{Z}_{n}$ to itself, and let $\mathbb{P}$ denote the uniform probability measure on $\mathcal{M}_{n}$ . We will write the elements of $\mathcal{M}_{n}$ as vectors by identifying the mapping $\mathbf{b}(i)=b_{i},\leavevmode\nobreak\ i=0,\dots,n-1$ with the vector $\mathbf{b}=(b_{0},\dots,b_{n-1})$ .

In what follows, we denote by $\mathcal{A}_{n}(\mathbf{b}):=(\mathbb{Z}_{n},\{\mathbf{a},\mathbf{b}\})$ a circular automaton of order $n\in\mathbb{N}$ , with $\mathbf{a}:\mathbb{Z}_{n}\rightarrow\mathbb{Z}_{n}$ being the circular right shift permutation $a(i)=(i+1)_{n}$ and $\mathbf{b}:=(b_{0},...,b_{n-1})$ being an element of $\mathcal{M}_{n}$ . We will understand that $\mathbf{b}$ is “randomly” chosen from $\mathcal{M}_{n}$ according to the uniform probability measure $\mathbb{P}$ , making $\mathcal{A}_{n}(\mathbf{b})$ a random circular automaton.

It follows from work of Perrin [Perrin, 1977] that a circular automaton $\mathcal{A}(Q,L)$ of prime order synchronizes if and only if $L$ contains a non-permutation. Pin [Pin, 1978] proved with combinatorial methods that a circular automaton $\mathcal{A}(Q,L)$ of prime order which has a letter of rank $\frac{n-1}{2}\leq k\leq n$ has a minimal word of size at most $(n-k)^{2}$ . For the probability of synchronization of $\mathcal{A}_{p}(\mathbf{b})$ a very precise result is known.

Theorem 3 ([Perrin, 1977][Pin, 1978]).

Let $p$ be a prime number. Then

[TABLE]

Thus, a uniformly distributed random circular automaton of prime order $p$ with $k\geq 2$ letters synchronizes with high probability (w.h.p.).

Theorem 3 is not explicitly stated in [Perrin, 1977], but it is observed in [Pin, 1978] that Perrin’s work implies the theorem.

It is known that the Černỳ conjecture holds true for the class of circular automata [Dubuc, 1998]. In a closely related work, Béal, Berlinkov and Perrin [Béal et al., 2011] gave an $O\left(n^{2}\right)$ upper bound for the shortest words of synchronizing automata with a single cluster.

A natural question arises: do random circular automata of order $n$ (not necessarily prime) synchronize with high probability? We give a positive answer to this question in the following:

Theorem 4 (Main result).

The following holds:

[TABLE]

as $n\to\infty$ . Thus, a randomly chosen $\mathcal{A}_{n}(\mathbf{b})$ synchronizes w.h.p. as $n\to\infty$ .

*Remark 5**.*

Theorem 4 does not follow from the results of Berlinkov or Nicaud. In their models, they use a random automaton $\mathcal{A}(Q,L)$ of order $n$ where $L$ is a collection of $k$ mappings from $Q$ to $Q$ i.i.d. uniformly chosen. For a fixed $k$ , the probability of randomly chosen $k$ mappings to contain a permutation with exactly one cycle is bounded from above by $k\cdot\frac{n!}{n^{n}}\xrightarrow{n\rightarrow\infty}0$ .

Given $n\in\mathbb{N}$ and $r\in\mathbb{Z}$ , we define the $n$ -cyclic absolute value of $r$ to be

[TABLE]

When $r,s\in\mathbb{Z}$ then $\big{|}r-s\big{|}_{n}$ is the $n$ -cyclic distance between $r$ and $s$ . When the numbers $0,1,\ldots,n-1$ are identified with the vertices of a cycle of length $n$ , the $n$ -cyclic distance between two such numbers is the length of the shortest path between them in the cycle. We now introduce the main tool for the proof of the main theorem.

*Definition**.*

Let $\mathcal{A}_{n}(\mathbf{b}):=(\mathbb{Z}_{n},\{\mathbf{a},\mathbf{b}\})$ be a circular automaton with $\mathbf{b}=(b_{0},b_{1},\ldots,b_{n-1})$ . Then we define $T_{\mathbf{b}}$ to be the matrix

[TABLE]

shortly written as

[TABLE]

As before, $b_{i}=\mathbf{b}(i)$ , i.e., the image of state $i$ under $\mathbf{b}$ . To be clear, note that the first row of $T_{\mathbf{b}}$ is formed of the cyclic distances of the images of states $r,s$ such that $\big{|}r-s\big{|}_{n}=1$ ; in general, the $i$ -th row of $T_{\mathbf{b}}$ is formed of the cyclic distances of the images of pairs of states $r,s$ of cyclic distance $i$ . Notice that the columns are counted from [math] to $n-1$ .

For $\mathbf{b}\in\mathcal{M}_{n}$ and $i=1,\ldots,\left\lfloor\frac{n}{2}\right\rfloor$ , let $R_{i}(\mathbf{b})$ denote the number of different entries in row $i$ of $T_{\mathbf{b}}$ :

[TABLE]

Set

[TABLE]

i.e., $\mathcal{E}_{\mathrm{\small row}}(\alpha)$ contains those $\mathbf{b}$ for which every row of $T_{\mathbf{b}}$ has at least $\alpha\left\lfloor\frac{n}{2}\right\rfloor$ different elements. Its complement is

[TABLE]

We also define

[TABLE]

and its complement

[TABLE]

where

[TABLE]

and

[TABLE]

That is, $\mathcal{E}_{\mathrm{\small zero}}(\beta)$ is the set of those $\mathbf{b}$ for which the matrix $T_{\mathbf{b}}$ has at least $\beta\left\lfloor\frac{n}{2}\right\rfloor$ rows containing the entry zero.

The proof of Theorem 4 relies on the following two lemmas.

Lemma 6.

*Let $\varepsilon>0$ and let $\alpha=1-e^{-1}-\varepsilon$ . Then *

[TABLE]

as $n\to\infty$ .

Lemma 7.

*Let $\varepsilon\in(0,1)$ and let $\beta=\frac{1}{2}-\varepsilon$ . Then *

[TABLE]

as $n\to\infty$ .

Proof of Theorem 4.

The main idea of the proof is to transform the question of synchronization of $\mathcal{A}_{n}(\mathbf{b})$ into a question concerning properties of the matrix $T_{\mathbf{b}}$ . The functions $T_{\mathbf{b}}(i,j)$ are random variables over $\mathcal{M}_{n}$ , and to obtain our desired probability estimates we will need to understand the joint stochastic dependence structure of these random variables.

Let $\mathbf{b}\in\mathcal{M}_{n}$ and consider the associated Matrix $T_{\mathbf{b}}$ . The first observation is that a zero in row $i$ of $T_{\mathbf{b}}$ means that two states $r,s$ with cyclic distance $i$ synchronize under $\mathbf{b}$ (i.e., ${\mathbf{b}}(r)={\mathbf{b}}(s)$ ), which implies that any pair $r^{\prime},s^{\prime}$ with cyclic distance $i$ can be synchronized with a word of the form $\mathbf{a}^{l}\mathbf{b}$ because $\{r^{\prime},s^{\prime}\}\mathbf{a}^{l}=\{r,s\}$ for some $l$ . The second observation is that if the $i$ -th row of $T_{\mathbf{b}}$ contains a number $j=|b_{k}-b_{(k+i)_{n}}|_{n}$ and the $j$ -th row contains a zero, then every pair of states $(r,s)$ with cyclic distance $i$ can be synchronized with a word of the form $\mathbf{a}^{l_{1}}\mathbf{b}\mathbf{a}^{l_{2}}\mathbf{b}$ . Indeed, we can proceed as follows: $\{r,s\}\stackrel{{\scriptstyle\mathbf{a}^{l_{1}}}}{{\rightarrow}}\{k,(k+i)_{n}\}\stackrel{{\scriptstyle\mathbf{b}}}{{\rightarrow}}\{b_{k},b_{(k+i)_{n}}\}$ , where this last pair has n-cyclic distance $j$ ; then $\{b_{k},b_{(k+i)_{n}}\}$ synchronizes with a word of the form $\mathbf{a}^{l_{2}}\mathbf{b}$ , for some $l_{2}$ because the $j$ -th row contains a zero. With these two observations, we establish sufficient conditions on $T_{\mathbf{b}}$ for the synchronization of $\mathcal{A}_{n}(\mathbf{b})$ . The sets $\mathcal{E}_{\mathrm{\small row}}(\alpha)$ and $\mathcal{E}_{\mathrm{\small zero}}(\beta)$ which we defined in (3) and (5) play a crucial role.

Let $\mathbf{b}\in\mathcal{M}_{n}$ . If $\mathbf{b}$ is contained in both $\mathcal{E}_{\mathrm{\small row}}(\alpha)$ and $\mathcal{E}_{\mathrm{\small zero}}(\beta)$ for some $\alpha,\beta>0$ such that $\alpha+\beta>1$ , then $\mathcal{A}_{n}(\mathbf{b})$ synchronizes. This follows from the two previous observations together with the union bound. Indeed, let $(r,s)$ be any pair of different states and let $i=|r-s|_{n}$ . If row $i$ contains a zero, we can synchronize $\{r,s\}$ with a word of the form $\mathbf{a}^{l}\mathbf{b}$ ; otherwise, row $i$ contains an entry $j\neq 0$ such that row $j$ contains a zero (because $\alpha+\beta>1$ ), which implies that $\{r,s\}$ can be synchronized with a word of the form $\mathbf{a}^{l_{1}}\mathbf{b}\mathbf{a}^{l_{2}}\mathbf{b}$ . Therefore, every pair of different states synchronizes and $\mathcal{A}_{n}(\mathbf{b})$ synchronizes by Claim 1. Therefore, for any $\alpha,\beta>0$ satisfying $\alpha+\beta>1$ , we have the following bound:

[TABLE]

Now, by the last inequality and by Lemmas 6 and 7 we obtain the bound stated in the main theorem. We can choose, for example, $\varepsilon^{\prime}=0.05$ , $\alpha^{\star}=1-e^{-1}-\varepsilon^{\prime}\approx 0.582$ and $\beta^{\star}=0.5-\varepsilon^{\prime}=0.45$ , so that $\alpha^{\star}>0$ , $\beta^{\star}>0$ and $\alpha^{\star}+\beta^{\star}>1$ . Then we have

[TABLE]

as $n\to\infty$ . ∎

3 Independence among the random variables $T_{\mathbf{b}}(i,j)$

For every pair $(i,j)$ , $1\leq i\leq\left\lfloor\frac{n}{2}\right\rfloor$ and $0\leq j\leq n-1$ , the function $T_{\mathbf{b}}(i,j):\leavevmode\nobreak\ \mathcal{M}_{n}\mapsto\mathbb{Z}_{n}$ is a random variable on the space $\mathcal{M}_{n}$ , equipped with the uniform probability measure $\mathbb{P}$ (and with the power set of $\mathcal{M}_{n}$ as the natural sigma-field). It is crucial for our proof to give a criterion on pairs of indices $(i_{1},j_{1}),\dots,(i_{k},j_{k})$ which guarantees that the random variables $T_{\mathbf{b}}(i_{1},j_{1})$ , …, $T_{\mathbf{b}}(i_{k},j_{k})$ are independent. First, notice that not every subset of random variables $T_{\mathbf{b}}(i,j)$ is independent. For example,

[TABLE]

are clearly dependent: if the first two random variables $T_{\mathbf{b}}(1,0)$ and $T_{\mathbf{b}}(1,1)$ are zero, then $b_{0}=b_{1}=b_{2}$ , which implies that $\big{|}b_{0}-b_{2}\big{|}_{n}=0$ and so $T_{\mathbf{b}}(2,0)$ necessarily is also zero. This dependence comes from the fact that there is a “cycle” of the form $b_{0}\to b_{1}\to b_{2}\to b_{0}$ generated by the indices of these three random variables. Generally, it will turn out that a set of random variables $T_{\mathbf{b}}(i,j)$ is independent if and only if the corresponding indices are “acyclic”. We formalize this in the following

*Definition**.*

Let

[TABLE]

be a multi-set, where $i_{l},j_{l}\in\mathbb{Z}_{n}$ . The associated (multi-)graph $G(S)$ is the (multi-)graph with vertex set $\mathbb{Z}_{n}$ and edge (multi-)set

[TABLE]

We say that $S$ is acyclic if its associated multi-graph $G(S)$ is acyclic. We also say that the edge $\{j,j+i\}$ is associated to the random variable $T_{\mathbf{b}}(i,j)$ .

The relation between acyclic index sets and independent variables is stated in the following

Proposition 8.

The variables $T_{\mathbf{b}}(i_{1},j_{1}),T_{\mathbf{b}}(i_{2},j_{2}),\ldots,T_{\mathbf{b}}(i_{k},j_{k})$ are i.i.d. $\iff$ the (multi-)set $S=\{(i_{1},j_{1}),(i_{2},j_{2}),\ldots,(i_{k},j_{k})\}$ is acyclic. Furthermore, if the variables are independent then

[TABLE]

where $s_{1},s_{2},\ldots,s_{k}$ are arbitrary integers and

[TABLE]

Henceforth in the paper we use the concepts “acyclic” and “independent” interchangeably when we refer to a multi-set of independent random variable entries of $T_{\mathbf{b}}$ , resp. to random variable entries whose associated multi-graph is acyclic.

*Remark 9**.*

Note that different random variables $T_{\mathbf{b}}(i,j),\,T_{\mathbf{b}}(i^{\prime},j^{\prime})$ may be associated with the same edge; since $1\leq i\leq\left\lfloor\frac{n}{2}\right\rfloor$ this only happens when $n$ is even and $i=i^{\prime}=\frac{n}{2}$ and $j\equiv j^{\prime}\mod\frac{n}{2}.$ Thus, for $n$ odd, a pair of different random variables $T_{\mathbf{b}}(i,j),\,T_{\mathbf{b}}(i^{\prime},j^{\prime})$ is always acyclic/independent.

*Remark 10**.*

For a vector $\mathbf{b}\in\mathcal{M}_{n}$ , we can write its entries $b_{0},\dots,b_{n-1}$ as functions of $\mathbf{b}$ . In other words, $b_{0}=b_{0}(\mathbf{b}),\dots,b_{n-1}=b_{n-1}(\mathbf{b})$ are random variables on $\mathcal{M}_{n}$ , equipped with the uniform measure $\mathbb{P}$ . The random variables $b_{0},\dots,b_{n-1}$ are independent and identically distributed over this space; this follows immediately from the fact that the uniform measure on $\mathcal{M}_{n}$ is a product of $n$ one-dimensional uniform measures.

Proof of Proposition 8.

First note that any two random variables $T_{\mathbf{b}}(i,j)=\big{|}b_{j}-b_{{(j+i)}_{n}}\big{|}_{n}$ and $T_{\mathbf{b}}(i^{\prime},j^{\prime})=\big{|}b_{j^{\prime}}-b_{{(j^{\prime}+i^{\prime})}_{n}}\big{|}_{n}$ are always identically distributed since $b_{0},b_{1},\ldots,b_{n-1}$ are i.i.d. (see Remark 10). Note also that for all $s$

[TABLE]

which can seen by an easy counting argument: there are $n$ different possible choices of $b_{p}$ , and then there are $m_{s}$ independent different choices of $b_{(p+q)_{n}}$ such that $\big{|}b_{p}-b_{p+q}\big{|}_{n}=s$ . Thus equation (8) is just a rephrasing of the fact that the random variables are independent. Therefore, what we need to prove is that independence holds if and only if the associated (multi-)graph is acyclic.

$\Rightarrow)$ (by contraposition) Let $S=\{(i_{1},j_{1}),\,(i_{2},j_{2}),\ldots,\,(i_{k},j_{k})\}$ be a (multi-)set which contains a cycle. Thus, its associated multi-graph $G(S)$ has a cycle $C$ of length $l\geq 2$ . Let this cycle be w.l.o.g.

[TABLE]

Recall that $T_{\mathbf{b}}(i,j)=0\iff b_{j}=b_{(j+i)_{n}}$ . Thus if for some $\mathbf{b}\in\mathcal{M}_{n}$ we have

[TABLE]

then $b_{j_{1}}=b_{j_{2}}=\ldots=b_{j_{l}}$ , and so we automatically also have $T_{\mathbf{b}}(i_{l},j_{l})=\big{|}b_{j_{l}}-b_{(j_{l}+i_{l})_{n}}\big{|}_{n}=\big{|}b_{j_{l}}-b_{j_{1}}\big{|}_{n}=0$ . Thus, the variables $T_{\mathbf{b}}(i_{1},j_{1}),\dots,T_{\mathbf{b}}(i_{\ell},j_{\ell})$ are not independent. We conclude that an independent multi-set must be acyclic.

$\Leftarrow)$ (by induction on $k$ ) Let $k\geq 2$ . Assume that the multi-set $S_{k}=\{(i_{1},j_{1}),\,(i_{2},j_{2}),\ldots,\,(i_{k},j_{k})\}$ is acyclic. We want to show that $T_{\mathbf{b}}(i_{k},j_{k})$ is independent of $T_{\mathbf{b}}(i_{1},j_{1}),\dots,T_{\mathbf{b}}(i_{k-1},j_{k-1})$ . This will allow us to factor out the $k$ -th factor on the left-hand side of (8), leading (by induction) to the formula on the right-hand side of (8), which is equivalent to independence.

We distinguish two cases: The first case is when the edge $\{j_{k},(j_{k}+i_{k})_{n}\}$ is a connected component by itself in $G(S)$ . This means that the sets $S_{1}:=\{j_{1},(j_{1}+i_{1})_{n},j_{2},(j_{2}+i_{2})_{n},\dots,j_{k},(j_{k-1}+i_{k-1})_{n}\}$ and $S_{2}:=\{j_{k},(j_{k}+i_{k})_{n}\}$ are disjoint. By construction, the random variables $T_{\mathbf{b}}(i_{1},j_{1}),\dots,T_{\mathbf{b}}(i_{k-1},j_{k-1})$ depend only on $b_{s}$ with $s\in S_{1}$ , while $T_{\mathbf{b}}(i_{k},j_{k})$ depends only on $b_{s}$ with $s\in S_{2}$ . Since $b_{0},\dots,b_{n-1}$ are independent by Remark 10, this implies that $T_{\mathbf{b}}(i_{k},j_{k})$ is independent of $T_{\mathbf{b}}(i_{1},j_{1}),\dots,T_{\mathbf{b}}(i_{k-1},j_{k-1})$ , as desired.

For the second case, the edge $\{j_{k},(j_{k}+i_{k})_{n}\}$ is not a connected component by itself in $G(S)$ . Since it is also not part of a cycle by assumption,we can assume that $(j_{k}+i_{k})_{n}$ is a leaf vertex in $G(S)$ . In principle, $T_{\mathbf{b}}(i_{k},j_{k})$ depends on $b_{j_{k}}$ as well as on $b_{(j_{k}+i_{k})_{n}}$ . However, since $T_{\mathbf{b}}(i_{k},j_{k})$ is defined as a cyclic distance, the conditional distribution of $T_{\mathbf{b}}(i_{k},j_{k})$ given $b_{j_{k}}$ is always the same. In formulas, for every $s_{k}$ we have

[TABLE]

for every $r\in\{0,\dots,n-1\}$ . This fact can be simply established by counting the possible configurations of $b_{j_{k}}$ and $b_{(j_{k}+i_{k})_{n}}$ . By definition, $T_{\mathbf{b}}(i_{k},j_{k})$ is independent of all $b_{\ell}$ with $\ell\neq j_{k},(j_{k}+i_{k})_{n}$ . Thus for every numbers $s_{1},\dots,s_{k}$ we have, using the independence of $b_{0},\dots,b_{n-1}$ and (9), that

[TABLE]

This is exactly the independence property that we wanted to establish. ∎

4 Proof of Lemma 6

The overview of the proof is as follows. Recall that we understand the entries of the matrix $T_{\mathbf{b}}$ as random variables. We will prove that every row of $T_{\mathbf{b}}$ contains a “large” number of independent random variables. Then we give a lower bound for the expected value of the number of different elements in each row. Then we apply McDiarmid’s inequality to each row and finally we use the union bound together with the exponential decay delivered by McDiarmid’s inequality to guarantee that w.h.p. every row of $T_{\mathbf{b}}$ has at least $\sim(1-e^{-1})\left\lfloor\frac{n}{2}\right\rfloor$ different elements. We denote by $C_{n}(i)$ the circulant graph on $n$ vertices, i.e., the graph with vertex set $\mathbb{Z}_{n}$ where two vertices $r,s$ are adjacent if and only if $\big{|}r-s\big{|}_{n}=i.$

We need the following property.

Claim 11.

For every $i$ , the $i$ -th row of $T_{\mathbf{b}}$ contains a set of at least $n-\gcd(n,i)$ random variables which are i.i.d.

Proof.

The variables in row $i$ are given by the multi-set

[TABLE]

Let $i\neq\frac{n}{2}$ . By Remark 9, the corresponding multi-set $E_{i}(\mathbf{b})$ does not have repeated elements and the associated multi-graph $G(E_{i}(\mathbf{b}))$ is isomorphic to the circulant graph $C_{n}(i)$ . It is well known and easy to show that $C_{n}(i)$ is a disjoint union of $\gcd(n,i)$ cycles of length $\frac{n}{\gcd(n,i)}$ [Boesch and Tindell, 1984]. We can then obtain an acyclic set of variables by removing one edge from each of the cycles of $G(S_{i})$ . The resulting set of variables is i.i.d. by Proposition 8. In the case $i=\frac{n}{2}$ , the first $\frac{n}{2}$ variables in row $\frac{n}{2}$

[TABLE]

have an associated multi-graph that is isomorphic to the circulant graph $C_{n}(\frac{n}{2})$ , which is a disjoint union of $\frac{n}{2}=\gcd(n,\frac{n}{2})$ edges. This last graph is acyclic, thus the variables are i.i.d. by Proposition 8. ∎

We prove the following lower bound

Claim 12.

We have $\mathbb{E}\left[R_{i}\right]\geq\left\lfloor\frac{n}{2}\right\rfloor(1-e^{-1})-1$ , where for all $\mathbf{b}\in\mathcal{M}_{n}$

[TABLE]

(see (2)) is the cardinality of different elements in row $i$ of $T_{\mathbf{b}}$ .

Proof.

First, for every $d\in\{0,\dots,\left\lfloor\frac{n}{2}\right\rfloor\}$ , we define the random variables

[TABLE]

and

[TABLE]

Note that $r_{d}^{(i)}(\mathbf{b})$ is zero if the number $d$ is included in the $i$ -th row of $T_{\mathbf{b}}$ , and that it is one otherwise. Recalling that the entries of $T_{\mathbf{b}}$ can only have values in $\{0,1,\ldots,\left\lfloor\frac{n}{2}\right\rfloor\}$ , we write the number of distinct elements in row $i$ as

[TABLE]

By Claim 11, there is a subset $I$ of $\mathbb{Z}_{n}$ of cardinality $n-\gcd(n,i)$ such that the variables $\{\delta_{w}^{(i)}:w\in I\}$ are i.i.d., and thus

[TABLE]

Furthermore, by Proposition 8, we have $\mathbb{E}\left[\delta_{0}^{(i)}(\mathbf{b},d)\right]=1-\frac{m_{d}}{n},$ and thus

[TABLE]

for $d\in\{0,1,\ldots,\left\lfloor\frac{n}{2}\right\rfloor\}$ . Using the inequality $1-x\leq e^{-x}$ , which is valid for any real number $x$ , we obtain

[TABLE]

Plugging this inequality into (11) yields

[TABLE]

This proves Claim 12. ∎

We introduce McDiarmid’s inequality to prove Claim 14.

*Definition**.*

Let $L:\left(\mathbb{Z}_{n}\right)^{n}\rightarrow\mathbb{R}$ be a function. We say that $L$ has Lipschitz coefficient $r\in\mathbb{R}^{+}$ if

[TABLE]

for every $\overrightarrow{v},\overrightarrow{w}\in\left(\mathbb{Z}_{n}\right)^{n}$ such that $\overrightarrow{v}(j)=\overrightarrow{w}(j)$ for all $j$ except for at most one index.

Proposition 13 (McDiarmid’s Inequality [McDiarmid, 1989]).

Let $\bar{X}:=(X_{1},X_{2},\ldots,X_{n})\in\left(\mathbb{Z}_{n}\right)^{n}$ be a random vector, where the variables $X_{1},X_{2},\ldots,X_{n}$ are independent, and let $L:\left(\mathbb{Z}_{n}\right)^{n}\rightarrow\mathbb{R}$ be a function with bounded Lipschitz coefficient $r$ . Then

[TABLE]

for all $\lambda\geq 0$ .

*Remark**.*

This is just a special case of the general form of McDiarmid’s inequality. The general inequality also bounds the upper tail, and allows different Lipschitz coefficients in the respective components.

In the following claim we use Proposition 13 to estimate the probability that row $i$ of $T_{\mathbf{b}}$ has less than $\sim(1-e^{-1})\left\lfloor\frac{n}{2}\right\rfloor$ different elements.

Claim 14.

Let $\varepsilon>0$ . Then

[TABLE]

for $i=1,2,\ldots,\left\lfloor\frac{n}{2}\right\rfloor$ .

Proof.

Let $\mathbf{b}=(b_{0},b_{1},\ldots,b_{n-1})$ . Let $E_{i}(\mathbf{b})$ be defined as in (10). The function $R_{i}(\mathbf{b}):=\#E_{i}(\mathbf{b})$ has Lipschitz coefficient 2: changing one $b_{j}$ affects at most two entries, namely $\big{|}b_{j}-b_{(j+i)_{n}}\big{|}_{n}$ and $\big{|}b_{(j-i)_{n}}-b_{j}\big{|}_{n}$ . Using McDiarmid’s inequality, we deduce that

[TABLE]

Using the lower bound $\mathbb{E}\left[R_{i}\right]\geq\left\lfloor\frac{n}{2}\right\rfloor(1-e^{-1})-1$ of Claim 12 we obtain

[TABLE]

Let $\varepsilon>0$ and let

[TABLE]

we observe that $\lambda_{\varepsilon}(n)$ is independent of $i.$ Let $n>\frac{2}{\varepsilon}$ , then plugging $\lambda=\lambda_{\varepsilon}(n)$ into the previous inequality yields

[TABLE]

∎

Recall that $\mathcal{E}_{\mathrm{\small row}}(\alpha)$ contains those $\mathbf{b}\in\mathcal{M}_{n}$ for which every row of $T_{\mathbf{b}}$ has at least $\alpha\left\lfloor\frac{n}{2}\right\rfloor$ different elements, so that

[TABLE]

Let $\varepsilon>0$ be arbitrary and let $\alpha^{*}=1-e^{-1}-\varepsilon$ . Then

[TABLE]

where we use Claim 14 for the second inequality. The proof of Lemma 6 then follows by noticing that

[TABLE]

5 Proof of Lemma 7

The overview of the proof is as follows. We will define two random variables $\mathcal{Z}_{0}(\mathbf{b})$ and $\mathcal{Z}_{1}(\mathbf{b})$ such that

[TABLE]

Then we will show that $\mathcal{Z}_{0}$ and $\mathcal{Z}_{1}$ concentrate around their respective means, and use this fact to give an upper bound on the probability that $D$ is small. For this purpose, we note the following property.

Claim 15.

*Let $\mathcal{Z}_{0},\mathcal{Z}_{1}$ and $D$ be random variables which take non-negative values, such that $D\geq\mathcal{Z}_{0}-\mathcal{Z}_{1}$ . Let $\nu>0$ and let $\delta\leq\mathbb{E}\left[\mathcal{Z}_{0}-\mathcal{Z}_{1}\right]-2\nu$ . Then *

[TABLE]

Proof.

This follows easily from the assumption that $\mathcal{Z}_{0}-\mathcal{Z}_{1}\leq D$ and the union bound. ∎

To prove concentration of $\mathcal{Z}_{0}$ and $\mathcal{Z}_{1}$ around their respective means, we use Chebyschev’s inequality. Notice that $D:\mathbb{Z}_{n}^{n}\rightarrow\mathbb{Z}_{n}$ does not have a bounded Lipschitz coefficient, so we cannot use McDiarmid’s inequality to guarantee its concentration.

5.1 Lower bound for $D(b)$

Recall that $D(\mathbf{b})$ counts the number of rows of $T_{\mathbf{b}}$ that contain at least one zero. Let

[TABLE]

and

[TABLE]

Then

[TABLE]

It is easy to verify that the number of non-ordered pairs of entries in the $i$ -th row with zero value is

[TABLE]

therefore

[TABLE]

From this and (15), we conclude that

Claim 16.

$D(\mathbf{b})\geq\mathcal{Z}_{0}(\mathbf{b})-\mathcal{Z}_{1}(\mathbf{b}),\quad\forall\,\mathbf{b}:\mathbb{Z}_{n}\rightarrow\mathbb{Z}_{n}.$ **

5.2 Estimates for $\mathbb{E}\left[\mathcal{Z}_{0}\right]$ , $\mathbb{E}\left[\mathcal{Z}_{1}\right]$ , $\mathbb{E}\left[\mathcal{Z}_{0}-\mathcal{Z}_{1}\right]$ , $\mathbb{V}\left[\mathcal{Z}_{0}\right]$ , $\mathbb{V}\left[\mathcal{Z}_{1}\right]$

In this subsection we prove that

•

$\mathbb{E}\left[\mathcal{Z}_{0}-\mathcal{Z}_{1}\right]\sim\frac{n}{2},$

•

$\mathbb{E}\left[\mathcal{Z}_{0}\right]=\Theta(n),$

•

$\mathbb{E}\left[\mathcal{Z}_{1}\right]=\Theta(n),$

•

$\mathbb{V}\left[\mathcal{Z}_{0}\right]=O(n),$ and

•

$\mathbb{V}\left[\mathcal{Z}_{1}\right]=O(n)$ .

For the rest of this subsection, we use the notation

[TABLE]

for $1\leq i\leq\left\lfloor\frac{n}{2}\right\rfloor$ and $0\leq j\leq n-1.$

*Definition**.*

The variables $y_{i_{1},j_{1}},y_{i_{2},j_{2}}\ldots,y_{i_{k},j_{k}}$ are called acyclic if the multi-set $\bigcup_{w=1}^{k}\{(i_{w},j_{w})\}$ is acyclic. Let

[TABLE]

be the associated multi-graph of the multi-set $\{y_{i_{1},j_{1}},y_{i_{2},j_{2}}\ldots,y_{i_{k},j_{k}}\}$ and let $e(y_{i,j}):=\{j,(j+i)_{n}\}$ be the associated edge to $y_{i,j}$ . The length of $e(y_{i,j})$ is $\big{|}j-(j+i)_{n}\big{|}_{n}=i.$

*Remark 17**.*

If the variables $y_{i_{1},j_{1}},y_{i_{2},j_{2}}\ldots,y_{i_{k},j_{k}}$ are acyclic then they are i.i.d.; this is an immediate consequence of Proposition 8.

We begin with the easy part: the bounds for the expected values.

Claim 18.

Let $n\in\mathbb{N}$ . We have $\mathbb{E}\left[\mathcal{Z}_{0}\right]=\Theta(n)$ , $\mathbb{E}\left[\mathcal{Z}_{1}\right]=\Theta(n)$ , and $\mathbb{E}\left[\mathcal{Z}_{0}-\mathcal{Z}_{1}\right]\geq\frac{1}{2}\left\lfloor\frac{n}{2}\right\rfloor-1.$

Proof.

Using the linearity of the expectation, we get that

[TABLE]

where for the second equality we use that

[TABLE]

Now we calculate an upper bound for $\mathbb{E}\left[\mathcal{Z}_{1}\right]$ , depending on the parity of $n$ .

Case 1: $n$ odd. Every product $y_{i,j}y_{i,j^{\prime}}$ in the sum

[TABLE]

is formed of independent random variables $y_{i,j}$ , $y_{i,j^{\prime}}$ by Remarks 9,17. Thus

[TABLE]

Case 2: $n$ even. Using Remark 9, we write $\mathcal{Z}_{1}$ as

[TABLE]

Every product $y_{i,j}y_{i,j^{\prime}}$ in the first sum is formed of independent variables $y_{i,j}$ , $y_{i,j^{\prime}}$ by Remark 9 and the same is valid for the products $y_{\frac{n}{2},r}y_{\frac{n}{2},r^{\prime}}$ in the second sum, therefore

[TABLE]

We deduce from the previous cases that $\mathbb{E}\left[\mathcal{Z}_{1}\right]=\Theta(n)$ and $\mathbb{E}\left[\mathcal{Z}_{1}\right]\leq\frac{1}{2}\left\lfloor\frac{n}{2}\right\rfloor+1$ for all $n$ . Using this last inequality and (16), we conclude that

[TABLE]

This concludes the proof of Claim 18. ∎

Now we estimate the variance of $\mathcal{Z}_{0}$ and $\mathcal{Z}_{1}$ .

Claim 19.

Let $n\in\mathbb{N}$ , then $\mathbb{V}\left[\mathcal{Z}_{0}\right]=O(n)$ and $\mathbb{V}\left[\mathcal{Z}_{1}\right]=O(n)$ .

Proof.

Here we also divide the calculations according to the parity of $n$ .

Case 1: $n$ odd. We expand the variance of $\mathcal{Z}_{0}$ to get that

[TABLE]

where the covariances are calculated among pairs of independent variables $y_{i,j},y_{i^{\prime},j^{\prime}}$ due to Remark 9. Thus

[TABLE]

We notice that $y_{i,j}^{2}=y_{i,j}$ because $y_{i,j}\in\{0,1\}$ , therefore

[TABLE]

where we use (17) in the last equality. Then, for all $n$ odd, we get that

[TABLE]

Now we calculate

[TABLE]

We first note that

[TABLE]

this follows since the variables $y_{i,j}$ and $y_{i,j^{\prime}}$ are different and therefore independent (see Remark 9). Thus

[TABLE]

For the sum of the covariances, we proceed as follows: if the variables $y_{i,j},y_{i,j^{\prime}},y_{r,s},y_{r,s^{\prime}}$ are acyclic then they are independent (see Proposition 8), therefore

[TABLE]

On the other hand, if the variables $y_{i,j},y_{i,j^{\prime}},y_{r,s},y_{r,s^{\prime}}$ are not acyclic, let

[TABLE]

and let

[TABLE]

Then $G(Y)$ is a multi-graph with four edges $e(y_{i,j}),e(y_{i,j^{\prime}}),e(y_{r,s}),e(y_{r,s^{\prime}})$ such that $e(y_{i,j})\neq e(y_{i,j^{\prime}})$ and $e(y_{r,s})\neq e(y_{r,s^{\prime}})$ (see Remark 9). In particular, there cannot be 3 equal edges. If $G(Y)$ has at least one cycle, it is isomorphic to one of the multi-graphs in Figure 2 below.

We will now estimate the contribution of each of these possible non-acyclic multi-graphs.

Claim 20.

Let $n\in\mathbb{N}$ , then

[TABLE]

Proof.

The cases $c=1,2,5,6,7$ can be bounded by the trivial bound $O(n^{4})$ , and the same for the cases $c=4,8$ with the bound $O(n^{3})$ . The remaining cases $c=3,9,10,11,12$ require better estimates than their respective trivial bounds.

First, notice that for all cases, the four edges of the multi-graph $G(\{y_{i,j},y_{i,j^{\prime}},y_{r,s},y_{r,s^{\prime}}\})$ are divided into two pairs: $e(y_{i,j}),e(y_{i,j^{\prime}})$ of length $i$ and $e(y_{r,s}),e(y_{r,s^{\prime}})$ of length $r$ . The case $G_{3}$ is bounded by ${n\choose 3}*2n=O(n^{4})$ because three vertices can be chosen freely to form a triangle whose edges have at most two different lengths $i,r$ , then we choose a vertex $v$ for the free edge and finally we choose $v^{\prime}$ such that $\big{|}v-v^{\prime}\big{|}_{n}=i$ or $\big{|}v-v^{\prime}\big{|}_{n}=r$ depending on the lengths of the edges in the triangle, therefore $v^{\prime}$ has only two choices.

The case $G_{12}$ is also bounded by $O(n^{4})$ . To show this, we distinguish between two subcases. In the first subcase, the multi-edge is formed of the associated edges of the same pair, w.l.o.g. $e(y_{i,j})=e(y_{i,j^{\prime}})$ (this can only happen in the case $n$ even). Then the free edges are formed of the edges $e(y_{r,s}),e(y_{r,s^{\prime}})$ , which have length $r$ ; we choose two vertices for the multi-edge and two more vertices $v_{1},v_{2}$ (one for each of the free edges), but then the two missing vertices $v_{1}^{\prime},v_{2}^{\prime}$ have at most two options each, because $\big{|}v-v_{1}\big{|}_{n}=\big{|}v_{2}-v_{2}^{\prime}\big{|}_{n}=r$ . Thus this subcase is bounded by $O(n^{4})$ . The second subcase is when $e(y_{i,j})\neq e(y_{i,j^{\prime}})$ and $e(y_{r,s})\neq e(y_{r,s^{\prime}})$ . Then w.l.o.g. the multi-edge is formed of the $e(y_{i,j})=e(y_{r,s})$ then $i=r$ , thus all edges have the same length; we choose two vertices $v,v^{\prime}$ for the multi-edge and two more vertices $v_{1},v_{2}$ (one for each of the free edges). The missing vertices $v_{1}^{\prime},v_{2}^{\prime}$ have at most two choices each because $\big{|}v_{1}-v_{1}^{\prime}\big{|}_{n}=\big{|}v_{2}-v_{2}^{\prime}\big{|}_{n}=\big{|}v-v^{\prime}\big{|}_{n}$ , which gives again a $O(n^{4})$ bound.

For $G_{9}$ , if we are in the case $n$ odd, then the multi-edge is formed of edges of different groups, w.l.o.g. $e(y_{i,j})=e(y_{r,s})$ and $i=r$ . Therefore the edge attached to the multi-edge is uniquely defined because its length is determined, and the isolated edge is almost uniquely defined once one of the end points is chosen, because the other end has at most two choices. Overall, this gives the $O(n^{3})$ bound. In the case $n$ even, it can happen that w.l.o.g. $e(y_{i,j})=e(y_{i,j^{\prime}})$ but this can only happen when $i=n/2$ . Then the multi-edge is uniquely defined by choosing one end, the isolated edge is defined by choosing two end points, and the last edge has at most four options since its length is already determined by the length of the isolated edge. This gives again a $O(n^{3})$ bound.

For $G_{10}$ , in the case $n$ odd we can assume as before $e(y_{i,j})=e(y_{r,s})$ . Then $i=r$ , and the multi-edge is determined by choosing two vertices and the remaining two edges are uniquely defined by the central vertex. This yields the bound $O(n^{3})$ . In the other case, w.l.o.g. $e(y_{i,j})=e(y_{i,j^{\prime}})$ , and $i=n/2$ . The multi-edge can be defined by choosing only one vertex, and the isolated path can be defined by choosing two vertices for one edge, while the remaining edge will have at most two options. This yields again a $O(n^{3})$ bound.

For $G_{11}$ , if $e(y_{i,j})=e(y_{r,s})$ , then all edges have the same length $i=r$ , we can choose two vertices for the first multi-edge and one vertices for the second multi-edge, while the remaining vertex has at most two options. This yields a $O(n^{3})$ bound. In the case when $e(y_{i,j})=e(y_{i,j^{\prime}})$ then $e(y_{r,s})=e(y_{r,s^{\prime}})$ and $i=r=n/2$ . In this case we can choose two vertices (one for each multi-edge), and the remaining two vertices are automatically determined. This yields a $O(n^{2})=O(n^{3})$ bound. Thus we have established Claim 20. ∎

We continue with the proof of Claim 19 in the case when $n$ is odd. We observe that

[TABLE]

and thus for $Y=\{y_{i,j},y_{i,j^{\prime}},y_{r,s},y_{r,s^{\prime}}\}\in\mathcal{Y}$ , we have that

[TABLE]

The last equation, combined with Claim 20, implies that

[TABLE]

Using the previous inequality and (22) we get that

[TABLE]

This completes the proof of Claim 19 in the case when $n$ is odd.

Case 2: $n$ even. We estimate the variances of $\mathcal{Z}_{0}$ and $\mathcal{Z}_{1}$ . For $n$ even, we can write $\mathcal{Z}_{0}$ as

[TABLE]

where all variables involved in the sums are mutually independent (see Remark 9). Thus

[TABLE]

Using (17), we deduce that

[TABLE]

for all $n$ even. By Remark 9, we can write $\mathcal{Z}_{1}$ as

[TABLE]

Therefore

[TABLE]

We divide the analysis into three parts: the first two sums, the third sum, and the fourth sum. Using Remark 9, we write the first two sums in (26) as

[TABLE]

The third sum in (26) can be bounded above in the same way as in the odd case: the associated graphs of variables $y_{i,j},y_{i,j^{\prime}},y_{r,s},y_{r,s^{\prime}}$ with non-zero covariance in the third sum, are isomorphic to one of the graphs in Figure 2. Thus we can use Claim 20 and (23) to obtain

[TABLE]

In the fourth sum in (26), the variables with non-zero covariance have an associated multi-graph which is isomorphic to one of the following multi-graphs.

Let $\mathcal{X}:=\left\{\{y_{u,v},y_{u,v^{\prime}},y_{\frac{n}{2},w}\}:1\leq u\leq\frac{n}{2};0\leq v<v^{\prime}\leq n-1;v\not\equiv_{\frac{n}{2}}v^{\prime};0\leq w\leq\frac{n}{2}-1\right\}$ . In the same way as Claim 20, we can prove that

[TABLE]

As in (23), we can prove that $\mathbb{E}\left[y_{u,v}y_{u,v^{\prime}}y_{\frac{n}{2},w}\right]=\frac{1}{n^{2}}$ for all $X=\{y_{u,v},y_{u,v^{\prime}},y_{\frac{n}{2},w}\}\in\mathcal{X}$ . Thus

[TABLE]

Plugging (27),(28),(29) into (26) finally yields

[TABLE]

for all $n$ even. Equations (19),(24),(25) and (30) together yield Claim 19 in the case when $n$ is even. Thus we have fully established Claim 19. ∎

5.3 $\mathcal{E}_{\mathrm{\small zero}}(\frac{1}{2}-\varepsilon)$ has high probability

Using Chebyshev’s inequality, we obtain that

[TABLE]

for every $\lambda_{0},\lambda_{1}>0$ . In particular, this implies that

[TABLE]

Let $\varepsilon\in(0,1)$ be the constant from the statement of Lemma 7, and set $\nu=\varepsilon n/8$ . Choosing $\lambda_{0}=\lambda_{1}=\nu$ and using Claims 18 and 19 we get that

[TABLE]

By Claim 18 we have

[TABLE]

for $n$ sufficiently large. Thus, using Claim 15 we can conclude that

[TABLE]

This concludes the proof of Lemma 7.

6 Connections with chromatic polynomials of circulant graphs

As we have already seen in the proof of Claim 11, the multi-graph associated with the variables in row $i\neq\frac{n}{2}$ of $T_{\mathbf{b}}$ is the circulant graph $C_{n}(i)$ , and the same holds for the variables in row $n/2$ if we consider the associated graph and not the associated multi-graph. Furthermore, we can express the probability of synchronization of circular automata in terms of chromatic polynomials of circulant graphs: this is a consequence of the close connection of the moments of $D(\mathbf{b})$ to chromatic polynomials of circulant graphs. We formalize this in the following results.

*Definition**.*

The circulant graph $C_{n}(i_{1},i_{2},\ldots,i_{k})$ is a graph with vertex set $\mathbb{Z}_{n}$ where two vertices $r,s$ are adjacent if $\big{|}r-s\big{|}_{n}\in\{i_{1},i_{2},\ldots,i_{k}\}.$

*Definition**.*

Let $G$ be a graph with vertex set $\{0,1,\ldots,n-1\}$ . The chromatic polynomial $P(G;x):\mathbb{N}\rightarrow\mathbb{N}$ of $G$ is defined by

[TABLE]

*Remark 21**.*

Let $G$ be of order $n$ . Then $P(G;x)=\sum_{j=1}^{n}\lambda_{j}x^{j},$ where $\lambda_{j}\in\mathbb{Z}$ (see, for instance, [Fengming et al., 2005]).

Claim 22.

Let $D(\mathbf{b})$ and $\mathbf{b}=(b_{0},b_{1},\ldots,b_{n-1})\in\mathcal{M}_{n}$ be as in Lemma 7. Then

[TABLE]

and

[TABLE]

where $P_{i}$ is the chromatic polynomial of the circulant graph $C_{n}(i)$ and $P_{i,j}$ is the chromatic polynomial of the circulant graph $C_{n}(i,j)$ .

*Remark 23**.*

$\bullet$ It is easy to derive that $P_{i}(x)=\left((x-1)^{l_{i}}+(-1)^{l_{i}}(x-1)\right)^{\frac{n}{l_{i}}}$ where $l_{i}=\frac{n}{\gcd(n,i)}$ , because $C_{n}(i)$ is a collection of $\gcd(n,i)$ many disjoint cycles of length $\frac{n}{\gcd(n,i)}$ [Boesch and Tindell, 1984]. With this explicit expression, an easy corollary of Claim 22 is the estimate $\mathbb{E}\left[D\right]\sim(1-e^{-1})\left\lfloor\frac{n}{2}\right\rfloor$ .

$\bullet$ We could not find an explicit expression for $P_{i,j}$ . The calculation of the chromatic number of circulant graphs with an arbitrary number of parameters is an NP-Hard problem [Codenotti et al., 1998]. This implies that the calculation of chromatic polynomials of circulant graphs is also NP-Hard since

$\chi(G)=\text{argmin}_{w\in\mathbb{N}}P(G;w)>0$ – we believe that our unfruitful attempts to estimate $\mathbb{V}\left[D\right]$ are connected to this. To circumvent these issues, the variables $\mathcal{Z}_{0}$ and $\mathcal{Z}_{1}$ in Section 5 were introduced.

Proof of Claim 22.

Let us recall that $D(\mathbf{b})=\sum_{i=1}^{\left\lfloor\frac{n}{2}\right\rfloor}D_{i}(\mathbf{b})$ , where

[TABLE]

Then $D_{i}(\mathbf{b})=1-x_{i}(\mathbf{b})$ , where

[TABLE]

We observe that $x_{i}(\mathbf{b})=1$ if and only if every two numbers $r,s\in\mathbb{Z}_{n}$ at cyclic distance $i$ have different images under $\mathbf{b}$ and $x_{i}(\mathbf{b})=0$ otherwise. If we consider $\mathbf{b}$ as a random coloring of $C_{n}(i)$ , then $x_{i}(\mathbf{b})=1$ if and only if $C_{n}(i)$ is properly colored by $\mathbf{b}$ . Thus

[TABLE]

In a similar way

[TABLE]

Therefore

[TABLE]

as well as

[TABLE]

and

[TABLE]

Plugging the two previous equations into

[TABLE]

yields Claim 22. ∎

We get the following relation between chromatic polynomials of circulant graphs and synchronization of circular automata. The number $\frac{1}{2}-e^{-1}$ in the statement of Theorem 24 has the approximate value $0.13$ .

Theorem 24.

Let $\mathcal{A}_{n}(\mathbf{b})$ be a circulant graph as introduced in Section 2. Let $\varepsilon\in(0,\frac{1}{2}-e^{-1})$ , then there exist $n_{\epsilon}\in\mathbb{N}$ such that for all $n\geq n_{\epsilon}$ it holds that

[TABLE]

where $\mathbb{V}\left[D\right]$ is as given in Claim 22.

Proof.

By (12),(14) we know that

[TABLE]

for all $\varepsilon>0$ and $n$ large enough, where $\alpha^{\star}=1-e^{-1}-\varepsilon$ . Using the expression for $P_{i}$ in Remark 23 together with the inequality $1-x\leq e^{-x},\leavevmode\nobreak\ x\in\mathbb{R}$ , we bound $P_{i}(n)/n^{n}$ from above

[TABLE]

and thus

[TABLE]

Using Equation 32 and the equation $\mathbb{E}\left[D\right]=\left\lfloor\frac{n}{2}\right\rfloor-\sum_{i=1}^{\left\lfloor\frac{n}{2}\right\rfloor}\frac{P_{i}(n)}{n^{n}}$ from Claim 22 we get that

[TABLE]

By Chebyshev’s inequality and elementary manipulations, we get that

[TABLE]

for all $\lambda>0$ . Let $\varepsilon>0$ . Setting $\lambda=\lambda_{\varepsilon}^{\prime}(n)=\eta_{\star}-\left\lfloor\frac{n}{2}\right\rfloor(1-e^{-1}-\varepsilon)+1$ and noting that $\lambda>0$ for $n$ large enough, we get that

[TABLE]

for $n$ sufficiently large, where $\tilde{\beta}=1-e^{-1}-\varepsilon-\frac{1}{\left\lfloor\frac{n}{2}\right\rfloor}$ . Using the previous inequalities, we conclude that

[TABLE]

for $n$ large enough where the relations $\alpha^{\star},\tilde{\beta}>0$ and $\alpha^{\star}+\tilde{\beta}>1$ are valid when $\varepsilon\in(0,\frac{1}{2}-e^{-1})$ and $n$ is large enough. ∎

Actually, we formulate the following conjecture:

*Conjecture 25**.*

$\mathbb{V}\left[D\right]=O(n).$

To prove this conjecture it is sufficient to prove that there is $g:\mathbb{N}\rightarrow\mathbb{R}$ such that $|\frac{P_{i,j}(n)}{n^{n}}-\frac{P_{i}(n)P_{j}(n)}{n^{2n}}|\leq g(n)=O(1/n)$ for all $i,j$ . From (32) we see that $0\leq P_{i}(n)/n^{n}\leq f(n)=O(1)$ for all $i$ , therefore the first part of the sum of $\mathbb{V}\left[D\right]$ given in Claim 22 is $|\sum_{i=1}^{n}\left(\frac{P_{i}(n)}{n^{n}}-\frac{P_{i}^{2}(n)}{n^{2n}}\right)|\leq nf(n)=O(n)$ . The second part of the sum $\sum_{1\leq i<j\leq\left\lfloor\frac{n}{2}\right\rfloor}\left(\frac{P_{i,j}(n)}{n^{n}}-\frac{P_{i}(n)P_{j}(n)}{n^{2n}}\right)$ has a quadratic number of elements of the form $\frac{P_{i,j}(n)}{n^{n}}-\frac{P_{i}(n)P_{j}(n)}{n^{2n}}$ , and it can be bounded by $O(n^{2})g(n)=O(n)$ if the assumption $|\frac{P_{i,j}(n)}{n^{n}}-\frac{P_{i}(n)P_{j}(n)}{n^{2n}}|\leq g(n)=O(1/n)$ for all $i,j$ is true, making $\mathbb{V}\left[D\right]=O(n)+O(n)=O(n).$ In particular, a positive answer to this chromatic-polynomial question would give an alternative proof of Theorem 4.

7 Future work

Let $\mathcal{A}_{n}(\mathbf{a},\mathbf{b})$ be an automaton where $\mathbf{a}:\mathbb{Z}_{n}\rightarrow\mathbb{Z}_{n}$ is fixed and $\mathbf{b}\in\mathcal{M}_{n}$ . These are natural lines of research to extend/improve the results in this paper:

$\bullet$ We want to explore in more detail the strengths and limitations in the ideas presented in this paper. For example, we think that these ideas can extend Theorem 4 to the case where $\mathbf{a}:\mathbb{Z}_{n}\rightarrow\mathbb{Z}_{n}$ is in the form of a finite number of pairwise disjoint cycles of almost-equal length. We also think that (probabilistic) upper bounds for the length of the synchronizing minimal words can be given with our techniques, in the spirit of the results of [Nicaud, 2019].

$\bullet$ Theorem 3 has a decay rate in $\Theta\left(\frac{\sqrt{p}}{e^{p}}\right)$ . We believe that this can be extended in a weaker form to the case of circular automata of composite order:

*Conjecture 26**.*

[TABLE]

for some $0<\alpha<1$ , as $n\to\infty$ .

Acknowledgments

CA acknowledges the financial support of the Austrian Science Fund (FWF), projects F-5512, I-3466 and Y-901. DD, AG and AR acknowledge the financial support of the FWF project P29355-N35. AR acknowledges also the partial support of the FWF project P25510-N26. We want to thank two anonymous referees who read our paper very carefully, and whose suggestions greatly helped us to improve the presentation of this paper.

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Ananichev et al., 2010] Ananichev, D. S., Gusev, V. V., and Volkov, M. V. (2010). Slowly synchronizing automata and digraphs. Co RR , abs/1005.0129.
2[Béal et al., 2011] Béal, M.-P., Berlinkov, M. V., and Perrin, D. (2011). A quadratic upper bound on the size of a synchronizing word in one-cluster automata. International Journal of Foundations of Computer Science , 22(02):277–288.
3[Berlinkov, 2016] Berlinkov, M. V. (2016). On the probability of being synchronizable. In Algorithms and discrete applied mathematics , volume 9602 of Lecture Notes in Comput. Sci. , pages 73–84. Springer, [Cham].
4[Berlinkov and Nicaud, 2018] Berlinkov, M. V. and Nicaud, C. (2018). Synchronizing random almost-group automata. In International Conference on Implementation and Application of Automata , pages 84–96. Springer.
5[Boesch and Tindell, 1984] Boesch, F. and Tindell, R. (1984). Circulants and their connectivities. J. Graph Theory , 8(4):487–499.
6[Cerny, 1964] Cerny, J. (1964). Poznamka k homogenym eksperimentom s konechnymi automatami. Math.-Fyz. Cas , 14:208–215.
7[Codenotti et al., 1998] Codenotti, B., Gerace, I., and Vigna, S. (1998). Hardness results and spectral techniques for combinatorial problems on circulant graphs. Linear Algebra and its Applications , 285(1):123 – 142.
8[Dubuc, 1998] Dubuc, L. (1998). Sur les automates circulaires et la conjecture de Černỳ. RAIRO-Theoretical Informatics and Applications , 32(1-3):21–34.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Circular automata synchronize with high probability

Abstract

1 Introduction

Claim 1**.**

Proof.

Conjecture 2* (The Černỳ conjecture).*

2 Main result

Theorem 3** ([Perrin, 1977][Pin, 1978]).**

Theorem 4** (Main result).**

Remark 5*.*

Definition*.*

Lemma 6**.**

Lemma 7**.**

Proof of Theorem 4.

3 Independence among the random variables Tb(i,j)T_{\mathbf{b}}(i,j)Tb​(i,j)

Definition*.*

Proposition 8**.**

Remark 9*.*

Remark 10*.*

Proof of Proposition 8.

4 Proof of Lemma 6

Claim 11**.**

Proof.

Claim 12**.**

Proof.

Definition*.*

Proposition 13** (McDiarmid’s Inequality [McDiarmid, 1989]).**

Remark*.*

Claim 14**.**

Proof.

5 Proof of Lemma 7

Claim 15**.**

Proof.

5.1 Lower bound for D(b)D(b)D(b)

Claim 16**.**

Definition*.*

Remark 17*.*

Claim 18**.**

Proof.

Claim 19**.**

Proof.

Claim 20**.**

Proof.

5.3 Ezero(12−ε)\mathcal{E}_{\mathrm{\small zero}}(\frac{1}{2}-\varepsilon)Ezero​(21​−ε) has high probability

6 Connections with chromatic polynomials of circulant graphs

Definition*.*

Definition*.*

Remark 21*.*

Claim 22**.**

Remark 23*.*

Proof of Claim 22.

Theorem 24**.**

Proof.

Conjecture 25*.*

7 Future work

Conjecture 26*.*

Acknowledgments

Claim 1.

*Conjecture 2** (The Černỳ conjecture).*

Theorem 3 ([Perrin, 1977][Pin, 1978]).

Theorem 4 (Main result).

*Remark 5**.*

*Definition**.*

Lemma 6.

Lemma 7.

3 Independence among the random variables $T_{\mathbf{b}}(i,j)$

*Definition**.*

Proposition 8.

*Remark 9**.*

*Remark 10**.*

Claim 11.

Claim 12.

*Definition**.*

Proposition 13 (McDiarmid’s Inequality [McDiarmid, 1989]).

*Remark**.*

Claim 14.

Claim 15.

5.1 Lower bound for $D(b)$

Claim 16.

*Definition**.*

*Remark 17**.*

Claim 18.

Claim 19.

Claim 20.

5.3 $\mathcal{E}_{\mathrm{\small zero}}(\frac{1}{2}-\varepsilon)$ has high probability

*Definition**.*

*Definition**.*

*Remark 21**.*

Claim 22.

*Remark 23**.*

Theorem 24.

*Conjecture 25**.*

*Conjecture 26**.*