Circular automata synchronize with high probability
Christoph Aistleitner, Daniele D'Angeli, Abraham Gutierrez, Emanuele, Rodaro, Amnon Rosenmann

TL;DR
This paper proves that a random circular automaton of size n synchronizes with high probability, using probabilistic methods and properties of associated random matrices, and relates synchronization probability to chromatic polynomials of circulant graphs.
Contribution
It establishes that random circular automata synchronize with high probability and introduces a novel approach using random matrix properties and graph chromatic polynomials.
Findings
Synchronization probability approaches 1 as n increases
Provides bounds on synchronization probability using chromatic polynomials
Connects automaton synchronization to properties of circulant graphs
Abstract
In this paper we prove that a uniformly distributed random circular automaton of order synchronizes with high probability (whp). More precisely, we prove that The main idea of the proof is to translate the synchronization problem into properties of a random matrix; these properties are then handled with tools of the probabilistic method. Additionally, we provide an upper bound for the probability of synchronization of circular automata in terms of chromatic polynomials of circulant graphs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Circular automata synchronize with high probability
Christoph Aistleitner Institute of Analysis and Number Theory, TUGraz, Austria. [email protected]
Daniele D’Angeli Università Niccolò Cusano, Via don Gnocchi Roma, Italia. [email protected]
Abraham Gutierrez Institute of Discrete Mathematics, TUGraz, Austria. {a.gutierrez, rosenmann}@math.tugraz.at
Emanuele Rodaro Department of Mathematics, Politecnico di Milano, Italia. [email protected]
Amnon Rosenmann*‡*
Abstract
In this paper we prove that a uniformly distributed random circular automaton of order synchronizes with high probability (w.h.p.). More precisely, we prove that
[TABLE]
The main idea of the proof is to translate the synchronization problem into a problem concerning properties of a random matrix; these properties are then established with high probability by a careful analysis of the stochastic dependence structure among the random entries of the matrix. Additionally, we provide an upper bound for the probability of synchronization of circular automata in terms of chromatic polynomials of circulant graphs.
Keywords: Automata; Synchronization; Random Matrices; Circulant Graphs; Chromatic Polynomials.
1 Introduction
A complete deterministic finite automaton (DFA) is a tuple , where is a finite set of states and is a finite set of mappings , where is also written as , , . The number of states is the order of . Each is called a letter and a sequence is a word of length . The action of on naturally extends to an action of on , defined recursively by , , , . This action further extends to an action of on subsets of by . We say that the subset synchronizes if there exists a word such that (equivalently, we say that synchronizes ). If the set synchronizes then we say that synchronizes (or that it is a synchronizing automaton). A word that synchronizes is called a synchronizing (or reset) word of .
The following simple criterion for synchronization is well known and plays a crucial role throughout the paper:
Claim 1**.**
* synchronizes every pair of states synchronizes.*
Proof.
It is clear that if synchronizes by a reset word then synchronizes every pair of states of . Conversely, a reset word for can be formed by concatenating words that synchronize pairs of states until we end up with a single state. ∎
The synchronization property may be described in terms of the graph representation of . The set of states comprises the vertices of the graph and for each pair of states and a letter such that there is an arrow labeled with and connecting to . Each and defines a directed path
[TABLE]
that begins in and ends in . then synchronizes if and only if there is a word , such that the paths have a common endpoint , that is, the word acts on as the constant mapping.
Synchronizing automata have been intensely studied by theoretical computer scientists as well as pure mathematicians since the 1960’s; see [Volkov, 2008] for a detailed introduction on synchronization of automata. A driving force in this research field is the Černỳ conjecture.
Conjecture 2* (The Černỳ conjecture).*
A synchronizing automaton of order has a shortest synchronizing word of length at most .
The bound in the Černỳ conjecture is tight: in [Cerny, 1964] Černỳ provided a series of synchronizing circular automata , such that has order and its shortest synchronizing word is of size exactly (see Fig. 1). Furthermore, the Černỳ series of circular automata is the only known infinite series of automata whose shortest synchronizing words are of length [Ananichev et al., 2010].
The best known general upper bounds for the size of shortest synchronizing words of an automaton with states are of order [Pin, 1983][Szykuła, 2017][Shitov, 2019]. Nevertheless, there are many classes of automata for which the Černỳ conjecture has been established (see [Volkov, 2008] for examples).
In last decade probabilistic approaches to the synchronization problem have been developed. Typical questions in this setting are: let be a uniformly chosen DFA with letters on a certain probability space, is it true that with high probability the automaton is synchronizing? Does the Černỳ conjecture hold with high probability? Here we give a (non-comprehensive) list of recent achievements in this probabilistic setting:
In [Skvortsov and Zaks, 2010] the authors study random automata where the number of letters grow together with . In particular, they prove that synchronizes w.h.p. when grows fast enough; 2.
In [Berlinkov, 2016] the author proves that , for arbitrary , and for ; 3.
In [Nicaud, 2019] the author proves that admits w.h.p. a synchronizing word of length for arbitrary 4.
In [Berlinkov and Nicaud, 2018] the authors prove that if is uniformly chosen among the strongly-connected almost-group automata then synchronizes with probability for arbitrary .
Since the sequence of circular automata depicted in Fig. 1 is the only known infinite series of synchronizing automata reaching Černỳ ’s bound , one might suspect that the class of circular automata is somehow difficult to synchronize. However, as we show in the present paper, it turns out that a random circular automaton is synchronizing with high probability.
The rest of the paper is organized as follows: in Section 2 we present the main result together with its proof and the statement of the two key lemmas for the proof. In Section 3 we study the dependence structure among the entries of the random matrix used in the proof of the main result; the result obtained in this section is crucial for the proof of the key lemmas. In Section 4 we prove the first lemma while in Section 5 we prove the second one. In Section 6 we present some interesting connections between synchronization of circular automata and chromatic polynomials of circulant graphs. Finally, in Section 7 we present some possible directions towards generalizing and improving the results presented in this paper.
2 Main result
Let be a positive integer. An automaton , where is the set of states, is called a circular automaton if contains a permutation that decomposes in exactly one cycle. Let . Let denote the set of all mappings from to itself, and let denote the uniform probability measure on . We will write the elements of as vectors by identifying the mapping with the vector .
In what follows, we denote by a circular automaton of order , with being the circular right shift permutation and being an element of . We will understand that is “randomly” chosen from according to the uniform probability measure , making a random circular automaton.
It follows from work of Perrin [Perrin, 1977] that a circular automaton of prime order synchronizes if and only if contains a non-permutation. Pin [Pin, 1978] proved with combinatorial methods that a circular automaton of prime order which has a letter of rank has a minimal word of size at most . For the probability of synchronization of a very precise result is known.
Theorem 3** ([Perrin, 1977][Pin, 1978]).**
Let be a prime number. Then
[TABLE]
Thus, a uniformly distributed random circular automaton of prime order with letters synchronizes with high probability (w.h.p.).
Theorem 3 is not explicitly stated in [Perrin, 1977], but it is observed in [Pin, 1978] that Perrin’s work implies the theorem.
It is known that the Černỳ conjecture holds true for the class of circular automata [Dubuc, 1998]. In a closely related work, Béal, Berlinkov and Perrin [Béal et al., 2011] gave an upper bound for the shortest words of synchronizing automata with a single cluster.
A natural question arises: do random circular automata of order (not necessarily prime) synchronize with high probability? We give a positive answer to this question in the following:
Theorem 4** (Main result).**
The following holds:
[TABLE]
as . Thus, a randomly chosen synchronizes w.h.p. as .
Remark 5*.*
Theorem 4 does not follow from the results of Berlinkov or Nicaud. In their models, they use a random automaton of order where is a collection of mappings from to i.i.d. uniformly chosen. For a fixed , the probability of randomly chosen mappings to contain a permutation with exactly one cycle is bounded from above by .
Given and , we define the -cyclic absolute value of to be
[TABLE]
When then \big{|}r-s\big{|}_{n} is the -cyclic distance between and . When the numbers are identified with the vertices of a cycle of length , the -cyclic distance between two such numbers is the length of the shortest path between them in the cycle. We now introduce the main tool for the proof of the main theorem.
Definition*.*
Let be a circular automaton with . Then we define to be the matrix
[TABLE]
shortly written as
[TABLE]
As before, , i.e., the image of state under . To be clear, note that the first row of is formed of the cyclic distances of the images of states such that \big{|}r-s\big{|}_{n}=1; in general, the -th row of is formed of the cyclic distances of the images of pairs of states of cyclic distance . Notice that the columns are counted from [math] to .
For and , let denote the number of different entries in row of :
[TABLE]
Set
[TABLE]
i.e., contains those for which every row of has at least different elements. Its complement is
[TABLE]
We also define
[TABLE]
and its complement
[TABLE]
where
[TABLE]
and
[TABLE]
That is, is the set of those for which the matrix has at least rows containing the entry zero.
The proof of Theorem 4 relies on the following two lemmas.
Lemma 6**.**
*Let and let . Then *
[TABLE]
as .
Lemma 7**.**
*Let and let . Then *
[TABLE]
as .
Proof of Theorem 4.
The main idea of the proof is to transform the question of synchronization of into a question concerning properties of the matrix . The functions are random variables over , and to obtain our desired probability estimates we will need to understand the joint stochastic dependence structure of these random variables.
Let and consider the associated Matrix . The first observation is that a zero in row of means that two states with cyclic distance synchronize under (i.e., ), which implies that any pair with cyclic distance can be synchronized with a word of the form because for some . The second observation is that if the -th row of contains a number and the -th row contains a zero, then every pair of states with cyclic distance can be synchronized with a word of the form . Indeed, we can proceed as follows: , where this last pair has n-cyclic distance ; then synchronizes with a word of the form , for some because the -th row contains a zero. With these two observations, we establish sufficient conditions on for the synchronization of . The sets and which we defined in (3) and (5) play a crucial role.
Let . If is contained in both and for some such that , then synchronizes. This follows from the two previous observations together with the union bound. Indeed, let be any pair of different states and let . If row contains a zero, we can synchronize with a word of the form ; otherwise, row contains an entry such that row contains a zero (because ), which implies that can be synchronized with a word of the form . Therefore, every pair of different states synchronizes and synchronizes by Claim 1. Therefore, for any satisfying , we have the following bound:
[TABLE]
Now, by the last inequality and by Lemmas 6 and 7 we obtain the bound stated in the main theorem. We can choose, for example, , and , so that , and . Then we have
[TABLE]
as . ∎
3 Independence among the random variables
For every pair , and , the function is a random variable on the space , equipped with the uniform probability measure (and with the power set of as the natural sigma-field). It is crucial for our proof to give a criterion on pairs of indices which guarantees that the random variables , …, are independent. First, notice that not every subset of random variables is independent. For example,
[TABLE]
are clearly dependent: if the first two random variables and are zero, then , which implies that \big{|}b_{0}-b_{2}\big{|}_{n}=0 and so necessarily is also zero. This dependence comes from the fact that there is a “cycle” of the form generated by the indices of these three random variables. Generally, it will turn out that a set of random variables is independent if and only if the corresponding indices are “acyclic”. We formalize this in the following
Definition*.*
Let
[TABLE]
be a multi-set, where . The associated (multi-)graph is the (multi-)graph with vertex set and edge (multi-)set
[TABLE]
We say that is acyclic if its associated multi-graph is acyclic. We also say that the edge is associated to the random variable .
The relation between acyclic index sets and independent variables is stated in the following
Proposition 8**.**
The variables are i.i.d. the (multi-)set is acyclic. Furthermore, if the variables are independent then
[TABLE]
where are arbitrary integers and
[TABLE]
Henceforth in the paper we use the concepts “acyclic” and “independent” interchangeably when we refer to a multi-set of independent random variable entries of , resp. to random variable entries whose associated multi-graph is acyclic.
Remark 9*.*
Note that different random variables may be associated with the same edge; since this only happens when is even and and Thus, for odd, a pair of different random variables is always acyclic/independent.
Remark 10*.*
For a vector , we can write its entries as functions of . In other words, are random variables on , equipped with the uniform measure . The random variables are independent and identically distributed over this space; this follows immediately from the fact that the uniform measure on is a product of one-dimensional uniform measures.
Proof of Proposition 8.
First note that any two random variables T_{\mathbf{b}}(i,j)=\big{|}b_{j}-b_{{(j+i)}_{n}}\big{|}_{n} and T_{\mathbf{b}}(i^{\prime},j^{\prime})=\big{|}b_{j^{\prime}}-b_{{(j^{\prime}+i^{\prime})}_{n}}\big{|}_{n} are always identically distributed since are i.i.d. (see Remark 10). Note also that for all
[TABLE]
which can seen by an easy counting argument: there are different possible choices of , and then there are independent different choices of such that \big{|}b_{p}-b_{p+q}\big{|}_{n}=s. Thus equation (8) is just a rephrasing of the fact that the random variables are independent. Therefore, what we need to prove is that independence holds if and only if the associated (multi-)graph is acyclic.
(by contraposition) Let be a (multi-)set which contains a cycle. Thus, its associated multi-graph has a cycle of length . Let this cycle be w.l.o.g.
[TABLE]
Recall that . Thus if for some we have
[TABLE]
then , and so we automatically also have T_{\mathbf{b}}(i_{l},j_{l})=\big{|}b_{j_{l}}-b_{(j_{l}+i_{l})_{n}}\big{|}_{n}=\big{|}b_{j_{l}}-b_{j_{1}}\big{|}_{n}=0. Thus, the variables are not independent. We conclude that an independent multi-set must be acyclic.
(by induction on ) Let . Assume that the multi-set is acyclic. We want to show that is independent of . This will allow us to factor out the -th factor on the left-hand side of (8), leading (by induction) to the formula on the right-hand side of (8), which is equivalent to independence.
We distinguish two cases: The first case is when the edge is a connected component by itself in . This means that the sets and are disjoint. By construction, the random variables depend only on with , while depends only on with . Since are independent by Remark 10, this implies that is independent of , as desired.
For the second case, the edge is not a connected component by itself in . Since it is also not part of a cycle by assumption,we can assume that is a leaf vertex in . In principle, depends on as well as on . However, since is defined as a cyclic distance, the conditional distribution of given is always the same. In formulas, for every we have
[TABLE]
for every . This fact can be simply established by counting the possible configurations of and . By definition, is independent of all with . Thus for every numbers we have, using the independence of and (9), that
[TABLE]
This is exactly the independence property that we wanted to establish. ∎
4 Proof of Lemma 6
The overview of the proof is as follows. Recall that we understand the entries of the matrix as random variables. We will prove that every row of contains a “large” number of independent random variables. Then we give a lower bound for the expected value of the number of different elements in each row. Then we apply McDiarmid’s inequality to each row and finally we use the union bound together with the exponential decay delivered by McDiarmid’s inequality to guarantee that w.h.p. every row of has at least different elements. We denote by the circulant graph on vertices, i.e., the graph with vertex set where two vertices are adjacent if and only if \big{|}r-s\big{|}_{n}=i.
We need the following property.
Claim 11**.**
For every , the -th row of contains a set of at least random variables which are i.i.d.
Proof.
The variables in row are given by the multi-set
[TABLE]
Let . By Remark 9, the corresponding multi-set does not have repeated elements and the associated multi-graph is isomorphic to the circulant graph . It is well known and easy to show that is a disjoint union of cycles of length [Boesch and Tindell, 1984]. We can then obtain an acyclic set of variables by removing one edge from each of the cycles of . The resulting set of variables is i.i.d. by Proposition 8. In the case , the first variables in row
[TABLE]
have an associated multi-graph that is isomorphic to the circulant graph , which is a disjoint union of edges. This last graph is acyclic, thus the variables are i.i.d. by Proposition 8. ∎
We prove the following lower bound
Claim 12**.**
We have , where for all
[TABLE]
(see (2)) is the cardinality of different elements in row of .
Proof.
First, for every , we define the random variables
[TABLE]
and
[TABLE]
Note that is zero if the number is included in the -th row of , and that it is one otherwise. Recalling that the entries of can only have values in , we write the number of distinct elements in row as
[TABLE]
By Claim 11, there is a subset of of cardinality such that the variables are i.i.d., and thus
[TABLE]
Furthermore, by Proposition 8, we have and thus
[TABLE]
for . Using the inequality , which is valid for any real number , we obtain
[TABLE]
Plugging this inequality into (11) yields
[TABLE]
This proves Claim 12. ∎
We introduce McDiarmid’s inequality to prove Claim 14.
Definition*.*
Let be a function. We say that has Lipschitz coefficient if
[TABLE]
for every such that for all except for at most one index.
Proposition 13** (McDiarmid’s Inequality [McDiarmid, 1989]).**
Let be a random vector, where the variables are independent, and let be a function with bounded Lipschitz coefficient . Then
[TABLE]
for all .
Remark*.*
This is just a special case of the general form of McDiarmid’s inequality. The general inequality also bounds the upper tail, and allows different Lipschitz coefficients in the respective components.
In the following claim we use Proposition 13 to estimate the probability that row of has less than different elements.
Claim 14**.**
Let . Then
[TABLE]
for .
Proof.
Let . Let be defined as in (10). The function has Lipschitz coefficient 2: changing one affects at most two entries, namely \big{|}b_{j}-b_{(j+i)_{n}}\big{|}_{n} and \big{|}b_{(j-i)_{n}}-b_{j}\big{|}_{n}. Using McDiarmid’s inequality, we deduce that
[TABLE]
Using the lower bound of Claim 12 we obtain
[TABLE]
Let and let
[TABLE]
we observe that is independent of Let , then plugging into the previous inequality yields
[TABLE]
∎
Recall that contains those for which every row of has at least different elements, so that
[TABLE]
Let be arbitrary and let . Then
[TABLE]
where we use Claim 14 for the second inequality. The proof of Lemma 6 then follows by noticing that
[TABLE]
5 Proof of Lemma 7
The overview of the proof is as follows. We will define two random variables and such that
[TABLE]
Then we will show that and concentrate around their respective means, and use this fact to give an upper bound on the probability that is small. For this purpose, we note the following property.
Claim 15**.**
*Let and be random variables which take non-negative values, such that . Let and let . Then *
[TABLE]
Proof.
This follows easily from the assumption that and the union bound. ∎
To prove concentration of and around their respective means, we use Chebyschev’s inequality. Notice that does not have a bounded Lipschitz coefficient, so we cannot use McDiarmid’s inequality to guarantee its concentration.
5.1 Lower bound for
Recall that counts the number of rows of that contain at least one zero. Let
[TABLE]
and
[TABLE]
Then
[TABLE]
It is easy to verify that the number of non-ordered pairs of entries in the -th row with zero value is
[TABLE]
therefore
[TABLE]
From this and (15), we conclude that
Claim 16**.**
**
5.2 Estimates for , , , ,
In this subsection we prove that
- •
- •
- •
- •
and
- •
.
For the rest of this subsection, we use the notation
[TABLE]
for and
Definition*.*
The variables are called acyclic if the multi-set is acyclic. Let
[TABLE]
be the associated multi-graph of the multi-set and let be the associated edge to . The length of is \big{|}j-(j+i)_{n}\big{|}_{n}=i.
Remark 17*.*
If the variables are acyclic then they are i.i.d.; this is an immediate consequence of Proposition 8.
We begin with the easy part: the bounds for the expected values.
Claim 18**.**
Let . We have , , and
Proof.
Using the linearity of the expectation, we get that
[TABLE]
where for the second equality we use that
[TABLE]
Now we calculate an upper bound for , depending on the parity of .
Case 1: odd. Every product in the sum
[TABLE]
is formed of independent random variables , by Remarks 9,17. Thus
[TABLE]
Case 2: even. Using Remark 9, we write as
[TABLE]
Every product in the first sum is formed of independent variables , by Remark 9 and the same is valid for the products in the second sum, therefore
[TABLE]
We deduce from the previous cases that and for all . Using this last inequality and (16), we conclude that
[TABLE]
This concludes the proof of Claim 18. ∎
Now we estimate the variance of and .
Claim 19**.**
Let , then and .
Proof.
Here we also divide the calculations according to the parity of .
Case 1: odd. We expand the variance of to get that
[TABLE]
where the covariances are calculated among pairs of independent variables due to Remark 9. Thus
[TABLE]
We notice that because , therefore
[TABLE]
where we use (17) in the last equality. Then, for all odd, we get that
[TABLE]
Now we calculate
[TABLE]
We first note that
[TABLE]
this follows since the variables and are different and therefore independent (see Remark 9). Thus
[TABLE]
For the sum of the covariances, we proceed as follows: if the variables are acyclic then they are independent (see Proposition 8), therefore
[TABLE]
On the other hand, if the variables are not acyclic, let
[TABLE]
and let
[TABLE]
Then is a multi-graph with four edges such that and (see Remark 9). In particular, there cannot be 3 equal edges. If has at least one cycle, it is isomorphic to one of the multi-graphs in Figure 2 below.
We will now estimate the contribution of each of these possible non-acyclic multi-graphs.
Claim 20**.**
Let , then
[TABLE]
Proof.
The cases can be bounded by the trivial bound , and the same for the cases with the bound . The remaining cases require better estimates than their respective trivial bounds.
First, notice that for all cases, the four edges of the multi-graph are divided into two pairs: of length and of length . The case is bounded by because three vertices can be chosen freely to form a triangle whose edges have at most two different lengths , then we choose a vertex for the free edge and finally we choose such that \big{|}v-v^{\prime}\big{|}_{n}=i or \big{|}v-v^{\prime}\big{|}_{n}=r depending on the lengths of the edges in the triangle, therefore has only two choices.
The case is also bounded by . To show this, we distinguish between two subcases. In the first subcase, the multi-edge is formed of the associated edges of the same pair, w.l.o.g. (this can only happen in the case even). Then the free edges are formed of the edges , which have length ; we choose two vertices for the multi-edge and two more vertices (one for each of the free edges), but then the two missing vertices have at most two options each, because \big{|}v-v_{1}\big{|}_{n}=\big{|}v_{2}-v_{2}^{\prime}\big{|}_{n}=r. Thus this subcase is bounded by . The second subcase is when and . Then w.l.o.g. the multi-edge is formed of the then , thus all edges have the same length; we choose two vertices for the multi-edge and two more vertices (one for each of the free edges). The missing vertices have at most two choices each because \big{|}v_{1}-v_{1}^{\prime}\big{|}_{n}=\big{|}v_{2}-v_{2}^{\prime}\big{|}_{n}=\big{|}v-v^{\prime}\big{|}_{n}, which gives again a bound.
For , if we are in the case odd, then the multi-edge is formed of edges of different groups, w.l.o.g. and . Therefore the edge attached to the multi-edge is uniquely defined because its length is determined, and the isolated edge is almost uniquely defined once one of the end points is chosen, because the other end has at most two choices. Overall, this gives the bound. In the case even, it can happen that w.l.o.g. but this can only happen when . Then the multi-edge is uniquely defined by choosing one end, the isolated edge is defined by choosing two end points, and the last edge has at most four options since its length is already determined by the length of the isolated edge. This gives again a bound.
For , in the case odd we can assume as before . Then , and the multi-edge is determined by choosing two vertices and the remaining two edges are uniquely defined by the central vertex. This yields the bound . In the other case, w.l.o.g. , and . The multi-edge can be defined by choosing only one vertex, and the isolated path can be defined by choosing two vertices for one edge, while the remaining edge will have at most two options. This yields again a bound.
For , if , then all edges have the same length , we can choose two vertices for the first multi-edge and one vertices for the second multi-edge, while the remaining vertex has at most two options. This yields a bound. In the case when then and . In this case we can choose two vertices (one for each multi-edge), and the remaining two vertices are automatically determined. This yields a bound. Thus we have established Claim 20. ∎
We continue with the proof of Claim 19 in the case when is odd. We observe that
[TABLE]
and thus for , we have that
[TABLE]
The last equation, combined with Claim 20, implies that
[TABLE]
Using the previous inequality and (22) we get that
[TABLE]
This completes the proof of Claim 19 in the case when is odd.
Case 2: even. We estimate the variances of and . For even, we can write as
[TABLE]
where all variables involved in the sums are mutually independent (see Remark 9). Thus
[TABLE]
Using (17), we deduce that
[TABLE]
for all even. By Remark 9, we can write as
[TABLE]
Therefore
[TABLE]
We divide the analysis into three parts: the first two sums, the third sum, and the fourth sum. Using Remark 9, we write the first two sums in (26) as
[TABLE]
The third sum in (26) can be bounded above in the same way as in the odd case: the associated graphs of variables with non-zero covariance in the third sum, are isomorphic to one of the graphs in Figure 2. Thus we can use Claim 20 and (23) to obtain
[TABLE]
In the fourth sum in (26), the variables with non-zero covariance have an associated multi-graph which is isomorphic to one of the following multi-graphs.
Let . In the same way as Claim 20, we can prove that
[TABLE]
As in (23), we can prove that for all . Thus
[TABLE]
Plugging (27),(28),(29) into (26) finally yields
[TABLE]
for all even. Equations (19),(24),(25) and (30) together yield Claim 19 in the case when is even. Thus we have fully established Claim 19. ∎
5.3 has high probability
Using Chebyshev’s inequality, we obtain that
[TABLE]
for every . In particular, this implies that
[TABLE]
Let be the constant from the statement of Lemma 7, and set . Choosing and using Claims 18 and 19 we get that
[TABLE]
By Claim 18 we have
[TABLE]
for sufficiently large. Thus, using Claim 15 we can conclude that
[TABLE]
This concludes the proof of Lemma 7.
6 Connections with chromatic polynomials of circulant graphs
As we have already seen in the proof of Claim 11, the multi-graph associated with the variables in row of is the circulant graph , and the same holds for the variables in row if we consider the associated graph and not the associated multi-graph. Furthermore, we can express the probability of synchronization of circular automata in terms of chromatic polynomials of circulant graphs: this is a consequence of the close connection of the moments of to chromatic polynomials of circulant graphs. We formalize this in the following results.
Definition*.*
The circulant graph is a graph with vertex set where two vertices are adjacent if \big{|}r-s\big{|}_{n}\in\{i_{1},i_{2},\ldots,i_{k}\}.
Definition*.*
Let be a graph with vertex set . The chromatic polynomial of is defined by
[TABLE]
Remark 21*.*
Let be of order . Then where (see, for instance, [Fengming et al., 2005]).
Claim 22**.**
Let and be as in Lemma 7. Then
[TABLE]
and
[TABLE]
where is the chromatic polynomial of the circulant graph and is the chromatic polynomial of the circulant graph .
Remark 23*.*
It is easy to derive that where , because is a collection of many disjoint cycles of length [Boesch and Tindell, 1984]. With this explicit expression, an easy corollary of Claim 22 is the estimate .
We could not find an explicit expression for . The calculation of the chromatic number of circulant graphs with an arbitrary number of parameters is an NP-Hard problem [Codenotti et al., 1998]. This implies that the calculation of chromatic polynomials of circulant graphs is also NP-Hard since
– we believe that our unfruitful attempts to estimate are connected to this. To circumvent these issues, the variables and in Section 5 were introduced.
Proof of Claim 22.
Let us recall that , where
[TABLE]
Then , where
[TABLE]
We observe that if and only if every two numbers at cyclic distance have different images under and otherwise. If we consider as a random coloring of , then if and only if is properly colored by . Thus
[TABLE]
In a similar way
[TABLE]
Therefore
[TABLE]
as well as
[TABLE]
and
[TABLE]
Plugging the two previous equations into
[TABLE]
yields Claim 22. ∎
We get the following relation between chromatic polynomials of circulant graphs and synchronization of circular automata. The number in the statement of Theorem 24 has the approximate value .
Theorem 24**.**
Let be a circulant graph as introduced in Section 2. Let , then there exist such that for all it holds that
[TABLE]
where is as given in Claim 22.
Proof.
[TABLE]
for all and large enough, where . Using the expression for in Remark 23 together with the inequality , we bound from above
[TABLE]
and thus
[TABLE]
Using Equation 32 and the equation from Claim 22 we get that
[TABLE]
By Chebyshev’s inequality and elementary manipulations, we get that
[TABLE]
for all . Let . Setting and noting that for large enough, we get that
[TABLE]
for sufficiently large, where . Using the previous inequalities, we conclude that
[TABLE]
for large enough where the relations and are valid when and is large enough. ∎
Actually, we formulate the following conjecture:
Conjecture 25*.*
To prove this conjecture it is sufficient to prove that there is such that for all . From (32) we see that for all , therefore the first part of the sum of given in Claim 22 is . The second part of the sum has a quadratic number of elements of the form , and it can be bounded by if the assumption for all is true, making In particular, a positive answer to this chromatic-polynomial question would give an alternative proof of Theorem 4.
7 Future work
Let be an automaton where is fixed and . These are natural lines of research to extend/improve the results in this paper:
We want to explore in more detail the strengths and limitations in the ideas presented in this paper. For example, we think that these ideas can extend Theorem 4 to the case where is in the form of a finite number of pairwise disjoint cycles of almost-equal length. We also think that (probabilistic) upper bounds for the length of the synchronizing minimal words can be given with our techniques, in the spirit of the results of [Nicaud, 2019].
Theorem 3 has a decay rate in . We believe that this can be extended in a weaker form to the case of circular automata of composite order:
Conjecture 26*.*
[TABLE]
for some , as .
Acknowledgments
CA acknowledges the financial support of the Austrian Science Fund (FWF), projects F-5512, I-3466 and Y-901. DD, AG and AR acknowledge the financial support of the FWF project P29355-N35. AR acknowledges also the partial support of the FWF project P25510-N26. We want to thank two anonymous referees who read our paper very carefully, and whose suggestions greatly helped us to improve the presentation of this paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Ananichev et al., 2010] Ananichev, D. S., Gusev, V. V., and Volkov, M. V. (2010). Slowly synchronizing automata and digraphs. Co RR , abs/1005.0129.
- 2[Béal et al., 2011] Béal, M.-P., Berlinkov, M. V., and Perrin, D. (2011). A quadratic upper bound on the size of a synchronizing word in one-cluster automata. International Journal of Foundations of Computer Science , 22(02):277–288.
- 3[Berlinkov, 2016] Berlinkov, M. V. (2016). On the probability of being synchronizable. In Algorithms and discrete applied mathematics , volume 9602 of Lecture Notes in Comput. Sci. , pages 73–84. Springer, [Cham].
- 4[Berlinkov and Nicaud, 2018] Berlinkov, M. V. and Nicaud, C. (2018). Synchronizing random almost-group automata. In International Conference on Implementation and Application of Automata , pages 84–96. Springer.
- 5[Boesch and Tindell, 1984] Boesch, F. and Tindell, R. (1984). Circulants and their connectivities. J. Graph Theory , 8(4):487–499.
- 6[Cerny, 1964] Cerny, J. (1964). Poznamka k homogenym eksperimentom s konechnymi automatami. Math.-Fyz. Cas , 14:208–215.
- 7[Codenotti et al., 1998] Codenotti, B., Gerace, I., and Vigna, S. (1998). Hardness results and spectral techniques for combinatorial problems on circulant graphs. Linear Algebra and its Applications , 285(1):123 – 142.
- 8[Dubuc, 1998] Dubuc, L. (1998). Sur les automates circulaires et la conjecture de Černỳ. RAIRO-Theoretical Informatics and Applications , 32(1-3):21–34.
