Reachability Problem in Non-uniform Cellular Automata

Sumit Adak; Sukanya Mukherjee; Sukanta Das

arXiv:1901.08246·cs.CC·January 25, 2019

Reachability Problem in Non-uniform Cellular Automata

Sumit Adak, Sukanya Mukherjee, Sukanta Das

PDF

TL;DR

This paper introduces an algorithm to determine configuration reachability in non-uniform cellular automata, utilizing a reachability tree, with good average performance despite exponential worst-case complexity.

Contribution

It presents the first decision algorithm for the configuration reachability problem in non-uniform cellular automata using a novel reachability tree method.

Findings

01

Algorithm effectively decides reachability in non-uniform CAs.

02

Reachability tree provides a useful characterization tool.

03

Average performance of the algorithm is very good despite exponential worst-case complexity.

Abstract

This paper deals with the CREP (Configuration REachability Problem) for non-uniform cellular automata (CAs). The cells of non-uniform CAs, we have considered here, can use different Wolfram's rules to generate their next states. We report an algorithm which decides whether or not a configuration of a given (non-uniform) cellular automaton is reachable from another configuration. A characterization tool, named Reachability tree, is used to develop theories and the decision algorithm for the CREP. Though the worst case complexity of the algorithm is exponential in time and space, but the average performance is very good.

Tables5

Table 1. Table 1: Rules 9, 170, 195 and 80

Present state	111	110	101	100	011	010	001	000	Rule
(RMT)	(7)	(6)	(5)	(4)	(3)	(2)	(1)	(0)
(i) Next state	0	0	0	0	1	0	0	1	9
(ii) Next state	1	0	1	0	1	0	1	0	170
(iii) Next state	1	1	0	0	0	0	1	1	195
(iv) Next state	0	1	0	1	0	0	0	0	80

Table 2. Table 2: Relationship between i t h superscript 𝑖 𝑡 ℎ i^{th} and ( i + 1 ) t h superscript 𝑖 1 𝑡 ℎ (i+1)^{th} RMTs.

$i^{t h}$ RMT	0	1	2	3	4	5	6	7
${(i + 1)}^{t h}$ RMT	0, 1	2, 3	4, 5	6, 7	0, 1	2, 3	4, 5	6, 7

Table 3. Table 3: An experimental study

Rule Vector	CA Size	Source	Destination	Reachable	Decision	Remarks
		$S$	$D$	or Not	Level
$⟨$ 8, 58, $ℛ_{2}$ , $ℛ_{3}$ ,	$n$ $\geq$ $2$	$10$	$11$	Not	1	satisfies
$\dots$ , $ℛ_{n - 1}$ $⟩$		${(0 + 1)}^{n - 2}$	${(0 + 1)}^{n - 2}$	reachable		Condition 1
$⟨$ 10, 164, $ℛ_{2}$ , $ℛ_{3}$ ,	$n$ $\geq$ $2$	$00$	$10$	Not	1	satisfies
$\dots$ , $ℛ_{n - 1}$ $⟩$		${(0 + 1)}^{n - 2}$	${(0 + 1)}^{n - 2}$	reachable		Condition 2
$⟨$ 7, 72, 254, $ℛ_{3}$ ,	$n$ $\geq$ $3$	$111$	$011$	Not	2	no path
$ℛ_{4}$ , $\dots$ , $ℛ_{n - 1}$ $⟩$		${(0 + 1)}^{n - 3}$	${(0 + 1)}^{n - 3}$	reachable		exists
$⟨$ 15, 213, 5, 196, 124,	$n$ = $10$	01001	11001	Reachable	9	satisfies
243, 218, 99, 184, $85$ $⟩$		00101	11011			Theorem IV.1

Table 4. Table 4: Experimental results

CA size	10	20	30	40	50	60	70	80	90	100
Average number of
edges to be	50	344	1085	2428	4536	7612	11704	17012	23742	31923
explore

Table 5. Table 5: Rate of growth with respect to CA size

CA size	10	20	30	40	50	60	70	80	90	100
Explored edges	50	344	1085	2428	4536	7612	11704	17012	23742	31923
Rate of growth ( $a$ )	-	2.78	2.83	2.80	2.80	2.84	2.79	2.80	2.83	2.81

Equations16

\overline{X_{1}} = \overline{X_{1}}

\overline{X_{1}} = \overline{X_{1}}

\overline{X_{2}} = \frac{X _{1} + X _{2}}{2} = \frac{1}{2} \overline{X_{1}} + \frac{1}{2} \overline{X_{2}}

\overline{X_{3}} = \frac{X _{1} + X _{2} + X _{3}}{3} = \frac{2}{3} \frac{( X _{1} + X _{2} )}{2} + \frac{1}{3} \overline{X_{3}} = \frac{2}{3} \overline{X_{2}} + \frac{1}{3} \overline{X_{3}}

\dots

\overline{X_{k}} = \frac{k - 1}{k} \overline{X_{k - 1}} + \frac{1}{k} \overline{X_{k}}

n_{2} = \frac{S _{1}^{2}}{C μ _{1}^{2}} (1 + 8 C + \frac{S _{1}^{2}}{n _{1} μ _{1}^{2}} + \frac{2}{n _{1}})

n_{2} = \frac{S _{1}^{2}}{C μ _{1}^{2}} (1 + 8 C + \frac{S _{1}^{2}}{n _{1} μ _{1}^{2}} + \frac{2}{n _{1}})

C = \frac{r ^{2}}{t ^{2}}

C = \frac{r ^{2}}{t ^{2}}

m_{0} = \frac{t ^{2} S _{2}^{2}}{r ^{2} μ _{2}^{2}}

m_{0} = \frac{t ^{2} S _{2}^{2}}{r ^{2} μ _{2}^{2}}

m = \frac{m _{0}}{1 + \frac{m _{0}}{N}}

m = \frac{m _{0}}{1 + \frac{m _{0}}{N}}

a \approx \frac{lo g ( e _{2} / e _{1} )}{lo g ( n _{2} / n _{1} )}

a \approx \frac{lo g ( e _{2} / e _{1} )}{lo g ( n _{2} / n _{1} )}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Reachability Problem in Non-uniform Cellular Automata

Sumit Adak

Department of Information Technology

Indian Institute of Engineering Science and Technology

Shibpur

Howrah-711103

India

[email protected]

Sukanya Mukherjee

Department of Computer Science and Engineering

Institute of Engineering and Management

Kolkata

West Bengal 700091

India

[email protected]

Sukanta Das

Department of Information Technology

Indian Institute of Engineering Science and Technology

Shibpur

Howrah-711103

India

[email protected]

Abstract

This paper deals with the CREP (Configuration REachability Problem) for non-uniform cellular automata (CAs). The cells of non-uniform CAs, we have considered here, can use different Wolfram’s rules to generate their next states. We report an algorithm which decides whether or not a configuration of a given (non-uniform) cellular automaton is reachable from another configuration. A characterization tool, named Reachability tree, is used to develop theories and the decision algorithm for the CREP. Though the worst case complexity of the algorithm is exponential in time and space, but the average performance is very good.

keywords:

Non-uniform Cellular Automata (CAs), reachability tree, link, rule, rule min term (RMT).

Reachability Problem in Non-uniform Cellular Automata

I Introduction

Cellular automata (CAs) are discrete dynamical systems which produce complex global behaviour using simple local computation [11, 16]. The Configuration REachability Problem (CREP) in CAs asks to decide whether a (destination) configuration $D$ of a given cellular automaton (CA) is reachable from another (source) configuration $S$ of the CA [3]. The CREP is undecidable for 1-d infinite CAs [14], so researchers considered this problem for finite CAs [14, 3]. CREP is P-complete, NP-complete and PSPACE-complete depending on the types of CAs [14]. It has also been shown that CREP is NP-intermediate for the CAs with additive rules [3]. Wolfram’s rule 90, for example, is an additive rule [9], and so to decide reachability of $D$ of rule 90 CA with $n$ cells from $S$ , we need superpolynomial time.

However, all the works on CREP consider the classical CAs, where the cells follow same next state function (that is, $rule$ ) to generate their next states. In recent time, a new class of CAs, known as non-uniform CAs, are under the focus of CAs research where the cells of a CA can follow different next state functions [2, 8, 13]. Obviously, classical CAs are proper subset of these non-uniform CAs. Primary focus of the non-uniform CA research was on the one-dimensional CAs, where the cells follow Wolfram’s CA rules [2]. Researchers already studied the reachability problem [14, 3] for finite classical CAs. However, for non-linear non-uniform CAs, there is no method to deal with the reachability problem. In this work, we propose a method to deal with the reachability problem for 1-d finite non-uniform CAs.

We use here a characterization tool, named Reachability tree, to discover the properties of non-uniform CAs. An algorithm to decide reachability of $D$ from $S$ of a given $n$ -cell non-uniform CA is reported. The algorithm can obviously deal with classical CAs as well. Worst case time complexity of the algorithm, however, is exponential, because CREP is itself PSPACE-complete [3]. But, the average case time requirement of the algorithm is polynomial.

To understand average case performance, we conduct an experimentation. And through experimentation, we determine that the average case complexity of the algorithm is $O(n^{3})$ , where $n$ is the size of automaton.

Hereafter, by “CA”, we will mean “non-uniform” CA. We next proceed with some useful definitions about CAs.

II Definitions

The CAs, we consider here, consist of a finite number of cells which are organized as a 1-dimensional lattice $\mathcal{L}$ . The cells can be in state 0 or state 1. A configuration or (global) state of the CA is a mapping $c$ : $\mathcal{L}$ $\mapsto$ $\{0,1\}$ . Let us consider that $\mathcal{C}$ is the collection of all possible configurations of an $n$ -cell CA (that is $|\mathcal{C}|$ = $2^{n}$ ). Then, a CA is a function $F$ : $\mathcal{C}$ $\rightarrow$ $\mathcal{C}$ , which satisfies the following conditions: $y=F(x)$ , $x,y\in{\mathcal{C}}$ , where $x=(x_{i})_{0\leq i\leq{n-1}}$ , $y=(y_{i})_{0\leq i\leq{n-1}}$ and $y_{i}=f_{i}(x_{i-1},x_{i},x_{i+1})$ . The $f_{i}:\{0,1\}^{3}\mapsto\{0,1\}$ is a next state function for the cell $i$ . In this work, we consider null boundary condition where left and right neighbors of cell 0 and cell $n-1$ are always in state 0. That is, $y_{0}=f_{0}(0,x_{1},x_{2})$ and $y_{n-1}=f_{i}(x_{n-2},x_{n-1},0)$ .

The next state function $f_{i}$ can be expressed in tabular form (Table 1). Decimal equivalents of 8-next states are conventionally called as “rule” ( $\mathcal{R}_{i}$ ) [15]. We name each of the 8 combinations of $x_{i-1}$ , $x_{i}$ and $x_{i+1}$ as Rule Min Term (RMT), which is generally presented in its decimal equivalent. The 001 of the first row of Table 1 is the RMT 1, next state against which is 0 for rule 9, 1 for rule 170. If $r$ is an RMT of ${\mathcal{R}_{i}}$ , we write ${\mathcal{R}_{i}}[r]$ to denote its next state. Hence, 9[1]=0, 170[1]=1 (see Table 1).

Now, we introduce a set $Z_{8}^{i}$ that contains the valid RMTs of ${\mathcal{R}_{i}}$ . That is, $Z_{8}^{i}=\{k\leavevmode\nobreak\ |$ RMT $k$ of ${\mathcal{R}_{i}}$ is valid}. Generally, $|Z_{8}^{i}|=8$ . However, only four RMTs are valid for the first and last rules of a null boundary CA, and $Z^{0}_{8}=\{0,1,2,3\}$ and $Z^{n-1}_{8}=\{0,2,4,6\}$ .

Traditionally, the cells of a CA follow same rule. Such a CA is uniform CA. In a non-uniform CA, the cells may follow different rules. We, therefore, need a rule vector ${\mathcal{R}}=\langle{\mathcal{R}_{0}},\leavevmode\nobreak\ {\mathcal{R}_{1}},\cdots,{\mathcal{R}_{i}},\cdots,{\mathcal{R}_{n-1}}\rangle$ to define an $n$ -cell non-uniform CA, where the cell $i$ follows ${\mathcal{R}_{i}}$ . The uniform CA, hence, is a special case of non-uniform CA; where ${\mathcal{R}_{0}}={\mathcal{R}_{1}}=\cdots={\mathcal{R}_{i}}=\cdots={\mathcal{R}_{n-1}}$ .

State Transition Diagram: The sequence of configurations or states of a CA generated (state transitions), during its evolution (with time), directs the CA behaviour. The state transition diagram of an automaton shows the transition of states, and depicts the relations among states of the automaton. As a proof of concept, Fig. 1 shows the state transition diagram of a 4-cell CA $\langle$ 9, 170, 195, 80 $\rangle$ . In this work, however, we have used the terms “configuration” and “state of a CA” interchangeably.

Definition 1

A state $c\in\mathcal{C}$ of a CA is reachable if there exists at least one state $x\in\mathcal{C}$ so that $c=F(x)$ . If no such $x$ exists, $c$ is non-reachable.

For example, state 0011 of Fig. 1 is non-reachable whereas state 1101 is reachable.

Definition 2

A state of a CA $D$ * is reachable from $S$ , $D,S\in\mathcal{C}$ , if there exists a finite $t\in\mathbb{N}$ so that $D=F^{t}(S)$ . If no such $t$ exists, then $D$ is not reachable from $S$ **.*

For example, state 1010 of Fig. 1 is reachable from the state 0100. However, 0100 is not reachable from the state 1010. Please note here that “ $D$ is not reachable from $S$ ” does not necessarily imply that “ $D$ is non-reachable”. $D$ may be reachable from other configuration, but not from $S$ .

RMT Sequence (RS): A CA state can also be viewed as a sequence of RMTs. For example, the state 0101 in null boundary condition can be viewed as $\langle 1252\rangle$ , where 1, 2, 5 and 2 are the RMTs on which the transition of first, second, third and fourth cells can be made. For an $n$ -bit state, we get a sequence of $n$ RMTs. Obviously, two consecutive RMTs in an RS, $r_{i}$ and $r_{i+1}$ are related, and $r_{i+1}$ = $2r_{i}$ or $2r_{i}+1\pmod{8}$ (Table 2).

Definition 3

Two RMTs $r$ and $s$ ( $r\neq s$ ) are said to be equivalent to each other if $2r\equiv 2s\pmod{8}$ . [6]

Definition 4

Two RMTs $r$ and $s$ ( $r\neq s$ ) are said to be sibling to each other if $\lfloor r/2\rfloor$ = $\lfloor s/2\rfloor$ . [6]

Therefore, RMT 2 is equivalent to RMT 6, whereas RMTs 2 and 3 are sibling to each other.

Now to decide whether a configuration or a state $D$ of a (non-uniform) CA reachable from another configuration $S$ , we next introduce a tool, named reachability tree.

III Reachability Tree and Configuration Tracing

Reachability Tree [7, 1], a characterization tool for 1-dimensional CA, is a rooted and edge-labelled binary tree that represents the reachable states of a CA. For an $n$ -cell CA, there are $n+1$ levels - root at level 0, and leaves at level $n$ . We represent a node of the tree by $N_{i.j}$ , where $i$ ( $0\leq i\leq n$ ) is the level index, and $j$ ( $0\leq j\leq 2^{i}-1$ ) is the node number at $i^{th}$ level. The numbering of nodes in each level starts from left side. In the reachability tree, the nodes are the subset of RMTs of rules – $N_{i.j}\subseteq Z_{8}^{i}$ .

The root is formed with the RMTs of ${\mathcal{R}}_{0}$ , the nodes of level $(n-1)$ are formed with RMTs of ${\mathcal{R}}_{n-1}$ , and the leaf nodes are empty. We represent an edge of the tree by $E_{i.j}$ , where $i$ ( $0\leq i\leq n-1$ ) is the level index, and $j$ ( $0\leq j\leq 2^{i+1}-1$ ) is the edge number at $i^{th}$ level. Here, we define the level of an edge. An edge is said to be edge of $i^{th}$ level, if it connects the nodes of $i^{th}$ and $(i+1)^{th}$ levels. So, we can write, $E_{i.2j}=(N_{i.j},N_{i+1.2j},l_{i.2j})$ and $E_{i.2j+1}=(N_{i.j},N_{i+1.2j+1},l_{i.2j+1})$ ( $0\leq i\leq n-1$ , $0\leq j\leq 2^{i}-1$ ), where $l_{i.2j}\subseteq N_{i.j}$ and $l_{i.2j+1}\subseteq N_{i.j}$ are the labels of the edges, and $l_{i.2j}\cup l_{i.2j+1}=N_{i.j}$ . If $l_{i.k}=\varnothing$ for any $k$ , the edge $E_{i.k}$ (hence, $N_{i+1.k}$ ) does not exit. We call such an edge as a non-reachable edge. However, for each $r\in l_{i.2j}$ (resp. $r\in l_{i.2j+1}$ ), RMT $r$ of ${\mathcal{R}}_{i}$ is 0 (resp. 1) and we get two RMTs $2r\pmod{8}$ and $2r+1\pmod{8}$ of ${\mathcal{R}}_{i+1}$ in $N_{i+1.2j}$ (resp. $N_{i+1.2j+1}$ ), and the edge is called 0-edge (resp. 1-edge). Following is the formal definition of the reachability tree.

Definition 5

Reachability tree of an $n$ -cell CA with rule vector $\langle$${\mathcal{R}}_{0}$ , ${\mathcal{R}}_{1}$ , $\cdots$ , ${\mathcal{R}}_{i}$ , $\cdots$ , ${\mathcal{R}}_{n-1}$$\rangle$ under null boundary condition is a rooted and edge-labelled binary tree with $n+1$ levels, where $E_{i.2j}=(N_{i.j},N_{i+1.2j},l_{i.2j})$ and $E_{i.2j+1}=(N_{i.j},N_{i+1.2j+1},l_{i.2j+1})$ are the edges between nodes $N_{i.j}\subseteq Z_{8}^{i}$ and $N_{i+1.2j}\subseteq Z_{8}^{i+1}$ with label $l_{i.2j}\subseteq N_{i.j}$ , and between nodes $N_{i.j}$ and $N_{i+1.2j+1}\subseteq Z_{8}^{i+1}$ with label $l_{i.2j+1}\subseteq N_{i.j}$ respectively $(0\leq i\leq n-1$ , $0\leq j\leq 2^{i}-1)$ . Following are the relations which exist in the tree:

[For root] $N_{0.0}=Z_{8}^{0}=\{0,1,2,3\}$ . 2. 2.

$\forall r\in N_{i.j}$ , RMT $r$ of ${\mathcal{R}}_{i}$ is in $l_{i.2j}$ (resp. $l_{i.2j+1}$ ), if ${\mathcal{R}}_{i}[r]$ = 0 (resp. 1). That means, $l_{i.2j}\cup l_{i.2j+1}=N_{i.j}$ ( $0\leq i\leq n-1$ , $0\leq j\leq 2^{i}-1$ ). 3. 3.

$\forall r\in l_{i.j}$ , RMTs $2r\pmod{8}$ and $2r+1\pmod{8}$ of ${\mathcal{R}}_{i+1}$ are in $N_{i+1.j}$ ( $0\leq i\leq n-3$ , $0\leq j\leq 2^{i+1}-1$ ). 4. 4.

[For level $n-1$ ] $\forall r\in l_{n-2.j}$ , RMT $2r\pmod{8}$ of ${\mathcal{R}}_{n-1}$ is in $N_{i+1.j}$ ( $0\leq j\leq 2^{n-1}-1$ ). 5. 5.

[For level $n$ ] $N_{n.j}=\varnothing$ , for any $j$ , $0\leq j\leq 2^{n}-1$ .

Fig. 2 is the reachability tree of the CA of Fig. 1. According to the null boundary condition, only 4 RMTs (0, 1, 2 and 3) of ${\mathcal{R}}_{0}$ are valid, and so the root is formed with these 4 RMTs. That is, $N_{0.0}=Z_{8}^{0}=\{0,1,2,3\}$ . Similarly, $Z_{8}^{n-1}=Z_{8}^{3}=\{0,2,4,6\}$ and $N_{3.j}\subseteq Z_{8}^{3}$ for all $j$ , $0\leq j\leq 3$ . However, the label of edge $E_{0.1}$ is {0, 3}, as RMTs 0 and 3 of rule 9 are 1. We write RMTs of a label on the edge. Note that, the label of $E_{3.1}$ is empty, that is, $l_{3.1}=\varnothing$ . This edge is non-reachable, and it can not connect any node of next level. Fig. 2 uses dotted line for them. Since $Z_{8}^{n}=\varnothing$ for an $n$ -cell CA, the leaves are empty. The number of leaves (excluding dotted leaves) in Fig. 2 is 8, which is the number of reachable states. We call edge $E_{i.j}$ as 0-edge when $j$ is even, and 1-edge otherwise. We further call the edge $E_{i.j}$ as an edge of level $i$ . A sequence of edges from the root to a leaf node represents a reachable state, when 0-edge and 1-edge are replaced by 0 and 1 respectively. For example, 0000 is a reachable state in Fig. 2, but the state 0001 is non-reachable.

From the reachability tree, we can get the information about reachable and non-reachable states. A sequence of edges $\langle$$E_{0.j_{0}}$ $E_{1.j_{1}}$ $\cdots$ $E_{i.j_{i}}$ $E_{i+1.j_{i+1}}$ $\cdots$ $E_{n-1.j_{n-1}}$$\rangle$ from root to a leaf associates a reachable state and at least one RS $\langle r_{0}r_{1}\cdots r_{i}r_{i+1}\cdots r_{n-1}\rangle$ , where $r_{i}\in l_{i.j_{i}}$ and $r_{i+1}\in l_{i+1.j_{i+1}}$ ( $0\leq i<n-1$ , $0\leq j_{i}\leq 2^{i}-1$ , and $j_{i+1}=2j_{i}$ or $2j_{i}+1$ ). That is, the sequence of edges represents at least two CA states. Note that if RMT $r_{i}$ is 0 (resp. 1) then $E_{i.j_{i}}$ is 0-edge (resp. 1-edge). Therefore, the reachable state is the next (resp. present) state of the state (resp. predecessor), represented as RS. Interestingly, there are $2^{n}$ RSs in the tree, but number of reachable states may be less than $2^{n}$ . A sequence of edges may associate $m$ -number of RSs ( $m\geq 1$ ), which implies, this state is reachable from $m$ -number of different states.

Obtaining only reachable or non-reachable states using reachability tree is not enough to make the decision about reachability of one state from another. We need to find out the predecessor(s) of each state in the reachability tree. Then only we can trace in the tree if a CA state $D$ is reachable from another state $S$ . However, the tree guides us to find the predecessors of the CA states by establishing relation among edges. To find the relations among the edges, we introduce the concept of “link” in the next section.

III.1 Links

As we have discussed before, a CA state/configuration can be represented as a bit sequence, and as an RMT sequence. Reachability tree uses both the representations - bit sequences to represent the reachable states, and RMT sequences to represent their predecessors. Now, the predecessors, which are also CA states, can be observed in the tree as bit sequence. Intuitively, the “links” link the states represented as bit sequences to their predecessor.

The links are formed for each RMT $r\in l_{i.j}$ , present on edge $E_{i.j}$ ( $0\leq i\leq n-1$ , $0\leq j\leq 2^{i+1}-1$ ). By the processing of reachability tree, we find the links among the edges for each individual RMT on the tree. The links are formed depending on whether the RMTs are self replicating (defined below) or not.

Definition 6

An RMT $r=4x+2y+z$ of a rule ${\mathcal{R}_{i}}$ is said to be self replicating if ${\mathcal{R}_{i}}[r]=y$ where $x,y,z\in\{0,1\}$ .

For example, RMT 1 (001) and RMT 3 (011) of rule 9 is self replicating, whereas RMTs 4, 5, 6 and 7 of rule 195 are self replicating (see Table 1). If an RMT $r\in l_{i.j}$ is not self replicating, then there is a link from the edge $E_{i.j}$ to $E_{i.k}$ ( $j\neq k$ ). Depending on the values of $j$ and $k$ , we can classify the links in the following way: forward link (when $j<k$ ), backward link (when $j>k$ ) and self link (when $j=k$ ). We represent this link as $E_{i.j}(r)\longrightarrow E_{i.k}$ . The rules, followed to form links in a reachability tree, are noted below:

R1) If RMT $r\in l_{0.j}$ is self replicating ( $j=0$ or $1$ ), the edge $E_{0.j}$ is self linked for RMT $r$ . Otherwise, if $j=0$ , there is a forward link from $E_{0.0}$ to $E_{0.1}$ for RMT $r$ ; else, there is a backward link from $E_{0.1}$ to $E_{0.0}$ for RMT $r$ .

R2) If $E_{i-1.j}$ is self linked for RMT $r\in l_{i-1.j}$ , and if $s$ is self replicating where $s\in l_{i.2j}$ (resp. $s\in l_{i.2j+1}$ ) is $2r$ or $2r+1\pmod{8}$ , then $E_{i.2j}$ (resp. $E_{i.2j+1}$ ) is self linked. But if $s$ is not self replicating, then there is a forward link from $E_{i.2j}$ to $E_{i.2j+1}$ (resp. backward link from $E_{i.2j+1}$ to $E_{i.2j}$ ).

R3) If there is a link from $E_{i-1.j}$ to $E_{i-1.k}$ ( $j\neq k$ ) for RMT $r\in l_{i-1.j}$ , and $s\in l_{i.2j}$ (resp. $s\in l_{i.2j+1}$ ) is $2r$ or $2r+1\pmod{8}$ , then there is a link from $E_{i.2j}$ (resp. $E_{i.2j+1}$ ) to $E_{i.2k}$ while $s\in\{0,1,4,5\}$ or to $E_{i.2k+1}$ while $s\in\{2,3,6,7\}$ . It is forward link if $j<k$ , backward link if $j>k$ .

Example III.1

Fig. 3 shows the links of edges caused by RMTs of the CA $\langle$ 9, 170, 195, 80 $\rangle$ . There is a (forward) link from $E_{0.0}$ to $E_{0.1}$ for RMT 2, so we write the link within a bracket beside the RMT 2. Now, we get a forward link from $E_{1.1}$ to $E_{1.2}$ for RMT 5. Now, we get $E_{2.2}(2)$ $\rightarrow$ $E_{2.5}$ , and $E_{3.5}(4)$ $\rightarrow$ $E_{3.10}$ . Therefore, for the RS $\langle 2524\rangle$ , we can get a sequence of links, hence a sequence of edges $\langle E_{0.1}E_{1.2}E_{2.5}E_{3.10}\rangle$ , which represents 1010. Note that the RS $\langle 2524\rangle$ corresponds to the state 1010. The sequence $\langle E_{0.0}E_{1.1}E_{2.2}E_{3.5}\rangle$ associates the state 0101, as well as the RS $\langle 2524\rangle$ . The RS $\langle 2524\rangle$ , hence the state 1010, is the predecessor of the state 0101. See Fig. 1 for verification.

The links help us to trace state transitions in reachability tree by identifying the predecessor(s) of each state. Through the links, we can identify the predecessor of predecessor of a state. If $E_{i.j}$ is linked with $E_{i.k}$ for RMT $r_{1}\in l_{i.j}$ , and $E_{i.k}$ is linked with $E_{i.p}$ ( $0\leq j<k<p\leq 2^{i}-1$ for forward link, $2^{i}-1\geq j>k>p\geq 0$ for backward link) for RMT $r_{2}\in l_{i.k}$ , we say there exists a link (forward or backward) from $E_{i.j}$ to $E_{i.p}$ , where $1\leq i\leq n-1$ . Therefore, we get the following property (transitivity property) of the links. We write $E_{i.j}(r_{1})\rightarrow E_{i.k}$ , if there is a link from $E_{i.j}$ to $E_{i.k}$ for RMT $r_{1}\in l_{i.j}$ .

•

If $E_{i.j}(r_{1})\rightarrow E_{i.k}$ and $E_{i.k}(r_{2})\rightarrow E_{i.p}$ , then

•

$E_{i.j}(r_{1})\rightarrow E_{i.k}(r_{2})\rightarrow E_{i.p}$ .

Now, we define length of the links. If from edge $E_{i.j_{1}}$ to $E_{i.j_{2}}$ , there are $k$ number of RMTs (or $k$ number of edges), then we write: $length(E_{i.j_{1}},E_{i.j_{2}})=k$ . We write, $length(E_{i.j_{1}},E_{i.j_{2}})=\infty$ if there is no link between $E_{i.j_{1}}$ and $E_{i.j_{2}}$ . In Fig. 3, following connection between $E_{1.0}$ and $E_{1.3}$ exists: $E_{1.0}(4)\rightarrow E_{1.2}(6)\rightarrow E_{1.3}$ . That is, $length(E_{1.0},E_{1.3})=2$ .

Lemma III.2

There exist only two links to $E_{i.j}$ from any one or two edges for RMTs $r$ and $s$ when $0\leq i<n-1$ and $r$ and $s$ are sibling to each other, and only one link when $i=n-1$ in a reachability tree ( $0\leq j\leq 2^{i+1}-1$ ). [1]

Property 1

A link present at $i^{th}$ level triggers two links at level $i+1$ , where $0\leq i\leq n-3$ , a link of $(n-2)^{th}$ level derives one link at $(n-1)^{th}$ level.

This is obvious, because an RMT $r$ at a node/label of level $i$ contributes two RMTs - $2r\pmod{8}$ and $2r+1\pmod{8}$ in node/label(s) of level $i+1$ . Both the RMTs participate in links, depending upon the link caused by RMT $r$ . For example, the link $E_{0.1}(0)\rightarrow E_{0.0}$ triggers two links $E_{1.2}(0)\rightarrow E_{1.0}$ and $E_{1.3}(1)\rightarrow E_{1.0}$ . However, a link at level $n-2$ triggers only one link at last level, as RMT $2r+1\pmod{8}$ is invalid in that level.

Let us now define path between two edges of a level - $E_{i.j_{1}}$ and $E_{i.j_{k}}$ . We say that there exists a path between $E_{i.j_{1}}$ and $E_{i.j_{k}}$ if $E_{i.j_{1}}$ is linked to $E_{i.j_{k}}$ , that is, if $length(E_{i.j_{1}},E_{i.j_{k}})$ is finite. Otherwise, there is no path between $E_{i.j_{1}}$ and $E_{i.j_{k}}$ . If a path exists, we write it as the following: $E_{i.j_{1}}(r_{1})$ $\rightarrow$ $E_{i.j_{2}}(r_{2})$ $\rightarrow$ $\cdots$ $\rightarrow$ $E_{i.j_{k}}$ . Now, the question is, can we say that there exist a path between $E_{i+1.p}$ and $E_{i+1.q}$ where $p\in\{2j_{1},2j_{1}+1\}$ and $q\in\{2j_{k},2j_{k}+1\}$ ? No, not always. Following Property 1, if a path is formed from $E_{i+1.p}$ and $E_{i+1.q}$ due to the path between $E_{i.j_{1}}$ and $E_{i.j_{k}}$ , we say the path between $E_{i+1.p}$ and $E_{i+1.q}$ is triggered by the path between $E_{i.j_{1}}$ and $E_{i.j_{k}}$ . However, no path may be triggered at level $i+1$ . Obviously, a path from $E_{n-1.j_{1}}$ to $E_{n-1.j_{k}}$ is triggered by the paths above.

Example III.3

In Fig. 3, following path is formed at level [math], which and triggers a path at level $3$ : $E_{0.1}(0)\rightarrow E_{0.0}$ , $E_{1.2}(0)\rightarrow E_{1.0}$ , $E_{2.5}(0)\rightarrow E_{2.0}$ , $E_{3.10}(0)\rightarrow E_{3.0}$ .

Now, we explore the reachability tree to check that if there exists any path or not from destination edge ( $D$ ) to source edge ( $S$ ) at leaf level.

IV Reachability Analysis

To check whether a configuration $D$ of an $n$ -cell CA is reachable from another configuration $S$ , we rewrite the configurations as following: $S=(s_{i})_{0\leq i\leq n-1}$ and $D=(d_{i})_{0\leq i\leq n-1}$ . The configurations can also be identified in the reachability tree as sequences of edges. For ease of understanding, let us rename the sequences of edges as $(s_{i})_{0\leq i\leq n-1}$ representing $S$ , and as $(d_{i})_{0\leq i\leq n-1}$ representing D. Now, we search in the reachability tree for a path from $d_{i}$ to $s_{i}$ . If no path exists, we declare that $D$ is not reachable from $S$ .

Theorem IV.1

For an $n$ -cell CA, $D$ is reachable from $S$ , if and only if there exists a path from $d_{n-1}$ to $s_{n-1}$ .

Proof IV.2

Let us consider, there is a path from $d_{n-1}$ to $s_{n-1}$ at leaf level of length $m$ : $E_{n-1.j_{1}}(r_{1})$ $\rightarrow$ $\cdots$ $\rightarrow$ $E_{n-1.j_{q}}(r_{q})$ $\rightarrow$ $\cdots$ $\rightarrow$ $E_{n-1.j_{m}}$ where $d_{n-1}$ = $E_{n-1.j_{1}}$ and $s_{n-1}$ = $E_{n-1.j_{m}}$ . Now, we can proof $D$ is reachable from $S$ . Hence, we can get a sequence of edges from root to $E_{n-1.k}$ for each $k\in\{j_{1},j_{2},\cdots,j_{m}\}$ which represents a reachable state. Here, two reachable states which are represented by edge sequences that end with $E_{n-1.j_{p}}$ and $E_{n-1.j_{p+1}}$ respectively are two consecutive states. Hence, we can get a sequence of consecutive states. Since there is a path, the sequence of states forms a path involving the RMTs $r_{1},r_{2},\cdots,r_{m-1}$ . Hence, $D$ is reachable from $S$ .

Now suppose, $D$ is reachable from $S$ . Obviously, there is a path from $d_{i}$ to $s_{i}$ , $0\leq i\leq n$ . Hence the proof.

Example IV.3

Suppose, $S=0000$ and $D=0101$ for the CA $\langle 9,170,195,80\rangle$ . Now, from Fig. 3, we see that $d_{3}$ = $E_{3.5}$ and $s_{3}$ = $E_{3.0}$ . From the linked tree, we can get the path - $E_{3.5}(4)\rightarrow E_{3.10}(0)\rightarrow E_{3.0}$ ( $length(d_{3},s_{3})=2$ ). Therefore, $D$ is reachable from $S$ , and $D=F^{2}(S)$ (check it from Fig. 1).

For the same CA, if $S=0000$ and $D=1101$ , then $d_{3}$ = $E_{3.13}$ and $s_{3}$ = $E_{3.0}$ . From Fig. 3, we can see that there is no path from $d_{3}$ to $s_{3}$ . So, $D$ is not reachable from $S$ .

To decide the reachability, we first form the root of the reachability tree (using $\mathcal{R}_{0}$ ), get edges from the root, identify links between edges following rule R1 of link formation. Then, check if there exist any path from $d_{0}$ to $s_{0}$ . If it exists then we continue, otherwise conclude that $D$ is not reachable from $S$ . If it exists then form the next level (using $\mathcal{R}_{1}$ ) and get the links, and again check whether there exists any path from $d_{1}$ to $s_{1}$ . If no path exists, then $D$ is not reachable from $S$ . Otherwise, continue the same process. Finally, if there exist a path from $d_{n-1}$ to $s_{n-1}$ , then declare that $D$ is reachable from $S$ .

By definition, reachability tree grows exponentially, in general. In this particular problem, however, we do not deal with all the edges. The edges, not in the path of $d_{i}$ and $s_{i}$ , are irrelevant to us. To reduce the number of edges/nodes in the proposed decision procedure, we remove such irrelevant edges.

Example IV.4

Let us consider the CA $\langle 9,170,195,80\rangle$ and $S=1010$ and $D=0000$ . Fig. 4 explains that $D$ is reachable from $S$ . The paths of $d_{i}$ and $s_{i}$ are shown in the figure. The edge $E_{1.3}$ is not in the path of $d_{1}$ and $s_{1}$ , and so it is irrelevant in this particular case. Hence, $E_{1.3}$ is removed, and the corresponding sub tree is not further developed. Similarly, $E_{2.1}$ , $E_{2.3}$ , $E_{2.4}$ , $E_{3.1}$ , etc are irrelevant, and hence removed. Obviously, we need not to deal with a good number of edges/nodes here.

However, we can sometime decide the non-reachability of $D$ from $S$ without tracing path of $d_{i}$ and $s_{i}$ , but by observing some conditions related to $d_{i}$ and $s_{i}$ . We next report these conditions.

Condition 1

For an $n$ -cell CA, if the edge $d_{i}$ is non-reachable where $0\leq i\leq n-1$ , then $S$ to $D$ is not reachable.

**Reason: ** From Theorem IV.1, we know that, $D$ is reachable from $S$ if there exist a path from $d_{n-1}$ to $s_{n-1}$ . And from Property 1, we can say that the path at leaf level is triggered from the root. That is, if there is a path from $d_{n-1}$ to $s_{n-1}$ , then there are the paths from $d_{0}$ to $s_{0}$ , $d_{1}$ to $s_{1}$ , $\cdots$ , and $d_{n-2}$ to $s_{n-2}$ . If at any level, $d_{i}$ is non-reachable, then there is no link from this edge. Hence, there is no path from $d_{i+1}$ to $s_{i+1}$ , $\cdots$ , and $d_{n-1}$ to $s_{n-1}$ . Therefore, there is no path at leaf level and we can conclude that $S$ to $D$ is non-reachable. $\Box$

Example IV.5

Suppose, $S=0000$ and $D=1011$ for the CA $\langle 9,170,195,80\rangle$ . Now, from Fig. 3, we get $d_{0}$ = $E_{0.1}$ and $s_{0}$ = $E_{0.0}$ , and there is a path $E_{0.1}(0)\rightarrow E_{0.0}$ . At the second level, $d_{1}$ = $E_{1.2}$ and $s_{1}$ = $E_{1.0}$ and there is also a path: $E_{1.2}(0)\rightarrow E_{1.0}$ . Now, at the third level, $d_{2}$ = $E_{2.5}$ and $s_{2}$ = $E_{2.0}$ and there is also a path: $E_{2.5}(0)\rightarrow E_{2.0}$ . Now at the leaf level, $d_{3}$ = $E_{3.11}$ and $s_{3}$ = $E_{3.0}$ , but the edge $d_{3}$ is non-reachable edge. So, there is no path from $d_{3}$ to anywhere. Therefore, $D$ is not reachable from $S$ (see Fig. 1).

Condition 2

For an $n$ -cell CA, if the edge $s_{i}$ is self linked for two sibling RMTs and $d_{i}\neq s_{i}$ , then $S$ to $D$ is not reachable ( $0\leq i\leq n-1$ ).

**Reason: ** From Theorem IV.1, we know that, $D$ is reachable from $S$ if there exists a path from $d_{n-1}$ to $s_{n-1}$ , which immediately implies the paths from $d_{0}$ to $s_{0}$ , $d_{1}$ to $s_{1}$ , $\cdots$ , and $d_{n-2}$ to $s_{n-2}$ . From Lemma III.2, we get that there exist two links to $E_{i.j}$ from any edges (except leaf level). If the edge $s_{i}$ is self linked for two RMTs, then no other edge can link to $s_{i}$ . So, we can reach to $s_{i}$ from only the edge $s_{i}$ and if $d_{i}\neq s_{i}$ , there is no path from $d_{i}$ to $s_{i}$ . $\Box$

Example IV.6

Consider, $S=1111$ and $D=0000$ of the CA $\langle 9,170,195,80\rangle$ . Now, from Fig. 3, we get that $d_{0}$ = $E_{0.0}$ and $s_{0}$ = $E_{0.1}$ and there is path $E_{0.0}(2)\rightarrow E_{1.0}$ . Now, at the next level, $d_{1}$ = $E_{1.0}$ and $s_{1}$ = $E_{1.3}$ and there is also have path: $E_{1.0}(4)\rightarrow E_{1.2}(6)\rightarrow E_{1.3}$ . At the next level, $d_{2}$ = $E_{2.0}$ and $s_{2}$ = $E_{2.7}$ and there is no path from $d_{2}$ to $s_{2}$ ( $d_{i}\neq s_{i}$ ). The edge $s_{2}$ is self linked for RMTs 6 and 7 of rule 195. Hence, the edge is not reachable from any other edge.

V Decision Algorithm

Now, we present an algorithm to decide whether $S$ to $D$ is reachable or not. The following algorithm uses the theories framed in the earlier sections, to decide the same. However, the algorithm deals only with the labels of edges. Moreover, the algorithm does not form the whole tree at a time, but it deals with two sets of labels - { $l_{i.0},l_{i.1},\cdots l_{i.2^{i}-1}$ } and { $l_{i+1.0},l_{i+1.1},\cdots l_{i+1.2^{i+1}-1}$ }. We proceed with only non-empty labels, $l_{0},\leavevmode\nobreak\ l_{1},\cdots$ and $l^{\prime}_{0},\leavevmode\nobreak\ l^{\prime}_{1},\cdots$ . Here, $l_{j}$ corresponds to the label of $E_{i.j}$ and $l^{\prime}_{k}$ correspond to the label of $E_{i+1.k}s$ ( $0\leq i\leq n-1$ ). The input of the algorithm is the CA (rule vector), $S$ (Source) and $D$ (Destination). The output is ‘Yes’ if $D$ is reachable from $S$ ; ‘No’ otherwise.

Example V.1

Let us consider the CA $\langle$ 9, 170, 195, 80 $\rangle$ , $S=1010$ and $D=0000$ (Fig.4) as input to Algorithm 1. Here $l^{\prime}_{0}=\{1,2\}$ , $l^{\prime}_{1}=\{0,3\}$ , $s=1$ and $d=0$ . A path from $l^{\prime}_{0}$ to $l^{\prime}_{1}$ exists (Step 7). Since there is no irrelevant label, so $Count=2$ . Next, we get 4 labels (Fig. 3) $l^{\prime}_{0}=\{2,4\}$ , $l^{\prime}_{1}=\{3,5\}$ , $l^{\prime}_{2}=\{0,6\}$ and $l^{\prime}_{3}=\{1,7\}$ (Step 4). Now, $s=2$ and $d=0$ . The conditions of Step 6 are not satisfied, so the algorithm searches for a path from $l^{\prime}_{0}$ to $l^{\prime}_{2}$ . There exists a path involving $l^{\prime}_{0}$ , $l^{\prime}_{1}$ and $l^{\prime}_{2}$ (see Fig.4). Obviously $l^{\prime}_{3}$ is irrelevant in this case. Hence, $Count=3$ (Step 8(b)). Now, we assign the following: $l_{0}\leftarrow l^{\prime}_{0}$ , $l_{1}\leftarrow l^{\prime}_{1}$ , $l_{2}\leftarrow l^{\prime}_{2}$ , and further we update $s=2$ and $d=0$ (Step 9(a)). As a next step, the algorithm finds $l^{\prime}_{0}$ , $l^{\prime}_{1}$ , $\cdots$ , $l^{\prime}_{5}$ (Step 4) and sets $s=5$ and $d=0$ . There exists a path involving $l^{\prime}_{0}$ , $l^{\prime}_{2}$ and $l^{\prime}_{5}$ . So, $l^{\prime}_{1}$ , $l^{\prime}_{3}$ and $l^{\prime}_{4}$ are irrelevant in this case. Hence, $Count=3$ (Step 8(b)). Now, we assign following: $l_{0}\leftarrow l^{\prime}_{0}$ , $l_{1}\leftarrow l^{\prime}_{2}$ , $l_{2}\leftarrow l^{\prime}_{5}$ , and further we update $s=2$ and $d=0$ (Step 9(a)). In this way, the algorithm proceeds, and finally reports “Yes”.

Correctness of Algorithm 1: The correctness of the algorithm is directly connected to the theorems, lemmas and conditions reported before. The algorithm conceptually forms reachability tree for the given CA and finds the links at each level. From the root to leaf, at any level, if the destination edge is non-reachable or the source edge is self linked for two sibling RMTs, then according to Condition 1 or Condition 2, the algorithm terminates with output Non-reachable. At any level, if there does not exist any path, then according to Property 1 and Theorem IV.1, the algorithm terminates with output Non-reachable. Otherwise, it forms a new level and checks the paths. At leaf level, if there exists any path, then according to Theorem IV.1, the algorithm terminates with output Reachable.

Theorem V.2

The upper bound running time of Algorithm 1 is proportional to the number of edges explored by the algorithm.

Proof V.3

Algorithm 1 contains main loop enclosing Steps 4-9. Hence, the time complexity of the algorithm is dependent on the time requirements of the steps. However, Step 4 finds the labels of edges of a level, and Steps 5-9 work on those labels. That is, if $k$ number of labels, hence edges, are explored at Step 4, then the other labels work only with them. Therefore, the upper bound of the time requirement for single execution of Steps 4 to 9 is proportional to $k$ . Now, before halting of the algorithm, it repeatedly explores the edges in each run of the main loop. Hence, upper bound of the running time is proportional to the total number of edges explored by the algorithm.

Worst case analysis: The worst case in Algorithm 1 occurs if $D$ is reachable from $S$ and no labels (hence, edges) can be removed. That is, the reachability tree contains all the possible leaves. In that case, space requirement, which is determined by two arrays - $l_{i}$ and $l^{\prime}_{i}$ , is exponential. The time requirement is then obviously exponential.

However, the algorithm performs well on an average. Because, in many cases, many edges are removed, and before reaching to the leaf of the tree, non-reachability can be decided. A sample result of another experimentation is shown Table 3, which speaks about the fact that in many cases, we need not to deal with all the of a CA. The first rule vector of Table 3 says that if $S=10(0+1)^{n-2}$ and $D=11(0+1)^{n-2}$ , and if first two rules of the CA are 8 and 58, then $D$ is not reachable from $S$ for any value of $n\geq 2$ . Table 3 gives us an idea that reachability can be decided much before than encountering the last rule. To understand the average performance of the algorithm, we have arranged a detailed experimental study which is reported in the next section.

VI Average Case Analysis

We find the upper bound of average running time of Algorithm 1 experimentally. Theorem V.2 points out the fact that the running time of the algorithm is proportional to the number of edges explored in corresponding reachability tree. By the proposed experimentation, we, therefore, find the average number of edges explored by Algorithm 1 for a given CA size. We next proceed with experimental setup.

VI.1 Experimental Setup

In this experiment, we use simple random sampling with replacement to calculate the population mean ( $\mu$ ) [4, 12]. In the estimation process, ${\overline{X_{k}}}$ denotes the mean of $k^{th}$ sample, and ${\widehat{\overline{X_{k}}}}$ denotes the $k^{th}$ estimate to the population mean ( $k\geq 1$ ). Let us consider that the sample size is $m$ . So, ${\overline{X_{k}}}=\frac{1}{m}\sum_{i=1}^{m}{x_{i}}$ , where $x_{i}$ is an element of the population which is chosen randomly and uniformly.

In the experiment, we first find ${\overline{X_{1}}}$ which is considered as the first estimate ${\widehat{\overline{X_{1}}}}$ to population mean ( $\mu$ ). Next we take the second sample of size $m$ , and find ${\overline{X_{2}}}$ . Then, we find the next estimate ${\widehat{\overline{X_{2}}}}$ to $\mu$ in the following way. And, this process continues.

[TABLE]

As the mean of all possible samples’ means is the population mean, the series $({\widehat{\overline{X_{k}}}})_{k\in{\mathbb{N}}}$ approaches to $\mu$ . For our study, population size is normally large. So, neither consideration of all possible samples nor finding of $\mu$ is possible. We, therefore, declare ${\widehat{\overline{X_{k}}}}$ as our final estimate to the population mean if $\frac{|\widehat{\overline{X_{k}}}-\widehat{\overline{X_{k-1}}}|}{\widehat{\overline{X_{k}}}}<\delta$ , where $\delta$ is a small threshold value and specifies the precision we desire to achieve. We consider here $\delta=0.01$ .

Now, fixing of the ‘ $m$ ’ value is another important task of this calculation. Here, we use another statistical method for choosing $m$ . For calculating the sample size ( $m$ ), we first take a random sample of size $n_{1}$ . Then, we find another sample size $n_{2}$ using the following equation [4].

[TABLE]

where $\mu_{1}$ and $S_{1}^{2}$ are the mean and variance of the first sample of size $n_{1}$ , and

[TABLE]

where $t$ is the constant and $r$ is the relative error. For our experimental setup, we consider $t=2$ and $r=0.05$ [4].

As a next step, we randomly and uniformly take the second sample of size $n_{2}$ . Then, we find $\mu_{2}$ and $S^{2}_{2}$ as the mean and variance of the second sample. Using these parameters, we find another sample size $m_{0}$ , which finally leads us to get the ‘m’:

[TABLE]

Now, the desired sample size is calculated as following, where $N$ is the population size.

[TABLE]

VI.2 The Method of Experiment

Though Algorithm 1 is a decision algorithm, a slight modification in Algorithm 1 enables us to get the total number of edges explored by it. To do that, we initialize a variable $Total\_count$ ( $Total\_count\leftarrow 0$ ) in the Step 2 of Algorithm 1, and rewrite the Step 8 as following:

**Step 8:

**(a) Mark the labels ( $l^{\prime}_{j}$ s) which are not in any path, computed in Step 7, as irrelevant.

(b) $Total\_count$ $\leftarrow$ $Total\_count$ $+$ 2*Count.

(c) $Count\leftarrow 2*Count$ $-$ $\#$ $irrelevant$ $labels$ .

So, we just add an extra step (Step 8(b)) in Algorithm 1 to get the number ( $Total\_count$ ) of explored edges. We use this modified algorithm in our experimentation. However, Algorithm 1 demands two input parameters - one is a CA (that is, a rule vector) and the other is a pair of states (source and destination). For the experiment, therefore, we need to find out sample size twice. One for the pairs of states when a CA is given, and the other for the CAs of a given size. Let us consider that $m^{\prime\prime}$ be the number of CAs to be sampled for a given size, and $m^{\prime}$ be the number of pairs to be sampled for a given CA.

Example VI.1

This example illustrates, the calculation of $m^{\prime}$ . Let us consider the 20-cell CA $\langle$ 106, 110, 191, 148, 71, 118, 189, 147, 164, 141, 90, 183, 201, 73, 106, 103, 230, 207, 73, 36 $\rangle$ . To find $m^{\prime}$ , we first randomly choose 500 ( $=n_{1}$ ) pairs of source and destination states. By Algorithm 1, we can calculate $\mu_{1}=359$ (mean of explored edges), $S^{2}_{1}=11025$ . Using the values of mean and variance, we find $n_{2}=138$ (using Equation 1). For the sample size $n_{2}$ , we get the $\mu_{2}=352$ and $S^{2}_{2}=9978$ . Now using Equation 3, we get the value of $m_{0}=129$ . Finally, we get the sample size $m^{\prime}$ , which is also 129 (using Equation 4) where $N=2^{20}$ .

Now, using the value of $m^{\prime}$ and $m^{\prime\prime}$ , we can get the average number of explored edges. For a given CA size, we randomly and uniformly synthesize $m^{\prime\prime}$ number of (non-uniform) CAs, and for each CA we randomly and uniformly choose $m^{\prime}$ number of source and destination pairs. However, for each case, we use the modified Algorithm 1 to get total number of edges explored to decide the reachability. For ease of reference, the method is summarized in Algorithm 2. This method takes the CA size as input, and reports the average number of explored edges. We use this algorithm to get our further results.

VI.3 The Results

Using Algorithm 2, we have extensively experimented with various CA sizes to get the average number of explored edges against a CA size. In Table 4, we report a sample experiment to show the average number of explored edges with respect to the size of automaton. The table points out the fact that with increase of CA size, explored number of edges also increases, but it is not exponential.

Therefore, we need another experiment for finding the rate of growth with respect to CA size. However, the worst case time complexity is exponential for this problem. Therefore, we can compare the average number of explored edges (experimentally) with the worst case of reachability problem for different size of automaton. In Figure 5, we plot the logarithm of number of edges explored against the CA size. The worst case scenario is shown by the dotted line and experimental result is shown by continuous curve in the figure. It is obvious from the graph that the edges explored on average is much less than that on worst case.

VI.4 The Rate of Growth

Experimental results indicate that the rate of growth of average number of explored edges is not exponential. In this sub-section, we find the rate of growth of explored edges to mathematically feel the change in explored edges with respect to the size of automaton. To find the rate of growth, we use the empirical curve bounding technique [10]. Assuming the explored edges ( $e$ ) follows power rule, that is, $e\approx kn^{a}$ [10], the coefficient ‘ $a$ ’ can be found by taking empirical measurements of explored edges $\{e_{1},e_{2}\}$ at some input CA size $\{n_{1},n_{2}\}$ , and calculating $\frac{e_{2}}{e_{1}}\approx(\frac{n_{2}}{n_{1}})^{a}$ . So,

[TABLE]

Now, after taking the value of ‘ $e$ ’ for different size of automaton, we can find rate of growth using the Equation 5. In Table 5, we are showing the rate of growth with respect to CA size.

From the experimentation, we have also observed that the growth rate of explored edges always lies under some upper bound. To represent this fact asymptotically, we are using the big-oh ( $O$ ) notation. From the definition of big-oh ( $O$ ) notation, we can get that for a given function $g(n)$ , $T(n)=O(g(n))$ , if there exist two positive constant $c$ and $n_{0}$ , such that $0\leq T(n)\leq cg(n)$ , for all $n\geq n_{0}$ [5]. As the average number of edges, explored of the non-uniform CAs satisfies the definition of big-oh, so we represent the rate of growth by big-oh notation. From Table 5, we can show that, the value of ‘ $a$ ’ is nearly $3$ for all value of $n$ ( $n$ is the size of automaton). So, we estimate $g(n)=n^{3}$ . Hence, we can say, the average number of edges to be explore of these CAs as $O(n^{3})$ . This is validated in Fig. 6.

VII Conclusion

This paper has presented an in-depth analysis on the non-uniform CAs for reachability problem. The reachability tree has been utilized to develop theories for this class of CAs. We have introduced here a technique to trace the state transition diagram in reachability tree. This technique has helped us to design the decision algorithm for the reachability problem. The average case analysis of our algorithm is done experimentally. The average case performance is $O(n^{3})$ of our algorithm, where the worst case time complexity is exponential.

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Adak, S., Naskar, N., Maji, P., Das, S.: On Synthesis of Non-uniform Cellular Automata Having Only Point Attractors, Journal of Cellular Automata, Special issue on Cellular Automata in Theoretical Computer Science , 12 (1-2), 2016, 81–100.
2[2] Chaudhuri, P. P., Chowdhury, D. R., Nandi, S., Chatterjee, S.: Additive Cellular Automata – Theory and Applications , vol. 1, IEEE Computer Society Press, USA, ISBN 0-8186-7717-1, 1997.
3[3] Clementi, A. E. F., Impagliazzo, R.: The Reachability Problem for Finite Cellular Automata., Inf. Process. Lett. , 53 (1), 1995, 27–31.
4[4] Cochran, W. G.: Sampling Techniques , John Wiley, 1977, ISBN 0-471-16240-X.
5[5] Cormen, T. H., Stein, C., Rivest, R. L., Leiserson, C. E.: Introduction to Algorithms , 2nd edition, Mc Graw-Hill Higher Education, 2001, ISBN 0070131511.
6[6] Das, S.: Theory and Applications of Nonlinear Cellular Automata In VLSI Design , Ph.D. Thesis, Bengal Engineering and Science University, Shibpur, India, 2007.
7[7] Das, S., Sikdar, B. K., Chaudhuri, P. P.: Characterization of Reachable/Nonreachable Cellular Automata States, Proceedings of 6 t h superscript 6 𝑡 ℎ 6^{th} International Conference on Cellular Automata for Research and Industry (ACRI) , October 2004.
8[8] Dennunzio, A., Formenti, E., Provillard, J.: Computational Complexity of Rule Distributions of Non-uniform Cellular Automata, Proceedings of the 6th International Conference on Language and Automata Theory and Applications , LATA’12, 2012, ISBN 978-3-642-28331-4.