The number of languages with maximum state complexity

Bj{\o}rn Kjos-Hanssen; Lei Liu

arXiv:1902.00815·cs.FL·May 3, 2022

The number of languages with maximum state complexity

Bj{\o}rn Kjos-Hanssen, Lei Liu

PDF

Open Access

TL;DR

This paper provides a formula for counting the number of finite languages with maximum state complexity and generalizes the concept from languages to functions on finite sets.

Contribution

It introduces a formula for the number of maximum-complexity languages and extends the analysis from languages to functions on finite sets.

Findings

01

Derived a formula for counting maximum-complexity languages.

02

Generalized the maximum complexity analysis from languages to functions.

03

Enhanced understanding of the distribution of maximum-complexity languages.

Abstract

C\^{a}mpeanu and Ho (2004) determined the maximum finite state complexity of finite languages, building on work of Champarnaud and Pin (1989). They stated that it is very difficult to determine the number of maximum-complexity languages. Here we give a formula for this number. We also generalize their work from languages to functions on finite sets.

Tables1

Table 1. Table 1 : The number of maximally complex functions from [ 2 ] n superscript delimited-[] 2 𝑛 [2]^{n} to [ 2 ] delimited-[] 2 [2] for n ≤ 6 𝑛 6 n\leq 6 .

$n$	$nmcf (n)$	$\frac{1}{n} \log_{2} \log_{2} (nmcf (n))$
0	1
$1$	$3$	0.664
$2$	$6$	0.685
$3$	$60$	0.854
$4$	$27720$	0.971
$5$	$259338240$	0.961
$6$	$177843714048000$	0.927

Equations142

i = 0 \sum n min (2^{i}, 2^{2^{n - i}} - 1)

i = 0 \sum n min (2^{i}, 2^{2^{n - i}} - 1)

\frac{k ^{r} - 1}{k - 1} + j = 0 \sum n - r (2^{k^{j}} - 1) + 1

\frac{k ^{r} - 1}{k - 1} + j = 0 \sum n - r (2^{k^{j}} - 1) + 1

I_{A} (x) = {x 0 if x \in A, if x \neq \in A .

I_{A} (x) = {x 0 if x \in A, if x \neq \in A .

f (x) = I_{F} (\overline{δ} (q_{0}, x)), if \overline{δ} (q_{0}, x) is defined,

f (x) = I_{F} (\overline{δ} (q_{0}, x)), if \overline{δ} (q_{0}, x) is defined,

L (M) = {x \in Σ^{*} : f (x) > 0} = {x \in Σ^{*} : \overline{δ} (q_{0}, x) \in F} .

L (M) = {x \in Σ^{*} : f (x) > 0} = {x \in Σ^{*} : \overline{δ} (q_{0}, x) \in F} .

i = 0 \sum n min (2^{i}, 2^{2^{n - i}} - 1)

i = 0 \sum n min (2^{i}, 2^{2^{n - i}} - 1)

x R_{f} y ⟺ for all z \in Σ^{*}, f (x z) = f (y z) .

x R_{f} y ⟺ for all z \in Σ^{*}, f (x z) = f (y z) .

φ (q) = [x]_{f} where x is such that \overline{δ} (q_{0}, x) = q .

φ (q) = [x]_{f} where x is such that \overline{δ} (q_{0}, x) = q .

δ (q_{0}, x z) = δ (δ (q_{0}, x), z) = δ (q, z) = δ (δ (q_{0}, y), z) = δ (q_{0}, y z)

δ (q_{0}, x z) = δ (δ (q_{0}, x), z) = δ (q, z) = δ (δ (q_{0}, y), z) = δ (q_{0}, y z)

δ (q_{0}, x_{1} z) = δ (q_{1}, z) \neq = δ (q_{2}, z) = δ (q_{0}, x_{2} z)

δ (q_{0}, x_{1} z) = δ (q_{1}, z) \neq = δ (q_{2}, z) = δ (q_{0}, x_{2} z)

C_{k} = {g \in [c]^{[b]^{n - k}} : \exists f \in C, w \in [b]^{k} \forall x g (x) = f (w x)} .

C_{k} = {g \in [c]^{[b]^{n - k}} : \exists f \in C, w \in [b]^{k} \forall x g (x) = f (w x)} .

C_{k} = {f \circ τ_{w} \in [c]^{[b]^{n - k}} : f \in C, w \in [b]^{k}} .

C_{k} = {f \circ τ_{w} \in [c]^{[b]^{n - k}} : f \in C, w \in [b]^{k}} .

Q = {q_{w} : w \in [b]^{i}, i \leq j} \cup {r_{g} : g \in C_{i}^{-}, i > j} .

Q = {q_{w} : w \in [b]^{i}, i \leq j} \cup {r_{g} : g \in C_{i}^{-}, i > j} .

δ (q_{w}, a)

δ (q_{w}, a)

δ (r_{g}, a)

# ({f \circ τ_{w} : w \in [b]^{j}}) \leq # ({h \circ τ_{w} : w \in [b]^{j}, h \in C})

# ({f \circ τ_{w} : w \in [b]^{j}}) \leq # ({h \circ τ_{w} : w \in [b]^{j}, h \in C})

q_{1}

q_{1}

q_{1}

q_{1}

q_{1}

q_{1}

q_{1}

q_{1}

# ([b]^{i_{0}}) = b^{i_{0}} \geq c^{b^{n - i_{0}}} - 1 = # (C_{i_{0}}^{-}),

# ([b]^{i_{0}}) = b^{i_{0}} \geq c^{b^{n - i_{0}}} - 1 = # (C_{i_{0}}^{-}),

Q = {q_{w} : w \in [b]^{i}, i \leq i_{0}} \cup {r_{g} : g \in C_{i}^{-}, i > i_{0}}

Q = {q_{w} : w \in [b]^{i}, i \leq i_{0}} \cup {r_{g} : g \in C_{i}^{-}, i > i_{0}}

⊤_{1} \ignorespaces \ignorespaces \ignorespaces \ignorespaces

⊤_{1} \ignorespaces \ignorespaces \ignorespaces \ignorespaces

(k v ^{b} - 1) - j = 1 \sum i (- 1)^{j + 1} (j i) (k ( v - j ) ^{b} - 1) .

(k v ^{b} - 1) - j = 1 \sum i (- 1)^{j + 1} (j i) (k ( v - j ) ^{b} - 1) .

# (([v]^{[b]} ∖ {Z}) \cap ([v] ∖ J)^{[b]}) = # (([v] ∖ J)^{[b]} ∖ {Z}) = (v - j)^{b} - 1.

# (([v]^{[b]} ∖ {Z}) \cap ([v] ∖ J)^{[b]}) = # (([v] ∖ J)^{[b]} ∖ {Z}) = (v - j)^{b} - 1.

j = 1 \sum i (- 1)^{j + 1} (j i) (k ( v - j ) ^{b} - 1) . \qed

j = 1 \sum i (- 1)^{j + 1} (j i) (k ( v - j ) ^{b} - 1) . \qed

{f \circ τ_{a} ∣ f \in X, a \in [b]} \supseteq [c]^{[b]^{n - (j + 1)}} ∖ {Z_{j + 1}} .

{f \circ τ_{a} ∣ f \in X, a \in [b]} \supseteq [c]^{[b]^{n - (j + 1)}} ∖ {Z_{j + 1}} .

f (x) = φ_{f, j} (x_{1}) (x_{2}) = φ_{g, j} (x_{1}) (x_{2}) = g (x) . \qed

f (x) = φ_{f, j} (x_{1}) (x_{2}) = φ_{g, j} (x_{1}) (x_{2}) = g (x) . \qed

k! ((k c ^{b^{d + 1}} - 1) - j = 1 \sum i (- 1)^{j + 1} (j i) (k ( c ^{b^{d}} - j ) ^{b} - 1)) .

k! ((k c ^{b^{d + 1}} - 1) - j = 1 \sum i (- 1)^{j + 1} (j i) (k ( c ^{b^{d}} - j ) ^{b} - 1)) .

β : [c]^{[b]^{n - i_{0}}} \to [c^{b^{n - (i_{0} + 1)}}]^{[b]}

β : [c]^{[b]^{n - i_{0}}} \to [c^{b^{n - (i_{0} + 1)}}]^{[b]}

α_{i} = (k c ^{b^{d + 1}} - 1) - j = 1 \sum i (- 1)^{j + 1} (j i) (k ( c ^{b^{d}} - j ) ^{b} - 1) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topicssemigroups and automata theory · Coding theory and cryptography · Advanced Combinatorial Mathematics

Full text

The number of languages with

maximum state complexity

Bjørn Kjos-Hanssen

Lei Liu This work was partially supported by grants from the Simons Foundation (#315188 and #704836 to Bjørn Kjos-Hanssen) and Decision Research Corporation (University of Hawai‘i Foundation Account #129-4770-4). We are grateful to the gracious referee who persisted through seven revisions of the paper.

Abstract

Câmpeanu and Ho (2004) determined the maximum finite state complexity of finite languages, building on work of Champarnaud and Pin (1989). They stated that it is very difficult to determine the number of maximum-complexity languages. Here we give a formula for this number. We also generalize their work from languages to functions on finite sets.

1 Introduction

At some point in the 1980s, Howard Straubing posed a problem that was subsequently solved in Champarnaud and Pin (1989) [2]. They showed that the minimal incomplete deterministic finite automaton of a language $L\subseteq\Sigma^{n}$ , where $\Sigma=\{0,1\}$ , has at most

[TABLE]

states. Moreover, for each $n$ there exists an $L$ attaining this bound. Câmpeanu and Ho (2004) [1] showed more generally that the tight upper bound for $\Sigma$ of cardinality $k$ and for complete automata is

[TABLE]

where $r=\min\{m:k^{m}\geq 2^{k^{n-m}}-1\}$ . (In these results, requiring totality of the transition function adds 1 to the state count.) Câmpeanu and Ho’s result can be viewed as concerning functions $f:[k]^{n}\to[2]$ where $[k]=\{0,\dots,k-1\}$ is a set of cardinality $k$ . We generalize their result to arbitrary functions $f:[k]^{n}\to[c]$ where $c$ is a positive integer. Equivalently, we consider functions $f:[k]^{*}\to[c]$ , where $\{x:f(x)>0\}\subseteq[k]^{n}$ for some $n$ , and where automata have $c-1$ accept states corresponding to nonzero values of $f$ .

The function $+$ on $\mathbb{Z}/5\mathbb{Z}$ may seem rather complicated as functions on that set go. On the other hand, $f(x,y,z)=x+y+z$ mod 5 is less so, in that we can decompose it as $(x+y)+z$ , so that after seeing $x$ and $y$ , we need not remember the pair $(x,y)$ , but only their sum. Out of the $5^{5^{3}}$ ternary functions on a 5-element set, at most $5^{2\cdot 5^{2}}$ can be decomposed as $(x*_{1}y)*_{2}z$ for some binary functions $*_{1}$ , $*_{2}$ . This idea of the state complexity of functions has been applied in bioinformatics [5]. In Section 2 we make precise a sense in which such functions are not the most complex ternary functions. We do this by extending a result of Câmpeanu and Ho [1] to functions taking values in a set of size larger than two. Rising to an implicit challenge posed by Câmpeanu and Ho, we give a formula for the number of maximally complex languages.

The structure of the paper is as follows. In Section 2 we obtain an upper bound in Theorem 2.14 for the complexity of a function $f:[b]^{n}\to[c]$ , and a matching lower bound in Theorem 2.18. In Section 3 we obtain the number of maximal complexity functions in Theorem 3.10. Then we look at asymptotics in Section 4, culminating in Theorem 4.12.

2 Complexity of languages and operations

Let $\lambda$ denote the empty word. Let the cardinality of a finite set $A$ be denoted by $\#(A)$ , and the length of a finite word $w$ by $|w|$ . We define a function $\mathbb{I}_{A}:B\to A\cup\{0\}$ for any sets $A\subseteq B$ with $0\not\in A$ by

[TABLE]

Definition 2.1.

Let $b$ and $c$ be positive integers and let $\Sigma$ be an alphabet with $\#\left(\Sigma\right)=b$ . An incomplete deterministic finite automaton (IDFA) $M$ is a 5-tuple $(Q,\Sigma,\delta,q_{0},F)$ , where $Q$ is a finite set of states, $\Sigma$ is a finite alphabet, $q_{0}\in Q$ is the start state, $F\subseteq Q$ is the set of accept states, and $\delta:D\to Q$ , where $D\subseteq Q\times\Sigma$ , is the transition function.

W also require $F=\{1,\dots,c-1\}=[c]\setminus\{0\}$ , where $c-1=\#\left(F\right)$ . If $D=Q\times\Sigma$ , i.e., $\delta$ is total, then $M$ is moreover a deterministic finite automaton (DFA).

We define $\overline{\delta}:D\to Q$ , where $D\subseteq Q\times\Sigma^{*}$ , by $\overline{\delta}(q,\lambda)=q$ , and recursively $\overline{\delta}(q,xu)=\delta(\overline{\delta}(q,x),u)$ for $x\in\Sigma^{*}$ and $u\in\Sigma$ . We say that states $q_{1},q_{2}$ are $M$ -distinguishable if there is a $z$ with $\overline{\delta}(q_{1},z)\neq\overline{\delta}(q_{2},z)$ and $\{\overline{\delta}(q_{1},z),\overline{\delta}(q_{2},z)\}\cap F\neq\emptyset$ .

The function accepted by $M$ is the function $f:\Sigma^{*}\to[c]$ defined by

[TABLE]

and $f(x)=0$ otherwise. Thus $f(x)=0$ if $\overline{\delta}(q_{0},x)\not\in F$ , and $f(x)=\overline{\delta}(q_{0},x)$ if $\overline{\delta}(q_{0},x)\in F$ . The language accepted by $M$ is

[TABLE]

Note that in the case $c=2$ , accepting a language is equivalent to accepting its indicator (characteristic) function.

Definition 2.2 (state complexity).

We call an IDFA $M=(Q,\Sigma,\delta,q_{0},F)$ minimal (for $L(M)$ ) if $\#\left(Q\right)\leq\#\left(Q^{\prime}\right)$ for all IDFAs $M^{\prime}=(Q^{\prime},\Sigma,\delta^{\prime},q_{0}^{\prime},F^{\prime})$ with $L(M^{\prime})=L(M)$ . Moreover, $M$ is minimal for $f$ if $M$ accepts $f$ and $\#\left(Q\right)\leq\#\left(Q^{\prime}\right)$ for all $M^{\prime}$ accepting $f$ . In this case we define the state complexity $\mathsf{sc}(f)$ by $\mathsf{sc}(f)=\#\left(Q\right)$ .

Champarnaud and Pin [2] obtained the following result.

Theorem 2.3 ([2, Theorem 4]).

A minimal IDFA for a language $L\subseteq\{0,1\}^{n}$ has at most

[TABLE]

states, and for each $n$ there exists a language $L$ attaining this bound.

Theorem 2.3 was generalized by Câmpeanu and Ho [1]:

Theorem 2.4 ([1, Corollary 10]).

Let $k\geq 1$ and $l\geq 0$ be integers, and let $M$ be a minimal DFA for a language $L\subseteq[k]^{l}$ . Let $Q$ be the set of states of $M$ . Then we have:

(i)

$\#\left(Q\right)\leq\frac{k^{r}-1}{k-1}+\sum_{j=0}^{l-r}(2^{k^{j}}-1)+1$ , where $r=\min\{m\mid k^{m}\geq 2^{k^{l-m}}-1\}$ . 2. (ii)

There is an $M$ such that the upper bound given by Item i is attained.

Both of these results involve an upper bound which can be viewed as a special case of Theorem 2.14 below.

We now develop a function version of the Myhill–Nerode theorem, by following and generalizing the presentation in Shallit’s textbook [6].

Definition 2.5.

Let $\Sigma$ be an alphabet and let $c\in\mathbb{N}$ . A relation $R\subseteq\Sigma^{*}\times\Sigma^{*}$ is right invariant if for all $x,y,z\in\Sigma^{*}$ , we have $xRy\implies xzRyz$ . An equivalence relation $E$ on $\Sigma^{*}$ is a congruence relation for $f:\Sigma^{*}\to[c]$ if for all $x,y\in\Sigma^{*}$ , $xEy\implies f(x)=f(y).$ For an equivalence relation $E$ , the index of $E$ , denoted $\mathrm{index}(E)$ , is the number of equivalence classes of $E$ . An equivalence relation has finite index if $\mathrm{index}(E)<\infty$ . The Myhill–Nerode equivalence relation for $f:\Sigma^{*}\to[c]$ is the relation $R_{f}$ defined by

[TABLE]

Let $[x]_{f}$ denote the $R_{f}$ -equivalence class of $x$ .

Lemma 2.6.

Let $f:\Sigma^{*}\to[c]$ .

$R_{f}$ * is an equivalence relation.* 2. 2.

$R_{f}$ * right invariant.*

Proof.

Item 1 is a standard observation. For Item 2: If we extend $xz$ and $yz$ by the same string $w$ , then we have also extended $x$ and $y$ by the same string $zw$ , and hence $f(xzw)=f(yzw)$ . ∎

Lemma 2.7.

Let $f:\Sigma^{*}\to[c]$ . Suppose that $E$ is a right invariant equivalence relation on $\Sigma^{*}$ which is a congruence relation for $f$ . Then $E$ is a refinement of $R_{f}$ .

Proof.

We must show that $xEy\implies xR_{f}y$ . Suppose $xEy$ and let $z\in\Sigma^{*}$ . Since $E$ is right invariant, $xzEyz$ . Since $E$ is a congruence relation for $f$ , $f(xz)=f(yz)$ . Thus we have shown that $xR_{f}y$ . ∎

Every function is onto its range, and when the range is a finite subset of $\mathbb{N}$ , when studying complexity under our definitions we assume the range is an initial segment of $\mathbb{N}$ . Thus we restrict attention to onto functions in Theorem 2.8.

Theorem 2.8.

Let $f:\Sigma^{*}\to[c]$ be onto. The following are equivalent:

$f$ * is accepted by some IDFA.* 2. 2.

There exists a right invariant congruence relation for $f$ of finite index. 3. 3.

$R_{f}$ * has finite index.* 4. 4.

$f$ * is accepted by some DFA.*

Proof.

We prove this in the usual round-robin fashion.

(1) $\implies$ (2):

Let $M$ be an IDFA that accepts $f$ . Define a relation $R_{M}$ by $xR_{M}y$ iff $\overline{\delta}(q_{0},x)=\overline{\delta}(q_{0},y)$ , or both are undefined. Since $M$ has finitely many states, $R_{M}$ has finite index. From the definition of $\overline{\delta}$ it follows that $R_{M}$ is right invariant. Finally, since $f(x)=\mathbb{I}_{F}(\overline{\delta}(q_{0},x))$ if defined, and 0 otherwise, $f(x)$ is determined by $\overline{\delta}(q_{0},x)$ . Thus $R_{M}$ is a congruence relation for $f$ .

(2) $\implies$ (3):

Let $R$ be a right invariant congruence relation for $f$ , of finite index. By Lemma 2.7, $R$ is a refinement of $R_{f}$ . Then $\mathrm{index}(R_{f})\leq\mathrm{index}(R)<\infty$ , as desired.

(3) $\implies$ (4):

Suppose $R_{f}$ has finite index. Define $Q^{\prime}=\{[x]_{f}:x\in\Sigma^{*}\}$ , $q_{0}^{\prime}=[\lambda]_{f}$ , $F^{\prime}=\{[x]_{f}:f(x)>0\}$ , and $\delta^{\prime}([x]_{f},a)=[xa]_{f}$ . Then $\#(Q^{\prime})=\mathrm{index}(R_{f})<\infty$ . Since $R_{f}$ is right invariant, $\delta^{\prime}$ is well-defined. Thus $M^{\prime}=(Q^{\prime},\Sigma,\delta^{\prime},q_{0}^{\prime},F^{\prime})$ is an IDFA. We must show that $f(x)=\mathbb{I}_{F^{\prime}}(\overline{\delta^{\prime}}(q_{0}^{\prime},x))$ for each $x$ . Case 1: $f(x)=0$ . Since $R_{f}$ is a congruence relation for $f$ , $[x]_{f}\not\in F^{\prime}$ and hence $\overline{\delta^{\prime}}(q_{0}^{\prime},x)=[\lambda x]_{f}=[x]_{f}\not\in F^{\prime}$ which means that $\mathbb{I}_{F^{\prime}}(\overline{\delta^{\prime}}(q_{0}^{\prime},x))=0$ . Case 2: $f(x)>0$ . Then by definition $[x]_{f}\in F^{\prime}$ and so $\overline{\delta^{\prime}}(q_{0}^{\prime},x)=[\lambda x]_{f}=[x]_{f}\in F^{\prime}$ which means that $\mathbb{I}_{F^{\prime}}(\overline{\delta^{\prime}}(q_{0}^{\prime},x))=\overline{\delta^{\prime}}(q_{0}^{\prime},x)$ . Finally, let $\pi:F^{\prime}\to[c]\setminus\{0\}$ be a bijection and formally replace each $q\in F^{\prime}$ by $\pi(q)\in[c]$ .

(4) $\implies$ (1):

This is immediate since each DFA is an IDFA.

∎

Theorem 2.9.

Let $f:\Sigma^{*}\to[c]$ . Let $M$ be an IDFA accepting $f$ . Let $q$ be the number of states of $M$ . Suppose that all states of $M$ are reachable and that any two states of $M$ are $M$ -distinguishable. Then $\mathsf{sc}(f)=q$ .

Proof.

Let $M^{\prime}$ be the automaton in Theorem 2.8 for $f$ and let $Q^{\prime}$ be its set of states. We claim that $M^{\prime}$ is minimal. Note that $\#(Q^{\prime})=\mathrm{index}(R_{f})$ . Let $N$ be any automaton accepting $f$ , let $Q$ be its set of states and $\delta$ its transition function. Since $N$ accepts $f$ , for all $x,y,z$ , if $f(xz)\neq f(yz)$ then $\overline{\delta}(q_{0},x)\neq\overline{\delta}(q_{0},y)$ . Thus $[x]_{f}\mapsto\overline{\delta}(q_{0},x)$ is injective, and we have established that $\mathrm{index}(R_{f})\leq\#(Q)$ , and hence that $M^{\prime}$ is minimal.

Now let $M=(Q,\Sigma,\delta,q_{0},F)$ be any IDFA accepting $f$ for which any two states are reachable and $M$ -distinguishable. It suffices to show that $\#\left(Q\right)\leq\#\left(Q^{\prime}\right)$ , and for this it suffices to give an injective map $\varphi:Q\to Q^{\prime}$ . For each $q\in Q$ we let

[TABLE]

Such an $x$ must exist, or else $q$ is not reachable.

Claim: $\varphi$ is well-defined by (1).

Proof of claim.

Suppose that $\overline{\delta}(q_{0},y)=q$ and let us show $[x]_{f}=[y]_{f}$ . Let $z\in\Sigma^{*}$ . Since $M$ accepts $f$ ,

•

for $i>0$ , $f(xz)=i$ iff $\delta(q_{0},xz)=i$ and $f(yz)=i$ iff $\delta(q_{0},yz)=i$ ; and

•

for $i=0$ , $f(xz)=0$ iff $\delta(q_{0},xz)$ is undefined or is not in $F$ , and $f(yz)=0$ iff $\delta(q_{0},yz)$ is undefined or is not in $F$

We have

[TABLE]

in the sense that $\delta(q_{0},xz)$ and $\delta(q_{0},yz)$ are both definitionally equal to $\delta(q,z)$ , which may or may not be defined or in $F$ . So in all cases $f(xz)=f(yz)$ . ∎

Finally, let us show that $\varphi$ is one-to-one. If $\varphi(q_{1})=\varphi(q_{2})$ then $[x_{1}]_{f}=[x_{2}]_{f}$ where $\delta(q_{0},x_{i})=q_{i}$ . We will show, using $M$ -distinguishability, that $q_{1}=q_{2}$ .

Suppose $q_{1}\neq q_{2}$ . Then there is some $z$ with

[TABLE]

and $\{\delta(q_{0},x_{1}z),\delta(q_{0},x_{2}z)\}\cap F\neq\emptyset.$ Hence since $M$ accepts $f$ , $f(x_{1}z)\neq f(x_{2}z)$ , which contradicts $[x_{1}]_{f}=[x_{2}]_{f}$ . ∎

We write $A^{B}$ for the set of all functions from $B$ to $A$ .

Definition 2.10.

Let $b$ and $c$ be positive integers and let ${[c]}^{{[b]}^{n}}$ be the set of $n$ -ary functions $f:[b]^{n}\to[c]$ . Let $\mathfrak{C}\subseteq{[c]}^{{[b]}^{n}}$ . The Champarnaud–Pin family of $\mathfrak{C}$ is the family of sets $\{\mathfrak{C}_{k}\}_{0\leq k\leq n}$ , where $\mathfrak{C}_{k}\subseteq{[c]}^{{[b]}^{n-k}}$ , $0\leq k\leq n$ , given by

[TABLE]

In terms of the function $\tau_{w}(x)=wx$ , this can be restated as

[TABLE]

So $\mathfrak{C}_{0}=\mathfrak{C}$ , $\mathfrak{C}_{1}$ is obtained from $\mathfrak{C}_{0}$ by plugging in constants for the first input, and so forth. We write $\mathfrak{C}_{n}^{-}=\{f\in\mathfrak{C}_{n}:f(x)>0\text{ for some$ x $}\}$ . Note that $\#\left(\mathfrak{C}_{n}^{-}\right)\geq\#\left(\mathfrak{C}_{n}\right)-1$ .

Definition 2.11.

Let us say that an IDFA $M$ accepts $f:[b]^{n}\to[c]$ if $M$ accepts the function $f^{+}:\Sigma^{*}\to[c]$ with $f^{+}(x)=f(x)$ if $x\in[b]^{n}$ , and $f^{+}(x)=0$ otherwise. The state complexity of $f:[b]^{n}\to[c]$ is the minimum number of states of an IDFA accepting $f:[b]^{n}\to[c]$ , and is denoted $\mathsf{sc}(f)$ .

Note that Definition 2.11 says that $\mathsf{sc}(f)=\mathsf{sc}(f^{+})$ . For $c>b=2$ , $\mathsf{sc}(f)$ corresponds to automatic complexity of equivalence relations on binary strings as studied in [3]. The case $b=c$ is that of $n$ -ary operations on a given finite set, which is of interest in universal algebra.

We also define $\mathsf{maxsc}_{b,c,n}=\sum_{i=0}^{n}\min(b^{i},c^{b^{n-i}}-1)$ , which shall turn out to be the maximum of $\mathsf{sc}(f)$ over all $f$ .

Definition 2.12.

We define a crossover function $\chi(b,c,n)=\max\{i\in[0,n]\mid b^{i}\leq c^{b^{n-i}}-1\}$ .

Definition 2.13.

Let $f\in[c]^{[b]^{n}}$ and $0\leq j\leq n$ . We define an IDFA $M_{f,j}$ . Its set of states is the disjoint union

[TABLE]

where all $q_{w}$ , $r_{g}$ are distinct. The transition function $\delta$ of $M_{f,j}$ is given by

[TABLE]

Theorem 2.14.

Let $b$ and $c$ be positive integers. Let $f\in[c]^{[b]^{n}}$ . Then $\mathsf{sc}(f)\leq\mathsf{maxsc}_{b,c,n}.$

Proof.

Let $f\in\mathfrak{C}$ . We must show that there is an IDFA $M_{f}$ accepting $f$ with at most the given number of states. Let $i_{0}=\chi(b,c,n)$ and let $M_{f}=M_{f,i_{0}}$ (Definition 2.13). Then $\min(b^{i},\#\left(\mathfrak{C}^{-}_{i}\right))=b^{i}$ for $i\leq i_{0}$ and $\min(b^{i},\#\left(\mathfrak{C}^{-}_{i}\right))=\#\left(\mathfrak{C}^{-}_{i}\right)$ for $i>i_{0}$ . Note that for each $q\in Q$ there is an integer $i(q)$ such that $i(q)\leq i_{0}\implies q=q_{w}$ for some $w$ and $i(q)>i_{0}\implies q=r_{g}$ for some $g$ . The transition function $\delta$ is given by Definition 2.13 and also described in Figure 1. Note that if $b^{n}+1<c$ , we may not have $i_{0}\leq n$ , but this is ruled out because then no $f:[b]^{n}\to[c]$ can be onto (Definition 2.12). (We may assume that $f$ is onto, since otherwise a smaller IDFA can be found.)

Since $f\in\mathfrak{C}$ , we have

[TABLE]

although this need not be strict (for instance, when $j=n$ , we are comparing the range of $f$ to the union of ranges of $h$ , $h\in\mathfrak{C}$ , which may both equal $[c]$ ). By construction, $M_{f}$ accepts $f$ ; see also Example 2.15, Example 2.16, and Example 2.17. ∎

Example 2.15.

The following example shows the case $b=c=2$ and $n=3$ , with $f$ the majority function. It has $\chi(b,c,n)=1$ :

[TABLE]

The states $r_{g}$ for $g\in\mathfrak{C}^{-}_{n}\subseteq[c]\setminus\{0\}$ serve as our final states and are indicated by a rectangular box. Here $\top_{k}$ is the constant 1 function of $k$ variables, whereas $1_{j}$ is defined by $1_{j}(x)=1$ if $j=x$ , 0 otherwise. There is no arrow labeled 0 between the states $q_{0}$ and $r_{1_{0}}$ . This is because after seeing $x=y=0$ we already know the majority of $x,y,z$ is 0, so we “reject by missing transition”.

Example 2.16.

A slightly larger example: the case $b=c=2$ and $n=4$ , with $f$ the majority function. It has $\chi(b,c,n)=2$ :

[TABLE]

In this case, the upper bound is strict: $q_{01}$ and $q_{10}$ are equivalent. Thus a smaller automaton suffices:

[TABLE]

Example 2.17.

As an example for the case $c>2$ , let $b=2$ , $c=3$ , $n=2$ , and let $f(x,y)=x+y$ . Then our automaton $M_{f}$ is:

[TABLE]

Theorem 2.18 is a generalization of Câmpeanu and Ho’s theorem. The construction is similar to that of [1, Figure 1 and Theorem 8].

Theorem 2.18.

Let $b,c\geq 2$ and $n\geq 1$ be integers. There exists a function $f:[b]^{n}\to[c]$ such that $\mathsf{sc}(f)=\mathsf{maxsc}_{b,c,n}.$

Proof.

Let $\mathfrak{C}=[c]^{[b]^{n}}$ To define $f\in\mathfrak{C}$ , we first note that it suffices to fix an $i$ with $0\leq i\leq n$ and define $f\circ\tau_{w}$ for each $w\in[b]^{i}$ . To that end, we fix $i_{0}=\chi(b,c,n)$ . Since

[TABLE]

there exists a surjective function $\phi:[b]^{i_{0}}\to\mathfrak{C}_{i_{0}}^{-}$ . Define $f$ by $f\circ\tau_{w}=\phi(w)$ for each $w\in[b]^{i_{0}}$ . We claim that $f$ attains the bound, i.e., there is no smaller automaton than that given in Theorem 2.14. By Theorem 2.9, an IDFA to accept $f$ is minimal if all states are reachable (from the start state) and any two states are $M$ -distinguishable.

Thus, it remains to show that the states for $f$ as given in the proof of Theorem 2.14 are reachable and $M$ -distinguishable.

By choice of $i_{0}$ it is easy to see that each state is reachable. For an example of what can go wrong with a different choice of $i_{0}$ , see Figure 2.

As for distinguishability, all states have a path to an accepting state, so it suffices to show that states that are the same distance from the start state are $M$ -distinguishable. Recall that the set of states of $M_{f}$ is

[TABLE]

For two states $q_{v}$ , $q_{w}$ where $|v|=|w|$ , it suffices to consider the case $|w|=i_{0}$ . Then $q_{v}$ and $q_{w}$ are $M$ -distinguishable precisely because we chose $i_{0}$ and $f$ so that each extension by adding one more symbol to $v$ , $w$ does not give the same set of possible extensions, i.e., precisely to distinguish $v$ and $w$ . Similarly $r_{g}$ and $r_{h}$ for $g,h\in\mathfrak{C}^{-}_{i},i>i_{0}$ have the sets of possible extensions given by $g,h$ and therefore are $M$ -distinguishable. ∎

3 The number of maximally complex languages

A $k$ -set is a set of cardinality $k$ . For a function $f:A\to B$ we denote the range and domain by $\operatorname{ran}(f)=\{f(x)\mid x\in A\}$ and $\operatorname{dom}(f)=A$ , respectively. The collection of all subsets of $A$ of cardinality $k$ is denoted $\binom{A}{k}$ .

Lemma 3.1.

Let $k,b,v,i$ be positive integers with $i<v$ . Let $Z:b\to v$ be the constant function defined by $Z(b^{\prime})=v-1$ for all $b^{\prime}\in[b]$ . The number of $k$ -sets $X\subseteq[v]^{[b]}\setminus\{Z\}$ such that $\bigcup_{f\in X}\operatorname{ran}(f)\supseteq[i]$ is

[TABLE]

Proof.

There are $v^{b}-1$ elements of $[v]^{[b]}\setminus\{Z\}$ and hence $\binom{v^{b}-1}{k}$ total $k$ -sets.

Since $i<v$ , $v-1\not\in[i]=\{0,\dots,i-1\}$ . Thus the range of $Z$ is disjoint from $[i]$ .

Given $J\subseteq[i]$ , $\#\left(J\right)=j$ , it follows that $Z\in([v]\setminus J)^{[b]}$ and so there are $(v-j)^{b}-1$ functions in $[v]^{[b]}\setminus\{Z\}$ whose range is disjoint from $J$ , i.e.,

[TABLE]

Here $([v]\setminus J)^{[b]}\subseteq[v]^{[b]}$ .

For the union of ranges to not contain $[i]$ means that there is some $i^{\prime}\in[i]$ that is missed. The number of $k$ -sets that miss some $i^{\prime}$ is then given by inclusion-exclusion in terms of $j$ , the cardinality of a set $J\subseteq[i]$ that is disjoint from $\bigcup_{f\in X}\operatorname{ran}(f)$ . Thus the number of $k$ -sets with $\bigcup_{f\in X}\operatorname{ran}(f)\not\supseteq[i]$ is

[TABLE]

For fixed $b$ and $c$ , let $\mathcal{B}_{d}$ ( $\mathcal{B}_{d}^{+}$ ) be the set of all (not constant zero) functions from $[b]^{d}$ to $[c]$ .

Definition 3.2.

For a function $f:[b]^{n}\to[c]$ and $0\leq j\leq n$ , define a function $\varphi_{f,j}:[b]^{j}\to[c]^{[b]^{n-j}}$ by $\varphi_{f,j}(w)=f\circ\tau_{w}$ for all $w$ .

Note that $\varphi_{f,j}$ is the function $\phi$ in the proof of Theorem 2.18.

Definition 3.3.

For each $0\leq j\leq n$ , let $Z_{j}:[b]^{n-j}\to[c]$ be the constant zero function. A set $X\subseteq[c]^{[b]^{n-j}}\setminus\{Z_{j}\}$ is $j$ -adequate if

[TABLE]

A function $\varphi:[b]^{j}\to[c]^{[b]^{n-j}}$ is called $j$ -adequate if its range is a $j$ -adequate $b^{j}$ -set, i.e.:

$\varphi(w)\neq Z_{j}$ for each $w$ , 2. 2.

$\varphi$ is injective, and 3. 3.

$\{(\varphi(w))\circ\tau_{a}\mid w\in[b]^{j},a\in[b]\}\supseteq[c]^{[b]^{n-(j+1)}}\setminus\{Z_{j+1}\}.$

We say that $\varphi$ is adequate if it is $j$ -adequate for $j=\chi(b,c,n)$ .

Proposition 3.4.

If $\varphi$ is $j$ -adequate then $b^{j}\leq c^{b^{n-j}}-1$ and $b^{j+1}\geq c^{b^{n-(j+1)}}-1$ .

The proof of Proposition 3.4 is immediate. It follows that $\varphi$ can only be $j$ -adequate if $j=\chi(b,c,n)$ , unless we happen to have $b^{j+1}=c^{b^{n-(j+1)}}-1$ .

Proposition 3.5.

For all $j$ , we have $f=g\iff\varphi_{f,j}=\varphi_{g,j}$ .

Proof.

$\implies$ is immediate. Conversely, suppose $\varphi_{f,j}=\varphi_{g,j}$ . Fix $x$ and write $x=x_{1}x_{2}$ , $|x_{1}|=j$ . Then

[TABLE]

Definition 3.6.

For each $f$ and $j$ we defined the associated automaton $M_{f,j}$ in Definition 2.13. Let $M^{-}_{f,j}$ be $M_{f,j}$ with unreachable states removed and indistinguishable states merged. Let $Q^{-}_{f,j}$ be the set of states of $M^{-}_{f,j}$ .

Theorem 3.7.

The following are equivalent:

$\varphi_{f,\chi(b,c,n)}$ * is adequate.* 2. 2.

$\#(Q^{-}_{f,\chi(b,c,n)})=\mathsf{maxsc}_{b,c,n}$ ; all states of $Q^{-}_{f,\chi(b,c,n)}$ are reachable and distinguishable; and $M^{-}_{f,\chi(b,c,n)}$ accepts $f$ . 3. 3.

It is not the case that: $\#(Q^{-}_{f,\chi(b,c,n)})<\mathsf{maxsc}_{b,c,n},$ and $M^{-}_{f,\chi(b,c,n)}$ accepts $f$ .

Proof.

(2) $\implies$ (1): If $\varphi_{f}$ is not adequate then by definition some states of $M_{f,\chi(b,c,n)}$ are not reachable.

(1) $\implies$ (2): Theorem 2.18.

(2) $\implies$ (3) is immediate.

(3) $\implies$ (2): Assume (3). Since $M^{-}_{f,\chi(b,c,n)}$ always accept $f$ , it follows that it has $\geq\mathsf{maxsc}_{b,c,n}$ states. By Theorem 2.14 it has exactly $\mathsf{maxsc}_{b,c,n}$ states. ∎

Theorem 3.8.

The following are equivalent:

$\varphi_{f,\chi(b,c,n)}$ * is adequate.* 2. 2.

$\mathsf{sc}(f)=\mathsf{maxsc}_{b,c,n}$ .

Proof.

(1) $\implies$ (2): by (1) $\implies$ (2) of Theorem 3.7 and then by Theorem 2.9.

(2) $\implies$ (1): Suppose $\neg$ (1). Then $\neg$ (1) in Theorem 3.7. Therefore $\neg$ (3) in Theorem 3.7, and so $\mathsf{sc}(f)<\mathsf{maxsc}_{b,c,n}$ . ∎

Proposition 3.9.

Let $b,c,n$ be given, $i_{0}=\chi(b,c,n)$ , $i=c^{b^{n-(i_{0}+1)}}-1$ , and $k=b^{i_{0}}$ . The number of adequate functions $\varphi:[b]^{i_{0}}\to[c]^{[b]^{n-i_{0}}}$ is

[TABLE]

Proof.

If $\alpha_{i}$ is the number of adequate sets then the number of adequate functions $\varphi$ is $k!\,\alpha_{i}$ .

The map $\varphi$ maps to functions whose union of ranges covers the next set of functions as in Lemma 3.1, $k$ -sets $X=\{f_{1},\dots,f_{k}\}\subseteq[v]^{[b]}\setminus\{Z\}$ such that $\bigcup_{f\in X}\operatorname{ran}(f)\supseteq[i]$ where $i=c^{b^{n-(i_{0}+1)}}-1$ .

Let $Z_{0}:[b]^{n-i_{0}}\to[c]$ be the constant zero function. Let $Z(a)=c^{b^{n-(i_{0}+1)}}-1$ for all $a$ . Let

[TABLE]

be an arbitrary bijection for which $\beta(Z_{0})=Z$ . By Lemma 3.1, applying $\beta$ , and with $v=c^{b^{d}}$ ,

[TABLE]

Thus, the number of maps $\varphi$ is

[TABLE]

Theorem 3.10.

Let integers $b,c\geq 2$ and $n\geq 1$ be given. Let $k=b^{j_{0}}$ where $j_{0}=\chi(b,c,n)$ . Let $d+1=n-j_{0}$ and $i=c^{b^{d}}-1$ . Then $\#\left(\{f\mid\mathsf{sc}(f)=\mathsf{maxsc}_{b,c,n}\}\right)$ is given by (3) and equals

[TABLE]

Proof.

By Proposition 3.5,

[TABLE]

By Theorem 3.8 this equals $\#\{\varphi_{f}:f\text{ is adequate}\}$ , which by Proposition 3.9 equals (3). ∎

Example 3.11.

For $n=4$ and $b=c=2$ , we have $\chi(b,c,n)=2$ , as illustrated in the following table:

[TABLE]

A maximal complexity function is determined by an injective function $\phi$ from $[2]^{2}$ to $\mathcal{B}_{2}^{+}$ , such that $\bigcup\{\operatorname{ran}\phi(w)\mid w\in[2]^{2}\}\supseteq\mathcal{B}_{1}^{+}$ . Associating each $\phi$ with the set $X_{\phi}=\{\phi(w)\mid w\in[2]^{2}\}\in\binom{\mathcal{B}_{2}^{+}}{4}$ , we see that the number of functions $\phi$ is $4!$ times the number of four-element subsets $X$ of $\mathcal{B}_{2}^{+}$ for which $\bigcup\{\operatorname{ran}f\mid f\in X\}\supseteq\mathcal{B}_{1}^{+}$ . By Lemma 3.1 that number is 1155: let $b=2$ , $k=4$ , $i=3$ , and $v=2^{2}=4$ and calculate that (2) is $\binom{15}{4}-3\binom{8}{4}=1155$ . Thus the total number of maximum complexity functions is $1155\cdot 24=27720$ .

4 Asymptotics

In this section we demonstrate (Theorem 4.3) that while most functions do not have maximum complexity, the growth rate of the number of maximally complex functions is similar to that of the total number of function $f:[b]^{n}\to[c]$ for $b=c=2$ .

Proposition 4.1.

Suppose $i\leq v$ and $k$ are positive integers, and $A$ is a set. Suppose $(k-1)\#\left(A\right)<i$ . Then we have

[TABLE]

Suppose that additionally $i<v$ , and $Z:A\to[v]$ is a constant function with $Z(a)=z\not\in[i]$ for all $a\in A$ . Then (4) also equals

[TABLE]

Proof.

(4)=(5): Let $\varphi$ be given and let $X=\operatorname{ran}(\varphi)$ . It suffices to show that $\#\left(X\right)=k$ . Since

[TABLE]

for each $f\in X$ , we have

[TABLE]

If $\#\left(X\right)\neq k$ then $\#\left(X\right)\leq k-1$ , and we have the contradiction

[TABLE]

(4)=(6): When $Z$ is constant equal to a value not in $[i]$ , $Z\not\in\operatorname{ran}(\varphi)$ follows from the other condition: if $Z\in\operatorname{ran}(\varphi)$ then let $X=\operatorname{ran}(\varphi)\setminus\{Z\}$ . Then $\#\left(X\right)=k-1$ and we get a contradiction as in (7). ∎

Definition 4.2.

Let $b$ and $c$ be positive integers and let $0\leq t\leq n$ . Let $O_{t}=O_{t}^{(b,c,n)}$ be the number of functions from $[b^{t}]$ to $[c^{b^{n-t}}]$ that are onto $[c^{b^{n-t}}-1]$ :

[TABLE]

Theorem 4.3.

Let $b$ and $c$ be positive integers and let $n\geq 0$ . Let $j_{0}=\chi(b,c,n)$ . If the condition

[TABLE]

holds, then $\#\left(\{f:[b]^{n}\to[c]\mid\mathsf{sc}(f)=\mathsf{maxsc}_{b,c,n}\}\right)=O_{t}$ , where $0\leq t\leq n$ is minimal such that $O_{t}>0$ .

Proof.

By Theorem 3.8,

[TABLE]

Let $Z_{0}:[b]^{n-j_{0}}\to[c]$ be the constant zero function. Let $Z(a)=c^{b^{n-(j_{0}+1)}}-1$ for all $a$ . Let

[TABLE]

be an arbitrary bijection for which $\beta(Z_{0})=Z$ .

Given $\varphi$ define $\psi$ by $\psi(x,b^{\prime})=\varphi(x)\circ\tau_{b^{\prime}}$ . The following are equivalent:

•

$\bigcup_{x\in[k]}\operatorname{ran}(\varphi(x))\supseteq[i]$ ;

•

$\psi$ is onto $[i]$ .

Thus $O_{t}$ is equal to (4), where $t=\chi(b,c,n)+1$ . By Proposition 4.1 under the bijection $\beta$ , with $k=b^{j_{0}}$ , $A=[b]$ , $i=c^{b^{n-j_{0}}}-1$ , and $v=c^{b^{n-j_{0}}}$ , $O_{t}$ is moreover equal to (6), as desired. ∎

Remark 4.4.

The authors regret that in [4], the condition (8) in Theorem 4.3 was erroneously omitted. By definition $b^{j_{0}+1}>c^{b^{n-(j_{0}+1)}}-1$ , so $b^{j_{0}+1}\geq c^{b^{n-(j_{0}+1)}}$ , but the condition fails when $b^{j_{0}+1}\geq c^{b^{n-(j_{0}+1)}}+b-1$ .

Example 4.5.

Consider the case $n=1$ , $b=c=2$ of Theorem 4.3. Then $i_{0}=1$ , where $i_{0}$ is the least $i$ such that $O_{i}>0$ . $O_{i}^{(2,2,1)}$ is the number of functions from $[2^{i}]$ to $[2^{2^{1-i}}]$ that are onto $[2^{2^{1-i}}-1]$ . For $i=0$ , there are no such functions. For $i=1$ , there are three such functions. And indeed, this is the number of maximal complexity functions in this case: the functions $f:\{0,1\}\to\{0,1\}$ that are onto $\{1\}$ .

Definition 4.6.

Let $O_{m,n}$ be the number of onto functions from $[m]$ to $[n]$ . Stirling numbers of the second kind are denoted ${m\brace n}$ and equal the number of equivalence relations on $[m]$ with $n$ equivalence classes.

The following result is well known.

Lemma 4.7.

Let $m,n$ be positive integers. Then $O_{m,n}=n!{m\brace n}.$

Lemma 4.8.

Let $u$ and $v$ be positive integers. The number of functions from $[u]$ to $[v]$ that are onto the first $v-1$ elements of $[v]$ is

[TABLE]

The number of functions from $[u]$ to $[v]$ that are onto $[i]$ is

[TABLE]

Proof.

Let $m$ be the number of elements going to $[v]\setminus[i]$ . Then we see that the number of such functions is

[TABLE]

by Lemma 4.7. ∎

Lemma 4.9.

Let $u$ be a positive integer. The number of functions from $[u]$ to $[u]$ that are onto $[u-1]$ is $(u+1)!/2$ .

Proof.

Note that for any $m$ , ${m\brace m}=1$ and ${m\brace m-1}={m\choose 2}$ . By Lemma 4.8, the number of such functions is

[TABLE]

The following Lemma 4.10 will only be applied in the case $\gamma=0$ .

Lemma 4.10.

Let $j$ be a nonnegative integer, let $p\geq 2$ , and let $0\leq\gamma\leq j$ be an integer. Let $n=p^{j}+j-\gamma$ and $b=p$ , $c=p^{p^{\gamma}}$ . $O_{i}:=O^{b,c,n}_{i}$ , where $i$ is minimal such that $O_{i}>0$ , equals

[TABLE]

Proof.

The condition that $O_{i}^{(b,c,n)}>0$ for some $i$ , i.e., $b^{i}\geq c^{b^{n-i}}-1$ for some $0\leq i\leq n$ , i.e., $p^{i}\geq p^{p^{\gamma}p^{n-i}}-1$ , i.e., $p^{n}\geq p^{p^{\gamma}}-1$ , i.e., either $n\geq p^{\gamma}$ (i.e., $\gamma\leq j$ ) or $n=\gamma=0,p=2$ , follows from $\gamma\leq j$ .

By Lemma 4.8, with $u=b^{i},v=c^{b^{n-i}}$ ,

[TABLE]

The condition $O_{i}>0$ is equivalent to $b^{i}\geq c^{b^{n-i}}-1$ . When $b=c=p$ and $i>0$ , this is equivalent to

[TABLE]

Let $k=p^{j}$ . Since by assumption $n=k+j-\gamma$ , (9) becomes

[TABLE]

Since the map $i\mapsto ip^{i}$ is increasing, the requirement for $i$ is that $i\geq k$ . Note that setting $i=k$ now makes $u=v$ . Therefore by Lemma 4.9, $O_{i}$ is $(u+1)!/2=(p^{i}+1)!/2$ as desired. ∎

Lemma 4.11.

Let $j$ be a nonnegative integer. Let $n=p^{j}+j$ and $b=c=p\geq 2$ . Then

[TABLE]

Proof.

We have $p^{n-j}=p^{p^{j}}$ and hence $p^{m}=p^{p^{n-m}}$ for $m=n-j$ , so that $p^{m}>p^{p^{n-m}}-1$ but $p(p^{m-1}-1)<p^{p^{n-m}}-1$ . Thus Theorem 4.3 applies and the number of such functions is $O_{i}:=O^{b,c,n}_{i}$ , where $i$ is minimal such that $O_{i}>0$ . By Lemma 4.10 with $\gamma=0$ we are done. ∎

Using Theorem 3.10 for $b=c=2$ we calculate some values for

[TABLE]

the number of maximally complex functions from $[2]^{n}$ to $[2]$ , in Table 1. In Theorem 4.12 we shall study the limiting behavior suggested by Table 1.

Theorem 4.12.

The number of maximal complexity functions satisfies

[TABLE]

Proof.

It is immediate that $\limsup_{n\to\infty}\frac{1}{n}\log_{2}\log_{2}(\mathrm{nmcf}(n))\leq 1$ . For the other direction, consider the case where $n=2^{j}+j$ for some $j$ . By Stirling’s approximation,

[TABLE]

and hence $\lim_{n\to\infty}\frac{1}{n}\log_{2}\log_{2}(2^{2^{j}}!)=1$ . By Lemma 4.11,

[TABLE]

In Lemma 4.11, $(p^{p^{j}}+1)!/2$ may seem like a large number but it is relatively small: in terms of $w:=p^{k}$ ,

[TABLE]

Example 4.13.

For $n=6$ and $b=c=2$ , then, we get $i=k=2^{j}=4$ , and

[TABLE]

So there are more than 177 trillion maximum-complexity 6-ary Boolean functions, which is however a small fraction of the total number of such functions,

[TABLE]

or over 18 quintillion.

Remark 4.14.

For future work, it would be interesting (but difficult) to determine the distribution of $\mathsf{sc}(f)$ over $f\in[c]^{[b]^{n}}$ .

Bibliography6

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Cezar Câmpeanu and Wing Hong Ho. The maximum state complexity for finite languages. J. Autom. Lang. Comb. , 9(2-3):189–202, 2004.
2[2] J.-M. Champarnaud and J.-E. Pin. A maxmin problem on finite automata. Discrete Applied Mathematics , 23(1):91 – 96, 1989.
3[3] Bjørn Kjos-Hanssen. On the complexity of automatic complexity. Theory Comput. Syst. , 61(4):1427–1439, 2017.
4[4] Bjørn Kjos-Hanssen and Lei Liu. The number of languages with maximum state complexity. In Theory and applications of models of computation , volume 11436 of Lecture Notes in Comput. Sci. , pages 394–409. Springer, Cham, 2019.
5[5] S. V. Poluyan and N. M. Ershov. Quantile transform in structural bioinformatics problems. Computational nanotechnology , (4):29–43, 2019.
6[6] Jeffrey Shallit. A Second Course in Formal Languages and Automata Theory . Cambridge University Press, New York, NY, USA, 1 edition, 2008.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The number of languages with

Abstract

1 Introduction

2 Complexity of languages and operations

Definition 2.1**.**

Definition 2.2** (state complexity).**

Theorem 2.3** ([2, Theorem 4]).**

Theorem 2.4** ([1, Corollary 10]).**

Definition 2.5**.**

Lemma 2.6**.**

Proof.

Lemma 2.7**.**

Proof.

Theorem 2.8**.**

Proof.

Theorem 2.9**.**

Proof.

Proof of claim.

Definition 2.10**.**

Definition 2.11**.**

Definition 2.12**.**

Definition 2.13**.**

Theorem 2.14**.**

Proof.

Example 2.15**.**

Example 2.16**.**

Example 2.17**.**

Theorem 2.18**.**

Proof.

3 The number of maximally complex languages

Lemma 3.1**.**

Proof.

Definition 3.2**.**

Definition 3.3**.**

Proposition 3.4**.**

Proposition 3.5**.**

Proof.

Definition 3.6**.**

Theorem 3.7**.**

Proof.

Theorem 3.8**.**

Proof.

Proposition 3.9**.**

Proof.

Theorem 3.10**.**

Proof.

Example 3.11**.**

4 Asymptotics

Proposition 4.1**.**

Proof.

Definition 4.2**.**

Theorem 4.3**.**

Proof.

Remark 4.4**.**

Example 4.5**.**

Definition 4.6**.**

Lemma 4.7**.**

Lemma 4.8**.**

Proof.

Lemma 4.9**.**

Proof.

Lemma 4.10**.**

Proof.

Lemma 4.11**.**

Proof.

Theorem 4.12**.**

Proof.

Example 4.13**.**

Remark 4.14**.**

Definition 2.1.

Definition 2.2 (state complexity).

Theorem 2.3 ([2, Theorem 4]).

Theorem 2.4 ([1, Corollary 10]).

Definition 2.5.

Lemma 2.6.

Lemma 2.7.

Theorem 2.8.

Theorem 2.9.

Definition 2.10.

Definition 2.11.

Definition 2.12.

Definition 2.13.

Theorem 2.14.

Example 2.15.

Example 2.16.

Example 2.17.

Theorem 2.18.

Lemma 3.1.

Definition 3.2.

Definition 3.3.

Proposition 3.4.

Proposition 3.5.

Definition 3.6.

Theorem 3.7.

Theorem 3.8.

Proposition 3.9.

Theorem 3.10.

Example 3.11.

Proposition 4.1.

Definition 4.2.

Theorem 4.3.

Remark 4.4.

Example 4.5.

Definition 4.6.

Lemma 4.7.

Lemma 4.8.

Lemma 4.9.

Lemma 4.10.

Lemma 4.11.

Theorem 4.12.

Example 4.13.

Remark 4.14.