Stronger Separation of Analog Neuron Hierarchy by Deterministic Context-Free Languages
Ji\v{r}\'i \v{S}\'ima

TL;DR
This paper investigates the computational capabilities of discrete-time recurrent neural networks with saturated-linear activation functions, establishing a hierarchy of analog neuron models and demonstrating their ability to recognize certain context-free languages beyond regular languages.
Contribution
The paper proves a stronger separation between neural network models recognizing regular and non-regular deterministic context-free languages, especially for 1ANNs and 2ANNs, advancing understanding of their computational power.
Findings
1ANNs cannot recognize non-regular DCFLs.
2ANNs can recognize all DCFLs.
The language $L_#=\{0^n1^n\}$ is the simplest non-regular DCFL recognized by 2ANNs.
Abstract
We analyze the computational power of discrete-time recurrent neural networks (NNs) with the saturated-linear activation function within the Chomsky hierarchy. This model restricted to integer weights coincides with binary-state NNs with the Heaviside activation function, which are equivalent to finite automata (Chomsky level 3) recognizing regular languages (REG), while rational weights make this model Turing-complete even for three analog-state units (Chomsky level 0). For the intermediate model ANN of a binary-state NN that is extended with extra analog-state neurons with rational weights, we have established the analog neuron hierarchy 0ANNs 1ANNs 2ANNs 3ANNs. The separation 1ANNs 2ANNs has been witnessed by the non-regular deterministic context-free language (DCFL) which cannot be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
