Stronger Separation of Analog Neuron Hierarchy by Deterministic   Context-Free Languages

Ji\v{r}\'i \v{S}\'ima

arXiv:2102.01633·cs.NE·February 3, 2021

Stronger Separation of Analog Neuron Hierarchy by Deterministic Context-Free Languages

Ji\v{r}\'i \v{S}\'ima

PDF

TL;DR

This paper investigates the computational capabilities of discrete-time recurrent neural networks with saturated-linear activation functions, establishing a hierarchy of analog neuron models and demonstrating their ability to recognize certain context-free languages beyond regular languages.

Contribution

The paper proves a stronger separation between neural network models recognizing regular and non-regular deterministic context-free languages, especially for 1ANNs and 2ANNs, advancing understanding of their computational power.

Findings

01

1ANNs cannot recognize non-regular DCFLs.

02

2ANNs can recognize all DCFLs.

03

The language $L_#=\{0^n1^n\}$ is the simplest non-regular DCFL recognized by 2ANNs.

Abstract

We analyze the computational power of discrete-time recurrent neural networks (NNs) with the saturated-linear activation function within the Chomsky hierarchy. This model restricted to integer weights coincides with binary-state NNs with the Heaviside activation function, which are equivalent to finite automata (Chomsky level 3) recognizing regular languages (REG), while rational weights make this model Turing-complete even for three analog-state units (Chomsky level 0). For the intermediate model $α$ ANN of a binary-state NN that is extended with $α \geq 0$ extra analog-state neurons with rational weights, we have established the analog neuron hierarchy 0ANNs $\subset$ 1ANNs $\subset$ 2ANNs $\subseteq$ 3ANNs. The separation 1ANNs $⫋$ 2ANNs has been witnessed by the non-regular deterministic context-free language (DCFL) $L_{#} = {0^{n} 1^{n} ∣ n \geq 1}$ which cannot be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.