A provably stable neural network Turing Machine
John Stogin, Ankur Mali, C Lee Giles

TL;DR
This paper introduces a provably stable neural network architecture that can simulate Turing machines and pushdown automata, providing theoretical bounds on the computational power of bounded precision neural networks with memory.
Contribution
It presents a neural stack, neural pushdown automaton, and neural Turing machine with proven stability and the ability to simulate classical automata and Turing machines, including a universal Turing machine with minimal precision.
Findings
Neural stack architecture is stable and closely approximates discrete stacks.
Neural pushdown automaton can simulate any PDA.
Neural Turing machine can simulate any Turing machine with only seven neurons.
Abstract
We introduce a neural stack architecture, including a differentiable parametrized stack operator that approximates stack push and pop operations for suitable choices of parameters that explicitly represents a stack. We prove the stability of this stack architecture: after arbitrarily many stack operations, the state of the neural stack still closely resembles the state of the discrete stack. Using the neural stack with a recurrent neural network, we introduce a neural network Pushdown Automaton (nnPDA) and prove that nnPDA with finite/bounded neurons and time can simulate any PDA. Furthermore, we extend our construction and propose new architecture neural state Turing Machine (nnTM). We prove that differentiable nnTM with bounded neurons can simulate Turing Machine (TM) in real-time. Just like the neural stack, these architectures are also stable. Finally, we extend our construction to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Ferroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing
