Parallelizable Neural Turing Machines
Gabriel Faria, Arnaldo Candido Junior

TL;DR
The paper presents P-NTM, a parallelizable and simplified neural architecture that efficiently solves algorithmic problems with length generalization and significantly faster training compared to standard NTMs.
Contribution
It introduces P-NTM, a parallelizable version of NTM that maintains performance while enabling efficient scan-based parallel execution.
Findings
Achieves length generalization comparable to standard NTM.
Up to tenfold faster training due to parallel execution.
Successfully solves algorithmic problems involving state tracking and memorization.
Abstract
We introduce a parallelizable simplification of Neural Turing Machine (NTM), referred to as P-NTM, which redesigns the core operations of the original architecture to enable efficient scan-based parallel execution. We evaluate the proposed architecture on a synthetic benchmark of algorithmic problems involving state tracking, memorization, and basic arithmetic, solved via autoregressive decoding. We compare it against a revisited stable implementation of the standard NTM, as well as conventional recurrent and attention-based architectures. Results show that, despite its simplifications, the proposed model attains length generalization performance comparable to the original, learning to solve all problems, including unseen sequence lengths, with perfect accuracy. It also improves training efficiency, with parallel execution of P-NTM being up to an order of magnitude faster than the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Neural Networks and Applications · Neural Networks and Reservoir Computing
