Parallelizable Neural Turing Machines

Gabriel Faria; Arnaldo Candido Junior

arXiv:2602.18508·cs.NE·February 24, 2026

Parallelizable Neural Turing Machines

Gabriel Faria, Arnaldo Candido Junior

PDF

Open Access

TL;DR

The paper presents P-NTM, a parallelizable and simplified neural architecture that efficiently solves algorithmic problems with length generalization and significantly faster training compared to standard NTMs.

Contribution

It introduces P-NTM, a parallelizable version of NTM that maintains performance while enabling efficient scan-based parallel execution.

Findings

01

Achieves length generalization comparable to standard NTM.

02

Up to tenfold faster training due to parallel execution.

03

Successfully solves algorithmic problems involving state tracking and memorization.

Abstract

We introduce a parallelizable simplification of Neural Turing Machine (NTM), referred to as P-NTM, which redesigns the core operations of the original architecture to enable efficient scan-based parallel execution. We evaluate the proposed architecture on a synthetic benchmark of algorithmic problems involving state tracking, memorization, and basic arithmetic, solved via autoregressive decoding. We compare it against a revisited stable implementation of the standard NTM, as well as conventional recurrent and attention-based architectures. Results show that, despite its simplifications, the proposed model attains length generalization performance comparable to the original, learning to solve all problems, including unseen sequence lengths, with perfect accuracy. It also improves training efficiency, with parallel execution of P-NTM being up to an order of magnitude faster than the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Neural Networks and Applications · Neural Networks and Reservoir Computing