Algorithm Development in Neural Networks: Insights from the Streaming Parity Task

Loek van Rossem; Andrew M. Saxe

arXiv:2507.09897·cs.LG·August 7, 2025

Algorithm Development in Neural Networks: Insights from the Streaming Parity Task

Loek van Rossem, Andrew M. Saxe

PDF

Open Access

TL;DR

This paper investigates how recurrent neural networks develop algorithms capable of infinite generalization on the streaming parity task, revealing a phase transition and an implicit automaton construction that explains this phenomenon.

Contribution

It provides a theoretical framework for understanding how neural networks learn algorithms that enable infinite generalization from finite training data.

Findings

01

RNNs exhibit a phase transition to perfect infinite generalization.

02

An implicit automaton is constructed through representational merger.

03

The study offers a new perspective on neural network generalization mechanisms.

Abstract

Even when massively overparameterized, deep neural networks show a remarkable ability to generalize. Research on this phenomenon has focused on generalization within distribution, via smooth interpolation. Yet in some settings neural networks also learn to extrapolate to data far beyond the bounds of the original training set, sometimes even allowing for infinite generalization, implying that an algorithm capable of solving the task has been learned. Here we undertake a case study of the learning dynamics of recurrent neural networks (RNNs) trained on the streaming parity task in order to develop an effective theory of algorithm development. The streaming parity task is a simple but nonlinear task defined on sequences up to arbitrary length. We show that, with sufficient finite training experience, RNNs exhibit a phase transition to perfect infinite generalization. Using an effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications