Exact Learning of Arithmetic with Differentiable Agents

Hristo Papazov; Francesco D'Angelo; Nicolas Flammarion

arXiv:2511.22751·cs.LG·December 1, 2025

Exact Learning of Arithmetic with Differentiable Agents

Hristo Papazov, Francesco D'Angelo, Nicolas Flammarion

PDF

Open Access

TL;DR

This paper introduces a differentiable framework using Finite-State Transducers for exact algorithmic learning, demonstrating strong length generalization in arithmetic tasks with gradient-based methods.

Contribution

It presents a novel differentiable model family, DFSTs, enabling exact learning of algorithms with length generalization using structured supervision.

Findings

01

Models trained on small datasets generalize to much longer inputs.

02

DFSTs achieve error-free performance on arithmetic tasks.

03

End-to-end differentiable training of algorithmic skills is feasible.

Abstract

We explore the possibility of exact algorithmic learning with gradient-based methods and introduce a differentiable framework capable of strong length generalization on arithmetic tasks. Our approach centers on Differentiable Finite-State Transducers (DFSTs), a Turing-complete model family that avoids the pitfalls of prior architectures by enabling constant-precision, constant-time generation, and end-to-end log-parallel differentiable training. Leveraging policy-trajectory observations from expert agents, we train DFSTs to perform binary and decimal addition and multiplication. Remarkably, models trained on tiny datasets generalize without error to inputs thousands of times longer than the training examples. These results show that training differentiable agents on structured intermediate supervision could pave the way towards exact gradient-based learning of algorithmic skills. Code…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Stochastic Gradient Optimization Techniques · Model Reduction and Neural Networks