A Group Theoretic Analysis of the Symmetries Underlying Base Addition and Their Learnability by Neural Networks

Cutter Dawes; Simon Segert; Kamesh Krishnamurthy; Jonathan D. Cohen

arXiv:2507.10678·cs.LG·July 17, 2025

A Group Theoretic Analysis of the Symmetries Underlying Base Addition and Their Learnability by Neural Networks

Cutter Dawes, Simon Segert, Kamesh Krishnamurthy, Jonathan D. Cohen

PDF

TL;DR

This paper uses group theory to analyze the symmetries in base addition, focusing on carry functions, and investigates how neural networks learn these symmetries, revealing that carry structure significantly influences learnability and generalization.

Contribution

It introduces a group theoretic framework for analyzing carry functions in base addition and examines neural network learnability of these symmetries based on carry structure.

Findings

01

Neural networks can achieve radical generalization with proper input format and carry function.

02

Carry function structure significantly affects neural network learnability.

03

Different carry functions lead to varying rates of learning in neural networks.

Abstract

A major challenge in the use of neural networks both for modeling human cognitive function and for artificial intelligence is the design of systems with the capacity to efficiently learn functions that support radical generalization. At the roots of this is the capacity to discover and implement symmetry functions. In this paper, we investigate a paradigmatic example of radical generalization through the use of symmetry: base addition. We present a group theoretic analysis of base addition, a fundamental and defining characteristic of which is the carry function -- the transfer of the remainder, when a sum exceeds the base modulus, to the next significant place. Our analysis exposes a range of alternative carry functions for a given base, and we introduce quantitative measures to characterize these. We then exploit differences in carry functions to probe the inductive biases of neural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.