Studying number theory with deep learning: a case study with the M\"obius and squarefree indicator functions

David Lowry-Duda

arXiv:2502.10335·math.NT·July 29, 2025

Studying number theory with deep learning: a case study with the M\"obius and squarefree indicator functions

David Lowry-Duda

PDF

Open Access

TL;DR

This paper explores the application of deep learning, specifically small transformer models, to predict number-theoretic functions like the Möbius function and squarefree indicator, providing insights into their structure.

Contribution

It demonstrates that transformer models can learn and predict number-theoretic functions, offering a novel intersection of deep learning and number theory with theoretical explanations.

Findings

01

Transformers achieve nontrivial accuracy in predicting μ(n) and μ²(n)

02

Model analysis offers theoretical insights into number-theoretic functions

03

Deep learning models reveal structural properties of number theory functions

Abstract

Building on work of Charton, we train small transformer models to calculate the M\"{o}bius function $μ (n)$ and the squarefree indicator function $μ^{2} (n)$ . The models attain nontrivial predictive power. We apply a mixture of additional models and feature scoring to give a theoretical explanation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnalytic Number Theory Research · Statistical and numerical algorithms