Studying number theory with deep learning: a case study with the M\"obius and squarefree indicator functions
David Lowry-Duda

TL;DR
This paper explores the application of deep learning, specifically small transformer models, to predict number-theoretic functions like the Möbius function and squarefree indicator, providing insights into their structure.
Contribution
It demonstrates that transformer models can learn and predict number-theoretic functions, offering a novel intersection of deep learning and number theory with theoretical explanations.
Findings
Transformers achieve nontrivial accuracy in predicting μ(n) and μ²(n)
Model analysis offers theoretical insights into number-theoretic functions
Deep learning models reveal structural properties of number theory functions
Abstract
Building on work of Charton, we train small transformer models to calculate the M\"{o}bius function and the squarefree indicator function . The models attain nontrivial predictive power. We apply a mixture of additional models and feature scoring to give a theoretical explanation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnalytic Number Theory Research · Statistical and numerical algorithms
