Neural Prime Sieves: Density-Driven Generalization and Empirical Evidence for Hardy-Littlewood Asymptotics
Manik Kakkar

TL;DR
This paper introduces PrimeFamilyNet, a neural network model that probabilistically filters prime families across large scales, demonstrating improved recall and generalization consistent with Hardy-Littlewood asymptotics.
Contribution
The authors develop a multi-head residual network conditioned on prime properties, achieving scalable, density-driven prime family identification and empirical validation of Hardy-Littlewood predictions.
Findings
Recall for isolated primes increased from 80.9% to 98.4% from scale 10^8 to 10^16.
Model trained up to 10^9 generalized correctly to 10^16 without density supervision.
Causal model retained over 95% recall at 10^10, reducing search space by up to 88%.
Abstract
Special prime families (twin, Sophie Germain, safe, cousin, sexy, Chen, and isolated primes) are central objects of analytic number theory, yet no efficiently computable probabilistic filter exists for identifying likely members among known primes at large scale. Classical sieves assign no probability weights to surviving candidates, and prior machine learning approaches are limited by the algorithmic randomness of the prime indicator sequence, yielding near-zero true positive rates. We present PrimeFamilyNet, a multi-head residual network conditioned on the backward prime gap and modular primorial residues of a known prime , learning probabilistic filters for all seven families simultaneously and generalising across nine orders of magnitude from training (--) to evaluation at . Isolated prime recall increased monotonically from at to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
