Bayesian Modeling of Collatz Stopping Times: A Probabilistic Machine Learning Perspective
Nicol\`o Bonacorsi, Matteo Bordoni

TL;DR
This paper models the Collatz total stopping time using Bayesian hierarchical regression and a mechanistic probabilistic generator, revealing modular structure influences its variability and improving prediction accuracy.
Contribution
Introduces a Bayesian hierarchical Negative Binomial model and a mechanistic stochastic generator for Collatz stopping times, highlighting the role of modular arithmetic in heterogeneity.
Findings
The NB2-GLM outperforms the mechanistic generator in predictive likelihood.
Conditioning on residue classes improves the generator's fit.
Modular structure significantly influences stopping time variability.
Abstract
We study the Collatz total stopping time over from a probabilistic machine learning viewpoint. Empirically, is a skewed and heavily overdispersed count with pronounced arithmetic heterogeneity. We develop two complementary models. First, a Bayesian hierarchical Negative Binomial regression (NB2-GLM) predicts from simple covariates ( and residue class ), quantifying uncertainty via posterior and posterior predictive distributions. Second, we propose a mechanistic generative approximation based on the odd-block decomposition: for odd , write with odd and ; randomizing these block lengths yields a stochastic approximation calibrated via a Dirichlet-multinomial update. On held-out data, the NB2-GLM achieves substantially higher predictive likelihood than the odd-block generators.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBenford’s Law and Fraud Detection · Probability and Statistical Research · Pharmacovigilance and Adverse Drug Reactions
