Parsimonious Inference
Jed A. Duersch, Thomas A. Catanach

TL;DR
Parsimonious inference offers an information-theoretic framework for Bayesian inference that emphasizes simplicity, formalizes Occam's Razor, and develops algorithms for small or skewed datasets, improving predictive modeling without cross-validation.
Contribution
It introduces a universal hyperprior based on program length and Kolmogorov complexity, formalizes inference as information minimization, and develops novel algorithms for polynomial regression and random forests.
Findings
Effective algorithms for small datasets demonstrated
Ensembles constructed without cross-validation
Framework quantifies memorization phenomena
Abstract
Bayesian inference provides a uniquely rigorous approach to obtain principled justification for uncertainty in predictions, yet it is difficult to articulate suitably general prior belief in the machine learning context, where computational architectures are pure abstractions subject to frequent modifications by practitioners attempting to improve results. Parsimonious inference is an information-theoretic formulation of inference over arbitrary architectures that formalizes Occam's Razor; we prefer simple and sufficient explanations. Our universal hyperprior assigns plausibility to prior descriptions, encoded as sequences of symbols, by expanding on the core relationships between program length, Kolmogorov complexity, and Solomonoff's algorithmic probability. We then cast learning as information minimization over our composite change in belief when an architecture is specified,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · Philosophy and History of Science · Race, Genetics, and Society
