Minimum Encoding Approaches for Predictive Modeling
Peter D Grunwald, Petri Kontkanen, Petri Myllymaki, Tomi Silander,, Henry Tirri

TL;DR
This paper compares two information-theoretic approaches, MDL and MML, for statistical inference, introduces revised MML estimators, and evaluates their performance on small data sets.
Contribution
It presents two new MML estimators, a pointwise and a volumewise version, improving upon previous MML methods and analyzing their performance relative to MDL.
Findings
MDL yields more accurate predictions on small data sets.
Revised MML estimators outperform original MML methods.
Empirical results favor MDL over MML in certain scenarios.
Abstract
We analyze differences between two information-theoretically motivated approaches to statistical inference and model selection: the Minimum Description Length (MDL) principle, and the Minimum Message Length (MML) principle. Based on this analysis, we present two revised versions of MML: a pointwise estimator which gives the MML-optimal single parameter model, and a volumewise estimator which gives the MML-optimal region in the parameter space. Our empirical results suggest that with small data sets, the MDL approach yields more accurate predictions than the MML estimators. The empirical results also demonstrate that the revised MML estimators introduced here perform better than the original MML estimator suggested by Wallace and Freeman.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Statistical Methods and Inference · Machine Learning and Algorithms
