Minimum Description Length Revisited
Peter Gr\"unwald, Teemu Roos

TL;DR
This paper provides an updated comprehensive overview of the Minimum Description Length (MDL) principle, highlighting recent advances, new methods, and its unifying perspective across various statistical and machine learning techniques.
Contribution
It introduces the latest developments in MDL, including new model selection, averaging, hypothesis testing methods, and a general definition of MDL estimators, unifying different approaches.
Findings
MDL now encompasses new model selection and averaging techniques.
A general definition of MDL estimators has been established.
MDL offers a unified view of methods like AIC, BIC, cross-validation, and Bayesian approaches.
Abstract
This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle, a theory of inductive inference that can be applied to general problems in statistics, machine learning and pattern recognition. While MDL was originally based on data compression ideas, this introduction can be read without any knowledge thereof. It takes into account all major developments since 2007, the last time an extensive overview was written. These include new methods for model selection and averaging and hypothesis testing, as well as the first completely general definition of {\em MDL estimators}. Incorporating these developments, MDL can be seen as a powerful extension of both penalized likelihood and Bayesian approaches, in which penalization functions and prior distributions are replaced by more general luckiness functions, average-case methodology is replaced by a more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Fault Detection and Control Systems · Gaussian Processes and Bayesian Inference
MethodsMinimum Description Length
