Foundation of Calculating Normalized Maximum Likelihood for Continuous Probability Models
Atsushi Suzuki, Kota Fukuzawa, Kenji Yamanishi

TL;DR
This paper proves that the method for calculating normalized maximum likelihood (NML) code length, previously validated for discrete models, is also correct for continuous models using a novel geometric measure theory approach.
Contribution
The paper provides the first rigorous proof that the NML code length calculation method applies to continuous probability models, introducing a new decomposition technique based on the coarea formula.
Findings
Confirmed the correctness of NML code length calculation for continuous models.
Developed a novel decomposition method using the coarea formula.
Bridged the theoretical gap between discrete and continuous NML calculations.
Abstract
The normalized maximum likelihood (NML) code length is widely used as a model selection criterion based on the minimum description length principle, where the model with the shortest NML code length is selected. A common method to calculate the NML code length is to use the sum (for a discrete model) or integral (for a continuous model) of a function defined by the distribution of the maximum likelihood estimator. While this method has been proven to correctly calculate the NML code length of discrete models, no proof has been provided for continuous cases. Consequently, it has remained unclear whether the method can accurately calculate the NML code length of continuous models. In this paper, we solve this problem affirmatively, proving that the method is also correct for continuous cases. Remarkably, completing the proof for continuous cases is non-trivial in that it cannot be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference
