Upper Bound on Normalized Maximum Likelihood Codes for Gaussian Mixture Models
So Hirai, Kenji Yamanishi

TL;DR
This paper establishes that the previously computed normalized maximum likelihood (NML) code-length serves as an upper bound for the true NML code-length in Gaussian Mixture Models, and demonstrates the universality of the model selection algorithm despite data scaling.
Contribution
It introduces an upper bound for NML code-length in Gaussian Mixture Models and shows the universality of the model selection algorithm across data scales, also correcting NML for generalized logistic distributions.
Findings
The NML code-length in [1] is an upper bound for the true NML in GMMs.
Model selection algorithm is scale-invariant in GMMs.
Corrected NML code-length for generalized logistic distributions.
Abstract
This paper shows that the normalized maximum likelihood~(NML) code-length calculated in [1] is an upper bound on the NML code-length strictly calculated for the Gaussian Mixture Model. When we use this upper bound on the NML code-length, we must change the scale of the data sequence to satisfy the restricted domain. However, we also show that the algorithm for model selection is essentially universal, regardless of the scale conversion of the data in Gaussian Mixture Models, and that, consequently, the experimental results in [1] can be used as they are. In addition to this, we correct the NML code-length in [1] for generalized logistic distributions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Statistical Distribution Estimation and Applications
