Asymptotic Log-loss of Prequential Maximum Likelihood Codes

Peter Grunwald; Steven de Rooij

arXiv:cs/0502004·cs.LG·July 16, 2007

Asymptotic Log-loss of Prequential Maximum Likelihood Codes

Peter Grunwald, Steven de Rooij

PDF

Open Access

TL;DR

This paper investigates the asymptotic redundancy of prequential maximum likelihood codes within exponential family models, revealing that their growth rate differs from other universal codes and may be problematic for MDL model selection.

Contribution

It provides a precise characterization of the redundancy growth rate for prequential ML codes and compares it to other universal coding schemes, highlighting potential issues for model selection.

Findings

01

Redundancy grows at rate (v1/v2)/2 ln n, where v1 and v2 are variances.

02

Prequential codes differ from MDL, Shtarkov, and Bayes codes in their asymptotic behavior.

03

This difference can be undesirable for MDL-based model selection.

Abstract

We analyze the Dawid-Rissanen prequential maximum likelihood codes relative to one-parameter exponential family models M. If data are i.i.d. according to an (essentially) arbitrary P, then the redundancy grows at rate c/2 ln n. We show that c=v1/v2, where v1 is the variance of P, and v2 is the variance of the distribution m* in M that is closest to P in KL divergence. This shows that prequential codes behave quite differently from other important universal codes such as the 2-part MDL, Shtarkov and Bayes codes, for which c=1. This behavior is undesirable in an MDL model selection setting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsError Correcting Code Techniques · Algorithms and Data Compression · Advanced Wireless Communication Techniques