Sequential Learning Of Neural Networks for Prequential MDL
Jorg Bornschein, Yazhe Li, Marcus Hutter

TL;DR
This paper explores efficient methods for computing prequential MDL for neural networks in image classification, proposing techniques like forward-calibration and replay-streams to improve description length estimates and model evaluation.
Contribution
It introduces replay-streams and forward-calibration techniques for better prequential MDL estimation in neural networks, enhancing evaluation on image datasets.
Findings
Online-learning with rehearsal outperforms block-wise estimation.
Proposed methods significantly improve description length estimates.
Efficient incremental training reduces computational costs.
Abstract
Minimum Description Length (MDL) provides a framework and an objective for principled model evaluation. It formalizes Occam's Razor and can be applied to data from non-stationary sources. In the prequential formulation of MDL, the objective is to minimize the cumulative next-step log-loss when sequentially going through the data and using previous observations for parameter estimation. It thus closely resembles a continual- or online-learning problem. In this study, we evaluate approaches for computing prequential description lengths for image classification datasets with neural networks. Considering the computational cost, we find that online-learning with rehearsal has favorable performance compared to the previously widely used block-wise estimation. We propose forward-calibration to better align the models predictions with the empirical observations and introduce replay-streams, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Neural Networks and Applications
MethodsALIGN · Minimum Description Length
