Sequential Learning Of Neural Networks for Prequential MDL

Jorg Bornschein; Yazhe Li; Marcus Hutter

arXiv:2210.07931·stat.ML·October 17, 2022·1 cites

Sequential Learning Of Neural Networks for Prequential MDL

Jorg Bornschein, Yazhe Li, Marcus Hutter

PDF

Open Access 1 Video

TL;DR

This paper explores efficient methods for computing prequential MDL for neural networks in image classification, proposing techniques like forward-calibration and replay-streams to improve description length estimates and model evaluation.

Contribution

It introduces replay-streams and forward-calibration techniques for better prequential MDL estimation in neural networks, enhancing evaluation on image datasets.

Findings

01

Online-learning with rehearsal outperforms block-wise estimation.

02

Proposed methods significantly improve description length estimates.

03

Efficient incremental training reduces computational costs.

Abstract

Minimum Description Length (MDL) provides a framework and an objective for principled model evaluation. It formalizes Occam's Razor and can be applied to data from non-stationary sources. In the prequential formulation of MDL, the objective is to minimize the cumulative next-step log-loss when sequentially going through the data and using previous observations for parameter estimation. It thus closely resembles a continual- or online-learning problem. In this study, we evaluate approaches for computing prequential description lengths for image classification datasets with neural networks. Considering the computational cost, we find that online-learning with rehearsal has favorable performance compared to the previously widely used block-wise estimation. We propose forward-calibration to better align the models predictions with the empirical observations and introduce replay-streams, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Sequential Learning of Neural Networks for Prequential MDL· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Neural Networks and Applications

MethodsALIGN · Minimum Description Length