Information-Theoretic Generalization Bounds for Sequential Decision Making

Futoshi Futami; Masahiro Fujisawa

arXiv:2605.12190·stat.ML·May 13, 2026

Information-Theoretic Generalization Bounds for Sequential Decision Making

Futoshi Futami, Masahiro Fujisawa

PDF

TL;DR

This paper develops a new information-theoretic framework for analyzing generalization in sequential decision-making, extending existing bounds to settings like online learning and bandits.

Contribution

It introduces a sequential supersample framework and sequential CMI bounds that handle adaptive data revealing and causal learner trajectories.

Findings

01

Sequential CMI bounds control generalization gap in adaptive settings.

02

Bernstein-type refinement achieves faster convergence rates.

03

Framework applies to online learning, streaming active learning, and bandits.

Abstract

Information-theoretic generalization bounds based on the supersample construction are a central tool for algorithm-dependent generalization analysis in the batch i.i.d.~setting. However, existing supersample conditional mutual information (CMI) bounds do not directly apply to sequential decision-making problems such as online learning, streaming active learning, and bandits, where data are revealed adaptively and the learner evolves along a causal trajectory. To address this limitation, we develop a sequential supersample framework that separates the learner filtration from a proof-side enlargement used for ghost-coordinate comparisons. Under a row-wise exchangeability assumption, the sequential generalization gap is controlled by sequential CMI, a sum of roundwise selector--loss information terms. We also establish a Bernstein-type refinement that yields faster rates under suitable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.