Generalization Bounds for Markov Algorithms through Entropy Flow Computations

Benjamin Dupuis; Maxime Haddouche; George Deligiannidis; Umut Simsekli

arXiv:2502.07584·stat.ML·March 6, 2026

Generalization Bounds for Markov Algorithms through Entropy Flow Computations

Benjamin Dupuis, Maxime Haddouche, George Deligiannidis, Umut Simsekli

PDF

Open Access

TL;DR

This paper extends entropy flow methods to derive generalization bounds for all Markov process-based learning algorithms, connecting their ergodic properties to generalization error.

Contribution

It introduces a new exact entropy flow formula for Markov algorithms and links it to modified logarithmic Sobolev inequalities, broadening the applicability of existing techniques.

Findings

01

Derived new generalization bounds for several algorithms

02

Extended entropy flow analysis to all Markov process-based algorithms

03

Connected ergodic properties of Markov processes to generalization error

Abstract

Many learning algorithms can be represented as Markov processes, and understanding their generalization error is a central topic in learning theory. For specific continuous-time noisy algorithms, a prominent analysis technique relies on information-theoretic tools and the so-called ``entropy flow'' method. This technique is compatible with a broad range of assumptions and leverages the convergence properties of learning dynamics to produce meaningful generalization bounds, which can also be informative or extend to discrete-time settings. Despite their success, existing entropy flow formulations are limited to specific noise and algorithm structures (\eg, Langevin dynamics). In this work, we exploit new technical tools to extend its applicability to all learning algorithms whose iterative dynamics is governed by a time-homogeneous Markov process. Our approach builds on a principled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications