A DPI-PAC-Bayesian Framework for Generalization Bounds

Muhan Guan; Farhad Farokhi; Jingge Zhu

arXiv:2507.14795·cs.IT·August 26, 2025

A DPI-PAC-Bayesian Framework for Generalization Bounds

Muhan Guan, Farhad Farokhi, Jingge Zhu

PDF

Open Access

TL;DR

This paper introduces a unified DPI-PAC-Bayesian framework that derives tighter generalization bounds in supervised learning by integrating data processing inequalities with PAC-Bayesian methods, applicable to various divergences.

Contribution

It presents a novel framework combining DPI and PAC-Bayesian techniques to obtain explicit, tighter generalization bounds for multiple divergence measures, improving upon classical bounds.

Findings

01

Derived bounds for Rényi, Hellinger, and Chi-Squared divergences.

02

Unified framework connects data processing and PAC-Bayesian bounds.

03

Achieves tighter bounds by removing extraneous logarithmic slack.

Abstract

We develop a unified Data Processing Inequality PAC-Bayesian framework -- abbreviated DPI-PAC-Bayesian -- for deriving generalization error bounds in the supervised learning setting. By embedding the Data Processing Inequality (DPI) into the change-of-measure technique, we obtain explicit bounds on the binary Kullback-Leibler generalization gap for both R\'enyi divergence and any $f$ -divergence measured between a data-independent prior distribution and an algorithm-dependent posterior distribution. We present three bounds derived under our framework using R\'enyi, Hellinger $p$ and Chi-Squared divergences. Additionally, our framework also demonstrates a close connection with other well-known bounds. When the prior distribution is chosen to be uniform, our bounds recover the classical Occam's Razor bound and, crucially, eliminate the extraneous $\log(2\sqrt{n})/n$ slack present in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference