A DPI-PAC-Bayesian Framework for Generalization Bounds
Muhan Guan, Farhad Farokhi, Jingge Zhu

TL;DR
This paper introduces a unified DPI-PAC-Bayesian framework that derives tighter generalization bounds in supervised learning by integrating data processing inequalities with PAC-Bayesian methods, applicable to various divergences.
Contribution
It presents a novel framework combining DPI and PAC-Bayesian techniques to obtain explicit, tighter generalization bounds for multiple divergence measures, improving upon classical bounds.
Findings
Derived bounds for Rényi, Hellinger, and Chi-Squared divergences.
Unified framework connects data processing and PAC-Bayesian bounds.
Achieves tighter bounds by removing extraneous logarithmic slack.
Abstract
We develop a unified Data Processing Inequality PAC-Bayesian framework -- abbreviated DPI-PAC-Bayesian -- for deriving generalization error bounds in the supervised learning setting. By embedding the Data Processing Inequality (DPI) into the change-of-measure technique, we obtain explicit bounds on the binary Kullback-Leibler generalization gap for both R\'enyi divergence and any -divergence measured between a data-independent prior distribution and an algorithm-dependent posterior distribution. We present three bounds derived under our framework using R\'enyi, Hellinger \(p\) and Chi-Squared divergences. Additionally, our framework also demonstrates a close connection with other well-known bounds. When the prior distribution is chosen to be uniform, our bounds recover the classical Occam's Razor bound and, crucially, eliminate the extraneous \(\log(2\sqrt{n})/n\) slack present in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference
