Sharpened Generalization Bounds based on Conditional Mutual Information and an Application to Noisy, Iterative Algorithms
Mahdi Haghifam, Jeffrey Negrea, Ashish Khisti, Daniel M. Roy, Gintare, Karolina Dziugaite

TL;DR
This paper develops sharper generalization bounds using conditional mutual information, improving upon previous bounds, and applies these to analyze noisy, iterative algorithms like Langevin dynamics.
Contribution
It introduces tighter generalization bounds based on conditional mutual information and applies them to analyze Langevin dynamics algorithms.
Findings
Conditional mutual information bounds are tighter than unconditional ones.
Bounds leverage information from optimization trajectories.
Application to Langevin dynamics shows improved generalization analysis.
Abstract
The information-theoretic framework of Russo and J. Zou (2016) and Xu and Raginsky (2017) provides bounds on the generalization error of a learning algorithm in terms of the mutual information between the algorithm's output and the training sample. In this work, we study the proposal, by Steinke and Zakynthinou (2020), to reason about the generalization error of a learning algorithm by introducing a super sample that contains the training sample as a random subset and computing mutual information conditional on the super sample. We first show that these new bounds based on the conditional mutual information are tighter than those based on the unconditional mutual information. We then introduce yet tighter bounds, building on the "individual sample" idea of Bu, S. Zou, and Veeravalli (2019) and the "data dependent" ideas of Negrea et al. (2019), using disintegrated mutual information.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Remote-Sensing Image Classification · Image and Signal Denoising Methods
