Information-theoretic Analysis of the Gibbs Algorithm: An Individual   Sample Approach

Youheng Zhu,Yuheng Bu

arXiv:2410.12623·cs.IT·October 17, 2024

Information-theoretic Analysis of the Gibbs Algorithm: An Individual Sample Approach

Youheng Zhu,Yuheng Bu

PDF

Open Access

TL;DR

This paper introduces an individual sample approach to analyze the generalization error of the Gibbs algorithm, providing asymptotic equivalence results and tighter bounds through information-theoretic measures.

Contribution

It establishes the asymptotic equivalence between sample-wise and dataset-wise symmetrized KL information for the Gibbs algorithm, with explicit non-asymptotic bounds.

Findings

01

Asymptotic equivalence of information measures

02

Explicit non-asymptotic bounds for the gap

03

Tighter generalization error bounds

Abstract

Recent progress has shown that the generalization error of the Gibbs algorithm can be exactly characterized using the symmetrized KL information between the learned hypothesis and the entire training dataset. However, evaluating such a characterization is cumbersome, as it involves a high-dimensional information measure. In this paper, we address this issue by considering individual sample information measures within the Gibbs algorithm. Our main contribution lies in establishing the asymptotic equivalence between the sum of symmetrized KL information between the output hypothesis and individual samples and that between the hypothesis and the entire dataset. We prove this by providing explicit expressions for the gap between these measures in the non-asymptotic regime. Additionally, we characterize the asymptotic behavior of various information measures in the context of the Gibbs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Statistical Mechanics and Entropy