Jensen-Shannon Information Based Characterization of the Generalization   Error of Learning Algorithms

Gholamali Aminian; Laura Toni; Miguel R. D. Rodrigues

arXiv:2010.12664·cs.IT·January 11, 2021

Jensen-Shannon Information Based Characterization of the Generalization Error of Learning Algorithms

Gholamali Aminian, Laura Toni, Miguel R. D. Rodrigues

PDF

TL;DR

This paper introduces a new information-theoretic generalization error bound using Jensen-Shannon information, which can be tighter than existing mutual information bounds and unifies previous bounds.

Contribution

It proposes a novel Jensen-Shannon information-based bound for generalization error that generalizes and improves upon existing mutual information bounds.

Findings

01

The bound can specialize to previous bounds.

02

Under certain conditions, the bound is tighter than mutual information bounds.

03

The approach unifies various existing generalization bounds.

Abstract

Generalization error bounds are critical to understanding the performance of machine learning models. In this work, we propose a new information-theoretic based generalization error upper bound applicable to supervised learning scenarios. We show that our general bound can specialize in various previous bounds. We also show that our general bound can be specialized under some conditions to a new bound involving the Jensen-Shannon information between a random variable modelling the set of training samples and another random variable modelling the hypothesis. We also prove that our bound can be tighter than mutual information-based bounds under some conditions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.