Tighter expected generalization error bounds via Wasserstein distance

Borja Rodr\'iguez-G\'alvez; Germ\'an Bassi; Ragnar Thobaben; and; Mikael Skoglund

arXiv:2101.09315·stat.ML·March 29, 2022·20 cites

Tighter expected generalization error bounds via Wasserstein distance

Borja Rodr\'iguez-G\'alvez, Germ\'an Bassi, Ragnar Thobaben, and, Mikael Skoglund

PDF

Open Access 1 Video

TL;DR

This paper develops new expected generalization error bounds using Wasserstein distance, bridging geometric and entropy-based approaches, and introduces bounds based on various information measures.

Contribution

It introduces novel generalization bounds based on Wasserstein distance that unify geometric and entropy-based methods, including bounds with different information measures.

Findings

01

Bounds are tighter than existing entropy-based bounds when loss is bounded.

02

New bounds based on Wasserstein distance can be derived for various information measures.

03

The bounds recover from below, providing non-vacuous estimates in certain settings.

Abstract

This work presents several expected generalization error bounds based on the Wasserstein distance. More specifically, it introduces full-dataset, single-letter, and random-subset bounds, and their analogues in the randomized subsample setting from Steinke and Zakynthinou [1]. Moreover, when the loss function is bounded and the geometry of the space is ignored by the choice of the metric in the Wasserstein distance, these bounds recover from below (and thus, are tighter than) current bounds based on the relative entropy. In particular, they generate new, non-vacuous bounds based on the relative entropy. Therefore, these results can be seen as a bridge between works that account for the geometry of the hypothesis space and those based on the relative entropy, which is agnostic to such geometry. Furthermore, it is shown how to produce various new bounds based on different information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Tighter Expected Generalization Error Bounds via Wasserstein Distance· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques