On the Validation of Gibbs Algorithms: Training Datasets, Test Datasets   and their Aggregation

Samir M. Perlaza; I\~naki Esnaola; Gaetan Bisson; H. Vincent Poor

arXiv:2306.12380·cs.LG·June 22, 2023·1 cites

On the Validation of Gibbs Algorithms: Training Datasets, Test Datasets and their Aggregation

Samir M. Perlaza, I\~naki Esnaola, Gaetan Bisson, H. Vincent Poor

PDF

Open Access

TL;DR

This paper analytically characterizes the dependence of Gibbs algorithms on training data, providing explicit formulas for sensitivity and insights into dataset aggregation and generalization performance.

Contribution

It introduces a closed-form sensitivity analysis of Gibbs algorithms and explores dataset aggregation effects on their generalization capabilities.

Findings

01

Explicit expressions linking training and test errors of GAs.

02

Sensitivity of GAs to training data characterized in closed form.

03

Connection established between Jeffrey's divergence and generalization metrics.

Abstract

The dependence on training data of the Gibbs algorithm (GA) is analytically characterized. By adopting the expected empirical risk as the performance metric, the sensitivity of the GA is obtained in closed form. In this case, sensitivity is the performance difference with respect to an arbitrary alternative algorithm. This description enables the development of explicit expressions involving the training errors and test errors of GAs trained with different datasets. Using these tools, dataset aggregation is studied and different figures of merit to evaluate the generalization capabilities of GAs are introduced. For particular sizes of such datasets and parameters of the GAs, a connection between Jeffrey's divergence, training and test errors is established.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Statistical Mechanics and Entropy · Evolutionary Algorithms and Applications

MethodsGenetic Algorithms