Empirical Risk Minimization with Relative Entropy Regularization

Samir M. Perlaza; Gaetan Bisson; I\~naki Esnaola; Alain Jean-Marie,; Stefano Rini

arXiv:2211.06617·math.ST·April 9, 2024

Empirical Risk Minimization with Relative Entropy Regularization

Samir M. Perlaza, Gaetan Bisson, I\~naki Esnaola, Alain Jean-Marie,, Stefano Rini

PDF

TL;DR

This paper explores ERM with relative entropy regularization under a general measure framework, establishing properties of solutions, their generalization guarantees, and connections to information theory.

Contribution

It generalizes ERM-RER to non-probability reference measures, proving solution uniqueness, generalization bounds, and linking sensitivity to lautum information.

Findings

01

Solution is a unique, absolutely continuous probability measure.

02

Empirical risk is sub-Gaussian under certain conditions.

03

Sensitivity analysis relates generalization error to lautum information.

Abstract

The empirical risk minimization (ERM) problem with relative entropy regularization (ERM-RER) is investigated under the assumption that the reference measure is a $σ$ -finite measure, and not necessarily a probability measure. Under this assumption, which leads to a generalization of the ERM-RER problem allowing a larger degree of flexibility for incorporating prior knowledge, numerous relevant properties are stated. Among these properties, the solution to this problem, if it exists, is shown to be a unique probability measure, mutually absolutely continuous with the reference measure. Such a solution exhibits a probably-approximately-correct guarantee for the ERM problem independently of whether the latter possesses a solution. For a fixed dataset and under a specific condition, the empirical risk is shown to be a sub-Gaussian random variable when the models are sampled from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsEntropy Regularization