Optimal regularizations for data generation with probabilistic graphical   models

Arnaud Fanthomme (ENS Paris); F Rizzato; S Cocco; R Monasson

arXiv:2112.01292·cs.LG·May 25, 2022

Optimal regularizations for data generation with probabilistic graphical models

Arnaud Fanthomme (ENS Paris), F Rizzato, S Cocco, R Monasson

PDF

TL;DR

This paper analyzes how optimal L1 and L2 regularizations improve the quality of generative models in graphical models by balancing likelihoods of training, test, and generated data, revealing a universal optimal regularization strength.

Contribution

It provides analytical and numerical evidence that the optimal regularization strength is approximately the inverse of the sum of squared couplings, largely independent of network structure.

Findings

01

Optimal regularization strength is about the inverse sum of squared couplings.

02

Test and generated data likelihoods are close at optimal regularization.

03

Results are robust across different interaction structures and regularization schemes.

Abstract

Understanding the role of regularization is a central question in Statistical Inference. Empirically, well-chosen regularization schemes often dramatically improve the quality of the inferred models by avoiding overfitting of the training data. We consider here the particular case of L 2 and L 1 regularizations in the Maximum A Posteriori (MAP) inference of generative pairwise graphical models. Based on analytical calculations on Gaussian multivariate distributions and numerical experiments on Gaussian and Potts models we study the likelihoods of the training, test, and 'generated data' (with the inferred models) sets as functions of the regularization strengths. We show in particular that, at its maximum, the test likelihood and the 'generated' likelihood, which quantifies the quality of the generated samples, have remarkably close values. The optimal value for the regularization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.