A Mathematical Framework for Learning Probability Distributions

Hongkang Yang

arXiv:2212.11481·stat.ML·January 2, 2023

A Mathematical Framework for Learning Probability Distributions

Hongkang Yang

PDF

TL;DR

This paper introduces a unified mathematical framework for understanding probability distribution learning models, explaining their success, generalization, and phenomena like mode collapse through theoretical analysis.

Contribution

It provides a comprehensive framework that derives various models from simple principles and analyzes their approximation, training, and generalization errors.

Findings

01

Models enjoy implicit regularization during training

02

Early stopping helps avoid curse of dimensionality

03

New insights into landscape analysis and mode collapse

Abstract

The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by virtue of its outstanding performance on sophisticated data such as images and texts. Nevertheless, a theoretical understanding of its success is still incomplete. One mystery is the paradox between memorization and generalization: In theory, the model is trained to be exactly the same as the empirical distribution of the finite samples, whereas in practice, the trained model can generate new samples or estimate the likelihood of unseen samples. Likewise, the overwhelming diversity of distribution learning models calls for a unified perspective on this subject. This paper provides a mathematical framework such that all the well-known models can be derived based on simple principles. To demonstrate its efficacy, we present a survey…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.