Learning and Generalization in Overparameterized Normalizing Flows
Kulin Shah, Amit Deshpande, Navin Goyal

TL;DR
This paper investigates the effects of overparameterization in normalizing flows, revealing that it can hinder training in constrained models but enables efficient learning in unconstrained models, both theoretically and empirically.
Contribution
It provides the first theoretical and empirical analysis of overparameterization effects in normalizing flows, distinguishing between constrained and unconstrained models.
Findings
Overparameterization hampers training in constrained NFs.
Unconstrained NFs can efficiently learn data distributions when overparameterized.
Empirical results support theoretical claims.
Abstract
In supervised learning, it is known that overparameterized neural networks with one hidden layer provably and efficiently learn and generalize, when trained using stochastic gradient descent with a sufficiently small learning rate and suitable initialization. In contrast, the benefit of overparameterization in unsupervised learning is not well understood. Normalizing flows (NFs) constitute an important class of models in unsupervised learning for sampling and density estimation. In this paper, we theoretically and empirically analyze these models when the underlying neural network is a one-hidden-layer overparametrized network. Our main contributions are two-fold: (1) On the one hand, we provide theoretical and empirical evidence that for constrained NFs (this class of NFs underlies many NF constructions) with the one-hidden-layer network, overparametrization hurts training. (2) On the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning
MethodsNormalizing Flows
