Deep Generative Modeling on Limited Data with Regularization by Nontransferable Pre-trained Models
Yong Zhong, Hongtao Liu, Xiaodong Liu, Fan Bao, Weiran Shen, Chongxuan, Li

TL;DR
This paper introduces Reg-DGM, a regularization method for deep generative models that uses nontransferable pre-trained models to reduce variance and improve performance on limited data, supported by theoretical analysis and empirical results.
Contribution
It proposes a novel regularization framework for DGMs leveraging nontransferable pre-trained models, with theoretical guarantees and practical improvements.
Findings
Reg-DGM improves generative quality with limited data.
Theoretical analysis shows existence and convergence of the method.
Empirical results outperform state-of-the-art on various datasets.
Abstract
Deep generative models (DGMs) are data-eager because learning a complex model on limited data suffers from a large variance and easily overfits. Inspired by the classical perspective of the bias-variance tradeoff, we propose regularized deep generative model (Reg-DGM), which leverages a nontransferable pre-trained model to reduce the variance of generative modeling with limited data. Formally, Reg-DGM optimizes a weighted sum of a certain divergence and the expectation of an energy function, where the divergence is between the data and the model distributions, and the energy function is defined by the pre-trained model w.r.t. the model distribution. We analyze a simple yet representative Gaussian-fitting case to demonstrate how the weighting hyperparameter trades off the bias and the variance. Theoretically, we characterize the existence and the uniqueness of the global minimum of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Music and Audio Processing · Face recognition and analysis
MethodsConvolution · Path Length Regularization · R1 Regularization · Weight Demodulation · HuMan(Expedia)||How do I get a human at Expedia?
