Information Theoretic Structured Generative Modeling
Bo Hu, Shujian Yu, Jose C. Principe

TL;DR
This paper introduces the structured generative model (SGM), a novel parametric framework leveraging Rénnyi's information for efficient density estimation and generative modeling, with improved training stability and performance.
Contribution
It extends Rénnyi's information to parametric models, proposing SGM with new variational costs and neural network implementation for better density and mutual information estimation.
Findings
SGM outperforms MINE in data efficiency and variance reduction
SGM improves density estimation and generative adversarial network performance
The framework enables scalable and stable optimization in parametric information theoretic models
Abstract
R\'enyi's information provides a theoretical foundation for tractable and data-efficient non-parametric density estimation, based on pair-wise evaluations in a reproducing kernel Hilbert space (RKHS). This paper extends this framework to parametric probabilistic modeling, motivated by the fact that R\'enyi's information can be estimated in closed-form for Gaussian mixtures. Based on this special connection, a novel generative model framework called the structured generative model (SGM) is proposed that makes straightforward optimization possible, because costs are scale-invariant, avoiding high gradient variance while imposing less restrictions on absolute continuity, which is a huge advantage in parametric information theoretic optimization. The implementation employs a single neural network driven by an orthonormal input appended to a single white noise source adapted to learn an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Model Reduction and Neural Networks
MethodsTest
