TL;DR
This paper explores alternative divergence measures for training GFlowNets, demonstrating that proper minimization leads to faster convergence and more effective generative modeling.
Contribution
It introduces statistically efficient estimators for various divergence measures and shows their benefits over traditional methods in GFlowNets training.
Findings
Proper divergence minimization improves convergence speed.
New estimators reduce gradient variance.
Training schemes based on these divergences are empirically effective.
Abstract
Generative Flow Networks (GFlowNets) are amortized inference models designed to sample from unnormalized distributions over composable objects, with applications in generative modeling for tasks in fields such as causal discovery, NLP, and drug discovery. Traditionally, the training procedure for GFlowNets seeks to minimize the expected log-squared difference between a proposal (forward policy) and a target (backward policy) distribution, which enforces certain flow-matching conditions. While this training procedure is closely related to variational inference (VI), directly attempting standard Kullback-Leibler (KL) divergence minimization can lead to proven biased and potentially high-variance estimators. Therefore, we first review four divergence measures, namely, Renyi-'s, Tsallis-'s, reverse and forward KL's, and design statistically efficient estimators for their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
