On Divergence Measures for Training GFlowNets

Tiago da Silva; Eliezer de Souza da Silva; Diego Mesquita

arXiv:2410.09355·cs.LG·April 13, 2026

On Divergence Measures for Training GFlowNets

Tiago da Silva, Eliezer de Souza da Silva, Diego Mesquita

PDF

1 Video

TL;DR

This paper explores alternative divergence measures for training GFlowNets, demonstrating that proper minimization leads to faster convergence and more effective generative modeling.

Contribution

It introduces statistically efficient estimators for various divergence measures and shows their benefits over traditional methods in GFlowNets training.

Findings

01

Proper divergence minimization improves convergence speed.

02

New estimators reduce gradient variance.

03

Training schemes based on these divergences are empirically effective.

Abstract

Generative Flow Networks (GFlowNets) are amortized inference models designed to sample from unnormalized distributions over composable objects, with applications in generative modeling for tasks in fields such as causal discovery, NLP, and drug discovery. Traditionally, the training procedure for GFlowNets seeks to minimize the expected log-squared difference between a proposal (forward policy) and a target (backward policy) distribution, which enforces certain flow-matching conditions. While this training procedure is closely related to variational inference (VI), directly attempting standard Kullback-Leibler (KL) divergence minimization can lead to proven biased and potentially high-variance estimators. Therefore, we first review four divergence measures, namely, Renyi- $α$ 's, Tsallis- $α$ 's, reverse and forward KL's, and design statistically efficient estimators for their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On Divergence Measures for Training GFlowNets· slideslive