Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Rui Hu, Yifan Zhang, Zhuoran Li, Longbo Huang

TL;DR
This paper introduces a theoretical framework linking regression losses to divergence measures in GFlowNets, proposing new loss functions that improve exploration, exploitation, and overall training performance.
Contribution
It provides a rigorous analysis of loss functions in GFlowNets, introduces novel losses based on divergence properties, and demonstrates their effectiveness across multiple benchmarks.
Findings
New loss functions improve convergence speed.
Enhanced sample diversity and robustness.
Theoretical insights guide better loss design.
Abstract
Generative Flow Networks (GFlowNets) are a novel class of generative models designed to sample from unnormalized distributions and have found applications in various important tasks, attracting great research interest in their training algorithms. In general, GFlowNets are trained by fitting the forward flow to the backward flow on sampled training objects. Prior work focused on the choice of training objects, parameterizations, sampling and resampling strategies, and backward policies, aiming to enhance credit assignment, exploration, or exploitation of the training process. However, the choice of regression loss, which can highly influence the exploration and exploitation behavior of the under-training policy, has been overlooked. Due to the lack of theoretical understanding for choosing an appropriate regression loss, most existing algorithms train the flow network by minimizing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Simulation Techniques and Applications
