Beyond Squared Error: Exploring Loss Design for Enhanced Training of   Generative Flow Networks

Rui Hu; Yifan Zhang; Zhuoran Li; Longbo Huang

arXiv:2410.02596·cs.LG·October 4, 2024

Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

Rui Hu, Yifan Zhang, Zhuoran Li, Longbo Huang

PDF

Open Access

TL;DR

This paper introduces a theoretical framework linking regression losses to divergence measures in GFlowNets, proposing new loss functions that improve exploration, exploitation, and overall training performance.

Contribution

It provides a rigorous analysis of loss functions in GFlowNets, introduces novel losses based on divergence properties, and demonstrates their effectiveness across multiple benchmarks.

Findings

01

New loss functions improve convergence speed.

02

Enhanced sample diversity and robustness.

03

Theoretical insights guide better loss design.

Abstract

Generative Flow Networks (GFlowNets) are a novel class of generative models designed to sample from unnormalized distributions and have found applications in various important tasks, attracting great research interest in their training algorithms. In general, GFlowNets are trained by fitting the forward flow to the backward flow on sampled training objects. Prior work focused on the choice of training objects, parameterizations, sampling and resampling strategies, and backward policies, aiming to enhance credit assignment, exploration, or exploitation of the training process. However, the choice of regression loss, which can highly influence the exploration and exploitation behavior of the under-training policy, has been overlooked. Due to the lack of theoretical understanding for choosing an appropriate regression loss, most existing algorithms train the flow network by minimizing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Simulation Techniques and Applications