Pre-Training and Fine-Tuning Generative Flow Networks
Ling Pan, Moksh Jain, Kanika Madan, Yoshua Bengio

TL;DR
This paper introduces a reward-free pre-training method for Generative Flow Networks (GFlowNets) using self-supervised learning, enabling efficient adaptation to new tasks and improved mode discovery.
Contribution
The paper proposes an outcome-conditioned GFlowNet (OC-GFN) framework for unsupervised pre-training and a novel approximation method for efficient fine-tuning on downstream tasks.
Findings
Pre-trained OC-GFN can sample from new reward functions effectively.
The approach accelerates adaptation to downstream tasks.
It improves mode discovery efficiency in generative modeling.
Abstract
Generative Flow Networks (GFlowNets) are amortized samplers that learn stochastic policies to sequentially generate compositional objects from a given unnormalized reward distribution. They can generate diverse sets of high-reward objects, which is an important consideration in scientific discovery tasks. However, as they are typically trained from a given extrinsic reward function, it remains an important open challenge about how to leverage the power of pre-training and train GFlowNets in an unsupervised fashion for efficient adaptation to downstream tasks. Inspired by recent successes of unsupervised pre-training in various domains, we introduce a novel approach for reward-free pre-training of GFlowNets. By framing the training as a self-supervised problem, we propose an outcome-conditioned GFlowNet (OC-GFN) that learns to explore the candidate space. Specifically, OC-GFN learns to…
Peer Reviews
Decision·ICLR 2024 spotlight
1. The paper introduces a novel approach for reward-free pre-training and fine-tuning of GFlowNets, which can serve as a foundation for further research of GFlowNet pretraining. 2. The paper provides a thorough description of the proposed approach, including the formulation of the problem, the training procedures, and the evaluation metrics. The experiments are well-designed and conducted, and the results are presented clearly.
1. The paper lacks a comparison with existing approaches for pre-trained models or goal-conditioned RL methods.
The concept presented in this paper is both simple and elegant. The unsupervised fine-tuning approach offers a significant contribution, adeptly addressing the training challenges associated with GFlowNet. Overall, the paper is well-structured and easy to follow, making it a valuable addition to the literature.
See questions.
- The exposition is generally clear, and I enjoyed reading the paper. The authors first present the goal-conditioning idea and how it applies to GFNs, then walk the reader through their derivation and assumptions for amortized adaptation. I especially appreciated Section 2 which gave a clear and concise background. - The paper tackles an impactful problem for GFNs. While the pretraining solution is not particularly novel, it’s a neat application of goal-condition RL to an amortized sampling prob
- There should be a discussions of assumptions behind the OC-GFNs pretraining. Namely, that transfer is only possible when the reward function changes but not if the action-space or the state-space change. Moreover, the goal-conditioning requires a well specified set of outcomes Y — presumably not all states s are terminal states — which makes the proposed method not truly unsupervised. These limitations (together with the applicability mentioned at the end of A.2) could be stated explicitly in
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis
