Pre-Training and Fine-Tuning Generative Flow Networks

Ling Pan; Moksh Jain; Kanika Madan; Yoshua Bengio

arXiv:2310.03419·cs.LG·October 6, 2023·1 cites

Pre-Training and Fine-Tuning Generative Flow Networks

Ling Pan, Moksh Jain, Kanika Madan, Yoshua Bengio

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a reward-free pre-training method for Generative Flow Networks (GFlowNets) using self-supervised learning, enabling efficient adaptation to new tasks and improved mode discovery.

Contribution

The paper proposes an outcome-conditioned GFlowNet (OC-GFN) framework for unsupervised pre-training and a novel approximation method for efficient fine-tuning on downstream tasks.

Findings

01

Pre-trained OC-GFN can sample from new reward functions effectively.

02

The approach accelerates adaptation to downstream tasks.

03

It improves mode discovery efficiency in generative modeling.

Abstract

Generative Flow Networks (GFlowNets) are amortized samplers that learn stochastic policies to sequentially generate compositional objects from a given unnormalized reward distribution. They can generate diverse sets of high-reward objects, which is an important consideration in scientific discovery tasks. However, as they are typically trained from a given extrinsic reward function, it remains an important open challenge about how to leverage the power of pre-training and train GFlowNets in an unsupervised fashion for efficient adaptation to downstream tasks. Inspired by recent successes of unsupervised pre-training in various domains, we introduce a novel approach for reward-free pre-training of GFlowNets. By framing the training as a self-supervised problem, we propose an outcome-conditioned GFlowNet (OC-GFN) that learns to explore the candidate space. Specifically, OC-GFN learns to…

Peer Reviews

Decision·ICLR 2024 spotlight

Reviewer 01Rating 6· marginally above the acceptance thresholdConfidence 4

Strengths

1. The paper introduces a novel approach for reward-free pre-training and fine-tuning of GFlowNets, which can serve as a foundation for further research of GFlowNet pretraining. 2. The paper provides a thorough description of the proposed approach, including the formulation of the problem, the training procedures, and the evaluation metrics. The experiments are well-designed and conducted, and the results are presented clearly.

Weaknesses

1. The paper lacks a comparison with existing approaches for pre-trained models or goal-conditioned RL methods.

Reviewer 02Rating 8· accept, good paperConfidence 3

Strengths

The concept presented in this paper is both simple and elegant. The unsupervised fine-tuning approach offers a significant contribution, adeptly addressing the training challenges associated with GFlowNet. Overall, the paper is well-structured and easy to follow, making it a valuable addition to the literature.

Weaknesses

See questions.

Reviewer 03Rating 6· marginally above the acceptance thresholdConfidence 4

Strengths

- The exposition is generally clear, and I enjoyed reading the paper. The authors first present the goal-conditioning idea and how it applies to GFNs, then walk the reader through their derivation and assumptions for amortized adaptation. I especially appreciated Section 2 which gave a clear and concise background. - The paper tackles an impactful problem for GFNs. While the pretraining solution is not particularly novel, it’s a neat application of goal-condition RL to an amortized sampling prob

Weaknesses

- There should be a discussions of assumptions behind the OC-GFNs pretraining. Namely, that transfer is only possible when the reward function changes but not if the action-space or the state-space change. Moreover, the goal-conditioning requires a well specified set of outcomes Y — presumably not all states s are terminal states — which makes the proposed method not truly unsupervised. These limitations (together with the applicability mentioned at the end of A.2) could be stated explicitly in

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis