Avoid What You Know: Divergent Trajectory Balance for GFlowNets
Pedro Dall'Antonia, Tiago da Silva, Daniel Csillag, Salem Lahlou, Diego Mesquita

TL;DR
This paper introduces ACE, a new exploration algorithm for GFlowNets that enhances their ability to discover diverse high-reward states by explicitly training an exploration GFlowNet, leading to better approximation and diversity.
Contribution
The paper proposes Adaptive Complementary Exploration (ACE), a novel method that trains an exploration GFlowNet to improve the exploration of high-reward regions in GFlowNets.
Findings
ACE outperforms prior methods in distribution approximation accuracy.
ACE increases the discovery rate of diverse high-reward states.
Experiments demonstrate significant improvements in exploration efficiency.
Abstract
Generative Flow Networks (GFlowNets) are a flexible family of amortized samplers trained to generate discrete and compositional objects with probability proportional to a reward function. However, learning efficiency is constrained by the model's ability to rapidly explore diverse high-probability regions during training. To mitigate this issue, recent works have focused on incentivizing the exploration of unvisited and valuable states via curiosity-driven search and self-supervised random network distillation, which tend to waste samples on already well-approximated regions of the state space. In this context, we propose Adaptive Complementary Exploration (ACE), a principled algorithm for the effective exploration of novel and high-probability regions when learning GFlowNets. To achieve this, ACE introduces an exploration GFlowNet explicitly trained to search for high-reward states in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)
