Embarrassingly Parallel GFlowNets
Tiago da Silva, Luiz Max Carvalho, Amauri Souza, Samuel Kaski, Diego, Mesquita

TL;DR
EP-GFlowNet is a divide-and-conquer approach that enables efficient parallel and federated sampling from complex distributions using GFlowNets, reducing communication and computation costs.
Contribution
This paper introduces EP-GFlowNet, a provably correct parallel GFlowNet method with a novel aggregation technique for product distributions, enhancing scalability and applicability.
Findings
Effective in parallel Bayesian phylogenetics
Supports multi-objective optimization tasks
Reduces communication in federated learning setups
Abstract
GFlowNets are a promising alternative to MCMC sampling for discrete compositional random variables. Training GFlowNets requires repeated evaluations of the unnormalized target distribution or reward function. However, for large-scale posterior sampling, this may be prohibitive since it incurs traversing the data several times. Moreover, if the data are distributed across clients, employing standard GFlowNets leads to intensive client-server communication. To alleviate both these issues, we propose embarrassingly parallel GFlowNet (EP-GFlowNet). EP-GFlowNet is a provably correct divide-and-conquer method to sample from product distributions of the form -- e.g., in parallel or federated Bayes, where each is a local posterior defined on a data partition. First, in parallel, we train a local GFlowNet targeting each and send the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications
