Torsional-GFN: a conditional conformation generator for small molecules
Alexandra Volokhova, L\'ena N\'ehale Ezzine, Piotr Gai\'nski, Luca Scimeca, Emmanuel Bengio, Prudencio Tossou, Yoshua Bengio, Alex Hernandez-Garcia

TL;DR
Torsional-GFN is a novel conditional GFlowNet model that efficiently samples molecular conformations proportional to their Boltzmann distribution, enabling zero-shot generalization and promising scalability for drug discovery applications.
Contribution
It introduces Torsional-GFN, a conditional GFlowNet that samples conformations based on a reward function, with zero-shot generalization to unseen molecules and local structures.
Findings
Samples conformations proportional to Boltzmann distribution
Achieves zero-shot generalization to unseen bond lengths and angles
Works effectively across multiple molecules with a single model
Abstract
Generating stable molecular conformations is crucial in several drug discovery applications, such as estimating the binding affinity of a molecule to a target. Recently, generative machine learning methods have emerged as a promising, more efficient method than molecular dynamics for sampling of conformations from the Boltzmann distribution. In this paper, we introduce Torsional-GFN, a conditional GFlowNet specifically designed to sample conformations of molecules proportionally to their Boltzmann distribution, using only a reward function as training signal. Conditioned on a molecular graph and its local structure (bond lengths and angles), Torsional-GFN samples rotations of its torsion angles. Our results demonstrate that Torsional-GFN is able to sample conformations approximately proportional to the Boltzmann distribution for multiple molecules with a single model, and allows for…
Peer Reviews
Decision·Submitted to ICLR 2026
1. The paper begins by clearly articulating the importance of efficiently sampling molecular conformations from the Boltzmann distribution and its applications in drug discovery. It accurately identifies the limitations of existing methods (e.g., computational cost of MD and data dependency of diffusion models) and makes a compelling case for GFlowNets as an alternative that can be trained solely on a reward function (energy). 2. The paper includes multiple metrics and good visualizations.
1. The dataset is in very limited scale. There is no demonstration of scalability to larger molecules or varying numbers of torsion angles. 2. One test molecule shows very poor performance and the other test molecule shows "limited coverage of some modes". This suggests generalization to unseen molecules is not reliably achieved. 3. No comparison with recent diffusion-based methods (TorsionalDiffusion, etc.) or other generative approaches. 4. In Equation (2), the authors simplify the problem by
- Introduces forward and backward policies based on a von Mises mixture, with well-defined training objectives. - Shows that the conditional sampler adapts to unseen local environments, indicating potential for amortized conformer generation workflows.
- While the combination of "torsional neural sampler with GFlowNet for molecules" is new, the work itself is primarily a combination of existing tasks and methods: molecular torsional neural sampler is already formalized in Adjoint Sampling (ICML 2025) and ASBS (NeurIPS 2025); molecular sampling with GFlowNet with similar goals have been explored in the cited works and more (e.g., arXiv:2505.19552), while with different training objectives. - Compared to amortized conformer generation in AS/ASBS
While the use of GFlowNets for conformation generation is not entirely new, the scaling of GFlowNets to a unified model across molecules represents a moderately significant contribution. The authors improve both the theoretical formulation and architectural design of the framework, proposing meaningful modifications to make GFlowNets more suitable for this problem. The paper is clearly written and reasonably easy to follow, with good explanations of the methodology and analysis of generated co
Despite promising direction, the practical evaluation remains limited and raises questions regarding the framework's scalability and general applicability. Although scalability is claimed as a key advantage (“our work presents a promising avenue for scaling the proposed approach for larger molecular system”), experiments are conducted on only 8 molecules (6 from training and 2 from testing). This small sample size does not convincingly support claims about scalability—in terms of either dataset
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer therapeutics and mechanisms · Synthesis and Biological Activity
