Efficient Regression-Based Training of Normalizing Flows for Boltzmann Generators
Danyal Rehman, Oscar Davis, Jiarui Lu, Jian Tang, Michael Bronstein, Yoshua Bengio, Alexander Tong, Avishek Joey Bose

TL;DR
This paper introduces RegFlow, a regression-based training method for normalizing flows that improves stability and efficiency in Boltzmann Generators, enabling better molecular conformation sampling.
Contribution
RegFlow offers a novel, scalable regression training objective for normalizing flows, overcoming instability and computational challenges of maximum likelihood methods in Boltzmann Generators.
Findings
RegFlow outperforms maximum likelihood training in stability and efficiency.
It enables training of broader architectures for Boltzmann Generators.
Demonstrated improved sampling in molecular systems like peptides.
Abstract
Simulation-free training frameworks have been at the forefront of the generative modelling revolution in continuous spaces, leading to large-scale diffusion and flow matching models. However, such modern generative models suffer from expensive inference, inhibiting their use in numerous scientific applications like Boltzmann Generators (BGs) for molecular conformations that require fast likelihood evaluation. In this paper, we revisit classical normalizing flows in the context of BGs that offer efficient sampling and likelihoods, but whose training via maximum likelihood is often unstable and computationally challenging. We propose Regression Training of Normalizing Flows (RegFlow), a novel and scalable regression-based training objective that bypasses the numerical instability and computational challenge of conventional maximum likelihood training in favour of a simple…
Peer Reviews
Decision·ICLR 2026 Poster
The paper addresses a chronic problem with normalizing flows. The described regression loss is intuitive an simple to implement (once couplings have been determined). The approach results in dramatic improvements compared to MLE training using the same models and data. Sensitivity to some parameters (e.g. regularization) is explored. I appreciate the evaluation of free energy. The paper is well written and easy to follow.
Although the improvement compared to NF models is extreme, the results aren't necessarily state-of-the-art compared to other models.
- Conceptually clear and well written manuscript. Numerous insightful comments about normalizing flows. - Well thought-out experiments and evaluations. All fairly standard in the field now, but still well done. - Clear performance gain, in terms of compute-time, over most of the included baselines.
- Claims and attribution. The proposed TFEP method is closely related to the ambient thermodynamic interpolant approach by Moqvist et al https://arxiv.org/abs/2411.10075 - Sample quality and scaling. ESS remain fairly low. Scaling to tetra peptides is nice several other recent works e.g. https://arxiv.org/abs/2502.18462 demonstrate scaling to significantly larger systems. - Lack of error estimates on evaluation statistics.
1. The proposed method is simple and effective, which improves the training stability of NFs 2. The author provides mathematical justification for using the CNF-coupling, by showing the wasserstein distance between trained model and target. 3. Though the CNF-coupling sounds expensive at the first glance, the author shows that it requires much less computational overhead in table 4. 4. The experimental results are good
1. The main concern lies in the invertibility of coupling, as both OT-coupling and CNF-coupling are approximations. It would be great if the author could elaborate on when this approximation would be broken and, in such a case, how the classic MLE objective would help. Intuitively speaking, if $p_0$ and $p_1$ are too separate, the true velocity might be less smooth and the (t-dependent) Lipschitz constants can be large, which means the Wasserstein bound can be very loose. 2. The experiments in
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Adversarial Robustness in Machine Learning · Flow Measurement and Analysis
