TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Haowei Lin, Shanda Li, Haotian Ye, Yiming Yang, Stefano, Ermon, Yitao Liang, Jianzhu Ma

TL;DR
TFG-Flow is a training-free guidance method for multimodal generative flow models that effectively handles both continuous and discrete data, demonstrated on molecular design tasks for drug discovery.
Contribution
It introduces a novel training-free guidance approach for multimodal generative flows, addressing high-dimensionality and discrete data challenges.
Findings
Successfully guided molecule generation with desired properties
Effective in handling both continuous and discrete data
Shows promise for drug design applications
Abstract
Given an unconditional generative model and a predictor for a target property (e.g., a classifier), the goal of training-free guidance is to generate samples with desirable target properties without additional training. As a highly efficient technique for steering generative models toward flexible outcomes, training-free guidance has gained increasing attention in diffusion models. However, existing methods only handle data in continuous spaces, while many scientific applications involve both continuous and discrete data (referred to as multimodality). Another emerging trend is the growing use of the simple and general flow matching framework in building generative foundation models, where guided generation remains under-explored. To address this, we introduce TFG-Flow, a novel training-free guidance method for multimodal generative flow. TFG-Flow addresses the curse-of-dimensionality…
Peer Reviews
Decision·ICLR 2025 Poster
Training-free guidance makes it computationally advantageous over other flow-based molecular generation models.
1. The performance of this method may heavily depend on the quality and accuracy of the target predictor. 2. Conditional generation performance does not seem to be improved over the baselines
* Conditional generation is an important and practical problem in molecular design. In fact, previous work in this space (recent trend of diffusion-based generative models) has under-studied how to do conditional generation and molecular optimization. * The proposed method is straightforward and effective based on validations from experiments.
* Even though conditional generation is under-studied in the small molecular generation domain, it is well-studied in other domains (especially on exact guidance; see questions). [1,2,3] * Molecular optimization has been a long-standing challenge in this field and unfortunately, almost none of them are mentioned or compared, see the review [4] for references. [1] Lu, C., Chen, H., Chen, J., Su, H., Li, C. and Zhu, J., 2023, July. Contrastive energy prediction for exact energy-guided diffusion s
* I think that the paper is very well written. The problem setup and motivation are clearly explained; the method and contributions are described in a way that is easy to understand. * Theoretical contributions, which are precisely stated and proven. * Encouraging experimental results. * Submitted code for improved reproducibility.
* To me, the biggest weakness lies in the following missing piece of the explanation of the "mutlimodal flow model". My understanding of the base model is the following. We train the model $p_{1|t}(G_1 | G_t)$, we then use it to estimate marginal velocities described in Equations (7), (9) and (10). This allows us to take a small step to estimate $G_{t + dt}$ based on $G_t$. This implicitly implies that $p_{1|t}$ is efficient to sample from. Otherwise, this procedure would be very expensive. My q
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Human Motion and Animation
MethodsSoftmax · Attention Is All You Need · Diffusion
