TL;DR
LeapAlign introduces a two-step trajectory method for fine-tuning flow matching models, enabling efficient, stable gradient updates at any generation step, improving image quality and alignment.
Contribution
The paper proposes LeapAlign, a novel two-step leap method that reduces computational costs and stabilizes gradients for better fine-tuning of flow matching models.
Findings
LeapAlign outperforms state-of-the-art methods in image quality.
It enables stable gradient propagation at any generation step.
The method improves image-text alignment in fine-tuned models.
Abstract
This paper focuses on the alignment of flow matching models with human preferences. A promising way is fine-tuning by directly backpropagating reward gradients through the differentiable generation process of flow matching. However, backpropagating through long trajectories results in prohibitive memory costs and gradient explosion. Therefore, direct-gradient methods struggle to update early generation steps, which are crucial for determining the global structure of the final image. To address this issue, we introduce LeapAlign, a fine-tuning method that reduces computational cost and enables direct gradient propagation from reward to early generation steps. Specifically, we shorten the long trajectory into only two steps by designing two consecutive leaps, each skipping multiple ODE sampling steps and predicting future latents in a single step. By randomizing the start and end…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
