TL;DR
This paper demonstrates that Latent Flow-Matching models are inherently stable under various perturbations, enabling more efficient training and inference without sacrificing output quality.
Contribution
It reveals the stability properties of LFM models and leverages them to develop faster training methods and a two-model inference approach for reduced computational cost.
Findings
Training on reduced datasets maintains performance.
Two-model approach reduces inference cost by over 50%.
Stability allows for faster convergence and less annotation effort.
Abstract
In this work, we show that Latent Flow-Matching (LFM) models are robust to different types of perturbations, including data reduction and model capacity shrinkage. We characterize this stability by their tendency to generate similar outputs under identical noise seeds. We provide a perspective relating this phenomenon to flow matching theory, which indicates that this stability is inherent to the FM objective. We further exploit this stability to derive practical algorithms for more efficient training and inference. Concretely, first, we show that by training LFM models on significantly reduced datasets, the performance does not degrade perceptually or quantitatively. This yields multiple advantages, such as reducing training time by converging faster under limited compute budget, and alleviating annotation effort when training conditional models. Second, LFM stability under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
