TL;DR
Posterior-Augmented Flow Matching (PAFM) enhances flow matching models by incorporating multiple plausible target completions, reducing training variance and improving image generation quality across various scales and conditions.
Contribution
PAFM introduces a theoretically grounded method that replaces single-target supervision with an expectation over an approximate posterior, improving flow matching training stability and performance.
Findings
PAFM reduces gradient variance during training.
PAFM achieves up to 3.4 FID improvement on ImageNet.
PAFM generalizes well across different model scales and architectures.
Abstract
Flow matching (FM) trains a time-dependent vector field that transports samples from a simple prior to a complex data distribution. However, for high-dimensional images, each training sample supervises only a single trajectory and intermediate point, yielding an extremely sparse and high-variance training signal. This under-constrained supervision can cause flow collapse, where the learned dynamics memorize specific source-target pairings, mapping diverse inputs to overly similar outputs, failing to generalize. We introduce Posterior-Augmented Flow Matching (PAFM), a theoretically grounded generalization of FM that replaces single-target supervision with an expectation over an approximate posterior of valid target completions for a given intermediate state and condition. PAFM factorizes this intractable posterior into (i) the likelihood of the intermediate under a hypothesized endpoint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
