Coreset-Induced Conditional Velocity Flow Matching
Xiao Wang, Zihua She, Jianxi Su

TL;DR
CCVFM introduces a data-informed coreset approach to improve hierarchical flow matching in generative models, enabling efficient sampling and competitive results on standard datasets.
Contribution
The paper presents a novel coreset-based surrogate source for hierarchical flow models, reducing training complexity and improving sampling efficiency.
Findings
Achieves competitive few-step generation on MNIST, CIFAR-10, ImageNet-32, and CelebA-HQ.
Proves the surrogate transport cost equals the Wasserstein gap under certain conditions.
Demonstrates that a lightweight correction flow refines the residual effectively.
Abstract
We propose Coreset-Induced Conditional Velocity Flow Matching (CCVFM), a generative model that augments hierarchical rectified flow with a data-informed source distribution. Hierarchical flow matching models the full conditional velocity law in velocity space, but its inner flow is asked to transport isotropic Gaussian noise to a multimodal target velocity distribution from scratch. Our key observation is that this inner source can be replaced by a closed-form surrogate built from a coreset of the target. CCVFM first compresses the target into weighted atoms using an entropic Sinkhorn coreset and lifts them to a Gaussian mixture. The induced conditional velocity law is then a closed-form Gaussian mixture that can be sampled without a learned neural sampler. A lightweight correction flow, trained from this exact surrogate source, then refines the remaining surrogate-to-target residual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
