Contrastive Flow Matching

George Stoica; Vivek Ramanujan; Xiang Fan; Ali Farhadi; Ranjay Krishna; Judy Hoffman

arXiv:2506.05350·cs.CV·June 6, 2025

Contrastive Flow Matching

George Stoica, Vivek Ramanujan, Xiang Fan, Ali Farhadi, Ranjay Krishna, Judy Hoffman

PDF

Open Access 1 Repo

TL;DR

Contrastive Flow Matching enhances conditional diffusion models by explicitly enforcing flow uniqueness across conditions, leading to faster training, fewer denoising steps, and improved image quality.

Contribution

It introduces a contrastive objective to flow matching, improving condition separation and overall model performance in conditional diffusion tasks.

Findings

01

Training speed improved up to 9x

02

Fewer denoising steps needed (up to 5x fewer)

03

Lower FID scores by up to 8.9

Abstract

Unconditional flow-matching trains diffusion models to transport samples from a source distribution to a target distribution by enforcing that the flows between sample pairs are unique. However, in conditional settings (e.g., class-conditioned models), this uniqueness is no longer guaranteed--flows from different conditions may overlap, leading to more ambiguous generations. We introduce Contrastive Flow Matching, an extension to the flow matching objective that explicitly enforces uniqueness across all conditional flows, enhancing condition separation. Our approach adds a contrastive objective that maximizes dissimilarities between predicted flows from arbitrary sample pairs. We validate Contrastive Flow Matching by conducting extensive experiments across varying model architectures on both class-conditioned (ImageNet-1k) and text-to-image (CC3M) benchmarks. Notably, we find that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gstoica27/deltafm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Medical Image Segmentation Techniques · Domain Adaptation and Few-Shot Learning

MethodsDiffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings