Flow Matching for Count Data

Ganchao Wei; John Pearson

arXiv:2605.07746·stat.ML·May 11, 2026

Flow Matching for Count Data

Ganchao Wei, John Pearson

PDF

TL;DR

Count-FM introduces a flow-matching method for high-dimensional count data using a birth-death process, enabling efficient distribution transport and improved sample quality in applications like scRNA-seq.

Contribution

It extends flow-matching models to count data with a birth-death process, allowing simulation-free training and better performance on count-based applications.

Findings

01

Count-FM outperforms baselines in sample quality.

02

Uses fewer parameters for comparable or better results.

03

Provides interpretable transport paths in count space.

Abstract

High-dimensional count data arise in applications such as single-cell RNA sequencing and neural spike trains, where mapping between distributions across successive batches or time points form critical components of data analysis. The recent success of diffusion- and flow-based deep generative models for images, video, and text motivates extending these ideas to count-valued settings, but many existing methods either treat each count as a categorical state or transform counts into a continuous space, neither of which is natural or efficient when the count range is large. We propose count-FM, a flow-matching framework for count data based on a continuous-time birth-death process with local unit jumps. Count-FM learns marginal transitions efficiently in count space through simulation-free training of conditional transition rates, allowing transport between arbitrary count-distributed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.