
TL;DR
Count-FM introduces a flow-matching method for high-dimensional count data using a birth-death process, enabling efficient distribution transport and improved sample quality in applications like scRNA-seq.
Contribution
It extends flow-matching models to count data with a birth-death process, allowing simulation-free training and better performance on count-based applications.
Findings
Count-FM outperforms baselines in sample quality.
Uses fewer parameters for comparable or better results.
Provides interpretable transport paths in count space.
Abstract
High-dimensional count data arise in applications such as single-cell RNA sequencing and neural spike trains, where mapping between distributions across successive batches or time points form critical components of data analysis. The recent success of diffusion- and flow-based deep generative models for images, video, and text motivates extending these ideas to count-valued settings, but many existing methods either treat each count as a categorical state or transform counts into a continuous space, neither of which is natural or efficient when the count range is large. We propose count-FM, a flow-matching framework for count data based on a continuous-time birth-death process with local unit jumps. Count-FM learns marginal transitions efficiently in count space through simulation-free training of conditional transition rates, allowing transport between arbitrary count-distributed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
