LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation

Huynh Trinh Ngoc; Hoang Anh Nguyen Kim; Toan Nguyen Hai; Long Tran Quoc

arXiv:2512.04821·cs.CV·April 1, 2026

LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation

Huynh Trinh Ngoc, Hoang Anh Nguyen Kim, Toan Nguyen Hai, Long Tran Quoc

PDF

TL;DR

LatentFM introduces a flow-based latent space model for medical image segmentation that produces accurate, diverse, and uncertainty-aware predictions with confidence maps, outperforming prior methods.

Contribution

It is the first to combine flow matching with VAEs for latent space medical segmentation, enabling diversity and uncertainty quantification.

Findings

01

Achieves superior segmentation accuracy on ISIC-2018 and CVC-Clinic datasets.

02

Generates diverse segmentation outputs reflecting data distribution.

03

Provides confidence maps for uncertainty estimation.

Abstract

Generative models have achieved remarkable progress with the emergence of flow matching (FM). It has demonstrated strong generative capabilities and attracted significant attention as a simulation-free flow-based framework capable of learning exact data densities. Motivated by these advances, we propose LatentFM, a flow-based model operating in the latent space for medical image segmentation. To model the data distribution, we first design two variational autoencoders (VAEs) to encode both medical images and their corresponding masks into a lower-dimensional latent space. We then estimate a conditional velocity field that guides the flow based on the input image. By sampling multiple latent representations, our method synthesizes diverse segmentation outputs whose pixel-wise variance reliably captures the underlying data distribution, enabling both highly accurate and uncertainty-aware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.