FlowDIS: Language-Guided Dichotomous Image Segmentation with Flow Matching

Andranik Sargsyan; Shant Navasardyan

arXiv:2605.05077·cs.CV·May 13, 2026

FlowDIS: Language-Guided Dichotomous Image Segmentation with Flow Matching

Andranik Sargsyan, Shant Navasardyan

PDF

1 Repo 1 Models 1 Datasets

TL;DR

FlowDIS introduces a novel flow matching-based approach for language-guided dichotomous image segmentation, significantly improving accuracy and controllability over existing methods.

Contribution

The paper presents FlowDIS, a new flow matching framework with position-aware training for precise, language-guided image segmentation, outperforming prior state-of-the-art methods.

Findings

01

FlowDIS achieves 5.5% higher $F_{eta}^{ ext{w}}$ measure than previous methods.

02

FlowDIS reduces MAE by 43% compared to the best prior DIS approach.

03

FlowDIS demonstrates strong controllability and fine-grained segmentation with language prompts.

Abstract

Accurate image segmentation is essential for modern computer vision applications such as image editing, autonomous driving, and medical image analysis. In recent years, Dichotomous Image Segmentation (DIS) has become a standard task for training and evaluating highly accurate segmentation models. Existing DIS approaches often fail to preserve fine-grained details or fully capture the semantic structure of the foreground. To address these challenges, we present FlowDIS, a novel dichotomous image segmentation method built on the flow matching framework, which learns a time-dependent vector field to transport the image distribution to the corresponding mask distribution, optionally conditioned on a text prompt. Moreover, with our Position-Aware Instance Pairing (PAIP) training strategy, FlowDIS offers strong controllability through text prompts, enabling precise, pixel-level object…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Picsart-AI-Research/FlowDIS
github

Models

🤗
PAIR/FlowDIS
model· 55 dl· ♡ 6
55 dl♡ 6

Datasets

PAIR/FlowDIS-DIS5K-language-prompts
dataset· 55 dl
55 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.