Rethinking Vector Field Learning for Generative Segmentation
Chaoyang Wang, Yaobo Liang, Boci Peng, Fan Duan, Jingdong Wang, Yunhai Tong

TL;DR
This paper reexamines diffusion-based generative segmentation through vector field learning, addressing key limitations with a novel reshaping strategy and an efficient category encoding scheme, leading to improved segmentation performance.
Contribution
It introduces a vector field reshaping method and a quasi-random category encoding scheme to enhance diffusion segmentation, bridging the gap with discriminative methods.
Findings
Significant performance improvements over vanilla flow matching.
Enhanced class separation and convergence speed.
Narrowed gap between generative and discriminative segmentation methods.
Abstract
Taming diffusion models for generative segmentation has attracted increasing attention. While existing approaches primarily focus on architectural tweaks or training heuristics, there remains a limited understanding of the intrinsic mismatch between continuous flow matching objectives and discrete perception tasks. In this work, we revisit diffusion segmentation from the perspective of vector field learning. We identify two key limitations of the commonly used flow matching objective: gradient vanishing and trajectory traversing, which result in slow convergence and poor class separation. To tackle these issues, we propose a principled vector field reshaping strategy that augments the learned velocity field with a detached distance-aware correction term. This correction introduces both attractive and repulsive interactions, enhancing gradient magnitudes near centroids while preserving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques
