iSeg: An Iterative Refinement-based Framework for Training-free Segmentation
Lin Sun, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang

TL;DR
iSeg is a training-free segmentation framework that iteratively refines cross-attention maps using entropy-reduced self-attention, achieving improved accuracy across various datasets and tasks.
Contribution
The paper introduces an iterative refinement framework for training-free segmentation that leverages entropy-reduced self-attention and category-enhanced cross-attention modules.
Findings
Achieves 3.8% mIoU improvement on Cityscapes for unsupervised segmentation.
Supports diverse images and interactions in segmentation tasks.
Demonstrates stable and effective iterative refinement process.
Abstract
Stable diffusion has demonstrated strong image synthesis ability to given text descriptions, suggesting it to contain strong semantic clue for grouping objects. The researchers have explored employing stable diffusion for training-free segmentation. Most existing approaches refine cross-attention map by self-attention map once, demonstrating that self-attention map contains useful semantic information to improve segmentation. To fully utilize self-attention map, we present a deep experimental analysis on iteratively refining cross-attention map with self-attention map, and propose an effective iterative refinement framework for training-free segmentation, named iSeg. The proposed iSeg introduces an entropy-reduced self-attention module that utilizes a gradient descent scheme to reduce the entropy of self-attention map, thereby suppressing the weak responses corresponding to irrelevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications
MethodsSoftmax · Diffusion · Concatenated Skip Connection
