Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation
Guohuan Xie, Xin He, Dingying Fan, Le Zhang, Ming-Ming Cheng, Yun Liu

TL;DR
This paper introduces Syn4Seg, a novel framework that leverages synthetic images and advanced pseudo-label refinement to significantly improve generalized few-shot semantic segmentation performance.
Contribution
Syn4Seg enhances GFSS by generating diverse synthetic images and refining pseudo-labels through a support-guided, multi-stage process, addressing coverage and supervision issues.
Findings
Consistent improvements on PASCAL-5i and COCO-20i datasets.
Effective synthetic data generation increases novel-class coverage.
Refined pseudo-labels lead to more accurate segmentation boundaries.
Abstract
Generalized few-shot semantic segmentation (GFSS) is fundamentally limited by the coverage of novel-class appearances under scarce annotations. While diffusion models can synthesize novel-class images at scale, practical gains are often hindered by insufficient coverage and noisy supervision when masks are unavailable or unreliable. We propose Syn4Seg, a generation-enhanced GFSS framework designed to expand novel-class coverage while improving pseudo-label quality. Syn4Seg first maximizes prompt-space coverage by constructing an embedding-deduplicated prompt bank for each novel class, yielding diverse yet class-consistent synthetic images. It then performs support-guided pseudo-label estimation via a two-stage refinement that i) filters low-consistency regions to obtain high-precision seeds and ii) relabels uncertain pixels with image-adaptive prototypes that combine global (support)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
