Towards PerSense++: Advancing Training-Free Personalized Instance Segmentation in Dense Images
Muhammad Ibraheem Siddiqui, Muhammad Umer Sheikh, Hassan Abid, Kevin Henry, Muhammad Haris Khan

TL;DR
PerSense++ is a training-free, model-agnostic framework for personalized instance segmentation in dense images, introducing novel modules and a new benchmark to improve segmentation accuracy in complex scenes.
Contribution
We propose PerSense++, an advanced training-free framework with novel modules and a new benchmark, significantly improving dense scene segmentation performance.
Findings
PerSense++ outperforms existing methods in dense image segmentation.
The proposed modules enhance robustness and accuracy in cluttered scenes.
PerSense-D benchmark facilitates evaluation of personalized segmentation methods.
Abstract
Segmentation in dense visual scenes poses significant challenges due to occlusions, background clutter, and scale variations. To address this, we introduce PerSense, an end-to-end, training-free, and model-agnostic one-shot framework for Personalized instance Segmentation in dense images. PerSense employs a novel Instance Detection Module (IDM) that leverages density maps (DMs) to generate instance-level candidate point prompts, followed by a Point Prompt Selection Module (PPSM) that filters false positives via adaptive thresholding and spatial gating. A feedback mechanism further enhances segmentation by automatically selecting effective exemplars to improve DM quality. We additionally present PerSense++, an enhanced variant that incorporates three additional components to improve robustness in cluttered scenes: (i) a diversity-aware exemplar selection strategy that leverages feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
