Dense Point-to-Mask Optimization with Reinforced Point Selection for Crowd Instance Segmentation
Hongru Chen, Jiyang Huang, Jia Wan, Antoni B.Chan

TL;DR
This paper introduces a novel dense point-to-mask optimization method combined with reinforced point selection to improve crowd instance segmentation and counting accuracy, leveraging limited point annotations.
Contribution
It proposes Dense Point-to-Mask Optimization with NNEC constraints and a Reinforced Point Selection framework trained with GRPO, achieving state-of-the-art results.
Findings
Achieved top performance on multiple crowd datasets.
Enhanced counting accuracy using mask-supervised loss functions.
Demonstrated the effectiveness of the proposed methods in dense crowd scenarios.
Abstract
Crowd instance segmentation is a crucial task with a wide range of applications, including surveillance and transportation. Currently, point labels are common in crowd datasets, while region labels (e.g., boxes) are rare and inaccurate. The masks obtained through segmentation help to improve the accuracy of region labels and resolve the correspondence between individual location coordinates and crowd density maps. However, directly applying currently popular large foundation models such as SAM does not yield ideal results in dense crowds. To this end, we first propose Dense Point-to-Mask Optimization (DPMO), which integrates SAM with the Nearest Neighbor Exclusive Circle (NNEC) constraint to generate dense instance segmentation from point annotations. With DPMO and manual correction, we obtain mask annotations from the existing point annotations for traditional crowd datasets. Then, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
