Human Instance Segmentation and Tracking via Data Association and Single-stage Detector
Lu Cheng, Mingbo Zhao

TL;DR
This paper introduces a novel single-stage detector-based method for human video instance segmentation that improves real-time performance and tracking accuracy by using data association and centroid sampling strategies.
Contribution
It proposes a new end-to-end approach combining a single-stage detector with data association and centroid sampling to enhance human instance segmentation and tracking.
Findings
Effective in maintaining instance identity despite activity changes
Reduces computational cost compared to Mask-RCNN-based methods
Achieves competitive accuracy and efficiency on the PVIS dataset
Abstract
Human video instance segmentation plays an important role in computer understanding of human activities and is widely used in video processing, video surveillance, and human modeling in virtual reality. Most current VIS methods are based on Mask-RCNN framework, where the target appearance and motion information for data matching will increase computational cost and have an impact on segmentation real-time performance; on the other hand, the existing datasets for VIS focus less on all the people appearing in the video. In this paper, to solve the problems, we develop a new method for human video instance segmentation based on single-stage detector. To tracking the instance across the video, we have adopted data association strategy for matching the same instance in the video sequence, where we jointly learn target instance appearances and their affinities in a pair of video frames in an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Advanced Neural Network Applications
