Region Generation and Assessment Network for Occluded Person Re-Identification
Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang,, Henghui Ding

TL;DR
This paper introduces RGANet, a novel network that detects and assesses human body regions for occluded person re-identification, reducing reliance on external tools and improving accuracy across multiple benchmarks.
Contribution
The paper proposes RGANet with a Region Generation Module using CLIP and a Region Assessment Module to improve occluded person ReID without external tools.
Findings
Outperforms state-of-the-art methods on six benchmarks
Effectively handles occlusion, partial, and holistic ReID tasks
Reduces negative impact of occlusions through confidence scoring
Abstract
Person Re-identification (ReID) plays a more and more crucial role in recent years with a wide range of applications. Existing ReID methods are suffering from the challenges of misalignment and occlusions, which degrade the performance dramatically. Most methods tackle such challenges by utilizing external tools to locate body parts or exploiting matching strategies. Nevertheless, the inevitable domain gap between the datasets utilized for external tools and the ReID datasets and the complicated matching process make these methods unreliable and sensitive to noises. In this paper, we propose a Region Generation and Assessment Network (RGANet) to effectively and efficiently detect the human body regions and highlight the important regions. In the proposed RGANet, we first devise a Region Generation Module (RGM) which utilizes the pre-trained CLIP to locate the human body regions using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Pose and Action Recognition
MethodsContrastive Language-Image Pre-training
