Weakly Supervised Person Search with Region Siamese Networks
Chuchu Han, Kai Su, Dongdong Yu, Zehuan Yuan, Changxin Gao, Nong Sang,, Yi Yang, Changhu Wang

TL;DR
This paper introduces a weakly supervised person search method using Region Siamese Networks that only requires bounding box annotations, achieving competitive results and surpassing some fully supervised methods.
Contribution
The paper proposes a novel weakly supervised framework for person search that eliminates the need for identity labels, using instance-level and cluster-level losses in a Siamese network architecture.
Findings
Achieves 87.1% rank-1 accuracy on CUHK-SYSU benchmark
Surpasses several fully supervised methods like OIM and MGTS
Demonstrates effectiveness of weak supervision with bounding boxes only
Abstract
Supervised learning is dominant in person search, but it requires elaborate labeling of bounding boxes and identities. Large-scale labeled training data is often difficult to collect, especially for person identities. A natural question is whether a good person search model can be trained without the need of identity supervision. In this paper, we present a weakly supervised setting where only bounding box annotations are available. Based on this new setting, we provide an effective baseline model termed Region Siamese Networks (R-SiamNets). Towards learning useful representations for recognition in the absence of identity labels, we supervise the R-SiamNet with instance-level consistency loss and cluster-level contrastive loss. For instance-level consistency learning, the R-SiamNet is constrained to extract consistent features from each person region with or without out-of-region…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Mobility and Location-Based Analysis
