Gated Siamese Convolutional Neural Network Architecture for Human Re-Identification
Rahul Rama Varior, Mrinal Haloi, and Gang Wang

TL;DR
This paper introduces a gated Siamese CNN architecture that adaptively emphasizes local features for improved human re-identification across camera views, outperforming baseline models.
Contribution
The paper proposes a novel gating mechanism that compares mid-level features across image pairs to produce flexible, context-aware representations for human re-identification.
Findings
Improved accuracy on CUHK03, Market-1501, and VIPeR datasets.
Enhanced ability to distinguish hard negative pairs.
Demonstrated effectiveness of mid-level feature comparison.
Abstract
Matching pedestrians across multiple camera views, known as human re-identification, is a challenging research problem that has numerous applications in visual surveillance. With the resurgence of Convolutional Neural Networks (CNNs), several end-to-end deep Siamese CNN architectures have been proposed for human re-identification with the objective of projecting the images of similar pairs (i.e. same identity) to be closer to each other and those of dissimilar pairs to be distant from each other. However, current networks extract fixed representations for each image regardless of other images which are paired with it and the comparison with other images is done only at the final level. In this setting, the network is at risk of failing to extract finer local patterns that may be essential to distinguish positive pairs from hard negative pairs. In this paper, we propose a gating function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Human Pose and Action Recognition
