What-and-Where to Match: Deep Spatially Multiplicative Integration Networks for Person Re-identification
Lin Wu, Yang Wang, Xue Li, Junbin Gao

TL;DR
This paper introduces a novel deep neural network for person re-identification that dynamically emphasizes matching relevant local patterns and aligns spatial features to improve accuracy across camera views.
Contribution
It proposes a deep multiplicative integration gating function combined with spatially recurrent pooling, enabling flexible and spatially aligned local feature matching for person re-id.
Findings
Outperforms existing methods on VIPeR, CUHK03, and Market-1501 datasets.
Effectively captures local patterns and spatial dependencies.
End-to-end trainable network enhances person re-identification accuracy.
Abstract
Matching pedestrians across disjoint camera views, known as person re-identification (re-id), is a challenging problem that is of importance to visual recognition and surveillance. Most existing methods exploit local regions within spatial manipulation to perform matching in local correspondence. However, they essentially extract \emph{fixed} representations from pre-divided regions for each image and perform matching based on the extracted representation subsequently. For models in this pipeline, local finer patterns that are crucial to distinguish positive pairs from negative ones cannot be captured, and thus making them underperformed. In this paper, we propose a novel deep multiplicative integration gating function, which answers the question of \emph{what-and-where to match} for effective person re-id. To address \emph{what} to match, our deep network emphasizes common local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
