End-to-End Comparative Attention Networks for Person Re-identification
Hao Liu, Jiashi Feng, Meibin Qi, Jianguo Jiang, Shuicheng Yan

TL;DR
This paper introduces an end-to-end comparative attention network that selectively focuses on discriminative parts of person images to improve re-identification accuracy across camera views, outperforming existing methods.
Contribution
The novel end-to-end CAN model employs soft attention to adaptively compare local regions of person images, enhancing re-identification performance over prior approaches.
Findings
Outperforms baseline methods on CUHK01, CHUHK03, and Market-1501 datasets.
Achieves state-of-the-art accuracy in person re-identification.
Effectively learns to focus on relevant image parts for identification.
Abstract
Person re-identification across disjoint camera views has been widely applied in video surveillance yet it is still a challenging problem. One of the major challenges lies in the lack of spatial and temporal cues, which makes it difficult to deal with large variations of lighting conditions, viewing angles, body poses and occlusions. Recently, several deep learning based person re-identification approaches have been proposed and achieved remarkable performance. However, most of those approaches extract discriminative features from the whole frame at one glimpse without differentiating various parts of the persons to identify. It is essentially important to examine multiple highly discriminative local regions of the person images in details through multiple glimpses for dealing with the large appearance variance. In this paper, we propose a new soft attention based model, i.e., the end…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
