Video Person Re-identification using Attribute-enhanced Features
Tianrui Chai, Zhiyuan Chen, Annan Li, Jiaxin Chen, Xinyu Mei, Yunhong, Wang

TL;DR
This paper introduces ASA-Net, a novel network for video person re-identification that leverages attribute-enhanced features and attention mechanisms to improve accuracy by focusing on salient regions and invariant attributes.
Contribution
The work proposes a new network architecture that uses attribute salience and a novel triplet loss to enhance video person Re-ID performance, addressing background separation and view/movement variations.
Findings
Significant improvement over existing methods in accuracy.
Effective attention on pedestrian body regions.
Robustness to view angle and movement variations.
Abstract
Video-based person re-identification (Re-ID) which aims to associate people across non-overlapping cameras using surveillance video is a challenging task. Pedestrian attribute, such as gender, age and clothing characteristics contains rich and supplementary information but is less explored in video person Re-ID. In this work, we propose a novel network architecture named Attribute Salience Assisted Network (ASA-Net) for attribute-assisted video person Re-ID, which achieved considerable improvement to existing works by two methods.First, to learn a better separation of the target from background, we propose to learn the visual attention from middle-level attribute instead of high-level identities. The proposed Attribute Salient Region Enhance (ASRE) module can attend more accurately on the body of pedestrian. Second, we found that many identity-irrelevant but object or subject-relevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Human Pose and Action Recognition
MethodsTriplet Loss
