Deep Attributes Driven Multi-Camera Person Re-identification
Chi Su, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian

TL;DR
This paper introduces a semi-supervised deep attribute learning framework for multi-camera person re-identification, leveraging robust mid-level attributes to improve accuracy across diverse datasets.
Contribution
It proposes a novel three-stage training process that enhances attribute robustness and generalization for person ReID using limited labeled data.
Findings
Deep attributes outperform traditional features in ReID accuracy.
The method achieves state-of-the-art results on four datasets.
A simple metric learning module further improves performance.
Abstract
The visual appearance of a person is easily affected by many factors like pose variations, viewpoint changes and camera parameter differences. This makes person Re-Identification (ReID) among multiple cameras a very challenging task. This work is motivated to learn mid-level human attributes which are robust to such visual appearance variations. And we propose a semi-supervised attribute learning framework which progressively boosts the accuracy of attributes only using a limited number of labeled data. Specifically, this framework involves a three-stage training. A deep Convolutional Neural Network (dCNN) is first trained on an independent dataset labeled with attributes. Then it is fine-tuned on another dataset only labeled with person IDs using our defined triplet loss. Finally, the updated dCNN predicts attribute labels for the target dataset, which is combined with the independent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion-Convolutional Neural Networks
