Pose Invariant Embedding for Deep Person Re-identification
Liang Zheng, Yujia Huang, Huchuan Lu, Yi Yang

TL;DR
This paper proposes a pose invariant embedding (PIE) for person re-identification that uses pose estimation and a novel fusion network to improve robustness against pose variations and misalignments.
Contribution
It introduces the PoseBox structure and a PoseBox fusion CNN to enhance feature alignment and reduce errors in person re-ID systems.
Findings
PIE achieves competitive re-ID accuracy on benchmark datasets.
PoseBox alone provides decent initial re-ID performance.
The PBF network improves robustness against pose variations.
Abstract
Pedestrian misalignment, which mainly arises from detector errors and pose variations, is a critical problem for a robust person re-identification (re-ID) system. With bad alignment, the background noise will significantly compromise the feature learning and matching process. To address this problem, this paper introduces the pose invariant embedding (PIE) as a pedestrian descriptor. First, in order to align pedestrians to a standard pose, the PoseBox structure is introduced, which is generated through pose estimation followed by affine transformations. Second, to reduce the impact of pose estimation errors and information loss during PoseBox construction, we design a PoseBox fusion (PBF) CNN architecture that takes the original image, the PoseBox, and the pose estimation confidence as input. The proposed PIE descriptor is thus defined as the fully connected layer of the PBF network for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Gait Recognition and Analysis
