TL;DR
This paper introduces a novel 3D space-based person re-identification method using a parameter-efficient graph network that leverages 3D body structure, improving matching robustness and combining 2D appearance with 3D geometry.
Contribution
It pioneers person re-identification in 3D space by projecting 2D images into 3D and employing a new OG-Net architecture for efficient 3D feature learning.
Findings
Achieves competitive results on large-scale datasets.
Eases matching difficulty compared to 2D methods.
Exploits complementary 2D and 3D information.
Abstract
People live in a 3D world. However, existing works on person re-identification (re-id) mostly consider the semantic representation learning in a 2D space, intrinsically limiting the understanding of people. In this work, we address this limitation by exploring the prior knowledge of the 3D body structure. Specifically, we project 2D images to a 3D space and introduce a novel parameter-efficient Omni-scale Graph Network (OG-Net) to learn the pedestrian representation directly from 3D point clouds. OG-Net effectively exploits the local information provided by sparse 3D points and takes advantage of the structure and appearance information in a coherent manner. With the help of 3D geometry information, we can learn a new type of deep re-id feature free from noisy variants, such as scale and viewpoint. To our knowledge, we are among the first attempts to conduct person re-identification in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
