TL;DR
Ego3DPose introduces a novel binocular egocentric 3D pose estimation system that leverages stereo cues and perspective-aware representations to significantly improve accuracy in challenging scenarios.
Contribution
The paper presents a two-path network architecture and a perspective-aware representation to enhance 3D pose estimation from egocentric binocular views, addressing occlusion and bias issues.
Findings
Outperforms state-of-the-art models with 23.1% MPJPE reduction
Effective in challenging occlusion and self-occlusion scenarios
Demonstrates superior qualitative results across various scenarios
Abstract
We present Ego3DPose, a highly accurate binocular egocentric 3D pose reconstruction system. The binocular egocentric setup offers practicality and usefulness in various applications, however, it remains largely under-explored. It has been suffering from low pose estimation accuracy due to viewing distortion, severe self-occlusion, and limited field-of-view of the joints in egocentric 2D images. Here, we notice that two important 3D cues, stereo correspondences, and perspective, contained in the egocentric binocular input are neglected. Current methods heavily rely on 2D image features, implicitly learning 3D information, which introduces biases towards commonly observed motions and leads to low overall accuracy. We observe that they not only fail in challenging occlusion cases but also in estimating visible joint positions. To address these challenges, we propose two novel approaches.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodsfail
