TL;DR
This paper introduces Pose-REN, a novel neural network architecture that improves single depth image hand pose estimation by leveraging pose-guided region extraction and hierarchical feature integration, achieving superior accuracy.
Contribution
The paper presents a pose-guided structured region ensemble network with hierarchical feature integration for enhanced hand pose estimation from depth images.
Findings
Outperforms state-of-the-art methods on public datasets.
Achieves higher accuracy in hand pose estimation.
Effectively utilizes pose guidance for feature extraction.
Abstract
Hand pose estimation from a single depth image is an essential topic in computer vision and human computer interaction. Despite recent advancements in this area promoted by convolutional neural network, accurate hand pose estimation is still a challenging problem. In this paper we propose a Pose guided structured Region Ensemble Network (Pose-REN) to boost the performance of hand pose estimation. The proposed method extracts regions from the feature maps of convolutional neural network under the guide of an initially estimated pose, generating more optimal and representative features for hand pose estimation. The extracted feature regions are then integrated hierarchically according to the topology of hand joints by employing tree-structured fully connections. A refined estimation of hand pose is directly regressed by the proposed network and the final hand pose is obtained by utilizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
