DeepPrior++: Improving Fast and Accurate 3D Hand Pose Estimation

Markus Oberweger; Vincent Lepetit

arXiv:1708.08325·cs.CV·August 29, 2017

DeepPrior++: Improving Fast and Accurate 3D Hand Pose Estimation

Markus Oberweger, Vincent Lepetit

PDF

4 Repos

TL;DR

DeepPrior++ enhances 3D hand pose estimation from depth maps by integrating ResNet, data augmentation, and improved localization, achieving state-of-the-art results with a simple approach.

Contribution

It introduces simple yet effective improvements to DeepPrior, significantly boosting performance while maintaining simplicity.

Findings

01

Outperforms recent methods on NYU, ICVL, MSRA benchmarks

02

Achieves comparable or better accuracy with simpler model

03

Open-source implementation available

Abstract

DeepPrior is a simple approach based on Deep Learning that predicts the joint 3D locations of a hand given a depth map. Since its publication early 2015, it has been outperformed by several impressive works. Here we show that with simple improvements: adding ResNet layers, data augmentation, and better initial hand localization, we achieve better or similar performance than more sophisticated recent methods on the three main benchmarks (NYU, ICVL, MSRA) while keeping the simplicity of the original method. Our new implementation is available at https://github.com/moberweger/deep-prior-pp .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection