SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal   Transformers

Vandad Davoodnia; Saeed Ghorbani; Alexandre Messier; Ali Etemad

arXiv:2404.12625·cs.CV·April 22, 2024·2 cites

SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers

Vandad Davoodnia, Saeed Ghorbani, Alexandre Messier, Ali Etemad

PDF

Open Access

TL;DR

SkelFormer is a new markerless 3D human pose and shape estimation method that uses skeletal transformers to improve robustness and generalization, especially under noisy and occluded conditions.

Contribution

The paper introduces a novel skeletal transformer architecture that separates 3D keypoint detection from pose estimation, enhancing accuracy and robustness in markerless motion capture.

Findings

01

Strong performance on multiple datasets

02

Robust to noise and occlusions

03

Ablation studies confirm module effectiveness

Abstract

We introduce SkelFormer, a novel markerless motion capture pipeline for multi-view human pose and shape estimation. Our method first uses off-the-shelf 2D keypoint estimators, pre-trained on large-scale in-the-wild data, to obtain 3D joint positions. Next, we design a regression-based inverse-kinematic skeletal transformer that maps the joint positions to pose and shape representations from heavily noisy observations. This module integrates prior knowledge about pose space and infers the full pose state at runtime. Separating the 3D keypoint detection and inverse-kinematic problems, along with the expressive representations learned by our skeletal transformer, enhance the generalization of our method to unseen noisy data. We evaluate our method on three public datasets in both in-distribution and out-of-distribution settings using three datasets, and observe strong performance with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Robot Manipulation and Learning