SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery
Da Li, Jiping Jin, Xuanlong Yu, Wei Liu, Xiaodong Cun, Kai Chen, Rui Fan, Jiangang Kong, Xi Shen

TL;DR
SKEL-CF introduces a transformer-based coarse-to-fine framework for accurate, anatomically consistent biomechanical skeleton and surface mesh recovery from images, improving over previous methods in pose estimation accuracy.
Contribution
The paper presents SKEL-CF, a novel transformer-based coarse-to-fine approach for SKEL parameter estimation, with new high-quality training data and explicit camera modeling.
Findings
Achieves 85.0 MPJPE on MOYO dataset, outperforming previous methods.
Demonstrates the importance of camera modeling in diverse viewpoints.
Provides a scalable framework for biomechanical human motion analysis.
Abstract
Parametric 3D human models such as SMPL have driven significant advances in human pose and shape estimation, yet their simplified kinematics limit biomechanical realism. The recently proposed SKEL model addresses this limitation by re-rigging SMPL with an anatomically accurate skeleton. However, estimating SKEL parameters directly remains challenging due to limited training data, perspective ambiguities, and the inherent complexity of human articulation. We introduce SKEL-CF, a coarse-to-fine framework for SKEL parameter estimation. SKEL-CF employs a transformer-based encoder-decoder architecture, where the encoder predicts coarse camera and SKEL parameters, and the decoder progressively refines them in successive layers. To ensure anatomically consistent supervision, we convert the existing SMPL-based dataset 4DHuman into a SKEL-aligned version, 4DHuman-SKEL, providing high-quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Gait Recognition and Analysis
