PAD-Hand: Physics-Aware Diffusion for Hand Motion Recovery
Elkhan Ismayilzada, Yufei Zhang, Zijun Cui

TL;DR
This paper introduces PAD-Hand, a physics-aware diffusion method that refines hand motion estimates from images into physically plausible sequences with interpretable confidence measures.
Contribution
It presents a novel physics-aware diffusion framework with a MeshCNN-Transformer backbone that models hand dynamics and estimates physics consistency variances.
Findings
Achieves consistent improvements over existing methods on hand datasets.
Provides interpretable variance maps indicating physical plausibility.
Produces physically plausible hand motions with confidence estimates.
Abstract
Significant advancements made in reconstructing hands from images have delivered accurate single-frame estimates, yet they often lack physics consistency and provide no notion of how confidently the motion satisfies physics. In this paper, we propose a novel physics-aware conditional diffusion framework that refines noisy pose sequences into physically plausible hand motion while estimating the physics variance in motion estimates. Building on a MeshCNN-Transformer backbone, we formulate Euler-Lagrange dynamics for articulated hands. Unlike prior works that enforce zero residuals, we treat the resulting dynamic residuals as virtual observables to more effectively integrate physics. Through a last-layer Laplace approximation, our method produces per-joint, per-time variances that measure physics consistency and offers interpretable variance maps indicating where physical consistency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
