Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views
Wentian Qu, Zhaopeng Cui, Yinda Zhang, Chenyu Meng, Cuixia Ma,, Xiaoming Deng, Hongan Wang

TL;DR
This paper introduces a neural rendering system for hand-object interaction that enables novel view synthesis and pose estimation from sparse views, using a two-stage approach with shape and appearance priors and a joint model fitting framework.
Contribution
It extends scene understanding techniques to dynamic hand-object scenarios, proposing a two-stage method with offline prior learning and online interaction modeling.
Findings
Outperforms state-of-the-art methods in experiments
Enables stable contact modeling during interactions
Allows for novel view synthesis of hand-object scenes
Abstract
Hand-object interaction understanding and the barely addressed novel view synthesis are highly desired in the immersive communication, whereas it is challenging due to the high deformation of hand and heavy occlusions between hand and object. In this paper, we propose a neural rendering and pose estimation system for hand-object interaction from sparse views, which can also enable 3D hand-object interaction editing. We share the inspiration from recent scene understanding work that shows a scene specific model built beforehand can significantly improve and unblock vision tasks especially when inputs are sparse, and extend it to the dynamic hand-object interaction scenario and propose to solve the problem in two stages. We first learn the shape and appearance prior knowledge of hands and objects separately with the neural representation at the offline stage. During the online stage, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Novel-View Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views· youtube
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Robot Manipulation and Learning
