Decaf: Monocular Deformation Capture for Face and Hand Interactions
Soshi Shimada, Vladislav Golyanik, Patrick P\'erez, Christian Theobalt

TL;DR
Decaf introduces a novel monocular 3D tracking method for human hands and faces during interaction, capturing realistic non-rigid deformations and contacts, advancing AR/VR and avatar realism.
Contribution
It is the first approach to model and track non-rigid face deformations caused by hand interactions from monocular videos.
Findings
Realistic 3D hand-face interaction reconstructions
Improved plausibility over baseline methods
New dataset with annotated face deformations
Abstract
Existing methods for 3D tracking from monocular RGB videos predominantly consider articulated and rigid objects. Modelling dense non-rigid object deformations in this setting remained largely unaddressed so far, although such effects can improve the realism of the downstream applications such as AR/VR and avatar communications. This is due to the severe ill-posedness of the monocular view setting and the associated challenges. While it is possible to naively track multiple non-rigid objects independently using 3D templates or parametric 3D models, such an approach would suffer from multiple artefacts in the resulting 3D estimates such as depth ambiguity, unnatural intra-object collisions and missing or implausible deformations. Hence, this paper introduces the first method that addresses the fundamental challenges depicted above and that allows tracking human hands interacting with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Face recognition and analysis · Hand Gesture Recognition Systems
