TL;DR
This paper introduces a novel approach for 3D human self-contact estimation from monocular images, improving the accuracy of detailed 3D reconstructions by explicitly modeling self-contact regions and leveraging new datasets.
Contribution
It develops a Self-Contact Prediction model that estimates self-contact signatures and enforces them during 3D human reconstruction, supported by two large annotated datasets.
Findings
Enhanced 3D reconstructions with realistic self-contact regions
Successful monocular detection of face-touch events
Large datasets enabling detailed self-contact analysis
Abstract
Monocular estimation of three dimensional human self-contact is fundamental for detailed scene analysis including body language understanding and behaviour modeling. Existing 3d reconstruction methods do not focus on body regions in self-contact and consequently recover configurations that are either far from each other or self-intersecting, when they should just touch. This leads to perceptually incorrect estimates and limits impact in those very fine-grained analysis domains where detailed 3d models are expected to play an important role. To address such challenges we detect self-contact and design 3d losses to explicitly enforce it. Specifically, we develop a model for Self-Contact Prediction (SCP), that estimates the body surface signature of self-contact, leveraging the localization of self-contact in the image, during both training and inference. We collect two large datasets to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
