TL;DR
This paper introduces POSA, a body-centric model that encodes human-scene interactions to improve scene placement of 3D human scans and monocular pose estimation, demonstrating significant advancements over prior methods.
Contribution
The paper presents a novel body-centric representation called POSA for human-scene interaction, enabling better scene placement and pose estimation in 3D environments.
Findings
POSA improves automatic placement of 3D human scans in scenes.
POSA enhances monocular human pose estimation with scene consistency.
Significant performance gains over state-of-the-art methods.
Abstract
Humans live within a 3D space and constantly interact with it to perform tasks. Such interactions involve physical contact between surfaces that is semantically meaningful. Our goal is to learn how humans interact with scenes and leverage this to enable virtual characters to do the same. To that end, we introduce a novel Human-Scene Interaction (HSI) model that encodes proximal relationships, called POSA for "Pose with prOximitieS and contActs". The representation of interaction is body-centric, which enables it to generalize to new scenes. Specifically, POSA augments the SMPL-X parametric human body model such that, for every mesh vertex, it encodes (a) the contact probability with the scene surface and (b) the corresponding semantic scene label. We learn POSA with a VAE conditioned on the SMPL-X vertices, and train on the PROX dataset, which contains SMPL-X meshes of people…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsUSD Coin Customer Service Number +1-833-534-1729
