Social EgoMesh Estimation
Luca Scofano, Alessio Sampieri, Edoardo De Matteis, Indro Spinelli,, Fabio Galasso

TL;DR
This paper introduces SEE-ME, a novel probabilistic framework that estimates the 3D mesh of a camera wearer in egocentric videos by incorporating scene context and social interactions, significantly improving accuracy.
Contribution
It is the first to use a latent probabilistic diffusion model conditioned on social interactions for ego-mesh estimation in egocentric videos.
Findings
Reduces pose estimation error (MPJPE) by 53%.
Quantifies the impact of interpersonal distance and gaze on ego-mesh accuracy.
Outperforms existing methods in social egocentric pose estimation.
Abstract
Accurately estimating the 3D pose of the camera wearer in egocentric video sequences is crucial to modeling human behavior in virtual and augmented reality applications. The task presents unique challenges due to the limited visibility of the user's body caused by the front-facing camera mounted on their head. Recent research has explored the utilization of the scene and ego-motion, but it has overlooked humans' interactive nature. We propose a novel framework for Social Egocentric Estimation of body MEshes (SEE-ME). Our approach is the first to estimate the wearer's mesh using only a latent probabilistic diffusion model, which we condition on the scene and, for the first time, on the social wearer-interactee interactions. Our in-depth study sheds light on when social interaction matters most for ego-mesh estimation; it quantifies the impact of interpersonal distance and gaze direction.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health Research Topics
MethodsDiffusion
