SOAR: Self-Occluded Avatar Recovery from a Single Video In the Wild
Zhuoyang Pan, Angjoo Kanazawa, Hang Gao

TL;DR
SOAR introduces a novel method for reconstructing complete human avatars from single videos with self-occlusion, using structural priors and generative diffusion to handle partial observations in unconstrained environments.
Contribution
The paper presents SOAR, a new approach combining structural normal priors and generative diffusion models for full human reconstruction from partial data in the wild.
Findings
Outperforms existing reconstruction methods on various benchmarks.
Achieves comparable results to state-of-the-art approaches.
Effectively reconstructs complete human avatars from partial observations.
Abstract
Self-occlusion is common when capturing people in the wild, where the performer do not follow predefined motion scripts. This challenges existing monocular human reconstruction systems that assume full body visibility. We introduce Self-Occluded Avatar Recovery (SOAR), a method for complete human reconstruction from partial observations where parts of the body are entirely unobserved. SOAR leverages structural normal prior and generative diffusion prior to address such an ill-posed reconstruction problem. For structural normal prior, we model human with an reposable surfel model with well-defined and easily readable shapes. For generative diffusion prior, we perform an initial reconstruction and refine it using score distillation. On various benchmarks, we show that SOAR performs favorably than state-of-the-art reconstruction and generation methods, and on-par comparing to concurrent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis · Human Motion and Animation
MethodsFast Attention Via Positive Orthogonal Random Features · Diffusion · Performer
