Reconstructing People, Places, and Cameras
Lea M\"uller, Hongsuk Choi, Anthony Zhang, Brent Yi, Jitendra Malik, Angjoo Kanazawa

TL;DR
This paper introduces HSfM, a joint reconstruction method that combines human meshes, scene point clouds, and camera parameters from multi-view images, improving accuracy and metric scale estimation by integrating data-driven models with traditional SfM.
Contribution
It presents a novel joint optimization framework that reconstructs humans, scenes, and cameras simultaneously, leveraging human statistical models to estimate metric scale and enhance reconstruction accuracy.
Findings
Significant reduction in human localization error (from 3.51m to 1.04m) on EgoHumans.
Improved camera pose estimation accuracy (RRA@15 increased by 20.3%).
Enhanced overall scene reconstruction quality.
Abstract
We present "Humans and Structure from Motion" (HSfM), a method for jointly reconstructing multiple human meshes, scene point clouds, and camera parameters in a metric world coordinate system from a sparse set of uncalibrated multi-view images featuring people. Our approach combines data-driven scene reconstruction with the traditional Structure-from-Motion (SfM) framework to achieve more accurate scene reconstruction and camera estimation, while simultaneously recovering human meshes. In contrast to existing scene reconstruction and SfM methods that lack metric scale information, our method estimates approximate metric scale by leveraging a human statistical model. Furthermore, it reconstructs multiple human meshes within the same world coordinate system alongside the scene point cloud, effectively capturing spatial relationships among individuals and their positions in the environment.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhotography and Visual Culture
MethodsSparse Evolutionary Training
