Reconstructing People, Places, and Cameras

Lea M\"uller; Hongsuk Choi; Anthony Zhang; Brent Yi; Jitendra Malik; Angjoo Kanazawa

arXiv:2412.17806·cs.CV·May 22, 2025

Reconstructing People, Places, and Cameras

Lea M\"uller, Hongsuk Choi, Anthony Zhang, Brent Yi, Jitendra Malik, Angjoo Kanazawa

PDF

Open Access 1 Repo

TL;DR

This paper introduces HSfM, a joint reconstruction method that combines human meshes, scene point clouds, and camera parameters from multi-view images, improving accuracy and metric scale estimation by integrating data-driven models with traditional SfM.

Contribution

It presents a novel joint optimization framework that reconstructs humans, scenes, and cameras simultaneously, leveraging human statistical models to estimate metric scale and enhance reconstruction accuracy.

Findings

01

Significant reduction in human localization error (from 3.51m to 1.04m) on EgoHumans.

02

Improved camera pose estimation accuracy (RRA@15 increased by 20.3%).

03

Enhanced overall scene reconstruction quality.

Abstract

We present "Humans and Structure from Motion" (HSfM), a method for jointly reconstructing multiple human meshes, scene point clouds, and camera parameters in a metric world coordinate system from a sparse set of uncalibrated multi-view images featuring people. Our approach combines data-driven scene reconstruction with the traditional Structure-from-Motion (SfM) framework to achieve more accurate scene reconstruction and camera estimation, while simultaneously recovering human meshes. In contrast to existing scene reconstruction and SfM methods that lack metric scale information, our method estimates approximate metric scale by leveraging a human statistical model. Furthermore, it reconstructs multiple human meshes within the same world coordinate system alongside the scene point cloud, effectively capturing spatial relationships among individuals and their positions in the environment.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hongsukchoi/hsfm_release
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhotography and Visual Culture

MethodsSparse Evolutionary Training