SmartMocap: Joint Estimation of Human and Camera Motion using Uncalibrated RGB Cameras
Nitin Saini, Chun-hao P. Huang, Michael J. Black, Aamir Ahmad

TL;DR
SmartMocap introduces a method for joint human and camera motion estimation using uncalibrated RGB cameras, eliminating the need for prior calibration and enabling accurate motion capture in dynamic camera setups.
Contribution
It proposes a novel approach that uses the ground plane as a common reference and a learned motion prior to jointly estimate human and camera motions without calibration.
Findings
Works on diverse datasets including aerial and smartphone cameras.
Achieves more accurate results than state-of-the-art monocular human mocap methods.
Handles both static and moving uncalibrated cameras effectively.
Abstract
Markerless human motion capture (mocap) from multiple RGB cameras is a widely studied problem. Existing methods either need calibrated cameras or calibrate them relative to a static camera, which acts as the reference frame for the mocap system. The calibration step has to be done a priori for every capture session, which is a tedious process, and re-calibration is required whenever cameras are intentionally or accidentally moved. In this paper, we propose a mocap method which uses multiple static and moving extrinsically uncalibrated RGB cameras. The key components of our method are as follows. First, since the cameras and the subject can move freely, we select the ground plane as a common reference to represent both the body and the camera motions unlike existing methods which represent bodies in the camera coordinate. Second, we learn a probability distribution of short human motion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Advanced Vision and Imaging
