TL;DR
AirPose is a novel multi-view fusion network that enables markerless 3D human pose and shape estimation using uncalibrated aerial cameras, suitable for outdoor environments and requiring minimal pre-calibration.
Contribution
This work introduces the first method to estimate human pose and shape with uncalibrated flying cameras, using distributed neural networks and the SMPL-X model, with a new offline refinement technique.
Findings
Successfully estimates 3D human pose and shape from uncalibrated UAV images.
Employs a distributed neural network approach for multi-view fusion.
Provides code and data for further research.
Abstract
In this letter, we present a novel markerless 3D human motion capture (MoCap) system for unstructured, outdoor environments that uses a team of autonomous unmanned aerial vehicles (UAVs) with on-board RGB cameras and computation. Existing methods are limited by calibrated cameras and off-line processing. Thus, we present the first method (AirPose) to estimate human pose and shape using images captured by multiple extrinsically uncalibrated flying cameras. AirPose itself calibrates the cameras relative to the person instead of relying on any pre-calibration. It uses distributed neural networks running on each UAV that communicate viewpoint-independent information with each other about the person (i.e., their 3D shape and articulated pose). The person's shape and pose are parameterized using the SMPL-X body model, resulting in a compact representation, that minimizes communication between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
