MonoPerfCap: Human Performance Capture from Monocular Video

Weipeng Xu; Avishek Chatterjee; Michael Zollh\"ofer; Helge Rhodin,; Dushyant Mehta; Hans-Peter Seidel; Christian Theobalt

arXiv:1708.02136·cs.CV·February 26, 2018·43 cites

MonoPerfCap: Human Performance Capture from Monocular Video

Weipeng Xu, Avishek Chatterjee, Michael Zollh\"ofer, Helge Rhodin,, Dushyant Mehta, Hans-Peter Seidel, Christian Theobalt

PDF

Open Access

TL;DR

MonoPerfCap introduces a novel marker-less method for capturing 3D human performance from monocular video, effectively handling occlusions and non-rigid deformations for applications like video editing and free viewpoint viewing.

Contribution

It is the first approach to achieve temporally coherent 3D human performance capture from monocular video with general clothing, using a neural network-based pose detection and a batch-based reconstruction strategy.

Findings

01

Outperforms previous monocular methods in accuracy and robustness

02

Handles complex scenes with occlusions and non-rigid deformations

03

Enables applications like video editing and free viewpoint video

Abstract

We present the first marker-less approach for temporally coherent 3D performance capture of a human with general clothing from monocular video. Our approach reconstructs articulated human skeleton motion as well as medium-scale non-rigid surface deformations in general scenes. Human performance capture is a challenging problem due to the large range of articulation, potentially fast motion, and considerable non-rigid deformations, even from multi-view data. Reconstruction from monocular video alone is drastically more challenging, since strong occlusions and the inherent depth ambiguity lead to a highly ill-posed reconstruction problem. We tackle these challenges by a novel approach that employs sparse 2D and 3D human pose detections from a convolutional neural network using a batch-based pose estimation strategy. Joint recovery of per-batch motion allows to resolve the ambiguities of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Human Motion and Animation