TL;DR
DeepMoCap introduces a novel marker-based optical motion capture system utilizing multiple depth sensors and retro-reflectors, employing deep learning to automatically localize reflectors and accurately capture 3D motion data.
Contribution
It presents a multi-stage FCN architecture for automatic reflector localization and temporal correlation modeling, advancing marker-based optical motion capture with deep learning.
Findings
Outperforms existing methods on the DMC2.5D dataset in 2D keypoint accuracy.
Achieves 4.5% higher 3D PCK accuracy compared to other RGB-D and inertial fusion approaches.
Provides publicly available datasets for benchmarking motion capture methods.
Abstract
In this paper, a marker-based, single-person optical motion capture method (DeepMoCap) is proposed using multiple spatio-temporally aligned infrared-depth sensors and retro-reflective straps and patches (reflectors). DeepMoCap explores motion capture by automatically localizing and labeling reflectors on depth images and, subsequently, on 3D space. Introducing a non-parametric representation to encode the temporal correlation among pairs of colorized depthmaps and 3D optical flow frames, a multi-stage Fully Convolutional Network (FCN) architecture is proposed to jointly learn reflector locations and their temporal dependency among sequential frames. The extracted reflector 2D locations are spatially mapped in 3D space, resulting in robust 3D optical data extraction. The subject's motion is efficiently captured by applying a template-based fitting technique on the extracted optical data.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMax Pooling · Convolution · Fully Convolutional Network
