TL;DR
This paper presents a real-time deep learning method that predicts lower-body pose from sparse upper-body tracking signals, improving robustness and reducing artifacts in VR motion capture.
Contribution
Introduces a GRU-based neural network that uses velocity-based input signals to accurately predict lower-body pose and contact states from sparse upper-body data in real-time.
Findings
Velocity representation improves correlation modeling between upper and lower body.
The method effectively reduces foot-skating and floating artifacts.
Achieves real-time performance on commercial VR tracking data.
Abstract
With the popularization of game and VR/AR devices, there is a growing need for capturing human motion with a sparse set of tracking data. In this paper, we introduce a deep neural-network (DNN) based method for real-time prediction of the lower-body pose only from the tracking signals of the upper-body joints. Specifically, our Gated Recurrent Unit (GRU)-based recurrent architecture predicts the lower-body pose and feet contact probability from past sequence of tracking signals of the head, hands and pelvis. A major feature of our method is that the input signal is represented with the velocity of tracking signals. We show that the velocity representation better models the correlation between the upper-body and lower-body motions and increase the robustness against the diverse scales and proportions of the user body than position-orientation representations. In addition, to remove…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
