NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action
Kuan-Chieh Wang, Zhenzhen Weng, Maria Xenochristou, Joao Pedro Araujo,, Jeffrey Gu, C. Karen Liu, Serena Yeung

TL;DR
NeMo introduces a novel method for reconstructing 3D human motion from multiple videos of the same action, leveraging shared information to outperform existing monocular methods in accuracy and robustness.
Contribution
The paper proposes the Neural Motion (NeMo) field, a new approach that improves 3D motion recovery by utilizing shared information across multiple video instances of the same action.
Findings
NeMo outperforms existing HMR methods in 2D keypoint detection.
NeMo achieves better 3D reconstruction metrics on a collected MoCap dataset.
NeMo effectively captures dynamic motions in sports videos.
Abstract
The task of reconstructing 3D human motion has wideranging applications. The gold standard Motion capture (MoCap) systems are accurate but inaccessible to the general public due to their cost, hardware and space constraints. In contrast, monocular human mesh recovery (HMR) methods are much more accessible than MoCap as they take single-view videos as inputs. Replacing the multi-view Mo- Cap systems with a monocular HMR method would break the current barriers to collecting accurate 3D motion thus making exciting applications like motion analysis and motiondriven animation accessible to the general public. However, performance of existing HMR methods degrade when the video contains challenging and dynamic motion that is not in existing MoCap datasets used for training. This reduces its appeal as dynamic motion is frequently the target in 3D motion recovery in the aforementioned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Human Motion and Animation
