RoMo: A Robust Solver for Full-body Unlabeled Optical Motion Capture
Xiaoyu Pan, Bowen Zheng, Xinwei Jiang, Zijiao Zeng, Qilong Kou, He, Wang, Xiaogang Jin

TL;DR
RoMo is a learning-based framework that robustly labels and solves full-body optical motion capture data, effectively handling errors like mislabeling and occlusion, and outperforming existing methods.
Contribution
Introduces RoMo, a novel divide-and-conquer, graph-based, and hybrid inverse kinematic approach for accurate, automated full-body motion capture data labeling and solving.
Findings
Achieves high labeling and solving accuracy across multiple datasets.
Improves hand labeling F1 score from 0.94 to 0.98.
Reduces joint position error by 25%.
Abstract
Optical motion capture (MoCap) is the "gold standard" for accurately capturing full-body motions. To make use of raw MoCap point data, the system labels the points with corresponding body part locations and solves the full-body motions. However, MoCap data often contains mislabeling, occlusion and positional errors, requiring extensive manual correction. To alleviate this burden, we introduce RoMo, a learning-based framework for robustly labeling and solving raw optical motion capture data. In the labeling stage, RoMo employs a divide-and-conquer strategy to break down the complex full-body labeling challenge into manageable subtasks: alignment, full-body segmentation and part-specific labeling. To utilize the temporal continuity of markers, RoMo generates marker tracklets using a K-partite graph-based clustering algorithm, where markers serve as nodes, and edges are formed based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
