RAM: Recover Any 3D Human Motion in-the-Wild

Sen Jia; Ning Zhu; Jinqin Zhong; Jiale Zhou; Huaping Zhang; Jenq-Neng Hwang; Lei Li

arXiv:2603.19929·cs.CV·April 13, 2026

RAM: Recover Any 3D Human Motion in-the-Wild

Sen Jia, Ning Zhu, Jinqin Zhong, Jiale Zhou, Huaping Zhang, Jenq-Neng Hwang, Lei Li

PDF

TL;DR

RAM introduces a comprehensive framework combining semantic tracking, temporal priors, and predictive modeling to robustly recover 3D human motion in challenging, real-world scenarios.

Contribution

It presents a novel integrated system with adaptive filtering, memory-augmented modules, and future pose prediction for improved in-the-wild 3D human motion reconstruction.

Findings

01

Outperforms previous methods in Zero-shot tracking stability.

02

Achieves higher 3D accuracy on PoseTrack and 3DPW benchmarks.

03

Demonstrates robustness under occlusions and dynamic interactions.

Abstract

RAM incorporates a motion-aware semantic tracker with adaptive Kalman filtering to achieve robust identity association under severe occlusions and dynamic interactions. A memory-augmented Temporal HMR module further enhances human motion reconstruction by injecting spatio-temporal priors for consistent and smooth motion estimation. Moreover, a lightweight Predictor module forecasts future poses to maintain reconstruction continuity, while a gated combiner adaptively fuses reconstructed and predicted features to ensure coherence and robustness. Experiments on in-the-wild multi-person benchmarks such as PoseTrack and 3DPW, demonstrate that RAM substantially outperforms previous state-of-the-art in both Zero-shot tracking stability and 3D accuracy, offering a generalizable paradigm for markerless 3D human motion capture in-the-wild.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.