M4Human: A Large-Scale Multimodal mmWave Radar Benchmark for Human Mesh Reconstruction
Junqiao Fan, Yunjiao Zhou, Yizhuo Yang, Xinyuan Cui, Jiarui Zhang, Lihua Xie, Jianfei Yang, Chris Xiaoxuan Lu, Fangqiang Ding

TL;DR
M4Human is the largest multimodal radar dataset for human mesh reconstruction, combining high-resolution mmWave radar, RGB, and depth data with extensive annotations to advance RF-based human modeling research.
Contribution
It introduces M4Human, a large-scale, multimodal benchmark dataset with high-quality annotations, enabling new research in radar-based human mesh reconstruction.
Findings
Benchmark results highlight the importance of multimodal data for human modeling.
Challenges remain in modeling fast, unconstrained motions.
The dataset supports research across raw radar, point clouds, and multimodal fusion.
Abstract
Human mesh reconstruction (HMR) provides direct insights into body-environment interaction, which enables various immersive applications. While existing large-scale HMR datasets rely heavily on line-of-sight RGB input, vision-based sensing is limited by occlusion, lighting variation, and privacy concerns. To overcome these limitations, recent efforts have explored radio-frequency (RF) mmWave radar for privacy-preserving indoor human sensing. However, current radar datasets are constrained by sparse skeleton labels, limited scale, and simple in-place actions. To advance the HMR research community, we introduce M4Human, the current largest-scale (661K-frame) ( prior largest) multimodal benchmark, featuring high-resolution mmWave radar, RGB, and depth data. M4Human provides both raw radar tensors (RT) and processed radar point clouds (RPC) to enable research across different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
