Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation
Xiaoqi An, Lin Zhao, Chen Gong, Jun Li, Jian Yang

TL;DR
This paper introduces a density-aware pose transformer for LiDAR-based 3D human pose estimation, leveraging intrinsic point cloud properties and augmentation techniques to improve robustness and achieve state-of-the-art results.
Contribution
It proposes a novel density-aware pose transformer and a comprehensive augmentation method for pre-training, enhancing robustness in low-quality LiDAR point clouds.
Findings
Achieves state-of-the-art performance on multiple datasets.
Reduces MPJPE by 10.0mm on Waymo dataset.
Reduces MPJPE by 20.7mm on SLOPER4D dataset.
Abstract
With the rapid development of autonomous driving, LiDAR-based 3D Human Pose Estimation (3D HPE) is becoming a research focus. However, due to the noise and sparsity of LiDAR-captured point clouds, robust human pose estimation remains challenging. Most of the existing methods use temporal information, multi-modal fusion, or SMPL optimization to correct biased results. In this work, we try to obtain sufficient information for 3D HPE only by modeling the intrinsic properties of low-quality point clouds. Hence, a simple yet powerful method is proposed, which provides insights both on modeling and augmentation of point clouds. Specifically, we first propose a concise and effective density-aware pose transformer (DAPT) to get stable keypoint representations. By using a set of joint anchors and a carefully designed exchange module, valid information is extracted from point clouds with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Hand Gesture Recognition Systems
MethodsSparse Evolutionary Training
