Human-M3: A Multi-view Multi-modal Dataset for 3D Human Pose Estimation   in Outdoor Scenes

Bohao Fan; Siqi Wang; Wenxuan Guo; Wenzhao Zheng; Jianjiang Feng; Jie; Zhou

arXiv:2308.00628·cs.CV·August 8, 2023·1 cites

Human-M3: A Multi-view Multi-modal Dataset for 3D Human Pose Estimation in Outdoor Scenes

Bohao Fan, Siqi Wang, Wenxuan Guo, Wenzhao Zheng, Jianjiang Feng, Jie, Zhou

PDF

Open Access 1 Repo

TL;DR

Human-M3 is a comprehensive outdoor multi-view, multi-modal dataset for 3D human pose estimation, featuring RGB videos and pointclouds, along with a novel annotation algorithm, advancing research in outdoor multi-person pose estimation.

Contribution

The paper introduces Human-M3, a new multi-modal, multi-view dataset with an annotation method, and demonstrates the benefits of multi-modal data for 3D human pose estimation.

Findings

01

The dataset is challenging and suitable for future research.

02

Multi-modal data improves 3D pose estimation accuracy.

03

The proposed annotation algorithm enhances ground truth reliability.

Abstract

3D human pose estimation in outdoor environments has garnered increasing attention recently. However, prevalent 3D human pose datasets pertaining to outdoor scenes lack diversity, as they predominantly utilize only one type of modality (RGB image or pointcloud), and often feature only one individual within each scene. This limited scope of dataset infrastructure considerably hinders the variability of available data. In this article, we propose Human-M3, an outdoor multi-modal multi-view multi-person human pose database which includes not only multi-view RGB videos of outdoor scenes but also corresponding pointclouds. In order to obtain accurate human poses, we propose an algorithm based on multi-modal data input to generate ground truth annotation. This benefits from robust pointcloud detection and tracking, which solves the problem of inaccurate human localization and matching…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

soullessrobot/human-m3-dataset
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Hand Gesture Recognition Systems