Towards Balanced Multi-Modal Learning in 3D Human Pose Estimation
Mengshi Qi, Jiaxuan Peng, Xianlin Zhang, Huadong Ma

TL;DR
This paper introduces a balanced multi-modal learning approach for 3D human pose estimation that effectively combines RGB, LiDAR, mmWave, and WiFi data, addressing modality imbalance with a novel contribution assessment and regulation strategy.
Contribution
It presents a new method using Shapley value-based contribution assessment and a learning regulation strategy to balance modalities in 3D pose estimation.
Findings
Outperforms existing methods on the MM-Fi dataset
Effectively detects and mitigates modality imbalance
Enhances pose estimation accuracy in complex scenarios
Abstract
3D human pose estimation (3D HPE) has emerged as a prominent research topic, particularly in the realm of RGB-based methods. However, the use of RGB images is often limited by issues such as occlusion and privacy constraints. Consequently, multi-modal sensing, which leverages non-intrusive sensors, is gaining increasing attention. Nevertheless, multi-modal 3D HPE still faces challenges, including modality imbalance. In this work, we introduce a novel balanced multi-modal learning method for 3D HPE, which harnesses the power of RGB, LiDAR, mmWave, and WiFi. Specifically, we propose a Shapley value-based contribution algorithm to assess the contribution of each modality and detect modality imbalance. To address this imbalance, we design a modality learning regulation strategy that decelerates the learning process during the early stages of training. We conduct extensive experiments on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Hand Gesture Recognition Systems
MethodsElastic Weight Consolidation
