Proactive Multi-Camera Collaboration For 3D Human Pose Estimation
Hai Ci, Mickel Liu, Xuehai Pan, Fangwei Zhong, Yizhou Wang

TL;DR
This paper introduces a multi-agent reinforcement learning approach with a novel reward mechanism for proactive multi-camera collaboration, significantly improving 3D human pose estimation in dynamic, occlusion-prone environments.
Contribution
It proposes a new Collaborative Triangulation Contribution Reward (CTCR) and joint training with environment dynamics learning to enhance multi-camera coordination and pose estimation accuracy.
Findings
Outperforms fixed and active camera baselines in diverse scenarios
Improves convergence and credit assignment in multi-agent settings
Demonstrates robustness across different numbers of cameras and humans
Abstract
This paper presents a multi-agent reinforcement learning (MARL) scheme for proactive Multi-Camera Collaboration in 3D Human Pose Estimation in dynamic human crowds. Traditional fixed-viewpoint multi-camera solutions for human motion capture (MoCap) are limited in capture space and susceptible to dynamic occlusions. Active camera approaches proactively control camera poses to find optimal viewpoints for 3D reconstruction. However, current methods still face challenges with credit assignment and environment dynamics. To address these issues, our proposed method introduces a novel Collaborative Triangulation Contribution Reward (CTCR) that improves convergence and alleviates multi-agent credit assignment issues resulting from using 3D reconstruction accuracy as the shared reward. Additionally, we jointly train our model with multiple world dynamics learning tasks to better capture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Video Surveillance and Tracking Methods
