Mean of Means: Human Localization with Calibration-free and Unconstrained Camera Settings (extended version)
Tianyi Zhang, Wengyu Zhang, Xulu Zhang, Jiaxin Wu, Xiao-Yong Wei,, Jiannong Cao, Qing Li

TL;DR
This paper introduces a low-cost, calibration-free vision-based human localization method using probabilistic modeling, achieving high accuracy with minimal hardware in diverse camera setups.
Contribution
It presents a novel probabilistic approach that models human body points as distributions, enabling high-precision localization without calibration or expensive hardware.
Findings
96% accuracy within 0.3 meters
Nearly 100% accuracy within 0.5 meters
Achieved with only two low-cost webcams
Abstract
Accurate human localization is crucial for various applications, especially in the Metaverse era. Existing high precision solutions rely on expensive, tag-dependent hardware, while vision-based methods offer a cheaper, tag-free alternative. However, current vision solutions based on stereo vision face limitations due to rigid perspective transformation principles and error propagation in multi-stage SVD solvers. These solutions also require multiple high-resolution cameras with strict setup constraints.To address these limitations, we propose a probabilistic approach that considers all points on the human body as observations generated by a distribution centered around the body's geometric center. This enables us to improve sampling significantly, increasing the number of samples for each point of interest from hundreds to billions. By modeling the relation between the means of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Video Surveillance and Tracking Methods · Advanced Vision and Imaging
