Three-dimensional Human Tracking of a Mobile Robot by Fusion of Tracking Results of Two Cameras
Shinya Matsubara, Akihiko Honda, Yonghoon Ji, Kazunori Umeda

TL;DR
This paper introduces a novel stereo vision framework that fuses detection results from two cameras to accurately track a human in 3D space, addressing calibration and matching issues in traditional methods.
Contribution
A new stereo vision approach that improves 3D human tracking accuracy by integrating multi-camera detection results and reducing calibration complexity.
Findings
Effective 3D human tracking demonstrated in experiments
Improved accuracy over traditional stereo vision methods
Reduced calibration and matching errors
Abstract
This paper proposes a process that uses two cameras to obtain three-dimensional (3D) information of a target object for human tracking. Results of human detection and tracking from two cameras are integrated to obtain the 3D information. OpenPose is used for human detection. In the case of a general processing a stereo camera, a range image of the entire scene is acquired as precisely as possible, and then the range image is processed. However, there are problems such as incorrect matching and computational cost for the calibration process. A new stereo vision framework is proposed to cope with the problems. The effectiveness of the proposed framework and the method is verified through target-tracking experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging
MethodsOpenPose
