MonStereo: When Monocular and Stereo Meet at the Tail of 3D Human   Localization

Lorenzo Bertoni; Sven Kreiss; Taylor Mordan; Alexandre Alahi

arXiv:2008.10913·cs.CV·March 23, 2021

MonStereo: When Monocular and Stereo Meet at the Tail of 3D Human Localization

Lorenzo Bertoni, Sven Kreiss, Taylor Mordan, Alexandre Alahi

PDF

2 Repos

TL;DR

This paper introduces MonStereo, a unified framework combining monocular and stereo cues for improved 3D human localization, especially in occluded or distant scenarios, with a new evaluation metric for human localization.

Contribution

It presents a novel joint learning approach that leverages both monocular and stereo information, addressing their individual limitations for 3D human localization.

Findings

01

Effective handling of occlusions and distant pedestrians

02

Improved accuracy over existing methods on KITTI dataset

03

Proposed a practical 3D localization metric for humans

Abstract

Monocular and stereo visions are cost-effective solutions for 3D human localization in the context of self-driving cars or social robots. However, they are usually developed independently and have their respective strengths and limitations. We propose a novel unified learning framework that leverages the strengths of both monocular and stereo cues for 3D human localization. Our method jointly (i) associates humans in left-right images, (ii) deals with occluded and distant cases in stereo settings by relying on the robustness of monocular cues, and (iii) tackles the intrinsic ambiguity of monocular perspective projection by exploiting prior knowledge of the human height distribution. We specifically evaluate outliers as well as challenging instances, such as occluded and far-away pedestrians, by analyzing the entire error distribution and by estimating calibrated confidence intervals.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.