Probabilistic and Geometric Depth: Detecting Objects in Perspective
Tai Wang, Xinge Zhu, Jiangmiao Pang, Dahua Lin

TL;DR
This paper introduces PGD, a novel monocular 3D object detection method that leverages geometric relation graphs and probabilistic depth estimation to improve accuracy and efficiency in perspective-based detection tasks.
Contribution
The paper proposes a new approach combining geometric relation graphs and probabilistic depth modeling to enhance monocular 3D object detection performance.
Findings
Achieves 1st place on KITTI and nuScenes benchmarks among monocular methods.
Significantly improves detection accuracy while maintaining real-time efficiency.
Effectively models uncertainty in depth estimation to guide predictions.
Abstract
3D object detection is an important capability needed in various practical applications such as driver assistance systems. Monocular 3D detection, as a representative general setting among image-based approaches, provides a more economical solution than conventional settings relying on LiDARs but still yields unsatisfactory results. This paper first presents a systematic study on this problem. We observe that the current monocular 3D detection can be simplified as an instance depth estimation problem: The inaccurate instance depth blocks all the other 3D attribute predictions from improving the overall detection performance. Moreover, recent methods directly estimate the depth based on isolated instances or pixels while ignoring the geometric relations across different objects. To this end, we construct geometric relation graphs across predicted objects and use the graph to facilitate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods
