Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud
Xinshuo Weng, Kris Kitani

TL;DR
This paper introduces a novel monocular 3D object detection method that converts single images into pseudo-LiDAR point clouds, significantly improving detection accuracy by leveraging LiDAR-based algorithms and innovative noise reduction techniques.
Contribution
It presents a new approach to monocular 3D detection by generating pseudo-LiDAR point clouds from single images and proposes two key innovations to handle noise and improve accuracy.
Findings
Achieves top-ranked performance on KITTI benchmark for monocular methods.
Quadruples previous state-of-the-art detection accuracy.
Effectively bridges the gap between 2D and 3D scene understanding.
Abstract
Monocular 3D scene understanding tasks, such as object size estimation, heading angle estimation and 3D localization, is challenging. Successful modern day methods for 3D scene understanding require the use of a 3D sensor. On the other hand, single image based methods have significantly worse performance. In this work, we aim at bridging the performance gap between 3D sensing and 2D sensing for 3D object detection by enhancing LiDAR-based algorithms to work with single image input. Specifically, we perform monocular depth estimation and lift the input image to a point cloud representation, which we call pseudo-LiDAR point cloud. Then we can train a LiDAR-based 3D detection network with our pseudo-LiDAR end-to-end. Following the pipeline of two-stage 3D detection algorithms, we detect 2D object proposals in the input image and extract a point cloud frustum from the pseudo-LiDAR for each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Optical Sensing Technologies
