Ground-aware Monocular 3D Object Detection for Autonomous Driving

Yuxuan Liu; Yuan Yixuan; Ming Liu

arXiv:2102.00690·cs.CV·February 2, 2021·6 cites

Ground-aware Monocular 3D Object Detection for Autonomous Driving

Yuxuan Liu, Yuan Yixuan, Ming Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a ground-aware neural network module that leverages ground plane information to improve monocular 3D object detection accuracy for autonomous driving, achieving state-of-the-art results.

Contribution

It proposes a novel ground-aware module that enhances deep learning-based 3D detection by utilizing ground plane priors, improving performance over existing methods.

Findings

01

Achieved state-of-the-art results on KITTI 3D detection benchmark.

02

Enhanced monocular depth prediction accuracy.

03

Demonstrated the effectiveness of ground priors in 3D reasoning.

Abstract

Estimating the 3D position and orientation of objects in the environment with a single RGB camera is a critical and challenging task for low-cost urban autonomous driving and mobile robots. Most of the existing algorithms are based on the geometric constraints in 2D-3D correspondence, which stems from generic 6D object pose estimation. We first identify how the ground plane provides additional clues in depth reasoning in 3D detection in driving scenes. Based on this observation, we then improve the processing of 3D anchors and introduce a novel neural network module to fully utilize such application-specific priors in the framework of deep learning. Finally, we introduce an efficient neural network embedded with the proposed module for 3D object detection. We further verify the power of the proposed module with a neural network designed for monocular depth prediction. The two proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Owen-Liuyuxuan/visualDet3D
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Advanced Vision and Imaging