MonoCD: Monocular 3D Object Detection with Complementary Depths

Longfei Yan; Pei Yan; Shengzhou Xiong; Xuanyu Xiang; Yihua Tan

arXiv:2404.03181·cs.CV·April 5, 2024·3 cites

MonoCD: Monocular 3D Object Detection with Complementary Depths

Longfei Yan, Pei Yan, Shengzhou Xiong, Xuanyu Xiang, Yihua Tan

PDF

Open Access 1 Repo

TL;DR

MonoCD introduces a novel approach to monocular 3D object detection by enhancing depth prediction complementarity through a new global depth branch and geometric relation exploitation, achieving state-of-the-art results.

Contribution

The paper proposes two innovative designs to increase depth prediction complementarity, significantly improving monocular 3D detection accuracy without extra data.

Findings

01

Achieves state-of-the-art performance on KITTI benchmark

02

Enhances depth complementarity with a new global depth branch

03

Boosts existing detectors with a lightweight plug-and-play module

Abstract

Monocular 3D object detection has attracted widespread attention due to its potential to accurately obtain object 3D localization from a single image at a low cost. Depth estimation is an essential but challenging subtask of monocular 3D object detection due to the ill-posedness of 2D to 3D mapping. Many methods explore multiple local depth clues such as object heights and keypoints and then formulate the object depth estimation as an ensemble of multiple depth predictions to mitigate the insufficiency of single-depth information. However, the errors of existing multiple depths tend to have the same sign, which hinders them from neutralizing each other and limits the overall accuracy of combined depth. To alleviate this problem, we propose to increase the complementarity of depths with two novel designs. First, we add a new depth prediction branch named complementary depth that utilizes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

elvintanhust/monocd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection