Understanding Depth Map Progressively: Adaptive Distance Interval   Separation for Monocular 3d Object Detection

Xianhui Cheng; Shoumeng Qiu; Zhikang Zou; Jian Pu; Xiangyang Xue

arXiv:2306.10921·cs.CV·June 21, 2023·1 cites

Understanding Depth Map Progressively: Adaptive Distance Interval Separation for Monocular 3d Object Detection

Xianhui Cheng, Shoumeng Qiu, Zhikang Zou, Jian Pu, Xiangyang Xue

PDF

Open Access

TL;DR

This paper introduces ADISN, a novel framework for monocular 3D object detection that adaptively separates depth maps into subgraphs for improved feature extraction, addressing depth estimation inaccuracies.

Contribution

The paper proposes an adaptive separation approach for depth maps, combining image and depth information with an uncertainty module for enhanced monocular 3D detection.

Findings

01

Effective depth map partitioning improves feature extraction.

02

Adaptive separation enhances detection accuracy.

03

Uncertainty modeling mitigates depth estimation errors.

Abstract

Monocular 3D object detection aims to locate objects in different scenes with just a single image. Due to the absence of depth information, several monocular 3D detection techniques have emerged that rely on auxiliary depth maps from the depth estimation task. There are multiple approaches to understanding the representation of depth maps, including treating them as pseudo-LiDAR point clouds, leveraging implicit end-to-end learning of depth information, or considering them as an image input. However, these methods have certain drawbacks, such as their reliance on the accuracy of estimated depth maps and suboptimal utilization of depth maps due to their image-based nature. While LiDAR-based methods and convolutional neural networks (CNNs) can be utilized for pseudo point clouds and depth maps, respectively, it is always an alternative. In this paper, we propose a framework named the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Robotics and Sensor-Based Localization