3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection
Yi Zhang, Yi Wang, Yawen Cui, Lap-Pui Chau

TL;DR
3DGeoDet introduces a geometry-aware 3D object detection method that effectively uses predicted depth to generate explicit and implicit 3D representations, significantly improving accuracy in diverse environments without requiring 3D supervision.
Contribution
The paper presents a novel approach combining explicit voxel occupancy and implicit TSDF representations for improved 3D detection from images, without needing 3D supervision.
Findings
Achieved 9.3 [email protected] on SUN RGB-D
Improved 3.3 [email protected] on ScanNetV2
Enhanced [email protected] on KITTI
Abstract
This paper proposes 3DGeoDet, a novel geometry-aware 3D object detection approach that effectively handles single- and multi-view RGB images in indoor and outdoor environments, showcasing its general-purpose applicability. The key challenge for image-based 3D object detection tasks is the lack of 3D geometric cues, which leads to ambiguity in establishing correspondences between images and 3D representations. To tackle this problem, 3DGeoDet generates efficient 3D geometric representations in both explicit and implicit manners based on predicted depth information. Specifically, we utilize the predicted depth to learn voxel occupancy and optimize the voxelized 3D feature volume explicitly through the proposed voxel occupancy attention. To further enhance 3D awareness, the feature volume is integrated with an implicit 3D representation, the truncated signed distance function (TSDF).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · 3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization
