MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization
Zengyi Qin, Jinglu Wang, Yan Lu

TL;DR
MonoGRNet is a unified neural network that performs monocular 3D object detection by combining 2D detection, depth estimation, and 3D localization, overcoming the challenge of geometric information loss from single images.
Contribution
It introduces a novel instance depth estimation method and a unified framework for amodal 3D detection from monocular images, achieving state-of-the-art results.
Findings
Achieves state-of-the-art performance on challenging datasets.
Effectively estimates 3D bounding boxes using sparse supervision.
Integrates multiple tasks into a single network for improved accuracy.
Abstract
Detecting and localizing objects in the real 3D space, which plays a crucial role in scene understanding, is particularly challenging given only a single RGB image due to the geometric information loss during imagery projection. We propose MonoGRNet for the amodal 3D object detection from a monocular RGB image via geometric reasoning in both the observed 2D projection and the unobserved depth dimension. MonoGRNet is a single, unified network composed of four task-specific subnetworks, responsible for 2D object detection, instance depth estimation (IDE), 3D localization and local corner regression. Unlike the pixel-level depth estimation that needs per-pixel annotations, we propose a novel IDE method that directly predicts the depth of the targeting 3D bounding box's center using sparse supervision. The 3D localization is further achieved by estimating the position in the horizontal and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Neural Network Applications
