Towards Unified 3D Object Detection via Algorithm and Data Unification
Zhuoling Li, Xiaogang Xu, SerNam Lim, Hengshuang Zhao

TL;DR
This paper introduces a unified approach for 3D object detection across indoor and outdoor scenes, leveraging algorithmic innovations and multi-modal data, and presents a new benchmark to evaluate such models.
Contribution
It proposes a novel monocular 3D detector with a two-stage BEV architecture, a unified multi-modal detector, and a new benchmark for comprehensive evaluation.
Findings
Multi-modal data improves detection robustness.
The proposed strategies enhance model performance across diverse scenarios.
The unified benchmark facilitates fair comparison of 3D detection methods.
Abstract
Realizing unified 3D object detection, including both indoor and outdoor scenes, holds great importance in applications like robot navigation. However, involving various scenarios of data to train models poses challenges due to their significantly distinct characteristics, \eg, diverse geometry properties and heterogeneous domain distributions. In this work, we propose to address the challenges from two perspectives, the algorithm perspective and data perspective. In terms of the algorithm perspective, we first build a monocular 3D object detector based on the bird's-eye-view (BEV) detection paradigm, where the explicit feature projection is beneficial to addressing the geometry learning ambiguity. In this detector, we split the classical BEV detection architecture into two stages and propose an uneven BEV grid design to handle the convergence instability caused by geometry difference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Image and Object Detection Techniques
