Uni3DETR: Unified 3D Detection Transformer
Zhenyu Wang, Yali Li, Xi Chen, Hengshuang Zhao, Shengjin Wang

TL;DR
Uni3DETR is a unified 3D detection transformer that effectively handles indoor and outdoor point cloud data within a single framework, demonstrating strong generalization and consistent performance across diverse scenes.
Contribution
The paper introduces Uni3DETR, a novel unified 3D detection framework that employs point-voxel interaction, mixture of query points, and decoupled IoU for improved cross-scene 3D detection.
Findings
Excellent performance on both indoor and outdoor datasets.
Strong generalization ability across heterogeneous scenes.
Outperforms specialized detectors in diverse environments.
Abstract
Existing point cloud based 3D detectors are designed for the particular scene, either indoor or outdoor ones. Because of the substantial differences in object distribution and point density within point clouds collected from various environments, coupled with the intricate nature of 3D metrics, there is still a lack of a unified network architecture that can accommodate diverse scenes. In this paper, we propose Uni3DETR, a unified 3D detector that addresses indoor and outdoor 3D detection within the same framework. Specifically, we employ the detection transformer with point-voxel interaction for object prediction, which leverages voxel features and points for cross-attention and behaves resistant to the discrepancies from data. We then propose the mixture of query points, which sufficiently exploits global information for dense small-range indoor scenes and local information for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Video Surveillance and Tracking Methods
