Uni3D: A Unified Baseline for Multi-dataset 3D Object Detection

Bo Zhang; Jiakang Yuan; Botian Shi; Tao Chen; Yikang Li; Yu Qiao

arXiv:2303.06880·cs.CV·May 1, 2023·1 cites

Uni3D: A Unified Baseline for Multi-dataset 3D Object Detection

Bo Zhang, Jiakang Yuan, Botian Shi, Tao Chen, Yikang Li, Yu Qiao

PDF

Open Access 1 Repo

TL;DR

Uni3D introduces a unified 3D object detection framework that effectively combines multiple datasets, addressing data and taxonomy differences to improve generalization and detection accuracy across diverse datasets.

Contribution

This paper proposes Uni3D, a simple yet effective method with data-level correction and semantic coupling modules for multi-dataset 3D detection, enhancing generalization.

Findings

01

Outperforms single-dataset detectors on multiple dataset consolidations

02

Achieves 1.04x parameter efficiency over baseline detectors

03

Demonstrates effective handling of data and taxonomy differences

Abstract

Current 3D object detection models follow a single dataset-specific training and testing paradigm, which often faces a serious detection accuracy drop when they are directly deployed in another dataset. In this paper, we study the task of training a unified 3D detector from multiple datasets. We observe that this appears to be a challenging task, which is mainly due to that these datasets present substantial data-level differences and taxonomy-level variations caused by different LiDAR types and data acquisition standards. Inspired by such observation, we present a Uni3D which leverages a simple data-level correction operation and a designed semantic-level coupling-and-recoupling module to alleviate the unavoidable data-level and taxonomy-level differences, respectively. Our method is simple and easily combined with many 3D object detection baselines such as PV-RCNN and Voxel-RCNN,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pjlab-adg/3dtrans
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Human Pose and Action Recognition