SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera   Videos

Haisong Liu; Yao Teng; Tao Lu; Haiguang Wang; Limin Wang

arXiv:2308.09244·cs.CV·September 6, 2023·1 cites

SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Haisong Liu, Yao Teng, Tao Lu, Haiguang Wang, Limin Wang

PDF

Open Access 1 Repo 1 Models 1 Video

TL;DR

SparseBEV introduces a fully sparse 3D object detection method that outperforms dense methods by using adaptive mechanisms in BEV and image spaces, achieving state-of-the-art results efficiently.

Contribution

The paper proposes SparseBEV, a novel sparse detector with adaptive features that significantly improves 3D detection performance over dense methods.

Findings

01

Achieves 67.5 NDS on nuScenes test split.

02

Maintains real-time inference at 23.5 FPS.

03

Outperforms dense detectors in accuracy.

Abstract

Camera-based 3D object detection in BEV (Bird's Eye View) space has drawn great attention over the past few years. Dense detectors typically follow a two-stage pipeline by first constructing a dense BEV feature and then performing object detection in BEV space, which suffers from complex view transformations and high computation cost. On the other side, sparse detectors follow a query-based paradigm without explicit dense BEV feature construction, but achieve worse performance than the dense counterparts. In this paper, we find that the key to mitigate this performance gap is the adaptability of the detector in both BEV and image space. To achieve this goal, we propose SparseBEV, a fully sparse 3D object detector that outperforms the dense counterparts. SparseBEV contains three key designs, which are (1) scale-adaptive self attention to aggregate features with adaptive receptive field…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mcg-nju/sparsebev
pytorchOfficial

Models

🤗
chinmaygarde/SparseBev
model

Videos

SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos· youtube

Taxonomy

TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques