CVFNet: Real-time 3D Object Detection by Learning Cross View Features

Jiaqi Gu; Zhiyu Xiang; Pan Zhao; Tingming Bai; Lingxuan Wang; Xijun; Zhao; Zhiyuan Zhang

arXiv:2203.06585·cs.CV·July 18, 2022

CVFNet: Real-time 3D Object Detection by Learning Cross View Features

Jiaqi Gu, Zhiyu Xiang, Pan Zhao, Tingming Bai, Lingxuan Wang, Xijun, Zhao, Zhiyuan Zhang

PDF

Open Access

TL;DR

CVFNet is a real-time 3D object detection framework that efficiently fuses multi-view features from LiDAR data, achieving high accuracy and speed suitable for time-critical applications.

Contribution

The paper introduces a novel Point-Range feature fusion module and Slice Pillar design for efficient, accurate 3D detection from LiDAR data in real-time.

Findings

01

Achieves state-of-the-art accuracy on KITTI and NuScenes benchmarks.

02

Operates in real-time with high computational efficiency.

03

Effectively balances detection accuracy and speed.

Abstract

In recent years 3D object detection from LiDAR point clouds has made great progress thanks to the development of deep learning technologies. Although voxel or point based methods are popular in 3D object detection, they usually involve time-consuming operations such as 3D convolutions on voxels or ball query among points, making the resulting network inappropriate for time critical applications. On the other hand, 2D view-based methods feature high computing efficiency while usually obtaining inferior performance than the voxel or point based methods. In this work, we present a real-time view-based single stage 3D object detector, namely CVFNet to fulfill this task. To strengthen the cross-view feature learning under the condition of demanding efficiency, our framework extracts the features of different views and fuses them in an efficient progressive way. We first propose a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Face recognition and analysis