CVFNet: Real-time 3D Object Detection by Learning Cross View Features
Jiaqi Gu, Zhiyu Xiang, Pan Zhao, Tingming Bai, Lingxuan Wang, Xijun, Zhao, Zhiyuan Zhang

TL;DR
CVFNet is a real-time 3D object detection framework that efficiently fuses multi-view features from LiDAR data, achieving high accuracy and speed suitable for time-critical applications.
Contribution
The paper introduces a novel Point-Range feature fusion module and Slice Pillar design for efficient, accurate 3D detection from LiDAR data in real-time.
Findings
Achieves state-of-the-art accuracy on KITTI and NuScenes benchmarks.
Operates in real-time with high computational efficiency.
Effectively balances detection accuracy and speed.
Abstract
In recent years 3D object detection from LiDAR point clouds has made great progress thanks to the development of deep learning technologies. Although voxel or point based methods are popular in 3D object detection, they usually involve time-consuming operations such as 3D convolutions on voxels or ball query among points, making the resulting network inappropriate for time critical applications. On the other hand, 2D view-based methods feature high computing efficiency while usually obtaining inferior performance than the voxel or point based methods. In this work, we present a real-time view-based single stage 3D object detector, namely CVFNet to fulfill this task. To strengthen the cross-view feature learning under the condition of demanding efficiency, our framework extracts the features of different views and fuses them in an efficient progressive way. We first propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Face recognition and analysis
