Range-Aware Attention Network for LiDAR-based 3D Object Detection with   Auxiliary Point Density Level Estimation

Yantao Lu; Xuetao Hao; Yilan Li; Weiheng Chai; Shiqi Sun; Senem; Velipasalar

arXiv:2111.09515·cs.CV·August 9, 2022·6 cites

Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Point Density Level Estimation

Yantao Lu, Xuetao Hao, Yilan Li, Weiheng Chai, Shiqi Sun, Senem, Velipasalar

PDF

Open Access 1 Repo

TL;DR

This paper introduces RAANet, a range-aware attention network that improves LiDAR-based 3D object detection by addressing point sparsity at greater distances, achieving state-of-the-art results with real-time performance.

Contribution

The paper proposes a novel lightweight RAA convolution and an auxiliary point density estimation loss to enhance BEV feature extraction and detection accuracy for distant and occluded objects.

Findings

01

Outperforms state-of-the-art methods on nuScenes and KITTI datasets.

02

Achieves real-time inference at 16 Hz (full) and 22 Hz (lite).

03

Effectively improves detection of distant and occluded objects.

Abstract

3D object detection from LiDAR data for autonomous driving has been making remarkable strides in recent years. Among the state-of-the-art methodologies, encoding point clouds into a bird's eye view (BEV) has been demonstrated to be both effective and efficient. Different from perspective views, BEV preserves rich spatial and distance information between objects. Yet, while farther objects of the same type do not appear smaller in the BEV, they contain sparser point cloud features. This fact weakens BEV feature extraction using shared-weight convolutional neural networks (CNNs). In order to address this challenge, we propose Range-Aware Attention Network (RAANet), which extracts effective BEV features and generates superior 3D object detection outputs. The range-aware attention (RAA) convolutions significantly improve feature extraction for near as well as far objects. Moreover, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

erbloo/raan
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Video Surveillance and Tracking Methods

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution