Context-Aware Dynamic Feature Extraction for 3D Object Detection in   Point Clouds

Yonglin Tian; Lichao Huang; Xuesong Li; Kunfeng Wang; Zilei Wang,; Fei-Yue Wang

arXiv:1912.04775·cs.CV·July 29, 2020

Context-Aware Dynamic Feature Extraction for 3D Object Detection in Point Clouds

Yonglin Tian, Lichao Huang, Xuesong Li, Kunfeng Wang, Zilei Wang,, Fei-Yue Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces CADNet, a context-aware dynamic network for 3D object detection in point clouds that adaptively captures density variations and improves detection accuracy and speed.

Contribution

The paper proposes a novel context-aware dynamic network with a dynamic convolutional layer and dual-path convolution block for enhanced 3D detection in varying density point clouds.

Findings

01

Outperforms SECOND and PointPillars in accuracy.

02

Achieves 30 FPS detection speed.

03

Effective in capturing density variance for improved detection.

Abstract

Varying density of point clouds increases the difficulty of 3D detection. In this paper, we present a context-aware dynamic network (CADNet) to capture the variance of density by considering both point context and semantic context. Point-level contexts are generated from original point clouds to enlarge the effective receptive filed. They are extracted around the voxelized pillars based on our extended voxelization method and processed with the context encoder in parallel with the pillar features. With a large perception range, we are able to capture the variance of features for potential objects and generate attentive spatial guidance to help adjust the strengths for different regions. In the region proposal network, considering the limited representation ability of traditional convolution where same kernels are shared among different samples and positions, we propose a decomposable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Hub-Tian/CADNet
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · 3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution