Context-Aware Dynamic Feature Extraction for 3D Object Detection in Point Clouds
Yonglin Tian, Lichao Huang, Xuesong Li, Kunfeng Wang, Zilei Wang,, Fei-Yue Wang

TL;DR
This paper introduces CADNet, a context-aware dynamic network for 3D object detection in point clouds that adaptively captures density variations and improves detection accuracy and speed.
Contribution
The paper proposes a novel context-aware dynamic network with a dynamic convolutional layer and dual-path convolution block for enhanced 3D detection in varying density point clouds.
Findings
Outperforms SECOND and PointPillars in accuracy.
Achieves 30 FPS detection speed.
Effective in capturing density variance for improved detection.
Abstract
Varying density of point clouds increases the difficulty of 3D detection. In this paper, we present a context-aware dynamic network (CADNet) to capture the variance of density by considering both point context and semantic context. Point-level contexts are generated from original point clouds to enlarge the effective receptive filed. They are extracted around the voxelized pillars based on our extended voxelization method and processed with the context encoder in parallel with the pillar features. With a large perception range, we are able to capture the variance of features for potential objects and generate attentive spatial guidance to help adjust the strengths for different regions. In the region proposal network, considering the limited representation ability of traditional convolution where same kernels are shared among different samples and positions, we propose a decomposable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · 3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution
