HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds
Gang Zhang, Junnan Chen, Guohuan Gao, Jianmin Li, Xiaolin Hu

TL;DR
HEDNet is a hierarchical encoder-decoder network designed for 3D object detection in point clouds, effectively capturing long-range spatial dependencies to improve accuracy while maintaining efficiency.
Contribution
The paper introduces HEDNet, a novel hierarchical encoder-decoder architecture that enhances 3D object detection by capturing long-range dependencies among features in point clouds.
Findings
Achieved superior detection accuracy on Waymo and nuScenes datasets.
Outperformed previous state-of-the-art methods in accuracy.
Maintained competitive computational efficiency.
Abstract
3D object detection in point clouds is important for autonomous driving systems. A primary challenge in 3D object detection stems from the sparse distribution of points within the 3D scene. Existing high-performance methods typically employ 3D sparse convolutional neural networks with small kernels to extract features. To reduce computational costs, these methods resort to submanifold sparse convolutions, which prevent the information exchange among spatially disconnected features. Some recent approaches have attempted to address this problem by introducing large-kernel convolutions or self-attention mechanisms, but they either achieve limited accuracy improvements or incur excessive computational costs. We propose HEDNet, a hierarchical encoder-decoder network for 3D object detection, which leverages encoder-decoder blocks to capture long-range dependencies among features in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · 3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage
