AeDet: Azimuth-invariant Multi-view 3D Object Detection
Chengjian Feng, Zequn Jie, Yujie Zhong, Xiangxiang Chu, Lin Ma

TL;DR
AeDet introduces azimuth-equivariant convolutions and anchors to improve multi-view 3D object detection by preserving BEV feature symmetry, resulting in significant performance gains on nuScenes.
Contribution
The paper proposes a novel azimuth-equivariant convolution and anchor, enhancing BEV feature learning and detection robustness in multi-view 3D object detection.
Findings
Achieves 62.0% NDS on nuScenes, outperforming recent methods.
Introduces camera-decoupled virtual depth for consistent depth prediction.
Demonstrates improved detection accuracy and robustness.
Abstract
Recent LSS-based multi-view 3D object detection has made tremendous progress, by processing the features in Brid-Eye-View (BEV) via the convolutional detector. However, the typical convolution ignores the radial symmetry of the BEV features and increases the difficulty of the detector optimization. To preserve the inherent property of the BEV features and ease the optimization, we propose an azimuth-equivariant convolution (AeConv) and an azimuth-equivariant anchor. The sampling grid of AeConv is always in the radial direction, thus it can learn azimuth-invariant BEV features. The proposed anchor enables the detection head to learn predicting azimuth-irrelevant targets. In addition, we introduce a camera-decoupled virtual depth to unify the depth prediction for the images with different camera intrinsic parameters. The resultant detector is dubbed Azimuth-equivariant Detector (AeDet).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Image Processing Techniques and Applications · Visual Attention and Saliency Detection
MethodsConvolution
