RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection
Ozsel Kilinc, Cem Tarhan

TL;DR
RQR3D introduces a novel 3D object detection approach using a quadrilateral representation and keypoint regression, achieving state-of-the-art results in BEV-based perception for autonomous driving.
Contribution
It proposes a new quadrilateral-based regression target and a simplified radar fusion backbone, improving accuracy and efficiency in BEV 3D detection.
Findings
Achieves 67.5 NDS and 59.7 mAP on nuScenes dataset.
Outperforms existing methods with a lightweight architecture.
Reduces translation and orientation errors significantly.
Abstract
Accurate, fast, and reliable 3D perception is essential for autonomous driving. Recently, bird's-eye view (BEV)-based perception approaches have emerged as superior alternatives to perspective-based solutions, offering enhanced spatial understanding and more natural outputs for planning. Existing BEV-based 3D object detection methods, typically using an angle-based representation, directly estimate the size and orientation of rotated bounding boxes. We observe that BEV-based 3D object detection is analogous to aerial oriented object detection, where angle-based methods are known to suffer from discontinuities in their loss functions. Drawing inspiration from this domain, we propose \textbf{R}estricted \textbf{Q}uadrilateral \textbf{R}epresentation to define \textbf{3D} regression targets. RQR3D regresses the smallest horizontal bounding box encapsulating the oriented box, along with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
