DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation
Yueru Luo, Shuguang Cui, Zhen Li

TL;DR
DV-3DLane introduces an end-to-end multi-modal 3D lane detection framework that combines image and LiDAR data in dual-view spaces, significantly improving accuracy and robustness for autonomous driving applications.
Contribution
The paper proposes a novel dual-view multi-modal framework with bidirectional feature fusion, unified query generation, and dual-view deformable attention for enhanced 3D lane detection.
Findings
Achieves state-of-the-art F1 score with 11.2 gain on OpenLane benchmark.
Reduces detection errors by 53.5%.
Demonstrates effective multi-modal feature integration in dual-view spaces.
Abstract
Accurate 3D lane estimation is crucial for ensuring safety in autonomous driving. However, prevailing monocular techniques suffer from depth loss and lighting variations, hampering accurate 3D lane detection. In contrast, LiDAR points offer geometric cues and enable precise localization. In this paper, we present DV-3DLane, a novel end-to-end Dual-View multi-modal 3D Lane detection framework that synergizes the strengths of both images and LiDAR points. We propose to learn multi-modal features in dual-view spaces, i.e., perspective view (PV) and bird's-eye-view (BEV), effectively leveraging the modal-specific information. To achieve this, we introduce three designs: 1) A bidirectional feature fusion strategy that integrates multi-modal features into each view space, exploiting their unique strengths. 2) A unified query generation approach that leverages lane-aware knowledge from both PV…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Advanced Vision and Imaging
MethodsSoftmax · Attention Is All You Need
