DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view   Representation

Yueru Luo; Shuguang Cui; Zhen Li

arXiv:2406.16072·cs.CV·June 25, 2024·1 cites

DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation

Yueru Luo, Shuguang Cui, Zhen Li

PDF

Open Access 1 Repo

TL;DR

DV-3DLane introduces an end-to-end multi-modal 3D lane detection framework that combines image and LiDAR data in dual-view spaces, significantly improving accuracy and robustness for autonomous driving applications.

Contribution

The paper proposes a novel dual-view multi-modal framework with bidirectional feature fusion, unified query generation, and dual-view deformable attention for enhanced 3D lane detection.

Findings

01

Achieves state-of-the-art F1 score with 11.2 gain on OpenLane benchmark.

02

Reduces detection errors by 53.5%.

03

Demonstrates effective multi-modal feature integration in dual-view spaces.

Abstract

Accurate 3D lane estimation is crucial for ensuring safety in autonomous driving. However, prevailing monocular techniques suffer from depth loss and lighting variations, hampering accurate 3D lane detection. In contrast, LiDAR points offer geometric cues and enable precise localization. In this paper, we present DV-3DLane, a novel end-to-end Dual-View multi-modal 3D Lane detection framework that synergizes the strengths of both images and LiDAR points. We propose to learn multi-modal features in dual-view spaces, i.e., perspective view (PV) and bird's-eye-view (BEV), effectively leveraging the modal-specific information. To achieve this, we introduce three designs: 1) A bidirectional feature fusion strategy that integrates multi-modal features into each view space, exploiting their unique strengths. 2) A unified query generation approach that leverages lane-aware knowledge from both PV…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jmoonr/dv-3dlane
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Advanced Vision and Imaging

MethodsSoftmax · Attention Is All You Need