Reconstruct from BEV: A 3D Lane Detection Approach based on Geometry Structure Prior
Chenguang Li, Jia Shi, Ya Wang, Guangliang Cheng

TL;DR
This paper introduces a geometry-structure-prior-based method for monocular 3D lane detection that improves detection range and accuracy by leveraging explicit supervision, BEV lane info, and data augmentation, achieving real-time performance.
Contribution
It is the first to incorporate geometry prior into DNN-based 3D lane detection, enhancing long-distance detection and generalization without extra costs.
Findings
Outperforms state-of-the-art by 3.8% F-Score on Apollo dataset.
Doubles detection range compared to previous methods.
Achieves real-time speed of 82 FPS.
Abstract
In this paper, we propose an advanced approach in targeting the problem of monocular 3D lane detection by leveraging geometry structure underneath the process of 2D to 3D lane reconstruction. Inspired by previous methods, we first analyze the geometry heuristic between the 3D lane and its 2D representation on the ground and propose to impose explicit supervision based on the structure prior, which makes it achievable to build inter-lane and intra-lane relationships to facilitate the reconstruction of 3D lanes from local to global. Second, to reduce the structure loss in 2D lane representation, we directly extract BEV lane information from front view images, which tremendously eases the confusion of distant lane features in previous methods. Furthermore, we propose a novel task-specific data augmentation method by synthesizing new training data for both segmentation and reconstruction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Adaptive Parameter-wise Diagonal Quasi-Newton Method
