Learning to Predict 3D Lane Shape and Camera Pose from a Single Image via Geometry Constraints
Ruijin Liu, Dapeng Chen, Tie Liu, Zhiliang Xiong, Zejian Yuan

TL;DR
This paper introduces a novel two-stage framework that predicts 3D lane shapes and camera pose from a single image using geometry constraints, eliminating the need for expensive sensor calibration.
Contribution
It proposes a method to estimate camera pose and 3D lanes simultaneously from a single image, outperforming methods that rely on perfect camera calibration.
Findings
Outperforms state-of-the-art methods without ground truth camera pose
Uses fewer parameters and computations than existing approaches
Enhances pose and lane estimation accuracy through multi-task learning
Abstract
Detecting 3D lanes from the camera is a rising problem for autonomous vehicles. In this task, the correct camera pose is the key to generating accurate lanes, which can transform an image from perspective-view to the top-view. With this transformation, we can get rid of the perspective effects so that 3D lanes would look similar and can accurately be fitted by low-order polynomials. However, mainstream 3D lane detectors rely on perfect camera poses provided by other sensors, which is expensive and encounters multi-sensor calibration issues. To overcome this problem, we propose to predict 3D lanes by estimating camera pose from a single image with a two-stage framework. The first stage aims at the camera pose task from perspective-view images. To improve pose estimation, we introduce an auxiliary 3D lane task and geometry constraints to benefit from multi-task learning, which enhances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
