Monocular Vision based Crowdsourced 3D Traffic Sign Positioning with   Unknown Camera Intrinsics and Distortion Coefficients

Hemang Chawla; Matti Jukola; Elahe Arani; and Bahram Zonooz

arXiv:2007.04592·cs.CV·March 3, 2021

Monocular Vision based Crowdsourced 3D Traffic Sign Positioning with Unknown Camera Intrinsics and Distortion Coefficients

Hemang Chawla, Matti Jukola, Elahe Arani, and Bahram Zonooz

PDF

TL;DR

This paper presents a method for crowdsourced 3D traffic sign mapping using monocular vision without prior knowledge of camera parameters, achieving high accuracy on a public dataset.

Contribution

It introduces a novel approach to estimate 3D traffic sign positions without known camera intrinsics or distortion coefficients, reducing reliance on precise camera calibration.

Findings

01

Achieves 0.26 m relative positioning accuracy

02

Achieves 1.38 m absolute positioning accuracy

03

Validates on KITTI dataset with monocular camera and GPS

Abstract

Autonomous vehicles and driver assistance systems utilize maps of 3D semantic landmarks for improved decision making. However, scaling the mapping process as well as regularly updating such maps come with a huge cost. Crowdsourced mapping of these landmarks such as traffic sign positions provides an appealing alternative. The state-of-the-art approaches to crowdsourced mapping use ground truth camera parameters, which may not always be known or may change over time. In this work, we demonstrate an approach to computing 3D traffic sign positions without knowing the camera focal lengths, principal point, and distortion coefficients a priori. We validate our proposed approach on a public dataset of traffic signs in KITTI. Using only a monocular color camera and GPS, we achieve an average single journey relative and absolute positioning accuracy of 0.26 m and 1.38 m, respectively.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.