Crowdsourced 3D Mapping: A Combined Multi-View Geometry and   Self-Supervised Learning Approach

Hemang Chawla; Matti Jukola; Terence Brouns; Elahe Arani; and Bahram; Zonooz

arXiv:2007.12918·cs.CV·February 3, 2023

Crowdsourced 3D Mapping: A Combined Multi-View Geometry and Self-Supervised Learning Approach

Hemang Chawla, Matti Jukola, Terence Brouns, Elahe Arani, and Bahram, Zonooz

PDF

1 Repo

TL;DR

This paper introduces a novel framework for crowdsourced 3D mapping that estimates the positions of traffic signs without prior knowledge of camera intrinsics, combining multi-view geometry and self-supervised learning.

Contribution

It presents a new method that jointly estimates camera parameters and 3D landmarks from monocular images and GPS, without needing pre-calibrated cameras.

Findings

01

Achieved 39cm average relative positioning accuracy

02

Achieved 1.26m absolute positioning accuracy

03

Constructed a new KITTI-based traffic sign dataset

Abstract

The ability to efficiently utilize crowdsourced visual data carries immense potential for the domains of large scale dynamic mapping and autonomous driving. However, state-of-the-art methods for crowdsourced 3D mapping assume prior knowledge of camera intrinsics. In this work, we propose a framework that estimates the 3D positions of semantically meaningful landmarks such as traffic signs without assuming known camera intrinsics, using only monocular color camera and GPS. We utilize multi-view geometry as well as deep learning based self-calibration, depth, and ego-motion estimation for traffic sign positioning, and show that combining their strengths is important for increasing the map coverage. To facilitate research on this task, we construct and make available a KITTI based 3D traffic sign ground truth positioning dataset. Using our proposed framework, we achieve an average…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hemangchawla/3d-groundtruth-traffic-sign-positions
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.