Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images
Matteo Toso, Stefano Fiorini, Stuart James, Alessio Del Bue

TL;DR
This paper introduces a novel graph-based framework to automatically generate accurate 2D semantic maps from sparse multi-view images, reducing manual effort and errors in traditional map-making.
Contribution
It presents a new method that aligns local object detections into a global map using a graph-based approach, handling incomplete and noisy data effectively.
Findings
Achieves 4-meter accuracy in global 2D registration, outperforming GPS and COLMAP.
Successfully handles sparse sequences with strong viewpoint changes.
Provides robust results where standard methods fail.
Abstract
World-wide detailed 2D maps require enormous collective efforts. OpenStreetMap is the result of 11 million registered users manually annotating the GPS location of over 1.75 billion entries, including distinctive landmarks and common urban objects. At the same time, manual annotations can include errors and are slow to update, limiting the map's accuracy. Maps from Motion (MfM) is a step forward to automatize such time-consuming map making procedure by computing 2D maps of semantic objects directly from a collection of uncalibrated multi-view images. From each image, we extract a set of object detections, and estimate their spatial arrangement in a top-down local map centered in the reference frame of the camera that captured the image. Aligning these local maps is not a trivial problem, since they provide incomplete, noisy fragments of the scene, and matching detections across them is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Advanced Vision and Imaging
MethodsSparse Evolutionary Training · Greedy Policy Search
