LASER: LAtent SpacE Rendering for 2D Visual Localization

Zhixiang Min; Naji Khosravan; Zachary Bessinger; Manjunath Narayana,; Sing Bing Kang; Enrique Dunn; Ivaylo Boyadzhiev

arXiv:2204.00157·cs.CV·March 28, 2023·1 cites

LASER: LAtent SpacE Rendering for 2D Visual Localization

Zhixiang Min, Naji Khosravan, Zachary Bessinger, Manjunath Narayana,, Sing Bing Kang, Enrique Dunn, Ivaylo Boyadzhiev

PDF

Open Access 1 Repo

TL;DR

LASER introduces a fast, view-dependent latent space rendering framework for 2D indoor localization, achieving state-of-the-art accuracy and high speed in large-scale datasets.

Contribution

LASER's novel latent space rendering with a dynamic codebook enables rapid, view-dependent localization, surpassing existing methods in speed and accuracy.

Findings

01

Achieves over 10KHz rendering speed.

02

Outperforms existing learning-based localization methods.

03

Excels in large-scale indoor datasets like ZInD and Structured3D.

Abstract

We present LASER, an image-based Monte Carlo Localization (MCL) framework for 2D floor maps. LASER introduces the concept of latent space rendering, where 2D pose hypotheses on the floor map are directly rendered into a geometrically-structured latent space by aggregating viewing ray features. Through a tightly coupled rendering codebook scheme, the viewing ray features are dynamically determined at rendering-time based on their geometries (i.e. length, incident-angle), endowing our representation with view-dependent fine-grain variability. Our codebook scheme effectively disentangles feature encoding from rendering, allowing the latent space rendering to run at speeds above 10KHz. Moreover, through metric learning, our geometrically-structured latent space is common to both pose hypotheses and query images with arbitrary field of views. As a result, LASER achieves state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zillow/laser
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques