SSR: A Generic Framework for Text-Aided Map Compression for Localization

Mohammad Omama; Po-han Li; Harsh Goel; Minkyu Choi; Behdad Chalaki; Vaishnav Tadiparthi; Hossein Nourkhiz Mahjoub; Ehsan Moradi Pari; Sandeep P. Chinchali

arXiv:2603.04272·cs.CV·March 5, 2026

SSR: A Generic Framework for Text-Aided Map Compression for Localization

Mohammad Omama, Po-han Li, Harsh Goel, Minkyu Choi, Behdad Chalaki, Vaishnav Tadiparthi, Hossein Nourkhiz Mahjoub, Ehsan Moradi Pari, Sandeep P. Chinchali

PDF

Open Access

TL;DR

This paper introduces SSR, a text-enhanced map compression framework that leverages large language models and compact image features to significantly reduce memory and bandwidth needs for robotic localization tasks.

Contribution

The paper presents a novel framework, Similarity Space Replication (SSR), which uses text descriptions and learned image embeddings for efficient map compression in localization.

Findings

01

SSR achieves 2x better compression than baselines.

02

Validated on multiple datasets including TokyoVal and KITTI.

03

Effective in indoor and outdoor localization tasks.

Abstract

Mapping is crucial in robotics for localization and downstream decision-making. As robots are deployed in ever-broader settings, the maps they rely on continue to increase in size. However, storing these maps indefinitely (cold storage), transferring them across networks, or sending localization queries to cloud-hosted maps imposes prohibitive memory and bandwidth costs. We propose a text-enhanced compression framework that reduces both memory and bandwidth footprints while retaining high-fidelity localization. The key idea is to treat text as an alternative modality: one that can be losslessly compressed with large language models. We propose leveraging lightweight text descriptions combined with very small image feature vectors, which capture "complementary information" as a compact representation for the mapping task. Building on this, our novel technique, Similarity Space…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization