Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding
Christina Kassab, Matias Mattamala, Lintong Zhang, and Maurice Fallon

TL;DR
LEXIS is a real-time indoor SLAM system that leverages large language models for versatile scene understanding, enabling adaptive room classification, segmentation, and improved place recognition in diverse environments.
Contribution
Introduces LEXIS, a novel SLAM system integrating open-vocabulary LLMs for flexible semantic understanding and place recognition in indoor environments.
Findings
Successfully categorizes rooms with varying layouts and dimensions.
Outperforms state-of-the-art in room classification and segmentation.
Achieves comparable performance to SOTA in place recognition and trajectory estimation.
Abstract
Versatile and adaptive semantic understanding would enable autonomous systems to comprehend and interact with their surroundings. Existing fixed-class models limit the adaptability of indoor mobile and assistive autonomous systems. In this work, we introduce LEXIS, a real-time indoor Simultaneous Localization and Mapping (SLAM) system that harnesses the open-vocabulary nature of Large Language Models (LLMs) to create a unified approach to scene understanding and place recognition. The approach first builds a topological SLAM graph of the environment (using visual-inertial odometry) and embeds Contrastive Language-Image Pretraining (CLIP) features in the graph nodes. We use this representation for flexible room classification and segmentation, serving as a basis for room-centric place recognition. This allows loop closure searches to be directed towards semantically relevant places. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Indoor and Outdoor Localization Technologies · 3D Surveying and Cultural Heritage
