Multi-modal Semantic SLAM for Complex Dynamic Environments

Han Wang; Jing Ying Ko; Lihua Xie

arXiv:2205.04300·cs.RO·May 17, 2022·5 cites

Multi-modal Semantic SLAM for Complex Dynamic Environments

Han Wang, Jing Ying Ko, Lihua Xie

PDF

Open Access 1 Repo

TL;DR

This paper introduces a robust multi-modal semantic SLAM framework that effectively handles dynamic environments by improving object recognition and combining geometric and semantic data, achieving real-time dense mapping.

Contribution

The paper presents a novel multi-modal semantic SLAM approach with enhanced object feature learning and a dual recognition mechanism, improving performance in dynamic, complex scenes.

Findings

01

Accurately identifies dynamic objects despite segmentation imperfections.

02

Achieves dense static mapping at over 10 Hz processing rate.

03

Effectively combines geometric and semantic information to reduce segmentation errors.

Abstract

Simultaneous Localization and Mapping (SLAM) is one of the most essential techniques in many real-world robotic applications. The assumption of static environments is common in most SLAM algorithms, which however, is not the case for most applications. Recent work on semantic SLAM aims to understand the objects in an environment and distinguish dynamic information from a scene context by performing image-based segmentation. However, the segmentation results are often imperfect or incomplete, which can subsequently reduce the quality of mapping and the accuracy of localization. In this paper, we present a robust multi-modal semantic framework to solve the SLAM problem in complex and highly dynamic environments. We propose to learn a more powerful object feature representation and deploy the mechanism of looking and thinking twice to the backbone network, which leads to a better…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wh200720041/mms_slam
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning