Multi-modal Semantic SLAM for Complex Dynamic Environments
Han Wang, Jing Ying Ko, Lihua Xie

TL;DR
This paper introduces a robust multi-modal semantic SLAM framework that effectively handles dynamic environments by improving object recognition and combining geometric and semantic data, achieving real-time dense mapping.
Contribution
The paper presents a novel multi-modal semantic SLAM approach with enhanced object feature learning and a dual recognition mechanism, improving performance in dynamic, complex scenes.
Findings
Accurately identifies dynamic objects despite segmentation imperfections.
Achieves dense static mapping at over 10 Hz processing rate.
Effectively combines geometric and semantic information to reduce segmentation errors.
Abstract
Simultaneous Localization and Mapping (SLAM) is one of the most essential techniques in many real-world robotic applications. The assumption of static environments is common in most SLAM algorithms, which however, is not the case for most applications. Recent work on semantic SLAM aims to understand the objects in an environment and distinguish dynamic information from a scene context by performing image-based segmentation. However, the segmentation results are often imperfect or incomplete, which can subsequently reduce the quality of mapping and the accuracy of localization. In this paper, we present a robust multi-modal semantic framework to solve the SLAM problem in complex and highly dynamic environments. We propose to learn a more powerful object feature representation and deploy the mechanism of looking and thinking twice to the backbone network, which leads to a better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
