Robust Loop Closure by Textual Cues in Challenging Environments

Tongxing Jin; Thien-Minh Nguyen; Xinhang Xu; Yizhuo Yang; Shenghai; Yuan; Jianping Li; Lihua Xie

arXiv:2410.15869·cs.RO·October 22, 2024

Robust Loop Closure by Textual Cues in Challenging Environments

Tongxing Jin, Thien-Minh Nguyen, Xinhang Xu, Yizhuo Yang, Shenghai, Yuan, Jianping Li, Lihua Xie

PDF

Open Access 1 Repo

TL;DR

This paper introduces a multi-modal loop closure method leveraging explicit textual cues extracted via OCR, improving robot navigation in featureless environments where traditional visual or LiDAR-based methods struggle.

Contribution

It presents a novel approach combining OCR and LiDAR data for loop closure detection in challenging environments, outperforming existing sensor-only methods.

Findings

01

Superior performance over visual and LiDAR-only methods

02

Effective in corridors, tunnels, and warehouses

03

Source code and datasets publicly available

Abstract

Loop closure is an important task in robot navigation. However, existing methods mostly rely on some implicit or heuristic features of the environment, which can still fail to work in common environments such as corridors, tunnels, and warehouses. Indeed, navigating in such featureless, degenerative, and repetitive (FDR) environments would also pose a significant challenge even for humans, but explicit text cues in the surroundings often provide the best assistance. This inspires us to propose a multi-modal loop closure method based on explicit human-readable textual cues in FDR environments. Specifically, our approach first extracts scene text entities based on Optical Character Recognition (OCR), then creates a local map of text cues based on accurate LiDAR odometry and finally identifies loop closure events by a graph-theoretic scheme. Experiment results demonstrate that this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tongxingjin/txtlcd
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Natural Language Processing Techniques