ThermalLoc: A Vision Transformer-Based Approach for Robust Thermal Camera Relocalization in Large-Scale Environments

Yu Liu; Yangtao Meng; Xianfei Pan; Jie Jiang; Changhao Chen

arXiv:2506.18268·cs.CV·June 24, 2025

ThermalLoc: A Vision Transformer-Based Approach for Robust Thermal Camera Relocalization in Large-Scale Environments

Yu Liu, Yangtao Meng, Xianfei Pan, Jie Jiang, Changhao Chen

PDF

TL;DR

ThermalLoc is a new deep learning approach that combines EfficientNet and Transformers to improve thermal camera relocalization accuracy and robustness in large-scale environments.

Contribution

It introduces ThermalLoc, the first end-to-end deep learning method specifically designed for thermal image relocalization, integrating local and global feature extraction.

Findings

01

ThermalLoc outperforms existing methods like AtLoc, MapNet, PoseNet, and RobustLoc.

02

It achieves higher accuracy and robustness in thermal camera relocalization.

03

Evaluations on multiple datasets validate its effectiveness.

Abstract

Thermal cameras capture environmental data through heat emission, a fundamentally different mechanism compared to visible light cameras, which rely on pinhole imaging. As a result, traditional visual relocalization methods designed for visible light images are not directly applicable to thermal images. Despite significant advancements in deep learning for camera relocalization, approaches specifically tailored for thermal camera-based relocalization remain underexplored. To address this gap, we introduce ThermalLoc, a novel end-to-end deep learning method for thermal image relocalization. ThermalLoc effectively extracts both local and global features from thermal images by integrating EfficientNet with Transformers, and performs absolute pose regression using two MLP networks. We evaluated ThermalLoc on both the publicly available thermal-odometry dataset and our own dataset. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.