Thermal Image Refinement with Depth Estimation using Recurrent Networks for Monocular ORB-SLAM3
H\"urkan \c{S}ahin, Huy Xuan Pham, Van Huyen Dang, Alper Yegenoglu, Erdal Kayacan

TL;DR
This paper presents a novel thermal image refinement and depth estimation pipeline using recurrent networks, integrated into ORB-SLAM3 for robust UAV navigation in low-light environments, trained on non-radiometric thermal data.
Contribution
Introduces a lightweight recurrent network for thermal image refinement and depth estimation, enabling thermal-only SLAM without high-cost radiometric cameras.
Findings
Achieves approximately 0.06 relative error on radiometric datasets.
Maintains trajectory error under 0.4 meters in thermal-only SLAM.
Demonstrates robustness in low-light UAV navigation environments.
Abstract
Autonomous navigation in GPS-denied and visually degraded environments remains challenging for unmanned aerial vehicles (UAVs). To this end, we investigate the use of a monocular thermal camera as a standalone sensor on a UAV platform for real-time depth estimation and simultaneous localization and mapping (SLAM). To extract depth information from thermal images, we propose a novel pipeline employing a lightweight supervised network with recurrent blocks (RBs) integrated to capture temporal dependencies, enabling more robust predictions. The network combines lightweight convolutional backbones with a thermal refinement network (T-RefNet) to refine raw thermal inputs and enhance feature visibility. The refined thermal images and predicted depth maps are integrated into ORB-SLAM3, enabling thermal-only localization. Unlike previous methods, the network is trained on a custom…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Neural Network Applications
