Robust Environment Perception for Automated Driving: A Unified Learning Pipeline for Visual-Infrared Object Detection
Mohsen Vadidar, Ali Kariminezhad, Christian Mayr, Laurent Kloeker and, Lutz Eckstein

TL;DR
This paper presents a multi-modal perception pipeline combining RGB and infrared sensors with a novel attention module for robust object detection in autonomous driving, outperforming existing methods by 10% mAP.
Contribution
It introduces a unified learning pipeline that fuses RGB and thermal data using a new entropy-block attention module, enhancing detection accuracy in varying lighting conditions.
Findings
RGB-thermal fusion network achieves 82.9% mAP
The proposed EBAM improves detection performance
Multi-modal approach enhances robustness in diverse environments
Abstract
The RGB complementary metal-oxidesemiconductor (CMOS) sensor works within the visible light spectrum. Therefore it is very sensitive to environmental light conditions. On the contrary, a long-wave infrared (LWIR) sensor operating in 8-14 micro meter spectral band, functions independent of visible light. In this paper, we exploit both visual and thermal perception units for robust object detection purposes. After delicate synchronization and (cross-) labeling of the FLIR [1] dataset, this multi-modal perception data passes through a convolutional neural network (CNN) to detect three critical objects on the road, namely pedestrians, bicycles, and cars. After evaluation of RGB and infrared (thermal and infrared are often used interchangeably) sensors separately, various network structures are compared to fuse the data at the feature level effectively. Our RGB-thermal (RGBT) fusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · CCD and CMOS Imaging Sensors · Advanced Chemical Sensor Technologies
