Robust pedestrian detection in thermal imagery using synthesized images
My Kieu, Lorenzo Berlincioni, Leonardo Galteri, Marco Bertini, Andrew, D. Bagdanov, Alberto Del Bimbo

TL;DR
This paper introduces a two-stage method combining generative data augmentation and domain adaptation to enhance pedestrian detection in thermal images, achieving state-of-the-art results with less real thermal data.
Contribution
The paper presents a novel approach using GAN-generated thermal images to improve thermal pedestrian detection and domain adaptation of RGB detectors.
Findings
Achieves state-of-the-art results on KAIST benchmark with limited real thermal data.
GAN-generated images effectively augment training data, improving detection performance.
Adding synthetic images enhances detection even with more real thermal data.
Abstract
In this paper we propose a method for improving pedestrian detection in the thermal domain using two stages: first, a generative data augmentation approach is used, then a domain adaptation method using generated data adapts an RGB pedestrian detector. Our model, based on the Least-Squares Generative Adversarial Network, is trained to synthesize realistic thermal versions of input RGB images which are then used to augment the limited amount of labeled thermal pedestrian images available for training. We apply our generative data augmentation strategy in order to adapt a pretrained YOLOv3 pedestrian detector to detection in the thermal-only domain. Experimental results demonstrate the effectiveness of our approach: using less than 50\% of available real thermal training data, and relying on synthesized data generated by our model in the domain adaptation phase, our detector achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAverage Pooling · 1x1 Convolution · Softmax · Global Average Pooling · Residual Connection · Batch Normalization · Convolution · BNB Customer Service Number +1-833-534-1729 · k-Means Clustering · Logistic Regression
