Cascaded information enhancement and cross-modal attention feature fusion for multispectral pedestrian detection
Yang Yang, Kaixiong Xu, Kaizheng Wang

TL;DR
This paper introduces a novel multispectral pedestrian detection method combining cascaded information enhancement and cross-modal attention fusion, significantly improving detection accuracy by effectively suppressing background noise and leveraging complementary features.
Contribution
The paper proposes a new multispectral pedestrian detection algorithm with cascaded attention modules and cross-modal fusion, enhancing feature representation and detection accuracy over existing methods.
Findings
Lower pedestrian miss rate compared to previous methods
More accurate detection boxes in experiments
Effective suppression of background noise
Abstract
Multispectral pedestrian detection is a technology designed to detect and locate pedestrians in Color and Thermal images, which has been widely used in automatic driving, video surveillance, etc. So far most available multispectral pedestrian detection algorithms only achieved limited success in pedestrian detection because of the lacking take into account the confusion of pedestrian information and background noise in Color and Thermal images. Here we propose a multispectral pedestrian detection algorithm, which mainly consists of a cascaded information enhancement module and a cross-modal attention feature fusion module. On the one hand, the cascaded information enhancement module adopts the channel and spatial attention mechanism to perform attention weighting on the features fused by the cascaded feature fusion block. Moreover, it multiplies the single-modal features with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Automated Road and Building Extraction
