TL;DR
Camo-M3FD introduces a new benchmark dataset for cross-spectral camouflaged pedestrian detection, addressing a critical gap in safety-critical applications by combining visible and thermal imagery with high-quality annotations.
Contribution
The paper presents a novel dataset with registered visible-thermal image pairs and a standardized evaluation framework for camouflaged pedestrian detection.
Findings
Thermal signals are crucial for localization.
Multispectral fusion improves structural detail detection.
The dataset enables benchmarking of state-of-the-art models.
Abstract
Pedestrian detection is fundamental to autonomous driving, robotics, and surveillance. Despite progress in deep learning, reliable identification remains challenging due to occlusions, cluttered backgrounds, and degraded visibility. While multispectral detection-combining visible and thermal sensors-mitigates poor visibility, the challenge of camouflaged pedestrians remains largely unexplored. Existing Camouflaged Object Detection (COD) benchmarks focus on biological species, leaving a gap in safety-critical human detection where targets blend into their surroundings. To address this, we introduce Camo-M3FD (derived from the M3FD dataset), a novel benchmark for cross-spectral camouflaged pedestrian detection, consisting of registered visible-thermal image pairs. The dataset is curated using quantitative metrics to ensure high foreground-background similarity. We provide high-quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
