Fusion for Visual-Infrared Person ReID in Real-World Surveillance Using Corrupted Multimodal Data
Arthur Josi, Mahdi Alehdaghi, Rafael M. O. Cruz, Eric Granger

TL;DR
This paper introduces a robust multimodal fusion model for visible-infrared person re-identification that effectively handles corrupted data and outperforms existing methods in real-world scenarios.
Contribution
The paper proposes the MMSF model and new evaluation datasets, along with data augmentation strategies, to improve robustness of V-I ReID under corrupted multimodal conditions.
Findings
MMSF outperforms existing methods in co-located and non-co-located camera scenarios.
ML-MDA improves model robustness to corrupted multimodal images.
Proposed datasets enable realistic evaluation of multimodal ReID models.
Abstract
Visible-infrared person re-identification (V-I ReID) seeks to match images of individuals captured over a distributed network of RGB and IR cameras. The task is challenging due to the significant differences between V and I modalities, especially under real-world conditions, where images are corrupted by, e.g, blur, noise, and weather. Indeed, state-of-art V-I ReID models cannot leverage corrupted modality information to sustain a high level of accuracy. In this paper, we propose an efficient model for multimodal V-I ReID -- named Multimodal Middle Stream Fusion (MMSF) -- that preserves modality-specific knowledge for improved robustness to corrupted multimodal images. In addition, three state-of-art attention-based multimodal fusion models are adapted to address corrupted multimodal data in V-I ReID, allowing to dynamically balance each modality importance. Recently, evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Infrared Target Detection Methodologies
MethodsNeighborhood Contrastive Learning
