MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection
Heitor R. Medeiros, David Latortue, Eric Granger, Marco Pedersoli

TL;DR
This paper introduces MiPa, a novel training technique that enables a single transformer-based object detection model to effectively learn from both RGB and IR modalities, achieving competitive results with reduced memory usage.
Contribution
MiPa proposes a patch-mixing training method and a modality-agnostic module to balance multimodal learning in a single encoder, addressing modality imbalance issues.
Findings
MiPa achieves competitive detection performance on RGB/IR benchmarks.
The method enables inference with only one modality, reducing computational requirements.
MiPa effectively balances modality influence during training.
Abstract
In real-world scenarios, using multiple modalities like visible (RGB) and infrared (IR) can greatly improve the performance of a predictive task such as object detection (OD). Multimodal learning is a common way to leverage these modalities, where multiple modality-specific encoders and a fusion module are used to improve performance. In this paper, we tackle a different way to employ RGB and IR modalities, where only one modality or the other is observed by a single shared vision encoder. This realistic setting requires a lower memory footprint and is more suitable for applications such as autonomous driving and surveillance, which commonly rely on RGB and IR data. However, when learning a single encoder on multiple modalities, one modality can dominate the other, producing uneven recognition results. This work investigates how to efficiently leverage RGB and IR modalities to train a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies
