Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation
Xu Zheng, Yuanhuiyi Lyu, Jiazhou Zhou, Lin Wang

TL;DR
This paper introduces MAGIC, a flexible multi-modal fusion method for semantic segmentation that identifies and leverages the most robust modalities, achieving state-of-the-art results and resilience against sensor failures.
Contribution
MAGIC is a novel, plug-and-play framework that adaptively selects and fuses modalities, reducing dependence on RGB and enhancing robustness in multi-modal semantic segmentation.
Findings
Achieves state-of-the-art performance in multi-modal segmentation.
Reduces model parameters by 60%.
Outperforms prior methods by +19.41% mIoU in modality-agnostic setting.
Abstract
Fusing an arbitrary number of modalities is vital for achieving robust multi-modal fusion of semantic segmentation yet remains less explored to date. Recent endeavors regard RGB modality as the center and the others as the auxiliary, yielding an asymmetric architecture with two branches. However, the RGB modality may struggle in certain circumstances, e.g., nighttime, while others, e.g., event data, own their merits; thus, it is imperative for the fusion model to discern robust and fragile modalities, and incorporate the most robust and fragile ones to learn a resilient multi-modal framework. To this end, we propose a novel method, named MAGIC, that can be flexibly paired with various backbones, ranging from compact to high-performance models. Our method comprises two key plug-and-play modules. Firstly, we introduce a multi-modal aggregation module to efficiently process features from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
