CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation
Xuming Zhang, Xingfa Gu, Qingjiu Tian, and Lorenzo Bruzzone

TL;DR
CoMiX introduces a novel asymmetric encoder-decoder architecture with deformable convolutions and cross-modality feature exchange modules to improve hyperspectral image and supplementary data fusion for semantic segmentation, achieving superior results.
Contribution
This paper presents CoMiX, a new architecture that effectively extracts, calibrates, and fuses multimodal information using deformable convolutions and feature exchange modules for HSI-X segmentation.
Findings
Achieves superior segmentation accuracy on HSI-X datasets.
Effectively captures geometric variations and spatial-spectral features.
Demonstrates strong generalization to various multimodal recognition tasks.
Abstract
Improving hyperspectral image (HSI) semantic segmentation by exploiting complementary information from a supplementary data type (referred to X-modality) is promising but challenging due to differences in imaging sensors, image content, and resolution. Current techniques struggle to enhance modality-specific and modality-shared information, as well as to capture dynamic interaction and fusion between different modalities. In response, this study proposes CoMiX, an asymmetric encoder-decoder architecture with deformable convolutions (DCNs) for HSI-X semantic segmentation. CoMiX is designed to extract, calibrate, and fuse information from HSI and X data. Its pipeline includes an encoder with two parallel and interacting backbones and a lightweight all-multilayer perceptron (ALL-MLP) decoder. The encoder consists of four stages, each incorporating 2D DCN blocks for the X model to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
