Object Segmentation by Mining Cross-Modal Semantics
Zongwei Wu, Jingjing Wang, Zhuyun Zhou, Zhaochong An, Qiuping Jiang,, C\'edric Demonceaux, Guolei Sun, Radu Timofte

TL;DR
This paper introduces XMSNet, a novel multi-sensor object segmentation network that effectively mines cross-modal semantics to improve fusion, decoding, and robustness across various datasets and challenging tasks.
Contribution
The paper proposes a new network, XMSNet, which explicitly models shared and specific features across modalities to enhance segmentation accuracy and robustness.
Findings
Outperforms existing methods on eleven datasets.
Effective in salient and camouflage object segmentation.
Demonstrates robustness to sensor noise and calibration errors.
Abstract
Multi-sensor clues have shown promise for object segmentation, but inherent noise in each sensor, as well as the calibration error in practice, may bias the segmentation accuracy. In this paper, we propose a novel approach by mining the Cross-Modal Semantics to guide the fusion and decoding of multimodal features, with the aim of controlling the modal contribution based on relative entropy. We explore semantics among the multimodal inputs in two aspects: the modality-shared consistency and the modality-specific variation. Specifically, we propose a novel network, termed XMSNet, consisting of (1) all-round attentive fusion (AF), (2) coarse-to-fine decoder (CFD), and (3) cross-layer self-supervision. On the one hand, the AF block explicitly dissociates the shared and specific representation and learns to weight the modal contribution by adjusting the \textit{proportion, region,} and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Olfactory and Sensory Function Studies · Infrared Target Detection Methodologies
