MOS: Mitigating Optical-SAR Modality Gap for Cross-Modal Ship Re-Identification
Yujian Zhao, Hankun Liu, Guanglin Niu

TL;DR
This paper introduces MOS, a framework that reduces the modality gap between optical and SAR images to improve cross-modal ship re-identification, using representation learning and data synthesis techniques.
Contribution
MOS is the first framework to effectively mitigate the optical-SAR modality gap for ship ReID, combining modality alignment and synthetic data generation.
Findings
Significantly outperforms state-of-the-art methods on HOSS dataset.
Achieves +16.4% R1 accuracy in SAR to Optical setting.
Demonstrates robust modality alignment and discriminability.
Abstract
Cross-modal ship re-identification (ReID) between optical and synthetic aperture radar (SAR) imagery has recently emerged as a critical yet underexplored task in maritime intelligence and surveillance. However, the substantial modality gap between optical and SAR images poses a major challenge for robust identification. To address this issue, we propose MOS, a novel framework designed to mitigate the optical-SAR modality gap and achieve modality-consistent feature learning for optical-SAR cross-modal ship ReID. MOS consists of two core components: (1) Modality-Consistent Representation Learning (MCRL) applies denoise SAR image procession and a class-wise modality alignment loss to align intra-identity feature distributions across modalities. (2) Cross-modal Data Generation and Feature fusion (CDGF) leverages a brownian bridge diffusion model to synthesize cross-modal samples, which are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced SAR Imaging Techniques · Advanced Neural Network Applications · Synthetic Aperture Radar (SAR) Applications and Techniques
