ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification
Pan Zhang, Baochai Peng, Chaoran Lu, Quanjin Huang

TL;DR
ASANet is a novel neural network architecture designed for improved RGB and SAR image land cover classification by explicitly modeling modality-specific features and effectively fusing multimodal data.
Contribution
The paper introduces ASANet, which employs asymmetric feature processing and novel modules for better multimodal feature fusion in land cover classification.
Findings
ASANet outperforms existing methods on three datasets.
Achieves up to 17.69% accuracy improvement.
Runs at 48.7 FPS for 256x256 images.
Abstract
Synthetic Aperture Radar (SAR) images have proven to be a valuable cue for multimodal Land Cover Classification (LCC) when combined with RGB images. Most existing studies on cross-modal fusion assume that consistent feature information is necessary between the two modalities, and as a result, they construct networks without adequately addressing the unique characteristics of each modality. In this paper, we propose a novel architecture, named the Asymmetric Semantic Aligning Network (ASANet), which introduces asymmetry at the feature level to address the issue that multi-modal architectures frequently fail to fully utilize complementary features. The core of this network is the Semantic Focusing Module (SFM), which explicitly calculates differential weights for each modality to account for the modality-specific features. Furthermore, ASANet incorporates a Cascade Fusion Module (CFM),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Image Retrieval and Classification Techniques
