P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation
Qi Zhang, Guohua Geng, Longquan Yan, Pengbo Zhou, Zhaodi Li, Kang Li, and Qinglin Liu

TL;DR
This paper introduces P-MSDiff, a novel parallel multi-scale diffusion model with a cross-bridge attention mechanism for improved remote sensing image segmentation, demonstrating superior performance on benchmark datasets.
Contribution
The paper proposes P-MSDiff, a multi-scale diffusion architecture with a new CBLA module, enhancing semantic understanding and attention mechanisms in remote sensing segmentation.
Findings
Achieved 1.60% improvement on UAVid dataset
Achieved 1.40% improvement on Vaihingen dataset
Enhanced multi-scale feature integration and attention in segmentation
Abstract
Diffusion models and multi-scale features are essential components in semantic segmentation tasks that deal with remote-sensing images. They contribute to improved segmentation boundaries and offer significant contextual information. U-net-like architectures are frequently employed in diffusion models for segmentation tasks. These architectural designs include dense skip connections that may pose challenges for interpreting intermediate features. Consequently, they might not efficiently convey semantic information throughout various layers of the encoder-decoder architecture. To address these challenges, we propose a new model for semantic segmentation known as the diffusion model with parallel multi-scale branches. This model consists of Parallel Multiscale Diffusion modules (P-MSDiff) and a Cross-Bridge Linear Attention mechanism (CBLA). P-MSDiff enhances the understanding of semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques
MethodsAttention Is All You Need · Softmax · Linear Layer · Multi-Head Attention · Diffusion
