# Channel reconstruction and dual attention dynamic fusion for remote sensing image semantic segmentation

**Authors:** Xin Wang, Longxing Niu, Zhiwen Zheng, Qun Yang, Jia Lu, Hao Yang, Qin Qin, Guan Lian, Jiawei Wang

PMC · DOI: 10.1371/journal.pone.0343777 · 2026-03-20

## TL;DR

This paper introduces CRDFNet, a new network for remote sensing image segmentation that improves accuracy by combining global and local features.

## Contribution

The novel CRDFNet integrates channel reconstruction and dual attention dynamic fusion for better segmentation of complex remote sensing images.

## Key findings

- CRDFNet outperforms existing methods on multiple datasets in terms of F1 score, OA, and mIoU.
- The channel feature aggregation module enhances high-resolution detail features for complex boundary shapes.
- The dual attention feature refinement module improves small target segmentation accuracy.

## Abstract

As the spatial resolution of remote sensing imagery continues to be improved, the complexity of the information also increases. Remote sensing images generally have characteristics such as wide imaging ranges, dispersed distribution of similar land objects, complex boundary shapes, and dense small targets, which pose severe challenges to semantic segmentation tasks. To address these challenges, we propose a channel reconstruction and dual attention dynamic fusion network (CRDFNet), which is a semantic segmentation network for remote sensing image that can effectively integrate global and local contexts. To better handle complex boundary shapes, we designed a channel feature aggregation module (CFAM), which can extract spatially redundant information during feature fusion and enhance high-resolution detail features. Through a channel reconstruction block, it promotes the alignment of fine-grained information from the encoder with high-level semantic information from the decoder, effectively aggregating multi-scale features extracted by the encoder and significantly improving segmentation accuracy. At the same time, to optimize the segmentation performance of small targets, we propose a dual attention feature refinement module (DAFRM), which achieves precise segmentation of small targets by effectively fuses the shallow spatial features of the encoder and the deep semantic features of the decoder through a dynamic fusion mechanism guided by dual attention. Experimental results on the Potsdam, Vaihingen, UAVid, and MSIDBG datasets demonstrate that CRDFNet outperforms existing methods in terms of F1 score, OA, and mIoU (Intersection over Union), validating its excellent performance.

## Full-text entities

- **Genes:** VIT (vitrin) [NCBI Gene 5212] {aka VIT1}
- **Diseases:** FD (MESH:D000795), OA (MESH:D010003)
- **Chemicals:** CFAM (-)
- **Species:** Kandelia obovata (species) [taxon 413952], Sonneratia apetala (species) [taxon 122813], Homo sapiens (human, species) [taxon 9606], Rhizophora stylosa (species) [taxon 98588], Bruguiera gymnorhiza (Burma mangrove, species) [taxon 39984], Avicennia marina (species) [taxon 82927]

## Figures

50 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13004410/full.md

---
Source: https://tomesphere.com/paper/PMC13004410