# MCFNet: Multi-Scale Contextual Fusion Network for Salient Object Detection in Optical Remote Sensing Images

**Authors:** Jinting Ding, Yueqian Quan, Honghui Xu

PMC · DOI: 10.3390/s25103035 · Sensors (Basel, Switzerland) · 2025-05-12

## TL;DR

This paper introduces MCFNet, a new deep learning network for detecting salient objects in optical remote sensing images, improving accuracy and boundary delineation.

## Contribution

The novel MCFNet combines a Semantic-Aware Attention Module and Contextual Interconnection Module for enhanced salient object detection in remote sensing.

## Key findings

- MCFNet outperforms existing methods on standard ORSI-SOD benchmark datasets.
- The network achieves robust and efficient performance in challenging remote sensing scenarios.
- MCFNet improves localization accuracy and boundary delineation for complex and varying-scale objects.

## Abstract

The rapid advancement of deep learning has catalyzed progress in salient object detection (SOD), extending its impact to the domain of optical remote sensing images (ORSIs). Despite increasing attention, salient object detection for optical remote sensing images (ORSI-SOD) remains highly challenging due to the intrinsic complexities of remote sensing scenes. In particular, severe variations in object scale and quantity, cluttered backgrounds, and irregular object morphologies significantly hinder accurate target localization and boundary delineation. In response to these challenges, we introduce the Multi-scale Contextual Fusion Network (MCFNet) for ORSI-SOD. MCFNet incorporates a Semantic-Aware Attention Module (SAM), which provides explicit semantic guidance during feature extraction. By producing preliminary semantic masks, SAM enables the network to capture long-range contextual dependencies, thereby enhancing localization accuracy for salient objects exhibiting substantial scale variation and structural complexity. In addition, MCFNet integrates a Contextual Interconnection Module (CIM), which promotes effective fusion of local and global contextual features. By facilitating cross-layer interactions and adopting a multiscale refinement strategy, CIM enriches texture representations while suppressing background interference, leading to smoother object boundaries and more precise delineation of salient regions. Extensive evaluations conducted on three standard ORSI-SOD benchmark datasets demonstrate the superior performance of MCFNet compared to existing methods, highlighting its robustness and efficiency in handling challenging remote sensing scenarios.

## Full-text entities

- **Genes:** SOD1 (superoxide dismutase 1) [NCBI Gene 6647] {aka ALS, ALS1, HEL-S-44, IPOA, SOD, STAHP}
- **Diseases:** SAM (MESH:D057180), injury to (MESH:D014947), WITHOUT (MESH:D001321), CIM (MESH:C538399)
- **Chemicals:** SAM (-), CB (MESH:C063451)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12114842/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12114842/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12114842/full.md

---
Source: https://tomesphere.com/paper/PMC12114842