# Nested Network with Two-Stream Pyramid for Salient Object Detection in   Optical Remote Sensing Images

**Authors:** Chongyi Li, Runmin Cong, Junhui Hou, Sanyi Zhang, Yue Qian, Sam Kwong

arXiv: 1906.08462 · 2020-01-08

## TL;DR

This paper introduces LV-Net, a novel deep learning architecture with a two-stream pyramid and nested encoder-decoder modules, designed to improve salient object detection in optical remote sensing images with diverse scales and cluttered backgrounds.

## Contribution

The paper presents LV-Net, the first publicly available dataset for optical RSI salient object detection, and demonstrates its superior performance over existing methods.

## Key findings

- LV-Net outperforms state-of-the-art methods quantitatively.
- Constructed the first optical RSI salient object detection dataset.
- Effective multi-scale and detail perception in complex remote sensing images.

## Abstract

Arising from the various object types and scales, diverse imaging orientations, and cluttered backgrounds in optical remote sensing image (RSI), it is difficult to directly extend the success of salient object detection for nature scene image to the optical RSI. In this paper, we propose an end-to-end deep network called LV-Net based on the shape of network architecture, which detects salient objects from optical RSIs in a purely data-driven fashion. The proposed LV-Net consists of two key modules, i.e., a two-stream pyramid module (L-shaped module) and an encoder-decoder module with nested connections (V-shaped module). Specifically, the L-shaped module extracts a set of complementary information hierarchically by using a two-stream pyramid structure, which is beneficial to perceiving the diverse scales and local details of salient objects. The V-shaped module gradually integrates encoder detail features with decoder semantic features through nested connections, which aims at suppressing the cluttered backgrounds and highlighting the salient objects. In addition, we construct the first publicly available optical RSI dataset for salient object detection, including 800 images with varying spatial resolutions, diverse saliency types, and pixel-wise ground truth. Experiments on this benchmark dataset demonstrate that the proposed method outperforms the state-of-the-art salient object detection methods both qualitatively and quantitatively.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.08462/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1906.08462/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/1906.08462/full.md

---
Source: https://tomesphere.com/paper/1906.08462