Salient Object Detection via High-to-Low Hierarchical Context Aggregation
Yun Liu, Yu Qiu, Le Zhang, JiaWang Bian, Guang-Yu Nie, Ming-Ming Cheng

TL;DR
This paper introduces a high-to-low hierarchical context aggregation method using an hourglass network with intermediate supervision, significantly improving salient object detection accuracy with a simpler approach.
Contribution
It proposes a novel high-to-low hierarchical context aggregation framework with an hourglass network for salient object detection, simplifying fusion strategies while enhancing performance.
Findings
Achieves state-of-the-art results on six challenging datasets.
Uses a high-to-low self-learning approach for contextual feature extraction.
Demonstrates effectiveness of hierarchical context aggregation in saliency detection.
Abstract
Recent progress on salient object detection mainly aims at exploiting how to effectively integrate convolutional side-output features in convolutional neural networks (CNN). Based on this, most of the existing state-of-the-art saliency detectors design complex network structures to fuse the side-output features of the backbone feature extraction networks. However, should the fusion strategies be more and more complex for accurate salient object detection? In this paper, we observe that the contexts of a natural image can be well expressed by a high-to-low self-learning of side-output convolutional features. As we know, the contexts of an image usually refer to the global structures, and the top layers of CNN usually learn to convey global information. On the other hand, it is difficult for the intermediate side-output features to express contextual information. Here, we design an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Image and Video Quality Assessment
