Agile Amulet: Real-Time Salient Object Detection with Contextual   Attention

Pingping Zhang; Luyao Wang; Dong Wang; Huchuan Lu; Chunhua Shen

arXiv:1802.06960·cs.CV·February 21, 2018·20 cites

Agile Amulet: Real-Time Salient Object Detection with Contextual Attention

Pingping Zhang, Luyao Wang, Dong Wang, Huchuan Lu, Chunhua Shen

PDF

Open Access

TL;DR

Agile Amulet introduces a fast, lightweight, and accurate salient object detection framework that leverages a novel contextual attention module and multi-level feature aggregation to improve speed and performance.

Contribution

The paper presents a new framework with a contextual attention module and efficient multi-level feature aggregation, reducing model size and increasing speed and accuracy.

Findings

01

Runs at 30 fps in real-time

02

Achieves higher accuracy on seven benchmarks

03

Model size is reduced to 67 MB

Abstract

This paper proposes an Agile Aggregating Multi-Level feaTure framework (Agile Amulet) for salient object detection. The Agile Amulet builds on previous works to predict saliency maps using multi-level convolutional features. Compared to previous works, Agile Amulet employs some key innovations to improve training and testing speed while also increase prediction accuracy. More specifically, we first introduce a contextual attention module that can rapidly highlight most salient objects or regions with contextual pyramids. Thus, it effectively guides the learning of low-layer convolutional features and tells the backbone network where to look. The contextual attention module is a fully convolutional mechanism that simultaneously learns complementary features and predicts saliency scores at each pixel. In addition, we propose a novel method to aggregate multi-level deep convolutional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Olfactory and Sensory Function Studies

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings