Agile Amulet: Real-Time Salient Object Detection with Contextual Attention
Pingping Zhang, Luyao Wang, Dong Wang, Huchuan Lu, Chunhua Shen

TL;DR
Agile Amulet introduces a fast, lightweight, and accurate salient object detection framework that leverages a novel contextual attention module and multi-level feature aggregation to improve speed and performance.
Contribution
The paper presents a new framework with a contextual attention module and efficient multi-level feature aggregation, reducing model size and increasing speed and accuracy.
Findings
Runs at 30 fps in real-time
Achieves higher accuracy on seven benchmarks
Model size is reduced to 67 MB
Abstract
This paper proposes an Agile Aggregating Multi-Level feaTure framework (Agile Amulet) for salient object detection. The Agile Amulet builds on previous works to predict saliency maps using multi-level convolutional features. Compared to previous works, Agile Amulet employs some key innovations to improve training and testing speed while also increase prediction accuracy. More specifically, we first introduce a contextual attention module that can rapidly highlight most salient objects or regions with contextual pyramids. Thus, it effectively guides the learning of low-layer convolutional features and tells the backbone network where to look. The contextual attention module is a fully convolutional mechanism that simultaneously learns complementary features and predicts saliency scores at each pixel. In addition, we propose a novel method to aggregate multi-level deep convolutional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Olfactory and Sensory Function Studies
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
