PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency   Detection

Nian Liu; Junwei Han; Ming-Hsuan Yang

arXiv:1812.06314·cs.CV·December 18, 2018·1 cites

PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency Detection

Nian Liu, Junwei Han, Ming-Hsuan Yang

PDF

Open Access 2 Repos

TL;DR

PiCANet introduces pixel-wise attention mechanisms that selectively focus on relevant contextual regions, significantly improving saliency detection and demonstrating versatility across semantic segmentation and object detection tasks.

Contribution

The paper proposes a novel pixel-wise contextual attention network, PiCANet, which learns to attend to informative context locations for each pixel, enhancing saliency detection accuracy.

Findings

01

PiCANet improves saliency detection performance over state-of-the-art methods.

02

Global and local attention mechanisms effectively incorporate contrast and smoothness.

03

PiCANet enhances semantic segmentation and object detection results.

Abstract

In saliency detection, every pixel needs contextual information to make saliency prediction. Previous models usually incorporate contexts holistically. However, for each pixel, usually only part of its context region is useful and contributes to its prediction, while some other part may serve as noises and distractions. In this paper, we propose a novel pixel-wise contextual attention network, \ie PiCANet, to learn to selectively attend to informative context locations at each pixel. Specifically, PiCANet generates an attention map over the context region of each pixel, where each attention weight corresponds to the relevance of a context location w.r.t the referred pixel. Then, attentive contextual features can be constructed via selectively incorporating the features of useful context locations with the learned attention. We propose three specific formulations of the PiCANet via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image Fusion Techniques

MethodsConcatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · U-Net · Convolution