Pixel-wise Attentional Gating for Parsimonious Pixel Labeling
Shu Kong, Charless Fowlkes

TL;DR
This paper introduces Pixel-wise Attentional Gating (PAG), a versatile mechanism that enables deep networks to selectively process spatial locations, improving efficiency and performance in various pixel labeling tasks without significant computational overhead.
Contribution
The paper presents PAG, a generic, architecture-independent gating mechanism that learns to dynamically allocate computation across spatial locations in deep networks for pixel labeling.
Findings
PAG achieves state-of-the-art or competitive results on multiple pixel labeling tasks.
PAG reduces computation by approximately 10% with minimal accuracy loss.
Dynamic spatial allocation improves performance trade-offs over static or layer-skipping methods.
Abstract
To achieve parsimonious inference in per-pixel labeling tasks with a limited computational budget, we propose a \emph{Pixel-wise Attentional Gating} unit (\emph{PAG}) that learns to selectively process a subset of spatial locations at each layer of a deep convolutional network. PAG is a generic, architecture-independent, problem-agnostic mechanism that can be readily "plugged in" to an existing model with fine-tuning. We utilize PAG in two ways: 1) learning spatially varying pooling fields that improve model performance without the extra computation cost associated with multi-scale pooling, and 2) learning a dynamic computation policy for each pixel to decrease total computation while maintaining accuracy. We extensively evaluate PAG on a variety of per-pixel labeling tasks, including semantic segmentation, boundary detection, monocular depth and surface normal estimation. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Visual Attention and Saliency Detection · Cell Image Analysis Techniques
