Polarized Self-Attention: Towards High-quality Pixel-wise Regression
Huajun Liu, Fuqiang Liu, Xinyi Fan, Dong Huang

TL;DR
This paper introduces Polarized Self-Attention (PSA), a novel attention mechanism designed to improve pixel-wise regression tasks in computer vision by maintaining high resolution and modeling output distributions effectively.
Contribution
The paper proposes PSA with polarized filtering and enhancement, achieving superior performance in pixel-wise regression tasks compared to existing methods.
Findings
Boosts baseline models by 2-4 points.
Improves state-of-the-art results by 1-2 points.
Effective in 2D pose estimation and semantic segmentation.
Abstract
Pixel-wise regression is probably the most common problem in fine-grained computer vision tasks, such as estimating keypoint heatmaps and segmentation masks. These regression problems are very challenging particularly because they require, at low computation overheads, modeling long-range dependencies on high-resolution inputs/outputs to estimate the highly nonlinear pixel-wise semantics. While attention mechanisms in Deep Convolutional Neural Networks(DCNNs) has become popular for boosting long-range dependencies, element-specific attention, such as Nonlocal blocks, is highly complex and noise-sensitive to learn, and most of simplified attention hybrids try to reach the best compromise among multiple types of tasks. In this paper, we present the Polarized Self-Attention(PSA) block that incorporates two critical designs towards high-quality pixel-wise regression: (1) Polarized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
