Polarized Self-Attention: Towards High-quality Pixel-wise Regression

Huajun Liu; Fuqiang Liu; Xinyi Fan; Dong Huang

arXiv:2107.00782·cs.CV·July 9, 2021·152 cites

Polarized Self-Attention: Towards High-quality Pixel-wise Regression

Huajun Liu, Fuqiang Liu, Xinyi Fan, Dong Huang

PDF

Open Access 5 Repos

TL;DR

This paper introduces Polarized Self-Attention (PSA), a novel attention mechanism designed to improve pixel-wise regression tasks in computer vision by maintaining high resolution and modeling output distributions effectively.

Contribution

The paper proposes PSA with polarized filtering and enhancement, achieving superior performance in pixel-wise regression tasks compared to existing methods.

Findings

01

Boosts baseline models by 2-4 points.

02

Improves state-of-the-art results by 1-2 points.

03

Effective in 2D pose estimation and semantic segmentation.

Abstract

Pixel-wise regression is probably the most common problem in fine-grained computer vision tasks, such as estimating keypoint heatmaps and segmentation masks. These regression problems are very challenging particularly because they require, at low computation overheads, modeling long-range dependencies on high-resolution inputs/outputs to estimate the highly nonlinear pixel-wise semantics. While attention mechanisms in Deep Convolutional Neural Networks(DCNNs) has become popular for boosting long-range dependencies, element-specific attention, such as Nonlocal blocks, is highly complex and noise-sensitive to learn, and most of simplified attention hybrids try to reach the best compromise among multiple types of tasks. In this paper, we present the Polarized Self-Attention(PSA) block that incorporates two critical designs towards high-quality pixel-wise regression: (1) Polarized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications