Focus On What's Important: Self-Attention Model for Human Pose   Estimation

Guanxiong Sun; Chengqin Ye; Kuanquan Wang

arXiv:1809.08371·cs.CV·April 5, 2019

Focus On What's Important: Self-Attention Model for Human Pose Estimation

Guanxiong Sun, Chengqin Ye, Kuanquan Wang

PDF

Open Access

TL;DR

This paper introduces ACNN, a multi-stage self-attention convolutional network that effectively filters redundant regions and improves human pose estimation accuracy without extra annotations.

Contribution

It proposes a novel attention-based architecture for human pose estimation that learns to focus on important regions, enhancing accuracy and efficiency.

Findings

01

Achieved state-of-the-art results on MPII benchmark.

02

Effectively filters out background and redundant regions.

03

Does not require additional manual annotations.

Abstract

Human pose estimation is an essential yet challenging task in computer vision. One of the reasons for this difficulty is that there are many redundant regions in the images. In this work, we proposed a convolutional network architecture combined with the novel attention model. We named it attention convolutional neural network (ACNN). ACNN learns to focus on specific regions of different input features. It's a multi-stage architecture. Early stages filtrate the "nothing-regions", such as background and redundant body parts. And then, they submit the important regions which contain the joints of the human body to the following stages to get a more accurate result. What's more, it does not require extra manual annotations and self-learning is one of our intentions. We separately trained the network because the attention learning task and the pose estimation task are not independent.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications