A Context-and-Spatial Aware Network for Multi-Person Pose Estimation

Dongdong Yu; Kai Su; Xin Geng; Changhu Wang

arXiv:1905.05355·cs.CV·May 15, 2019·6 cites

A Context-and-Spatial Aware Network for Multi-Person Pose Estimation

Dongdong Yu, Kai Su, Xin Geng, Changhu Wang

PDF

Open Access

TL;DR

This paper introduces CSANet, a novel network that effectively combines context and spatial information for improved multi-person pose estimation, achieving state-of-the-art results on the COCO benchmark.

Contribution

The paper proposes a new network architecture with dedicated context and spatial paths, enhancing feature extraction for pose estimation.

Findings

01

Outperforms existing methods on COCO keypoint benchmark

02

Effectively integrates context and spatial information

03

Validates the importance of combined feature paths

Abstract

Multi-person pose estimation is a fundamental yet challenging task in computer vision. Both rich context information and spatial information are required to precisely locate the keypoints for all persons in an image. In this paper, a novel Context-and-Spatial Aware Network (CSANet), which integrates both a Context Aware Path and Spatial Aware Path, is proposed to obtain effective features involving both context information and spatial information. Specifically, we design a Context Aware Path with structure supervision strategy and spatial pyramid pooling strategy to enhance the context information. Meanwhile, a Spatial Aware Path is proposed to preserve the spatial information, which also shortens the information propagation path from low-level features to high-level features. On top of these two paths, we employ a Heavy Head Path to further combine and enhance the features effectively.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Neural Network Applications

MethodsSpatial Pyramid Pooling