Multi-Person Pose Estimation with Enhanced Feature Aggregation and   Selection

Xixia Xu; Qi Zou; Xue Lin

arXiv:2003.10238·cs.CV·March 24, 2020·1 cites

Multi-Person Pose Estimation with Enhanced Feature Aggregation and Selection

Xixia Xu, Qi Zou, Xue Lin

PDF

Open Access

TL;DR

This paper introduces EFASNet, a novel network for multi-person 2D pose estimation that enhances feature aggregation and selection, leading to more accurate joint localization in crowded scenes.

Contribution

The paper presents a new EFASNet architecture with a Feature Aggregation and Selection Module, a feature fusion strategy, and a Dense Upsampling Convolution module, improving pose estimation accuracy.

Findings

01

Outperforms state-of-the-art methods on CrowdPose, COCO, and MPII datasets.

02

Achieves higher accuracy in crowded and occluded scenes.

03

Demonstrates robustness and superior performance across benchmarks.

Abstract

We propose a novel Enhanced Feature Aggregation and Selection network (EFASNet) for multi-person 2D human pose estimation. Due to enhanced feature representation, our method can well handle crowded, cluttered and occluded scenes. More specifically, a Feature Aggregation and Selection Module (FASM), which constructs hierarchical multi-scale feature aggregation and makes the aggregated features discriminative, is proposed to get more accurate fine-grained representation, leading to more precise joint locations. Then, we perform a simple Feature Fusion (FF) strategy which effectively fuses high-resolution spatial features and low-resolution semantic features to obtain more reliable context information for well-estimated joints. Finally, we build a Dense Upsampling Convolution (DUC) module to generate more precise prediction, which can recover missing joint details that are usually…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications

MethodsConvolution