DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation   Model

Eldar Insafutdinov; Leonid Pishchulin; Bjoern Andres; Mykhaylo; Andriluka; and Bernt Schiele

arXiv:1605.03170·cs.CV·December 1, 2016·116 cites

DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model

Eldar Insafutdinov, Leonid Pishchulin, Bjoern Andres, Mykhaylo, Andriluka, and Bernt Schiele

PDF

Open Access 5 Repos 1 Models

TL;DR

This paper introduces an advanced multi-person pose estimation model that improves detection accuracy and speed by integrating better body part proposals, image-conditioned pairwise terms, and an efficient optimization strategy, outperforming existing methods.

Contribution

It presents a novel multi-person pose estimation framework with improved detectors, pairwise terms, and optimization, achieving state-of-the-art results and faster performance.

Findings

01

Significantly outperforms previous multi-person pose estimation methods.

02

Demonstrates competitive performance on single-person pose estimation.

03

Achieves faster inference with improved accuracy.

Abstract

The goal of this paper is to advance the state-of-the-art of articulated pose estimation in scenes with multiple people. To that end we contribute on three fronts. We propose (1) improved body part detectors that generate effective bottom-up proposals for body parts; (2) novel image-conditioned pairwise terms that allow to assemble the proposals into a variable number of consistent body part configurations; and (3) an incremental optimization strategy that explores the search space more efficiently thus leading both to better performance and significant speed-up factors. Evaluation is done on two single-person and two multi-person pose estimation benchmarks. The proposed approach significantly outperforms best known multi-person pose estimation results while demonstrating competitive performance on the task of single person pose estimation. Models and code available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
mwmathis/DeepLabCutModelZoo-DLC_human_fullbody_resnet_101
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging

MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling