Deep Poselets for Human Detection

Lubomir Bourdev; Fei Yang; Rob Fergus

arXiv:1407.0717·cs.CV·July 4, 2014·21 cites

Deep Poselets for Human Detection

Lubomir Bourdev, Fei Yang, Rob Fergus

PDF

Open Access

TL;DR

This paper introduces a novel human detection method using poselets and CNNs, achieving state-of-the-art results by leveraging weakly labeled data and a new feature representation called PDF vectors.

Contribution

It presents a bootstrapping approach to gather large-scale poselet data and a CNN-based pose discriminative feature for improved human detection.

Findings

01

Achieved state-of-the-art performance on PASCAL datasets.

02

Developed a compact 256-dimensional PDF feature vector.

03

Combined poselet and object-level CNNs for enhanced detection.

Abstract

We address the problem of detecting people in natural scenes using a part approach based on poselets. We propose a bootstrapping method that allows us to collect millions of weakly labeled examples for each poselet type. We use these examples to train a Convolutional Neural Net to discriminate different poselet types and separate them from the background class. We then use the trained CNN as a way to represent poselet patches with a Pose Discriminative Feature (PDF) vector -- a compact 256-dimensional feature vector that is effective at discriminating pose from appearance. We train the poselet model on top of PDF features and combine them with object-level CNNs for detection and bounding box prediction. The resulting model leads to state-of-the-art performance for human detection on the PASCAL datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Neural Network Applications