MoDeep: A Deep Learning Framework Using Motion Features for Human Pose   Estimation

Arjun Jain; Jonathan Tompson; Yann LeCun; Christoph Bregler

arXiv:1409.7963·cs.CV·September 30, 2014·30 cites

MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation

Arjun Jain, Jonathan Tompson, Yann LeCun, Christoph Bregler

PDF

Open Access

TL;DR

This paper introduces MoDeep, a deep learning framework that leverages motion and color features for improved human pose estimation in videos, supported by a new dataset and outperforming existing methods.

Contribution

The paper presents a novel convolutional network architecture that integrates motion features and introduces the FLIC-motion dataset for enhanced pose estimation.

Findings

01

Significantly improved pose detection accuracy

02

Effective use of motion features in deep learning models

03

New dataset extends FLIC with motion information

Abstract

In this work, we propose a novel and efficient method for articulated human pose estimation in videos using a convolutional network architecture, which incorporates both color and motion features. We propose a new human body pose dataset, FLIC-motion, that extends the FLIC dataset with additional motion features. We apply our architecture to this dataset and report significantly better performance than current state-of-the-art pose detection systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Video Analysis and Summarization