MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation
Arjun Jain, Jonathan Tompson, Yann LeCun, Christoph Bregler

TL;DR
This paper introduces MoDeep, a deep learning framework that leverages motion and color features for improved human pose estimation in videos, supported by a new dataset and outperforming existing methods.
Contribution
The paper presents a novel convolutional network architecture that integrates motion features and introduces the FLIC-motion dataset for enhanced pose estimation.
Findings
Significantly improved pose detection accuracy
Effective use of motion features in deep learning models
New dataset extends FLIC with motion information
Abstract
In this work, we propose a novel and efficient method for articulated human pose estimation in videos using a convolutional network architecture, which incorporates both color and motion features. We propose a new human body pose dataset, FLIC-motion, that extends the FLIC dataset with additional motion features. We apply our architecture to this dataset and report significantly better performance than current state-of-the-art pose detection systems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Video Analysis and Summarization
