TL;DR
This paper introduces a new learning-based method for 3D human pose estimation from asynchronous event camera data, demonstrating improved accuracy and robustness over existing RGB-based methods, especially in fast-moving scenarios.
Contribution
It presents the first learning-based approach for 3D human pose estimation from a single stream of event data and provides a new dataset for this task.
Findings
Achieves solid accuracy in 3D pose estimation from event data
Narrowing the performance gap between RGB and event-based methods
Provides a new dataset for event-based human pose estimation
Abstract
This paper presents a novel 3D human pose estimation approach using a single stream of asynchronous events as input. Most of the state-of-the-art approaches solve this task with RGB cameras, however struggling when subjects are moving fast. On the other hand, event-based 3D pose estimation benefits from the advantages of event-cameras, especially their efficiency and robustness to appearance changes. Yet, finding human poses in asynchronous events is in general more challenging than standard RGB pose estimation, since little or no events are triggered in static scenes. Here we propose the first learning-based method for 3D human pose from a single stream of events. Our method consists of two steps. First, we process the event-camera stream to predict three orthogonal heatmaps per joint; each heatmap is the projection of of the joint onto one orthogonal plane. Next, we fuse the sets of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsHeatmap
