Learning Temporal 3D Human Pose Estimation with Pseudo-Labels

Arij Bouazizi; Ulrich Kressel; Vasileios Belagiannis

arXiv:2110.07578·cs.CV·October 15, 2021

Learning Temporal 3D Human Pose Estimation with Pseudo-Labels

Arij Bouazizi, Ulrich Kressel, Vasileios Belagiannis

PDF

Open Access 2 Repos

TL;DR

This paper introduces a self-supervised approach for 3D human pose estimation that leverages temporal information and multi-view triangulation to improve accuracy, achieving state-of-the-art results on standard benchmarks.

Contribution

It proposes a novel method combining temporal convolutional networks with multi-view triangulation for self-supervised 3D human pose estimation.

Findings

01

Achieves state-of-the-art results on Human3.6M and MPI-INF-3DHP datasets.

02

Effectively uses temporal and multi-view information during training.

03

Operates with single-view 2D inputs during inference.

Abstract

We present a simple, yet effective, approach for self-supervised 3D human pose estimation. Unlike the prior work, we explore the temporal information next to the multi-view self-supervision. During training, we rely on triangulating 2D body pose estimates of a multiple-view camera system. A temporal convolutional neural network is trained with the generated 3D ground-truth and the geometric multi-view consistency loss, imposing geometrical constraints on the predicted 3D body skeleton. During inference, our model receives a sequence of 2D body pose estimates from a single-view to predict the 3D body pose for each of them. An extensive evaluation shows that our method achieves state-of-the-art performance in the Human3.6M and MPI-INF-3DHP benchmarks. Our code and models are publicly available at \url{https://github.com/vru2020/TM_HPE/}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging