Conditional Directed Graph Convolution for 3D Human Pose Estimation

Wenbo Hu; Changgong Zhang; Fangneng Zhan; Lei Zhang; Tien-Tsin Wong

arXiv:2107.07797·cs.CV·August 5, 2021·6 cites

Conditional Directed Graph Convolution for 3D Human Pose Estimation

Wenbo Hu, Changgong Zhang, Fangneng Zhan, Lei Zhang, Tien-Tsin Wong

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel directed graph convolutional network that explicitly models the hierarchical structure of human skeletons for improved 3D pose estimation from monocular videos, achieving top performance on benchmark datasets.

Contribution

It proposes a directed graph representation of the human skeleton and a spatial-temporal conditional graph convolution to better capture pose dependencies and hierarchy.

Findings

01

Outperforms existing methods on Human3.6M and MPI-INF-3DHP datasets.

02

Directed graphs better exploit skeletal hierarchy than undirected graphs.

03

Conditional graph topology adapts to different poses, improving accuracy.

Abstract

Graph convolutional networks have significantly improved 3D human pose estimation by representing the human skeleton as an undirected graph. However, this representation fails to reflect the articulated characteristic of human skeletons as the hierarchical orders among the joints are not explicitly presented. In this paper, we propose to represent the human skeleton as a directed graph with the joints as nodes and bones as edges that are directed from parent joints to child joints. By so doing, the directions of edges can explicitly reflect the hierarchical relationships among the nodes. Based on this representation, we further propose a spatial-temporal conditional directed graph convolution to leverage varying non-local dependence for different poses by conditioning the graph topology on input poses. Altogether, we form a U-shaped network, named U-shaped Conditional Directed Graph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tamasino52/U-CondDGCN
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications

MethodsConvolution