Conditional Directed Graph Convolution for 3D Human Pose Estimation
Wenbo Hu, Changgong Zhang, Fangneng Zhan, Lei Zhang, Tien-Tsin Wong

TL;DR
This paper introduces a novel directed graph convolutional network that explicitly models the hierarchical structure of human skeletons for improved 3D pose estimation from monocular videos, achieving top performance on benchmark datasets.
Contribution
It proposes a directed graph representation of the human skeleton and a spatial-temporal conditional graph convolution to better capture pose dependencies and hierarchy.
Findings
Outperforms existing methods on Human3.6M and MPI-INF-3DHP datasets.
Directed graphs better exploit skeletal hierarchy than undirected graphs.
Conditional graph topology adapts to different poses, improving accuracy.
Abstract
Graph convolutional networks have significantly improved 3D human pose estimation by representing the human skeleton as an undirected graph. However, this representation fails to reflect the articulated characteristic of human skeletons as the hierarchical orders among the joints are not explicitly presented. In this paper, we propose to represent the human skeleton as a directed graph with the joints as nodes and bones as edges that are directed from parent joints to child joints. By so doing, the directions of edges can explicitly reflect the hierarchical relationships among the nodes. Based on this representation, we further propose a spatial-temporal conditional directed graph convolution to leverage varying non-local dependence for different poses by conditioning the graph topology on input poses. Altogether, we form a U-shaped network, named U-shaped Conditional Directed Graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications
MethodsConvolution
