3D Human Pose Estimation via Spatial Graph Order Attention and Temporal Body Aware Transformer
Kamel Aouaidjia, Aofan Li, Wenhao Zhang, Chongsheng Zhang

TL;DR
This paper introduces a novel 3D human pose estimation method combining graph order attention and a body-aware transformer to better model spatial and temporal dependencies, achieving state-of-the-art results.
Contribution
It proposes a new graph modeling approach with multiple graph orders and a graph order attention module, along with a temporal body-aware transformer that incorporates pose centrality.
Findings
Effective on Human3.6m, MPIINF-3DHP, and HumanEva-I datasets.
Outperforms existing methods in 3D pose estimation accuracy.
Code and models are publicly available.
Abstract
Nowadays, Transformers and Graph Convolutional Networks (GCNs) are the prevailing techniques for 3D human pose estimation. However, Transformer-based methods either ignore the spatial neighborhood relationships between the joints when used for skeleton representations or disregard the local temporal patterns of the local joint movements in skeleton sequence modeling, while GCN-based methods often neglect the need for pose-specific representations. To address these problems, we propose a new method that exploits the graph modeling capability of GCN to represent each skeleton with multiple graphs of different orders, incorporated with a newly introduced Graph Order Attention module that dynamically emphasizes the most representative orders for each joint. The resulting spatial features of the sequence are further processed using a proposed temporal Body Aware Transformer that models the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Hand Gesture Recognition Systems
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Attentive Walk-Aggregating Graph Neural Network · Adam · Dropout · Layer Normalization · Focus · Position-Wise Feed-Forward Layer
