MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network
Soroush Mehraban, Vida Adeli, Babak Taati

TL;DR
MotionAGFormer introduces a hybrid transformer-GCNFormer network that captures both global and local joint relationships, achieving state-of-the-art 3D human pose estimation with high efficiency.
Contribution
The paper proposes a novel AGFormer block combining transformer and GCNFormer streams, and a MotionAGFormer architecture with variants for speed-accuracy trade-offs, setting new benchmarks.
Findings
State-of-the-art P1 errors on Human3.6M and MPI-INF-3DHP datasets.
Uses a quarter of the parameters of previous models.
Threefold increase in computational efficiency.
Abstract
Recent transformer-based approaches have demonstrated excellent performance in 3D human pose estimation. However, they have a holistic view and by encoding global relationships between all the joints, they do not capture the local dependencies precisely. In this paper, we present a novel Attention-GCNFormer (AGFormer) block that divides the number of channels by using two parallel transformer and GCNFormer streams. Our proposed GCNFormer module exploits the local relationship between adjacent joints, outputting a new representation that is complementary to the transformer output. By fusing these two representation in an adaptive way, AGFormer exhibits the ability to better learn the underlying 3D structure. By stacking multiple AGFormer blocks, we propose MotionAGFormer in four different variants, which can be chosen based on the speed-accuracy trade-off. We evaluate our model on two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
MotionAGFormer: Enhancing 3D Human Pose Estimation With a Transformer-GCNFormer Network· youtube
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Gait Recognition and Analysis
