HopFIR: Hop-wise GraphFormer with Intragroup Joint Refinement for 3D Human Pose Estimation
Kai Zhai, Qiang Nie, Bo Ouyang, Xiang Li, Shanlin Yang

TL;DR
HopFIR introduces a novel hop-wise transformer attention mechanism and joint refinement modules to better model joint interactions and improve 3D human pose estimation accuracy.
Contribution
The paper proposes HopFIR, a new architecture with hop-wise GraphFormer and intragroup joint refinement for enhanced 3D human pose estimation.
Findings
Outperforms state-of-the-art methods with MPJPE of 32.67 mm on Human3.6M.
HopFIR improves GCN-based methods significantly, e.g., SemGCN by 8.9%.
HopFIR effectively models latent joint synergies and limb information.
Abstract
2D-to-3D human pose lifting is fundamental for 3D human pose estimation (HPE), for which graph convolutional networks (GCNs) have proven inherently suitable for modeling the human skeletal topology. However, the current GCN-based 3D HPE methods update the node features by aggregating their neighbors' information without considering the interaction of joints in different joint synergies. Although some studies have proposed importing limb information to learn the movement patterns, the latent synergies among joints, such as maintaining balance are seldom investigated. We propose the Hop-wise GraphFormer with Intragroup Joint Refinement (HopFIR) architecture to tackle the 3D HPE problem. HopFIR mainly consists of a novel hop-wise GraphFormer (HGF) module and an intragroup joint refinement (IJR) module. The HGF module groups the joints by k-hop neighbors and applies a hopwise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Stroke Rehabilitation and Recovery
