HDFormer: High-order Directed Transformer for 3D Human Pose Estimation
Hanyuan Chen, Jun-Yan He, Wangmeng Xiang, Zhi-Qi Cheng, Wei Liu,, Hanbing Liu, Bin Luo, Yifeng Geng, Xuansong Xie

TL;DR
HDFormer introduces a high-order attention transformer that models complex joint relationships for more accurate and efficient 3D human pose estimation, especially in occlusion-heavy scenarios, outperforming existing methods.
Contribution
The paper proposes a novel high-order attention mechanism within a transformer architecture for 3D pose estimation, effectively capturing complex joint interactions and reducing computational costs.
Findings
Outperforms state-of-the-art on Human3.6M and MPI-INF-3DHP datasets.
Uses only 1/10 of the parameters of comparable models.
Enables real-time 3D human pose estimation.
Abstract
Human pose estimation is a challenging task due to its structured data sequence nature. Existing methods primarily focus on pair-wise interaction of body joints, which is insufficient for scenarios involving overlapping joints and rapidly changing poses. To overcome these issues, we introduce a novel approach, the High-order Directed Transformer (HDFormer), which leverages high-order bone and joint relationships for improved pose estimation. Specifically, HDFormer incorporates both self-attention and high-order attention to formulate a multi-order attention module. This module facilitates first-order "jointjoint", second-order "bonejoint", and high-order "hyperbonejoint" interactions, effectively addressing issues in complex and occlusion-heavy situations. In addition, modern CNN techniques are integrated into the transformer-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Anomaly Detection Techniques and Applications
MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Position-Wise Feed-Forward Layer · Adam
