Hypergraph Transformer for Skeleton-based Action Recognition
Yuxuan Zhou, Zhi-Qi Cheng, Chao Li, Yanwen Fang, Yifeng Geng, Xuansong, Xie, Margret Keuper

TL;DR
This paper introduces Hyperformer, a hypergraph transformer model that incorporates skeletal structure and higher-order joint relations for improved skeleton-based action recognition.
Contribution
It proposes HyperSA, a novel self-attention mechanism on hypergraphs, to model high-order dependencies, and integrates bone connectivity into a transformer architecture.
Findings
Outperforms state-of-the-art graph models in accuracy.
Achieves higher efficiency on benchmark datasets.
Effectively models high-order joint relations.
Abstract
Skeleton-based action recognition aims to recognize human actions given human joint coordinates with skeletal interconnections. By defining a graph with joints as vertices and their natural connections as edges, previous works successfully adopted Graph Convolutional networks (GCNs) to model joint co-occurrences and achieved superior performance. More recently, a limitation of GCNs is identified, i.e., the topology is fixed after training. To relax such a restriction, Self-Attention (SA) mechanism has been adopted to make the topology of GCNs adaptive to the input, resulting in the state-of-the-art hybrid models. Concurrently, attempts with plain Transformers have also been made, but they still lag behind state-of-the-art GCN-based methods due to the lack of structural prior. Unlike hybrid models, we propose a more elegant solution to incorporate the bone connectivity into Transformer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Artificial Intelligence in Healthcare
MethodsMulti-Head Attention · Attention Is All You Need · HyperGraph Self-Attention · Residual Connection · Transformer
