Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition
Abhisek Ray, Ayush Raj, Maheshkumar H. Kolekar

TL;DR
This paper introduces AutoregAd-HGformer, a novel transformer-based hypergraph model that effectively captures multiscale and long-range dependencies in skeleton sequences for improved activity recognition.
Contribution
It proposes an autoregressive adaptive hypergraph transformer with in-phase and out-phase hypergraph generation, enhancing feature representation for skeleton-based action recognition.
Findings
Outperforms state-of-the-art hypergraph models on NTU RGB+D datasets.
Demonstrates superior accuracy through extensive experiments and ablation studies.
Effectively captures complex spatial, temporal, and channel dependencies.
Abstract
Extracting multiscale contextual information and higher-order correlations among skeleton sequences using Graph Convolutional Networks (GCNs) alone is inadequate for effective action classification. Hypergraph convolution addresses the above issues but cannot harness the long-range dependencies. The transformer proves to be effective in capturing these dependencies and making complex contextual features accessible. We propose an Autoregressive Adaptive HyperGraph Transformer (AutoregAd-HGformer) model for in-phase (autoregressive and discrete) and out-phase (adaptive) hypergraph generation. The vector quantized in-phase hypergraph equipped with powerful autoregressive learned priors produces a more robust and informative representation suitable for hyperedge formation. The out-phase hypergraph generator provides a model-agnostic hyperedge learning technique to align the attributes with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis
MethodsLinear Layer · Multi-Head Attention · Residual Connection · Softmax · Byte Pair Encoding · Dropout · Absolute Position Encodings · Attention Is All You Need · Dense Connections · Label Smoothing
