Autoregressive Adaptive Hypergraph Transformer for Skeleton-based   Activity Recognition

Abhisek Ray; Ayush Raj; Maheshkumar H. Kolekar

arXiv:2411.05692·cs.CV·March 3, 2025

Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition

Abhisek Ray, Ayush Raj, Maheshkumar H. Kolekar

PDF

Open Access 1 Repo

TL;DR

This paper introduces AutoregAd-HGformer, a novel transformer-based hypergraph model that effectively captures multiscale and long-range dependencies in skeleton sequences for improved activity recognition.

Contribution

It proposes an autoregressive adaptive hypergraph transformer with in-phase and out-phase hypergraph generation, enhancing feature representation for skeleton-based action recognition.

Findings

01

Outperforms state-of-the-art hypergraph models on NTU RGB+D datasets.

02

Demonstrates superior accuracy through extensive experiments and ablation studies.

03

Effectively captures complex spatial, temporal, and channel dependencies.

Abstract

Extracting multiscale contextual information and higher-order correlations among skeleton sequences using Graph Convolutional Networks (GCNs) alone is inadequate for effective action classification. Hypergraph convolution addresses the above issues but cannot harness the long-range dependencies. The transformer proves to be effective in capturing these dependencies and making complex contextual features accessible. We propose an Autoregressive Adaptive HyperGraph Transformer (AutoregAd-HGformer) model for in-phase (autoregressive and discrete) and out-phase (adaptive) hypergraph generation. The vector quantized in-phase hypergraph equipped with powerful autoregressive learned priors produces a more robust and informative representation suitable for hyperedge formation. The out-phase hypergraph generator provides a model-agnostic hyperedge learning technique to align the attributes with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rayabhisek123/autoregad-hgformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis

MethodsLinear Layer · Multi-Head Attention · Residual Connection · Softmax · Byte Pair Encoding · Dropout · Absolute Position Encodings · Attention Is All You Need · Dense Connections · Label Smoothing