Generative Data Augmentation for Skeleton Action Recognition

Xu Dong; Wanqing Li; Anthony Adeyemi-Ejeye; Andrew Gilbert

arXiv:2604.14933·cs.CV·April 17, 2026

Generative Data Augmentation for Skeleton Action Recognition

Xu Dong, Wanqing Li, Anthony Adeyemi-Ejeye, Andrew Gilbert

PDF

TL;DR

This paper introduces a Transformer-based generative pipeline for augmenting skeleton action recognition data, improving model performance especially in low-data scenarios.

Contribution

It presents a novel conditional generative method with a Transformer architecture to synthesize diverse, high-fidelity skeleton sequences for better action recognition.

Findings

01

Improves recognition accuracy in low-data settings.

02

Enhances model generalization with synthetic skeleton data.

03

Validates effectiveness on multiple datasets.

Abstract

Skeleton-based human action recognition is a powerful approach for understanding human behaviour from pose data, but collecting large-scale, diverse, and well-annotated 3D skeleton datasets is both expensive and labor-intensive. To address this challenge, we propose a conditional generative pipeline for data augmentation in skeleton action recognition. Our method learns the distribution of real skeleton sequences under the constraint of action labels, enabling the synthesis of diverse and high-fidelity data. Even with limited training samples, it can effectively generate skeleton sequences and achieve competitive recognition performance in low-data scenarios, demonstrating strong generalisation in downstream tasks. Specifically, we introduce a Transformer-based encoder-decoder architecture, combined with a generative refinement module and a dropout mechanism, to balance fidelity and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.