Towards Universal Skeleton-Based Action Recognition

Jidong Kuang; Hongsong Wang; Jie Gui

arXiv:2604.17013·cs.CV·April 21, 2026

Towards Universal Skeleton-Based Action Recognition

Jidong Kuang, Hongsong Wang, Jie Gui

PDF

1 Repo

TL;DR

This paper introduces a Transformer-based approach for universal skeleton-based action recognition that handles heterogeneous skeleton data and open vocabularies, supported by a large-scale dataset and multi-level motion-text alignment.

Contribution

It presents a novel model with unified skeleton representation and multi-grained alignment, addressing data heterogeneity and open-vocabulary challenges in action recognition.

Findings

01

Effective on benchmarks with heterogeneous skeleton data

02

Demonstrates strong generalization ability

03

Code available at https://github.com/jidongkuang/Universal-Skeleton

Abstract

With the development of robotics, skeleton-based action recognition has become increasingly important, as human-robot interaction requires understanding the actions of humans and humanoid robots. Due to different sources of human skeletons and structures of humanoid robots, skeleton data naturally exhibit heterogeneity. However, previous works overlook the data heterogeneity of skeletons and solely construct models using homogeneous skeletons. Moreover, open-vocabulary action recognition is also essential for real-world applications. To this end, this work studies the challenging problem of heterogeneous skeleton-based action recognition with open vocabularies. We construct a large-scale Heterogeneous Open-Vocabulary (HOV) Skeleton dataset by integrating and refining multiple representative large-scale skeleton-based action datasets. To address universal skeleton-based action…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jidongkuang/Universal-Skeleton
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.