Navigating Open Set Scenarios for Skeleton-based Action Recognition

Kunyu Peng; Cheng Yin; Junwei Zheng; Ruiping Liu; David Schneider,; Jiaming Zhang; Kailun Yang; M. Saquib Sarfraz; Rainer Stiefelhagen; Alina; Roitberg

arXiv:2312.06330·cs.CV·December 12, 2023·2 cites

Navigating Open Set Scenarios for Skeleton-based Action Recognition

Kunyu Peng, Cheng Yin, Junwei Zheng, Ruiping Liu, David Schneider,, Jiaming Zhang, Kailun Yang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina, Roitberg

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new open-set skeleton-based action recognition benchmark, evaluates existing methods, and proposes CrossMax, a novel cross-modality ensemble approach that significantly improves recognition of known and unknown actions.

Contribution

It formalizes the OS-SAR task, evaluates existing approaches, and proposes CrossMax, a novel cross-modality method with a discrepancy suppression mechanism for improved open-set recognition.

Findings

01

CrossMax outperforms existing methods on all datasets.

02

The benchmark provides a new standard for OS-SAR evaluation.

03

CrossMax achieves state-of-the-art results across various backbones.

Abstract

In real-world scenarios, human actions often fall outside the distribution of training data, making it crucial for models to recognize known actions and reject unknown ones. However, using pure skeleton data in such open-set conditions poses challenges due to the lack of visual background cues and the distinct sparse structure of body pose sequences. In this paper, we tackle the unexplored Open-Set Skeleton-based Action Recognition (OS-SAR) task and formalize the benchmark on three skeleton-based datasets. We assess the performance of seven established open-set approaches on our task and identify their limits and critical generalization issues when dealing with skeleton information. To address these challenges, we propose a distance-based cross-modality ensemble method that leverages the cross-modal alignment of skeleton joints, bones, and velocities to achieve superior open-set…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kpeng9510/os-sar
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Multimodal Machine Learning Applications

MethodsALIGN