Exploring Self-supervised Skeleton-based Action Recognition in Occluded Environments
Yifei Chen, Kunyu Peng, Alina Roitberg, David Schneider, Jiaming, Zhang, Junwei Zheng, Yufan Chen, Ruiping Liu, Kailun Yang, Rainer, Stiefelhagen

TL;DR
This paper introduces IosPSTL, a self-supervised learning framework that effectively handles occlusions in skeleton-based action recognition by combining a KNN imputer and an occluded partial spatio-temporal learning strategy, achieving state-of-the-art results.
Contribution
The work presents a novel self-supervised framework with a KNN imputer and OPSTL strategy specifically designed for occluded skeleton sequences in action recognition.
Findings
Achieves state-of-the-art performance on occluded NTU datasets.
Effectively imputes missing skeleton data using KNN in latent space.
Enhances learning with adaptive spatial masking during training.
Abstract
To integrate action recognition into autonomous robotic systems, it is essential to address challenges such as person occlusions-a common yet often overlooked scenario in existing self-supervised skeleton-based action recognition methods. In this work, we propose IosPSTL, a simple and effective self-supervised learning framework designed to handle occlusions. IosPSTL combines a cluster-agnostic KNN imputer with an Occluded Partial Spatio-Temporal Learning (OPSTL) strategy. First, we pre-train the model on occluded skeleton sequences. Then, we introduce a cluster-agnostic KNN imputer that performs semantic grouping using k-means clustering on sequence embeddings. It imputes missing skeleton data by applying K-Nearest Neighbors in the latent space, leveraging nearby sample representations to restore occluded joints. This imputation generates more complete skeleton sequences, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Gait Recognition and Analysis
Methodsk-Means Clustering
