Unsupervised Spatial-Temporal Feature Enrichment and Fidelity Preservation Network for Skeleton based Action Recognition
Chuankun Li, Shuai Li, Yanbo Gao, Ping Chen, Jian Li, Wanqing Li

TL;DR
This paper introduces U-FEFP, an unsupervised framework that enriches spatial-temporal features and preserves fidelity in skeleton-based action recognition, effectively addressing overfitting and improving discriminative capabilities.
Contribution
The paper proposes a novel unsupervised learning framework combining feature enrichment and fidelity preservation using graph neural networks and bootstrap learning.
Findings
Achieves state-of-the-art results on NTU-RGB+D-60, NTU-RGB+D-120, and PKU-MMD datasets.
Demonstrates more discriminative feature learning through t-SNE visualization.
Effectively mitigates overfitting in unsupervised skeleton-based action recognition.
Abstract
Unsupervised skeleton based action recognition has achieved remarkable progress recently. Existing unsupervised learning methods suffer from severe overfitting problem, and thus small networks are used, significantly reducing the representation capability. To address this problem, the overfitting mechanism behind the unsupervised learning for skeleton based action recognition is first investigated. It is observed that the skeleton is already a relatively high-level and low-dimension feature, but not in the same manifold as the features for action recognition. Simply applying the existing unsupervised learning method may tend to produce features that discriminate the different samples instead of action classes, resulting in the overfitting problem. To solve this problem, this paper presents an Unsupervised spatial-temporal Feature Enrichment and Fidelity Preservation framework (U-FEFP)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Medical Imaging and Analysis
