Delving Deep into One-Shot Skeleton-based Action Recognition with Diverse Occlusions
Kunyu Peng, Alina Roitberg, Kailun Yang, Jiaming Zhang, Rainer, Stiefelhagen

TL;DR
This paper introduces a new benchmark for one-shot skeleton-based action recognition under realistic occlusions, evaluates existing models, and proposes a transformer-based model that improves robustness against occlusions.
Contribution
It formalizes the first benchmark for occluded skeleton-based one-shot action recognition and introduces Trans4SOAR, a transformer model that enhances recognition accuracy under occlusions.
Findings
Trans4SOAR outperforms existing models on all datasets.
Recognition accuracy declines with occlusions, but less so with Trans4SOAR.
State-of-the-art performance achieved on NTU-120 without occlusion.
Abstract
Occlusions are universal disruptions constantly present in the real world. Especially for sparse representations, such as human skeletons, a few occluded points might destroy the geometrical and temporal continuity critically affecting the results. Yet, the research of data-scarce recognition from skeleton sequences, such as one-shot action recognition, does not explicitly consider occlusions despite their everyday pervasiveness. In this work, we explicitly tackle body occlusions for Skeleton-based One-shot Action Recognition (SOAR). We mainly consider two occlusion variants: 1) random occlusions and 2) more realistic occlusions caused by diverse everyday objects, which we generate by projecting the existing IKEA 3D furniture models into the camera coordinate system of the 3D skeletons with different geometric parameters. We leverage the proposed pipeline to blend out portions of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Gait Recognition and Analysis
