SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition
Shaojie Zhang, Jianqin Yin, Yonghao Dang, Jiajun Fu

TL;DR
This paper introduces SiT-MLP, an MLP-based model that effectively captures spatial dependencies in skeleton data for action recognition without relying on complex priors, achieving competitive results with fewer parameters.
Contribution
The paper proposes the first MLP-based model, SiT-MLP, utilizing a novel Spatial Topology Gating Unit for point-wise topology feature learning in skeleton-based action recognition.
Findings
Achieves competitive performance on large-scale datasets.
Reduces model parameters significantly.
Introduces a novel gate-based feature interaction mechanism.
Abstract
Graph convolution networks (GCNs) have achieved remarkable performance in skeleton-based action recognition. However, previous GCN-based methods rely on elaborate human priors excessively and construct complex feature aggregation mechanisms, which limits the generalizability and effectiveness of networks. To solve these problems, we propose a novel Spatial Topology Gating Unit (STGU), an MLP-based variant without extra priors, to capture the co-occurrence topology features that encode the spatial dependency across all joints. In STGU, to learn the point-wise topology features, a new gate-based feature interaction mechanism is introduced to activate the features point-to-point by the attention map generated from the input sample. Based on the STGU, we propose the first MLP-based model, SiT-MLP, for skeleton-based action recognition in this work. Compared with previous methods on three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Gait Recognition and Analysis
MethodsConvolution
