FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition
Wenhan Wu, Pengfei Wang, Chen Chen, Aidong Lu

TL;DR
FreqMixFormerV2 introduces a lightweight, frequency-aware transformer architecture for human skeleton action recognition, achieving high accuracy with significantly fewer parameters suitable for resource-limited environments.
Contribution
It presents a novel frequency-aware design and simplified attention mechanism that reduce model complexity while maintaining or improving recognition performance.
Findings
Outperforms state-of-the-art methods on standard datasets.
Uses only 60% of the parameters of comparable models.
Achieves a superior efficiency-accuracy trade-off.
Abstract
Transformer-based human skeleton action recognition has been developed for years. However, the complexity and high parameter count demands of these models hinder their practical applications, especially in resource-constrained environments. In this work, we propose FreqMixForemrV2, which was built upon the Frequency-aware Mixed Transformer (FreqMixFormer) for identifying subtle and discriminative actions with pioneered frequency-domain analysis. We design a lightweight architecture that maintains robust performance while significantly reducing the model complexity. This is achieved through a redesigned frequency operator that optimizes high-frequency and low-frequency parameter adjustments, and a simplified frequency-aware attention module. These improvements result in a substantial reduction in model parameters, enabling efficient deployment with only a minimal sacrifice in accuracy.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Anomaly Detection Techniques and Applications
MethodsByte Pair Encoding · Linear Layer · Absolute Position Encodings · Dropout · Softmax · Attention Is All You Need · Dense Connections · Residual Connection · Multi-Head Attention · Adam
