GaitMixer: Skeleton-based Gait Representation Learning via Wide-spectrum Multi-axial Mixer
Ekkasit Pinyoanuntapong, Ayman Ali, Pu Wang, Minwoo Lee, Chen Chen

TL;DR
GaitMixer introduces a novel skeleton-based gait recognition model that leverages multi-axial mixing and attention mechanisms to significantly improve performance, narrowing the gap with appearance-based methods.
Contribution
The paper proposes GaitMixer, a new network architecture that enhances skeleton-based gait recognition through heterogeneous multi-axial mixing and attention, achieving state-of-the-art results.
Findings
GaitMixer outperforms previous skeleton-based methods on CASIA-B.
GaitMixer achieves competitive results with appearance-based methods.
The model effectively captures multi-frequency gait features.
Abstract
Most existing gait recognition methods are appearance-based, which rely on the silhouettes extracted from the video data of human walking activities. The less-investigated skeleton-based gait recognition methods directly learn the gait dynamics from 2D/3D human skeleton sequences, which are theoretically more robust solutions in the presence of appearance changes caused by clothes, hairstyles, and carrying objects. However, the performance of skeleton-based solutions is still largely behind the appearance-based ones. This paper aims to close such performance gap by proposing a novel network model, GaitMixer, to learn more discriminative gait representation from skeleton sequence data. In particular, GaitMixer follows a heterogeneous multi-axial mixer architecture, which exploits the spatial self-attention mixer followed by the temporal large-kernel convolution mixer to learn rich…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGait Recognition and Analysis · Diabetic Foot Ulcer Assessment and Management · Human Pose and Action Recognition
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Label Smoothing · Dropout · Byte Pair Encoding · Adam · Dense Connections · Softmax
