Multi-view Gait Recognition based on Siamese Vision Transformer
Yanchen Yang, Lijun Yun, Ruoyu Li, Feiyan Cheng

TL;DR
This paper introduces SMViT, a Siamese Vision Transformer model that effectively captures multi-view gait features, achieving state-of-the-art recognition accuracy on CASIA B dataset by considering local and long-distance gait characteristics.
Contribution
The paper presents a novel Siamese Mobile Vision Transformer for multi-view gait recognition, emphasizing local features and long-distance attention, outperforming existing models.
Findings
Achieved 96.4% recognition accuracy on CASIA B dataset.
Outperformed models like GaitGAN, Multi_view GAN, Posegait.
Effectively models multi-view gait characteristics.
Abstract
While the Vision Transformer has been used in gait recognition, its application in multi-view gait recognition is still limited. Different views significantly affect the extraction and identification accuracy of the characteristics of gait contour. To address this, this paper proposes a Siamese Mobile Vision Transformer (SMViT). This model not only focuses on the local characteristics of the human gait space but also considers the characteristics of long-distance attention associations, which can extract multi-dimensional step status characteristics. In addition, it describes how different perspectives affect gait characteristics and generate reliable perspective feature relationship factors. The average recognition rate of SMViT on the CASIA B data set reached 96.4%. The experimental results show that SMViT can attain state-of-the-art performance compared to advanced step recognition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGait Recognition and Analysis · Diabetic Foot Ulcer Assessment and Management · Hand Gesture Recognition Systems
MethodsAttention Is All You Need · Softmax · Adam · Label Smoothing · Linear Layer · Absolute Position Encodings · Layer Normalization · Byte Pair Encoding · Residual Connection · Dropout
