Multi-view Gait Recognition based on Siamese Vision Transformer

Yanchen Yang; Lijun Yun; Ruoyu Li; Feiyan Cheng

arXiv:2210.10421·cs.CV·October 20, 2022

Multi-view Gait Recognition based on Siamese Vision Transformer

Yanchen Yang, Lijun Yun, Ruoyu Li, Feiyan Cheng

PDF

Open Access

TL;DR

This paper introduces SMViT, a Siamese Vision Transformer model that effectively captures multi-view gait features, achieving state-of-the-art recognition accuracy on CASIA B dataset by considering local and long-distance gait characteristics.

Contribution

The paper presents a novel Siamese Mobile Vision Transformer for multi-view gait recognition, emphasizing local features and long-distance attention, outperforming existing models.

Findings

01

Achieved 96.4% recognition accuracy on CASIA B dataset.

02

Outperformed models like GaitGAN, Multi_view GAN, Posegait.

03

Effectively models multi-view gait characteristics.

Abstract

While the Vision Transformer has been used in gait recognition, its application in multi-view gait recognition is still limited. Different views significantly affect the extraction and identification accuracy of the characteristics of gait contour. To address this, this paper proposes a Siamese Mobile Vision Transformer (SMViT). This model not only focuses on the local characteristics of the human gait space but also considers the characteristics of long-distance attention associations, which can extract multi-dimensional step status characteristics. In addition, it describes how different perspectives affect gait characteristics and generate reliable perspective feature relationship factors. The average recognition rate of SMViT on the CASIA B data set reached 96.4%. The experimental results show that SMViT can attain state-of-the-art performance compared to advanced step recognition…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGait Recognition and Analysis · Diabetic Foot Ulcer Assessment and Management · Hand Gesture Recognition Systems

MethodsAttention Is All You Need · Softmax · Adam · Label Smoothing · Linear Layer · Absolute Position Encodings · Layer Normalization · Byte Pair Encoding · Residual Connection · Dropout