LiCAF: LiDAR-Camera Asymmetric Fusion for Gait Recognition
Yunze Deng, Haijun Xiong, Bin Feng

TL;DR
This paper introduces LiCAF, a novel LiDAR-camera fusion network for gait recognition that employs asymmetric modeling strategies to improve robustness and accuracy, achieving state-of-the-art results on the SUSTech1K dataset.
Contribution
LiCAF is the first to use asymmetric cross-modal channel attention and interlaced temporal modeling for LiDAR-camera gait recognition.
Findings
Achieves 93.9% Rank-1 accuracy on SUSTech1K dataset.
Outperforms existing methods in gait recognition accuracy.
Demonstrates effective cross-modal feature fusion and temporal modeling.
Abstract
Gait recognition is a biometric technology that identifies individuals by using walking patterns. Due to the significant achievements of multimodal fusion in gait recognition, we consider employing LiDAR-camera fusion to obtain robust gait representations. However, existing methods often overlook intrinsic characteristics of modalities, and lack fine-grained fusion and temporal modeling. In this paper, we introduce a novel modality-sensitive network LiCAF for LiDAR-camera fusion, which employs an asymmetric modeling strategy. Specifically, we propose Asymmetric Cross-modal Channel Attention (ACCA) and Interlaced Cross-modal Temporal Modeling (ICTM) for cross-modal valuable channel information selection and powerful temporal modeling. Our method achieves state-of-the-art performance (93.9% in Rank-1 and 98.8% in Rank-5) on the SUSTech1K dataset, demonstrating its effectiveness.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGait Recognition and Analysis · Video Surveillance and Tracking Methods · Diabetic Foot Ulcer Assessment and Management
MethodsSoftmax · Attention Is All You Need
