Skeleton-Guided Spatial-Temporal Feature Learning for Video-Based   Visible-Infrared Person Re-Identification

Wenjia Jiang; Xiaoke Zhu; Jiakang Gao; Di Liao

arXiv:2411.11069·cs.CV·December 12, 2024

Skeleton-Guided Spatial-Temporal Feature Learning for Video-Based Visible-Infrared Person Re-Identification

Wenjia Jiang, Xiaoke Zhu, Jiakang Gao, Di Liao

PDF

Open Access

TL;DR

This paper introduces STAR, a skeleton-guided method that enhances spatial-temporal feature learning in video-based visible-infrared person re-identification, addressing issues like low quality and occlusions.

Contribution

The paper proposes a novel skeleton-guided framework with frame and sequence level strategies to improve spatial-temporal features in VVI-ReID, especially for infrared videos.

Findings

01

STAR outperforms existing methods on benchmark datasets.

02

Skeleton information improves robustness to occlusions and low-quality videos.

03

The method effectively integrates body part contributions for better feature representation.

Abstract

Video-based visible-infrared person re-identification (VVI-ReID) is challenging due to significant modality feature discrepancies. Spatial-temporal information in videos is crucial, but the accuracy of spatial-temporal information is often influenced by issues like low quality and occlusions in videos. Existing methods mainly focus on reducing modality differences, but pay limited attention to improving spatial-temporal features, particularly for infrared videos. To address this, we propose a novel Skeleton-guided spatial-Temporal feAture leaRning (STAR) method for VVI-ReID. By using skeleton information, which is robust to issues such as poor image quality and occlusions, STAR improves the accuracy of spatial-temporal features in videos of both modalities. Specifically, STAR employs two levels of skeleton-guided strategies: frame level and sequence level. At the frame level, the robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Pose and Action Recognition

MethodsSoftmax · Attention Is All You Need · Focus