Spatio-Temporal Representation Factorization for Video-based Person Re-Identification
Abhishek Aich, Meng Zheng, Srikrishna Karanam, Terrence Chen, Amit K., Roy-Chowdhury, Ziyan Wu

TL;DR
This paper introduces Spatio-Temporal Representation Factorization (STRF), a modular unit for 3D CNNs that enhances video-based person re-ID by explicitly modeling spatial and temporal features, leading to state-of-the-art results.
Contribution
The paper proposes a novel STRF module that explicitly factorizes spatial and temporal features, improving re-ID accuracy and compatibility with existing architectures.
Findings
STRF improves baseline model performance across multiple architectures.
Achieves new state-of-the-art results on three standard re-ID benchmarks.
Effectively handles occlusions and spatial misalignments in videos.
Abstract
Despite much recent progress in video-based person re-identification (re-ID), the current state-of-the-art still suffers from common real-world challenges such as appearance similarity among various people, occlusions, and frame misalignment. To alleviate these problems, we propose Spatio-Temporal Representation Factorization (STRF), a flexible new computational unit that can be used in conjunction with most existing 3D convolutional neural network architectures for re-ID. The key innovations of STRF over prior work include explicit pathways for learning discriminative temporal and spatial features, with each component further factorized to capture complementary person-specific appearance and motion information. Specifically, temporal factorization comprises two branches, one each for static features (e.g., the color of clothes) that do not change much over time, and dynamic features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Pose and Action Recognition
