SAS-VPReID: A Scale-Adaptive Framework with Shape Priors for Video-based Person Re-Identification at Extreme Far Distances

Qiwei Yang; Pingping Zhang; Yuhao Wang; Zijing Gong

arXiv:2601.05535·cs.CV·January 12, 2026

SAS-VPReID: A Scale-Adaptive Framework with Shape Priors for Video-based Person Re-Identification at Extreme Far Distances

Qiwei Yang, Pingping Zhang, Yuhao Wang, Zijing Gong

PDF

Open Access

TL;DR

SAS-VPReID introduces a scale-adaptive framework with shape priors that significantly improves video-based person re-identification at extreme far distances by addressing resolution, viewpoint, and appearance challenges.

Contribution

The paper presents a novel framework combining a memory-enhanced visual backbone, multi-granularity temporal modeling, and shape dynamics regularization for improved VPReID performance.

Findings

01

Achieved top rank on VReID-XFD benchmark.

02

Each module significantly enhances re-identification accuracy.

03

Framework effectively handles extreme far-distance challenges.

Abstract

Video-based Person Re-IDentification (VPReID) aims to retrieve the same person from videos captured by non-overlapping cameras. At extreme far distances, VPReID is highly challenging due to severe resolution degradation, drastic viewpoint variation and inevitable appearance noise. To address these issues, we propose a Scale-Adaptive framework with Shape Priors for VPReID, named SAS-VPReID. The framework is built upon three complementary modules. First, we deploy a Memory-Enhanced Visual Backbone (MEVB) to extract discriminative feature representations, which leverages the CLIP vision encoder and multi-proxy memory. Second, we propose a Multi-Granularity Temporal Modeling (MGTM) to construct sequences at multiple temporal granularities and adaptively emphasize motion cues across scales. Third, we incorporate Prior-Regularized Shape Dynamics (PRSD) to capture body structure dynamics. With…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Pose and Action Recognition