Local-Global Associative Frame Assemble in Video Re-ID
Qilei Li, Jiabo Huang, Shaogang Gong

TL;DR
This paper introduces a novel approach for video re-identification that jointly leverages local alignments and global appearance correlations to improve discriminative feature learning, effectively handling noisy frames.
Contribution
It proposes a joint optimization framework combining local aligned quality and global correlated quality modules for better frame selection in video Re-ID.
Findings
Outperforms state-of-the-art on five Re-ID benchmarks.
Effectively handles noisy and unrepresentative frames.
Demonstrates significant accuracy improvements over existing methods.
Abstract
Noisy and unrepresentative frames in automatically generated object bounding boxes from video sequences cause significant challenges in learning discriminative representations in video re-identification (Re-ID). Most existing methods tackle this problem by assessing the importance of video frames according to either their local part alignments or global appearance correlations separately. However, given the diverse and unknown sources of noise which usually co-exist in captured video data, existing methods have not been effective satisfactorily. In this work, we explore jointly both local alignments and global correlations with further consideration of their mutual promotion/reinforcement so to better assemble complementary discriminative Re-ID information within all the relevant frames in video tracklets. Specifically, we concurrently optimise a local aligned quality (LAQ) module that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Image Enhancement Techniques · Human Pose and Action Recognition
