Exploiting Temporal Coherence for Self-Supervised One-shot Video Re-identification
Dripta S. Raychaudhuri, Amit K. Roy-Chowdhury

TL;DR
This paper introduces a self-supervised learning framework leveraging temporal coherence to improve one-shot video re-identification, significantly enhancing label estimation and re-identification accuracy on challenging datasets.
Contribution
It proposes a novel Temporal Consistency Progressive Learning framework that exploits intra-unlabeled data relationships using local and global consistency losses.
Findings
Achieves up to 8% better label estimation accuracy.
Outperforms state-of-the-art methods on MARS and DukeMTMC-VideoReID datasets.
Demonstrates improved re-identification performance with richer representations.
Abstract
While supervised techniques in re-identification are extremely effective, the need for large amounts of annotations makes them impractical for large camera networks. One-shot re-identification, which uses a singular labeled tracklet for each identity along with a pool of unlabeled tracklets, is a potential candidate towards reducing this labeling effort. Current one-shot re-identification methods function by modeling the inter-relationships amongst the labeled and the unlabeled data, but fail to fully exploit such relationships that exist within the pool of unlabeled data itself. In this paper, we propose a new framework named Temporal Consistency Progressive Learning, which uses temporal coherence as a novel self-supervised auxiliary task in the one-shot learning paradigm to capture such relationships amongst the unlabeled tracklets. Optimizing two new losses, which enforce consistency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
