Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification
Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen

TL;DR
This paper introduces an uncertainty-aware multi-shot teacher-student framework that leverages multiple images of the same object to improve single-image object re-identification, achieving state-of-the-art results.
Contribution
It is the first to utilize multi-shots in a teacher-student learning setup for enhancing single-image re-id performance.
Findings
S-net outperforms baselines in re-id tasks
Achieves state-of-the-art performance on vehicle and person re-id datasets
Effectively transfers knowledge from multi-shot teacher to single-shot student
Abstract
Object re-identification (re-id) aims to identify a specific object across times or camera views, with the person re-id and vehicle re-id as the most widely studied applications. Re-id is challenging because of the variations in viewpoints, (human) poses, and occlusions. Multi-shots of the same object can cover diverse viewpoints/poses and thus provide more comprehensive information. In this paper, we propose exploiting the multi-shots of the same identity to guide the feature learning of each individual image. Specifically, we design an Uncertainty-aware Multi-shot Teacher-Student (UMTS) Network. It consists of a teacher network (T-net) that learns the comprehensive features from multiple images of the same object, and a student network (S-net) that takes a single image as input. In particular, we take into account the data dependent heteroscedastic uncertainty for effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Visual Attention and Saliency Detection
