Large-Scale Pre-training for Person Re-identification with Noisy Labels
Dengpan Fu, Dongdong Chen, Hao Yang, Jianmin Bao, Lu Yuan, and Lei Zhang, Houqiang Li, Fang Wen, Dong Chen

TL;DR
This paper introduces a scalable pre-training framework for person re-identification using noisy labels derived from raw videos, improving state-of-the-art performance and transferability.
Contribution
It proposes a novel large-scale pre-training method utilizing noisy labels with three integrated learning modules, enhancing Re-ID representations from scratch.
Findings
Pre-trained models outperform unsupervised counterparts in multiple datasets.
Significant improvements in small-scale and few-shot scenarios.
Effective noise rectification through joint learning modules.
Abstract
This paper aims to address the problem of pre-training for person re-identification (Re-ID) with noisy labels. To setup the pre-training task, we apply a simple online multi-object tracking system on raw videos of an existing unlabeled Re-ID dataset "LUPerson" nd build the Noisy Labeled variant called "LUPerson-NL". Since theses ID labels automatically derived from tracklets inevitably contain noises, we develop a large-scale Pre-training framework utilizing Noisy Labels (PNL), which consists of three learning modules: supervised Re-ID learning, prototype-based contrastive learning, and label-guided contrastive learning. In principle, joint learning of these three modules not only clusters similar examples to one prototype, but also rectifies noisy labels based on the prototype assignment. We demonstrate that learning directly from raw videos is a promising alternative for pre-training,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Automated Road and Building Extraction
