FlipReID: Closing the Gap between Training and Inference in Person Re-Identification
Xingyang Ni, Esa Rahtu

TL;DR
FlipReID introduces a training method that aligns training and inference features in person re-identification, leading to improved performance by minimizing the gap caused by test-time augmentation practices.
Contribution
The paper proposes the FlipReID structure with a flipping loss to unify training and inference features, enhancing re-identification accuracy.
Findings
Achieves state-of-the-art results on MSMT17 dataset.
Consistent performance improvements across multiple benchmarks.
Effectively reduces the gap between training and inference features.
Abstract
Since neural networks are data-hungry, incorporating data augmentation in training is a widely adopted technique that enlarges datasets and improves generalization. On the other hand, aggregating predictions of multiple augmented samples (i.e., test-time augmentation) could boost performance even further. In the context of person re-identification models, it is common practice to extract embeddings for both the original images and their horizontally flipped variants. The final representation is the mean of the aforementioned feature vectors. However, such scheme results in a gap between training and inference, i.e., the mean feature vectors calculated in inference are not part of the training pipeline. In this study, we devise the FlipReID structure with the flipping loss to address this issue. More specifically, models using the FlipReID structure are trained on the original images and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Face recognition and analysis
