TL;DR
This paper introduces a comprehensive, efficient data pruning method for object re-identification that accurately identifies important, mislabeled, and outlier samples, reducing training costs with minimal accuracy loss.
Contribution
It is the first to adapt and extend data pruning techniques specifically for object re-identification tasks, leveraging logit history for improved sample importance estimation.
Findings
Reduces training data by up to 35% with negligible accuracy loss
Achieves 10x faster importance score estimation
Applicable across multiple ReID datasets
Abstract
Previous studies have demonstrated that not each sample in a dataset is of equal importance during training. Data pruning aims to remove less important or informative samples while still achieving comparable results as training on the original (untruncated) dataset, thereby reducing storage and training costs. However, the majority of data pruning methods are applied to image classification tasks. To our knowledge, this work is the first to explore the feasibility of these pruning methods applied to object re-identification (ReID) tasks, while also presenting a more comprehensive data pruning approach. By fully leveraging the logit history during training, our approach offers a more accurate and comprehensive metric for quantifying sample importance, as well as correcting mislabeled samples and recognizing outliers. Furthermore, our approach is highly efficient, reducing the cost of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
