Data Pruning Can Do More: A Comprehensive Data Pruning Approach for   Object Re-identification

Zi Yang; Haojin Yang; Soumajit Majumder; Jorge Cardoso; Guillermo; Gallego

arXiv:2412.10091·cs.CV·December 16, 2024

Data Pruning Can Do More: A Comprehensive Data Pruning Approach for Object Re-identification

Zi Yang, Haojin Yang, Soumajit Majumder, Jorge Cardoso, Guillermo, Gallego

PDF

1 Repo

TL;DR

This paper introduces a comprehensive, efficient data pruning method for object re-identification that accurately identifies important, mislabeled, and outlier samples, reducing training costs with minimal accuracy loss.

Contribution

It is the first to adapt and extend data pruning techniques specifically for object re-identification tasks, leveraging logit history for improved sample importance estimation.

Findings

01

Reduces training data by up to 35% with negligible accuracy loss

02

Achieves 10x faster importance score estimation

03

Applicable across multiple ReID datasets

Abstract

Previous studies have demonstrated that not each sample in a dataset is of equal importance during training. Data pruning aims to remove less important or informative samples while still achieving comparable results as training on the original (untruncated) dataset, thereby reducing storage and training costs. However, the majority of data pruning methods are applied to image classification tasks. To our knowledge, this work is the first to explore the feasibility of these pruning methods applied to object re-identification (ReID) tasks, while also presenting a more comprehensive data pruning approach. By fully leveraging the logit history during training, our approach offers a more accurate and comprehensive metric for quantifying sample importance, as well as correcting mislabeled samples and recognizing outliers. Furthermore, our approach is highly efficient, reducing the cost of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zi-y/data-pruning-reid
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning