Extending Dataset Pruning to Object Detection: A Variance-based Approach

Ryota Yagi

arXiv:2505.17245·cs.CV·May 26, 2025

Extending Dataset Pruning to Object Detection: A Variance-based Approach

Ryota Yagi

PDF

TL;DR

This paper extends dataset pruning techniques from image classification to object detection by introducing a variance-based scoring method, VPS, which effectively identifies informative samples and improves detection performance on PASCAL VOC and MS COCO.

Contribution

It presents the first principled approach to dataset pruning for object detection, addressing key challenges and proposing the Variance-based Prediction Score (VPS) for sample selection.

Findings

01

VPS outperforms prior pruning methods in mAP on benchmark datasets.

02

Informative sample selection is more critical than dataset size or class balance.

03

Pruning enhances detection performance while reducing dataset complexity.

Abstract

Dataset pruning -- selecting a small yet informative subset of training data -- has emerged as a promising strategy for efficient machine learning, offering significant reductions in computational cost and storage compared to alternatives like dataset distillation. While pruning methods have shown strong performance in image classification, their extension to more complex computer vision tasks, particularly object detection, remains relatively underexplored. In this paper, we present the first principled extension of classification pruning techniques to the object detection domain, to the best of our knowledge. We identify and address three key challenges that hinder this transition: the Object-Level Attribution Problem, the Scoring Strategy Problem, and the Image-Level Aggregation Problem. To overcome these, we propose tailored solutions, including a novel scoring method called…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.