Fine-tuning Vision Classifiers On A Budget

Sunil Kumar; Ted Sandler; Paulina Varshavskaya

arXiv:2410.00085·cs.LG·October 2, 2024

Fine-tuning Vision Classifiers On A Budget

Sunil Kumar, Ted Sandler, Paulina Varshavskaya

PDF

Open Access

TL;DR

This paper introduces GTX, a method that leverages prior labeler accuracy estimates with a naive-Bayes model to improve label quality and quantity, enabling effective fine-tuning of vision models with fewer labels.

Contribution

The paper presents GTX, a novel approach that uses labeler accuracy estimates to enhance label quality, reducing labeling costs for fine-tuning vision classifiers.

Findings

01

GTX improves label efficiency in fine-tuning.

02

Using prior labeler accuracy estimates maintains model performance.

03

GTX reduces the need for extensive human labeling.

Abstract

Fine-tuning modern computer vision models requires accurately labeled data for which the ground truth may not exist, but a set of multiple labels can be obtained from labelers of variable accuracy. We tie the notion of label quality to confidence in labeler accuracy and show that, when prior estimates of labeler accuracy are available, using a simple naive-Bayes model to estimate the true labels allows us to label more data on a fixed budget without compromising label or fine-tuning quality. We present experiments on a dataset of industrial images that demonstrates that our method, called Ground Truth Extension (GTX), enables fine-tuning ML models using fewer human labels.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques

MethodsSparse Evolutionary Training