Active Testing: An Efficient and Robust Framework for Estimating   Accuracy

Phuc Nguyen; Deva Ramanan; Charless Fowlkes

arXiv:1807.00493·cs.CV·July 3, 2018·6 cites

Active Testing: An Efficient and Robust Framework for Estimating Accuracy

Phuc Nguyen, Deva Ramanan, Charless Fowlkes

PDF

Open Access

TL;DR

This paper introduces an active testing framework that efficiently estimates model accuracy on large, noisy datasets by minimizing human annotation effort and improving robustness over traditional evaluation methods.

Contribution

The paper proposes a novel active testing approach for large-scale noisy datasets, reducing annotation effort and enhancing robustness in accuracy estimation.

Findings

01

Effective estimation of Precision@K and mean Average Precision

02

Significant reduction in human annotation effort

03

More robust evaluation compared to existing protocols

Abstract

Much recent work on visual recognition aims to scale up learning to massive, noisily-annotated datasets. We address the problem of scaling- up the evaluation of such models to large-scale datasets with noisy labels. Current protocols for doing so require a human user to either vet (re-annotate) a small fraction of the test set and ignore the rest, or else correct errors in annotation as they are found through manual inspection of results. In this work, we re-formulate the problem as one of active testing, and examine strategies for efficiently querying a user so as to obtain an accu- rate performance estimate with minimal vetting. We demonstrate the effectiveness of our proposed active testing framework on estimating two performance metrics, Precision@K and mean Average Precision, for two popular computer vision tasks, multi-label classification and instance segmentation. We further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVLSI and Analog Circuit Testing · Fault Detection and Control Systems · Scientific Measurement and Uncertainty Evaluation