Active Testing: An Efficient and Robust Framework for Estimating Accuracy
Phuc Nguyen, Deva Ramanan, Charless Fowlkes

TL;DR
This paper introduces an active testing framework that efficiently estimates model accuracy on large, noisy datasets by minimizing human annotation effort and improving robustness over traditional evaluation methods.
Contribution
The paper proposes a novel active testing approach for large-scale noisy datasets, reducing annotation effort and enhancing robustness in accuracy estimation.
Findings
Effective estimation of Precision@K and mean Average Precision
Significant reduction in human annotation effort
More robust evaluation compared to existing protocols
Abstract
Much recent work on visual recognition aims to scale up learning to massive, noisily-annotated datasets. We address the problem of scaling- up the evaluation of such models to large-scale datasets with noisy labels. Current protocols for doing so require a human user to either vet (re-annotate) a small fraction of the test set and ignore the rest, or else correct errors in annotation as they are found through manual inspection of results. In this work, we re-formulate the problem as one of active testing, and examine strategies for efficiently querying a user so as to obtain an accu- rate performance estimate with minimal vetting. We demonstrate the effectiveness of our proposed active testing framework on estimating two performance metrics, Precision@K and mean Average Precision, for two popular computer vision tasks, multi-label classification and instance segmentation. We further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVLSI and Analog Circuit Testing · Fault Detection and Control Systems · Scientific Measurement and Uncertainty Evaluation
