ALT-MAS: A Data-Efficient Framework for Active Testing of Machine Learning Algorithms
Huong Ha, Sunil Gupta, Santu Rana, Svetha Venkatesh

TL;DR
This paper introduces ALT-MAS, a data-efficient framework that uses Bayesian neural networks and an information-based sampling strategy to accurately estimate model performance metrics with minimal labeled data.
Contribution
It presents a novel data augmentation and sampling approach for active testing of machine learning models, improving metric estimation accuracy with limited labeled data.
Findings
Outperforms existing baselines in metric estimation accuracy
Effective with small labeled test datasets
Applicable to various machine learning models and metrics
Abstract
Machine learning models are being used extensively in many important areas, but there is no guarantee a model will always perform well or as its developers intended. Understanding the correctness of a model is crucial to prevent potential failures that may have significant detrimental impact in critical application areas. In this paper, we propose a novel framework to efficiently test a machine learning model using only a small amount of labeled test data. The idea is to estimate the metrics of interest for a model-under-test using Bayesian neural network (BNN). We develop a novel data augmentation method helping to train the BNN to achieve high accuracy. We also devise a theoretic information based sampling strategy to sample data points so as to achieve accurate estimations for the metrics of interest. Finally, we conduct an extensive set of experiments to test various machine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Software Testing and Debugging Techniques
