Generalized Adversarial Distances to Efficiently Discover Classifier Errors
Walter Bennette, Sally Dufek, Karsten Maurer, Sean Sisti, Bunyod, Tusmatov

TL;DR
This paper introduces a generalized adversarial distance method that efficiently identifies high-confidence errors in black-box classifiers, outperforming existing approaches and aiding in more effective model evaluation.
Contribution
The paper proposes a novel generalization of the Adversarial Distance search that applies to any classifier and data domain, improving high-confidence error detection.
Findings
The method finds errors at higher-than-expected rates based on confidence.
It outperforms competing error discovery methods.
Effective across various classifiers and data domains.
Abstract
Given a black-box classification model and an unlabeled evaluation dataset from some application domain, efficient strategies need to be developed to evaluate the model. Random sampling allows a user to estimate metrics like accuracy, precision, and recall, but may not provide insight to high-confidence errors. High-confidence errors are rare events for which the model is highly confident in its prediction, but is wrong. Such errors can represent costly mistakes and should be explicitly searched for. In this paper we propose a generalization to the Adversarial Distance search that leverages concepts from adversarial machine learning to identify predictions for which a classifier may be overly confident. These predictions are useful instances to sample when looking for high-confidence errors because they are prone to a higher rate of error than expected. Our generalization allows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification
