Model Rectification via Unknown Unknowns Extraction from Deployment Samples
Bruno Abrahao, Zheng Wang, Haider Ahmed, Yuchen Zhu

TL;DR
This paper introduces RTSCV, a post-training framework that identifies and rectifies unknown unknowns in classifiers during deployment, significantly improving model performance without active learning.
Contribution
The paper proposes RTSCV, a novel algorithmic framework for model rectification that extracts unknown unknowns during deployment, with theoretical guarantees and superior empirical results.
Findings
RTSCV reduces performance gaps by up to 41%.
RTSCV outperforms state-of-the-art methods across datasets.
Theoretical guarantees support RTSCV's effectiveness.
Abstract
Model deficiency that results from incomplete training data is a form of structural blindness that leads to costly errors, oftentimes with high confidence. During the training of classification tasks, underrepresented class-conditional distributions that a given hypothesis space can recognize results in a mismatch between the model and the target space. To mitigate the consequences of this discrepancy, we propose Random Test Sampling and Cross-Validation (RTSCV) as a general algorithmic framework that aims to perform a post-training model rectification at deployment time in a supervised way. RTSCV extracts unknown unknowns (u.u.s), i.e., examples from the class-conditional distributions that a classifier is oblivious to, and works in combination with a diverse family of modern prediction models. RTSCV augments the training set with a sample of the test set (or deployment data) and uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
