Model Rectification via Unknown Unknowns Extraction from Deployment   Samples

Bruno Abrahao; Zheng Wang; Haider Ahmed; Yuchen Zhu

arXiv:2102.04145·cs.LG·February 9, 2021

Model Rectification via Unknown Unknowns Extraction from Deployment Samples

Bruno Abrahao, Zheng Wang, Haider Ahmed, Yuchen Zhu

PDF

Open Access

TL;DR

This paper introduces RTSCV, a post-training framework that identifies and rectifies unknown unknowns in classifiers during deployment, significantly improving model performance without active learning.

Contribution

The paper proposes RTSCV, a novel algorithmic framework for model rectification that extracts unknown unknowns during deployment, with theoretical guarantees and superior empirical results.

Findings

01

RTSCV reduces performance gaps by up to 41%.

02

RTSCV outperforms state-of-the-art methods across datasets.

03

Theoretical guarantees support RTSCV's effectiveness.

Abstract

Model deficiency that results from incomplete training data is a form of structural blindness that leads to costly errors, oftentimes with high confidence. During the training of classification tasks, underrepresented class-conditional distributions that a given hypothesis space can recognize results in a mismatch between the model and the target space. To mitigate the consequences of this discrepancy, we propose Random Test Sampling and Cross-Validation (RTSCV) as a general algorithmic framework that aims to perform a post-training model rectification at deployment time in a supervised way. RTSCV extracts unknown unknowns (u.u.s), i.e., examples from the class-conditional distributions that a classifier is oblivious to, and works in combination with a diverse family of modern prediction models. RTSCV augments the training set with a sample of the test set (or deployment data) and uses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications