GRASP: A Goodness-of-Fit Test for Classification Learning
Adel Javanmard, Mohammad Mehrabi

TL;DR
This paper introduces GRASP, a nonparametric goodness-of-fit test for classifiers that assesses how well a model fits the true label distribution without relying on parametric assumptions, applicable in finite samples.
Contribution
The paper proposes a novel, distribution-free goodness-of-fit test called GRASP for binary classifiers, including a model-X version that leverages known feature distributions for improved power.
Findings
GRASP effectively tests classifier fit in finite samples.
Model-X GRASP outperforms in settings with known feature distributions.
Extensive experiments validate the test's robustness and accuracy.
Abstract
Performance of classifiers is often measured in terms of average accuracy on test data. Despite being a standard measure, average accuracy fails in characterizing the fit of the model to the underlying conditional law of labels given the features vector (), e.g. due to model misspecification, over fitting, and high-dimensionality. In this paper, we consider the fundamental problem of assessing the goodness-of-fit for a general binary classifier. Our framework does not make any parametric assumption on the conditional law , and treats that as a black box oracle model which can be accessed only through queries. We formulate the goodness-of-fit assessment problem as a tolerance hypothesis testing of the form \[ H_0: \mathbb{E}\Big[D_f\Big({\sf Bern}(\eta(X))\|{\sf Bern}(\hat{\eta}(X))\Big)\Big]\leq \tau\,, \] where represents an -divergence function, and ,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Machine Learning and Algorithms
MethodsTest
