Understanding Prediction Discrepancies in Machine Learning Classifiers
Xavier Renard, Thibault Laugel, Marcin Detyniecki

TL;DR
This paper investigates why different high-performing classifiers trained on the same data can produce significantly different predictions, and introduces a model-agnostic method to analyze and explain these discrepancies to aid better model selection.
Contribution
It presents DIG, a novel local explanation algorithm for prediction discrepancies, helping practitioners understand differences between models for improved decision-making.
Findings
DIG effectively captures local prediction discrepancies.
Analysis reveals significant differences in learned patterns among top models.
Using DIG can improve model selection and fairness considerations.
Abstract
A multitude of classifiers can be trained on the same data to achieve similar performances during test time, while having learned significantly different classification patterns. This phenomenon, which we call prediction discrepancies, is often associated with the blind selection of one model instead of another with similar performances. When making a choice, the machine learning practitioner has no understanding on the differences between models, their limits, where they agree and where they don't. But his/her choice will result in concrete consequences for instances to be classified in the discrepancy zone, since the final decision will be based on the selected classification pattern. Besides the arbitrary nature of the result, a bad choice could have further negative consequences such as loss of opportunity or lack of fairness. This paper proposes to address this question by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Imbalanced Data Classification Techniques
