FreaAI: Automated extraction of data slices to test machine learning   models

Samuel Ackerman; Orna Raz; Marcel Zalmanovici

arXiv:2108.05620·cs.LG·August 13, 2021

FreaAI: Automated extraction of data slices to test machine learning models

Samuel Ackerman, Orna Raz, Marcel Zalmanovici

PDF

TL;DR

FreaAI automatically identifies specific data slices where machine learning models underperform, enhancing explainability and helping ensure models meet business requirements.

Contribution

The paper introduces FreaAI, a novel automated technique for extracting explainable data slices with underperformance, bridging the gap between model metrics and business needs.

Findings

01

Successfully extracted meaningful data slices from seven datasets

02

Demonstrated statistically significant underperformance in identified slices

03

Enhanced explainability of model performance issues

Abstract

Machine learning (ML) solutions are prevalent. However, many challenges exist in making these solutions business-grade. One major challenge is to ensure that the ML solution provides its expected business value. In order to do that, one has to bridge the gap between the way ML model performance is measured and the solution requirements. In previous work (Barash et al, "Bridging the gap...") we demonstrated the effectiveness of utilizing feature models in bridging this gap. Whereas ML performance metrics, such as the accuracy or F1-score of a classifier, typically measure the average ML performance, feature models shed light on explainable data slices that are too far from that average, and therefore might indicate unsatisfied requirements. For example, the overall accuracy of a bank text terms classifier may be very high, say $98% \pm 2%$ , yet it might perform poorly for terms that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.