FreaAI: Automated extraction of data slices to test machine learning models
Samuel Ackerman, Orna Raz, Marcel Zalmanovici

TL;DR
FreaAI automatically identifies specific data slices where machine learning models underperform, enhancing explainability and helping ensure models meet business requirements.
Contribution
The paper introduces FreaAI, a novel automated technique for extracting explainable data slices with underperformance, bridging the gap between model metrics and business needs.
Findings
Successfully extracted meaningful data slices from seven datasets
Demonstrated statistically significant underperformance in identified slices
Enhanced explainability of model performance issues
Abstract
Machine learning (ML) solutions are prevalent. However, many challenges exist in making these solutions business-grade. One major challenge is to ensure that the ML solution provides its expected business value. In order to do that, one has to bridge the gap between the way ML model performance is measured and the solution requirements. In previous work (Barash et al, "Bridging the gap...") we demonstrated the effectiveness of utilizing feature models in bridging this gap. Whereas ML performance metrics, such as the accuracy or F1-score of a classifier, typically measure the average ML performance, feature models shed light on explainable data slices that are too far from that average, and therefore might indicate unsatisfied requirements. For example, the overall accuracy of a bank text terms classifier may be very high, say , yet it might perform poorly for terms that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
