Slice-based Learning: A Programming Model for Residual Learning in   Critical Data Slices

Vincent S. Chen; Sen Wu; Zhenzhen Weng; Alexander Ratner and; Christopher R\'e

arXiv:1909.06349·cs.LG·March 3, 2020·23 cites

Slice-based Learning: A Programming Model for Residual Learning in Critical Data Slices

Vincent S. Chen, Sen Wu, Zhenzhen Weng, Alexander Ratner and, Christopher R\'e

PDF

Open Access 2 Repos

TL;DR

Slice-based Learning introduces a programming model that enhances model performance on critical data subsets by learning slice-specific representations, improving slice and overall accuracy across diverse datasets.

Contribution

The paper proposes Slice-based Learning, a novel programming model that enables models to focus on critical data slices using slice-specific representations and attention mechanisms.

Findings

01

Up to 19.0 F1 improvement on slices

02

Up to 4.6 F1 overall improvement

03

Effective across language, vision, and industrial datasets

Abstract

In real-world machine learning applications, data subsets correspond to especially critical outcomes: vulnerable cyclist detections are safety-critical in an autonomous driving task, and "question" sentences might be important to a dialogue agent's language understanding for product purposes. While machine learning models can achieve high quality performance on coarse-grained metrics like F1-score and overall accuracy, they may underperform on critical subsets---we define these as slices, the key abstraction in our approach. To address slice-level performance, practitioners often train separate "expert" models on slice subsets or use multi-task hard parameter sharing. We propose Slice-based Learning, a new programming model in which the slicing function (SF), a programming interface, specifies critical data subsets for which the model should commit additional capacity. Any model can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning