WeShap: Weak Supervision Source Evaluation with Shapley Values
Naiqing Guan, Nick Koudas

TL;DR
WeShap introduces a Shapley value-based metric for evaluating and improving weak supervision sources, enabling better understanding and refinement of programmatic labeling pipelines to enhance machine learning model accuracy.
Contribution
This paper proposes WeShap, a novel, efficient Shapley value-based evaluation metric for weak supervision sources, with demonstrated versatility and generalizability across different PWS pipelines.
Findings
WeShap effectively identifies beneficial and detrimental labeling functions.
Refining PWS pipelines with WeShap improves downstream model accuracy by an average of 5.0 points.
WeShap generalizes well to various PWS pipeline configurations.
Abstract
Efficient data annotation stands as a significant bottleneck in training contemporary machine learning models. The Programmatic Weak Supervision (PWS) pipeline presents a solution by utilizing multiple weak supervision sources to automatically label data, thereby expediting the annotation process. Given the varied contributions of these weak supervision sources to the accuracy of PWS, it is imperative to employ a robust and efficient metric for their evaluation. This is crucial not only for understanding the behavior and performance of the PWS pipeline but also for facilitating corrective measures. In our study, we introduce WeShap values as an evaluation metric, which quantifies the average contribution of weak supervision sources within a proxy PWS pipeline, leveraging the theoretical underpinnings of Shapley values. We demonstrate efficient computation of WeShap values using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning
