XPASC: Measuring Generalization in Weak Supervision by Explainability   and Association

Luisa M\"arz; Ehsaneddin Asgari; Fabienne Braune; Franziska; Zimmermann; Benjamin Roth

arXiv:2206.01444·cs.LG·November 23, 2022

XPASC: Measuring Generalization in Weak Supervision by Explainability and Association

Luisa M\"arz, Ehsaneddin Asgari, Fabienne Braune, Franziska, Zimmermann, Benjamin Roth

PDF

Open Access 1 Repo

TL;DR

This paper introduces XPASC, a novel metric to measure how well weakly supervised models generalize, revealing that more generalization does not always lead to better performance, and provides tools to control overfitting.

Contribution

The paper presents XPASC, a new explainability-based score for assessing generalization in weak supervision, along with its evaluation and implementation release.

Findings

01

XPASC effectively measures model generalization in weak supervision.

02

KnowMAN can control the degree of generalization via a hyperparameter.

03

Higher generalization does not necessarily improve model performance.

Abstract

Weak supervision is leveraged in a wide range of domains and tasks due to its ability to create massive amounts of labeled data, requiring only little manual effort. Standard approaches use labeling functions to specify signals that are relevant for the labeling. It has been conjectured that weakly supervised models over-rely on those signals and as a result suffer from overfitting. To verify this assumption, we introduce a novel method, XPASC (eXPlainability-Association SCore), for measuring the generalization of a model trained with a weakly supervised dataset. Considering the occurrences of features, classes and labeling functions in a dataset, XPASC takes into account the relevance of each feature for the predictions of the model as well as the associations of the feature with the class and the labeling function, respectively. The association in XPASC can be measured in two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luisamaerz/xpasc
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning