Lifting Weak Supervision To Structured Prediction
Harit Vishwakarma, Nicholas Roberts, Frederic Sala

TL;DR
This paper extends the theoretical guarantees of weak supervision from binary classification to structured prediction tasks involving complex output spaces like graphs and manifolds, introducing new techniques for noise estimation.
Contribution
It introduces novel methods based on pseudo-Euclidean embeddings and tensor decompositions for noise rate estimation in structured labels, achieving near-consistent results and generalization guarantees.
Findings
Methods achieve nearly-consistent noise estimation in structured label spaces.
Theoretical guarantees extend to complex structured outputs.
Empirical results validate the effectiveness of the proposed techniques.
Abstract
Weak supervision (WS) is a rich set of techniques that produce pseudolabels by aggregating easily obtained but potentially noisy label estimates from a variety of sources. WS is theoretically well understood for binary classification, where simple approaches enable consistent estimation of pseudolabel noise rates. Using this result, it has been shown that downstream models trained on the pseudolabels have generalization guarantees nearly identical to those trained on clean labels. While this is exciting, users often wish to use WS for structured prediction, where the output space consists of more than a binary or multi-class label set: e.g. rankings, graphs, manifolds, and more. Do the favorable theoretical properties of WS for binary classification lift to this setting? We answer this question in the affirmative for a wide range of scenarios. For labels taking values in a finite metric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)
