AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels
Nicholas Roberts, Xintong Li, Tzu-Heng Huang, Dyah Adila, Spencer, Schoenberg, Cheng-Yu Liu, Lauren Pick, Haotian Ma, Aws Albarghouthi, Frederic, Sala

TL;DR
AutoWS-Bench-101 evaluates automated weak supervision techniques across diverse challenging domains, comparing their effectiveness with zero-shot and few-shot methods, and highlights the importance of integrating foundation model signals for improved performance.
Contribution
Introduces a comprehensive benchmark framework for automated weak supervision in complex domains, facilitating comparison with modern zero-shot and few-shot learners.
Findings
AutoWS methods often need foundation model signals to outperform simple baselines.
Benchmark reveals the importance of integrating foundation models in AutoWS.
Thorough ablation studies identify key factors influencing AutoWS performance.
Abstract
Weak supervision (WS) is a powerful method to build labeled datasets for training supervised models in the face of little-to-no labeled data. It replaces hand-labeling data with aggregating multiple noisy-but-cheap label estimates expressed by labeling functions (LFs). While it has been used successfully in many domains, weak supervision's application scope is limited by the difficulty of constructing labeling functions for domains with complex or high-dimensional features. To address this, a handful of methods have proposed automating the LF design process using a small set of ground truth labels. In this work, we introduce AutoWS-Bench-101: a framework for evaluating automated WS (AutoWS) techniques in challenging WS settings -- a set of diverse application domains on which it has been previously difficult or impossible to apply traditional WS techniques. While AutoWS is a promising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
