AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100   Labels

Nicholas Roberts; Xintong Li; Tzu-Heng Huang; Dyah Adila; Spencer; Schoenberg; Cheng-Yu Liu; Lauren Pick; Haotian Ma; Aws Albarghouthi; Frederic; Sala

arXiv:2208.14362·cs.LG·November 28, 2023·1 cites

AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels

Nicholas Roberts, Xintong Li, Tzu-Heng Huang, Dyah Adila, Spencer, Schoenberg, Cheng-Yu Liu, Lauren Pick, Haotian Ma, Aws Albarghouthi, Frederic, Sala

PDF

Open Access 1 Video

TL;DR

AutoWS-Bench-101 evaluates automated weak supervision techniques across diverse challenging domains, comparing their effectiveness with zero-shot and few-shot methods, and highlights the importance of integrating foundation model signals for improved performance.

Contribution

Introduces a comprehensive benchmark framework for automated weak supervision in complex domains, facilitating comparison with modern zero-shot and few-shot learners.

Findings

01

AutoWS methods often need foundation model signals to outperform simple baselines.

02

Benchmark reveals the importance of integrating foundation models in AutoWS.

03

Thorough ablation studies identify key factors influencing AutoWS performance.

Abstract

Weak supervision (WS) is a powerful method to build labeled datasets for training supervised models in the face of little-to-no labeled data. It replaces hand-labeling data with aggregating multiple noisy-but-cheap label estimates expressed by labeling functions (LFs). While it has been used successfully in many domains, weak supervision's application scope is limited by the difficulty of constructing labeling functions for domains with complex or high-dimensional features. To address this, a handful of methods have proposed automating the LF design process using a small set of ground truth labels. In this work, we introduce AutoWS-Bench-101: a framework for evaluating automated WS (AutoWS) techniques in challenging WS settings -- a set of diverse application domains on which it has been previously difficult or impossible to apply traditional WS techniques. While AutoWS is a promising…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition