Self-Training with Weak Supervision

Giannis Karamanolakis; Subhabrata Mukherjee; Guoqing Zheng; Ahmed; Hassan Awadallah

arXiv:2104.05514·cs.CL·April 13, 2021

Self-Training with Weak Supervision

Giannis Karamanolakis, Subhabrata Mukherjee, Guoqing Zheng, Ahmed, Hassan Awadallah

PDF

Open Access 1 Repo

TL;DR

This paper introduces ASTRA, a semi-supervised framework that combines weak supervision, self-training, and rule attention to improve text classification by leveraging all available data, including unlabeled and weakly labeled instances.

Contribution

ASTRA is a novel weak supervision approach that utilizes self-training and rule attention to effectively incorporate unlabeled data and weak rules for improved learning.

Findings

01

Significant performance improvements over state-of-the-art baselines.

02

Effective use of unlabeled data through self-training.

03

Robust rule aggregation via rule attention network.

Abstract

State-of-the-art deep neural networks require large-scale labeled training data that is often expensive to obtain or not available for many tasks. Weak supervision in the form of domain-specific rules has been shown to be useful in such settings to automatically generate weakly labeled training data. However, learning with weak rules is challenging due to their inherent heuristic and noisy nature. An additional challenge is rule coverage and overlap, where prior work on weak supervision only considers instances that are covered by weak rules, thus leaving valuable unlabeled data behind. In this work, we develop a weak supervision framework (ASTRA) that leverages all the available data for a given task. To this end, we leverage task-specific unlabeled data through self-training with a model (student) that considers contextualized representations and predicts pseudo-labels for instances…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/ASTRA
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Topic Modeling