Meta Self-Refinement for Robust Learning with Weak Supervision

Dawei Zhu; Xiaoyu Shen; Michael A. Hedderich; Dietrich Klakow

arXiv:2205.07290·cs.CL·May 2, 2023

Meta Self-Refinement for Robust Learning with Weak Supervision

Dawei Zhu, Xiaoyu Shen, Michael A. Hedderich, Dietrich Klakow

PDF

Open Access 1 Repo

TL;DR

This paper introduces Meta Self-Refinement (MSR), a novel framework that enhances robustness of deep neural networks trained with weak supervision by refining pseudo-labels through meta-learning, leading to improved generalization and noise resistance.

Contribution

MSR is the first method to dynamically refine pseudo-labels via meta-gradient descent, significantly reducing error propagation from noisy weak labels.

Findings

01

MSR outperforms state-of-the-art methods by up to 11.4% in accuracy.

02

MSR demonstrates robustness against label noise across eight NLP benchmarks.

03

MSR effectively refines pseudo-labels to improve model generalization.

Abstract

Training deep neural networks (DNNs) under weak supervision has attracted increasing research attention as it can significantly reduce the annotation cost. However, labels from weak supervision can be noisy, and the high capacity of DNNs enables them to easily overfit the label noise, resulting in poor generalization. Recent methods leverage self-training to build noise-resistant models, in which a teacher trained under weak supervision is used to provide highly confident labels for teaching the students. Nevertheless, the teacher derived from such frameworks may have fitted a substantial amount of noise and therefore produce incorrect pseudo-labels with high confidence, leading to severe error propagation. In this work, we propose Meta Self-Refinement (MSR), a noise-resistant learning framework, to effectively combat label noise from weak supervision. Instead of relying on a fixed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uds-lsv/msr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Infrastructure Maintenance and Monitoring · Advanced Neural Network Applications