Meta Self-Refinement for Robust Learning with Weak Supervision
Dawei Zhu, Xiaoyu Shen, Michael A. Hedderich, Dietrich Klakow

TL;DR
This paper introduces Meta Self-Refinement (MSR), a novel framework that enhances robustness of deep neural networks trained with weak supervision by refining pseudo-labels through meta-learning, leading to improved generalization and noise resistance.
Contribution
MSR is the first method to dynamically refine pseudo-labels via meta-gradient descent, significantly reducing error propagation from noisy weak labels.
Findings
MSR outperforms state-of-the-art methods by up to 11.4% in accuracy.
MSR demonstrates robustness against label noise across eight NLP benchmarks.
MSR effectively refines pseudo-labels to improve model generalization.
Abstract
Training deep neural networks (DNNs) under weak supervision has attracted increasing research attention as it can significantly reduce the annotation cost. However, labels from weak supervision can be noisy, and the high capacity of DNNs enables them to easily overfit the label noise, resulting in poor generalization. Recent methods leverage self-training to build noise-resistant models, in which a teacher trained under weak supervision is used to provide highly confident labels for teaching the students. Nevertheless, the teacher derived from such frameworks may have fitted a substantial amount of noise and therefore produce incorrect pseudo-labels with high confidence, leading to severe error propagation. In this work, we propose Meta Self-Refinement (MSR), a noise-resistant learning framework, to effectively combat label noise from weak supervision. Instead of relying on a fixed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Infrastructure Maintenance and Monitoring · Advanced Neural Network Applications
