Avoiding Your Teacher's Mistakes: Training Neural Networks with   Controlled Weak Supervision

Mostafa Dehghani; Aliaksei Severyn; Sascha Rothe; Jaap Kamps

arXiv:1711.00313·cs.LG·December 8, 2017·29 cites

Avoiding Your Teacher's Mistakes: Training Neural Networks with Controlled Weak Supervision

Mostafa Dehghani, Aliaksei Severyn, Sascha Rothe, Jaap Kamps

PDF

Open Access

TL;DR

This paper introduces a semi-supervised learning approach with two neural networks, one for the task and one for confidence estimation, to improve training with weak labels and reduce the impact of noisy data.

Contribution

The paper proposes a novel multi-task training method using a confidence network to weight updates, effectively handling noisy labels in weak supervision scenarios.

Findings

01

Improved performance over baseline methods in document ranking and sentiment classification.

02

Faster learning convergence with the proposed confidence-weighted approach.

03

Effective mitigation of noisy label impact during training.

Abstract

Training deep neural networks requires massive amounts of training data, but for many tasks only limited labeled data is available. This makes weak supervision attractive, using weak or noisy signals like the output of heuristic methods or user click-through data for training. In a semi-supervised setting, we can use a large set of data with weak labels to pretrain a neural network and then fine-tune the parameters with a small amount of data with true labels. This feels intuitively sub-optimal as these two independent stages leave the model unaware about the varying label quality. What if we could somehow inform the model about the label quality? In this paper, we propose a semi-supervised learning method where we train two neural networks in a multi-task fashion: a "target network" and a "confidence network". The target network is optimized to perform a given task and is trained using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Imbalanced Data Classification Techniques