Self Training with Ensemble of Teacher Models

Soumyadeep Ghosh; Sanjay Kumar; Janu Verma; Awanish Kumar

arXiv:2107.08211·cs.LG·July 20, 2021

Self Training with Ensemble of Teacher Models

Soumyadeep Ghosh, Sanjay Kumar, Janu Verma, Awanish Kumar

PDF

Open Access

TL;DR

This paper proposes an ensemble-based self-training method for semi-supervised learning that improves model accuracy and calibration, especially when unlabeled data contains out-of-distribution samples.

Contribution

It introduces a novel ensemble approach for self-training that mitigates calibration loss and enhances robustness in semi-supervised classification tasks.

Findings

01

Improved model accuracy over vanilla self-training methods

02

Enhanced calibration of the trained models

03

Effective handling of out-of-distribution unlabeled data

Abstract

In order to train robust deep learning models, large amounts of labelled data is required. However, in the absence of such large repositories of labelled data, unlabeled data can be exploited for the same. Semi-Supervised learning aims to utilize such unlabeled data for training classification models. Recent progress of self-training based approaches have shown promise in this area, which leads to this study where we utilize an ensemble approach for the same. A by-product of any semi-supervised approach may be loss of calibration of the trained model especially in scenarios where unlabeled data may contain out-of-distribution samples, which leads to this investigation on how to adapt to such effects. Our proposed algorithm carefully avoids common pitfalls in utilizing unlabeled data and leads to a more accurate and calibrated supervised model compared to vanilla self-training based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Machine Learning and Algorithms