Improving self-training under distribution shifts via anchored   confidence with theoretical guarantees

Taejong Joo; Diego Klabjan

arXiv:2411.00586·cs.LG·November 4, 2024

Improving self-training under distribution shifts via anchored confidence with theoretical guarantees

Taejong Joo, Diego Klabjan

PDF

Open Access

TL;DR

This paper introduces a theoretically grounded method using anchored confidence and temporal ensembles to enhance self-training under distribution shifts, achieving significant accuracy improvements without extra computational costs.

Contribution

It proposes a novel uncertainty-aware temporal ensemble approach with theoretical guarantees to improve self-training under distribution shifts.

Findings

01

Improves self-training accuracy by 8-16% across various shifts

02

Enhances calibration and robustness to hyperparameters

03

Provides theoretical guarantees for asymptotic correctness

Abstract

Self-training often falls short under distribution shifts due to an increased discrepancy between prediction confidence and actual accuracy. This typically necessitates computationally demanding methods such as neighborhood or ensemble-based label corrections. Drawing inspiration from insights on early learning regularization, we develop a principled method to improve self-training under distribution shifts based on temporal consistency. Specifically, we build an uncertainty-aware temporal ensemble with a simple relative thresholding. Then, this ensemble smooths noisy pseudo labels to promote selective temporal consistency. We show that our temporal ensemble is asymptotically correct and our label smoothing technique can reduce the optimality gap of self-training. Our extensive experiments validate that our approach consistently improves self-training performances by 8% to 16% across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications