Confidence Regularized Self-Training

Yang Zou; Zhiding Yu; Xiaofeng Liu; B. V. K. Vijaya Kumar; Jinsong; Wang

arXiv:1908.09822·cs.CV·July 16, 2020·33 cites

Confidence Regularized Self-Training

Yang Zou, Zhiding Yu, Xiaofeng Liu, B. V. K. Vijaya Kumar, Jinsong, Wang

PDF

Open Access 2 Repos

TL;DR

This paper introduces Confidence Regularized Self-Training (CRST), a novel framework for unsupervised domain adaptation that mitigates pseudo-label noise through confidence regularization, leading to improved performance in image classification and segmentation.

Contribution

It proposes a new confidence regularized self-training framework with label and model regularization, jointly optimizing pseudo-labels as continuous variables for better domain adaptation.

Findings

01

CRST outperforms non-regularized methods in experiments

02

Achieves state-of-the-art results in image classification and segmentation

03

Effectively reduces noise in pseudo-labels during training

Abstract

Recent advances in domain adaptation show that deep self-training presents a powerful means for unsupervised domain adaptation. These methods often involve an iterative process of predicting on target domain and then taking the confident predictions as pseudo-labels for retraining. However, since pseudo-labels can be noisy, self-training can put overconfident label belief on wrong classes, leading to deviated solutions with propagated errors. To address the problem, we propose a confidence regularized self-training (CRST) framework, formulated as regularized self-training. Our method treats pseudo-labels as continuous latent variables jointly optimized via alternating optimization. We propose two types of confidence regularization: label regularization (LR) and model regularization (MR). CRST-LR generates soft pseudo-labels while CRST-MR encourages the smoothness on network output.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications

MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling