SCOMatch: Alleviating Overtrusting in Open-set Semi-supervised Learning

Zerun Wang; Liuyu Xiang; Lang Huang; Jiafeng Mao; Ling Xiao; Toshihiko; Yamasaki

arXiv:2409.17512·cs.CV·September 27, 2024

SCOMatch: Alleviating Overtrusting in Open-set Semi-supervised Learning

Zerun Wang, Liuyu Xiang, Lang Huang, Jiafeng Mao, Ling Xiao, Toshihiko, Yamasaki

PDF

Open Access 1 Repo

TL;DR

SCOMatch introduces a novel open-set semi-supervised learning approach that treats OOD samples as an additional class, effectively reducing overtrusting of labeled data and improving decision boundary accuracy.

Contribution

It proposes a new SSL method that selects reliable OOD samples as labeled data and integrates this into the training process, addressing overtrusting issues in prior methods.

Findings

01

Outperforms state-of-the-art methods on various benchmarks.

02

Effectively refines decision boundaries between ID and OOD classes.

03

Validated through extensive ablation studies and visualizations.

Abstract

Open-set semi-supervised learning (OSSL) leverages practical open-set unlabeled data, comprising both in-distribution (ID) samples from seen classes and out-of-distribution (OOD) samples from unseen classes, for semi-supervised learning (SSL). Prior OSSL methods initially learned the decision boundary between ID and OOD with labeled ID data, subsequently employing self-training to refine this boundary. These methods, however, suffer from the tendency to overtrust the labeled ID data: the scarcity of labeled data caused the distribution bias between the labeled samples and the entire ID data, which misleads the decision boundary to overfit. The subsequent self-training process, based on the overfitted result, fails to rectify this problem. In this paper, we address the overtrusting issue by treating OOD samples as an additional class, forming a new SSL process. Specifically, we propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

komejisatori/SCOMatch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Imbalanced Data Classification Techniques