IOMatch: Simplifying Open-Set Semi-Supervised Learning with Joint Inliers and Outliers Utilization
Zekun Li, Lei Qi, Yinghuan Shi, Yang Gao

TL;DR
IOMatch introduces a joint utilization approach for inliers and outliers in open-set semi-supervised learning, improving performance by avoiding unreliable outlier filtering and leveraging all unlabeled data effectively.
Contribution
The paper proposes a novel framework that jointly utilizes inliers and outliers with a multi-binary classifier, simplifying open-set SSL and enhancing accuracy without explicit outlier detection.
Findings
Significantly outperforms baseline methods on benchmark datasets.
Effectively leverages all unlabeled data, including outliers.
Simplifies open-set SSL with a unified classification approach.
Abstract
Semi-supervised learning (SSL) aims to leverage massive unlabeled data when labels are expensive to obtain. Unfortunately, in many real-world applications, the collected unlabeled data will inevitably contain unseen-class outliers not belonging to any of the labeled classes. To deal with the challenging open-set SSL task, the mainstream methods tend to first detect outliers and then filter them out. However, we observe a surprising fact that such approach could result in more severe performance degradation when labels are extremely scarce, as the unreliable outlier detector may wrongly exclude a considerable portion of valuable inliers. To tackle with this issue, we introduce a novel open-set SSL framework, IOMatch, which can jointly utilize inliers and outliers, even when it is difficult to distinguish exactly between them. Specifically, we propose to employ a multi-binary classifier…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning · Imbalanced Data Classification Techniques
