They are Not Completely Useless: Towards Recycling Transferable Unlabeled Data for Class-Mismatched Semi-Supervised Learning
Zhuo Huang, Ying Tai, Chengjie Wang, Jian Yang, Chen Gong

TL;DR
This paper introduces TOOR, a novel method that recycles certain out-of-distribution data in semi-supervised learning with class mismatch, improving data utilization and classification performance.
Contribution
The paper proposes a transferable OOD data recycling approach that leverages adversarial domain adaptation to utilize recyclable OOD data in class-mismatched SSL.
Findings
TOOR outperforms existing methods on benchmark datasets.
Recycling OOD data enhances the effectiveness of SSL.
The approach improves classification accuracy by utilizing more unlabeled data.
Abstract
Semi-Supervised Learning (SSL) with mismatched classes deals with the problem that the classes-of-interests in the limited labeled data is only a subset of the classes in massive unlabeled data. As a result, the classes only possessed by the unlabeled data may mislead the classifier training and thus hindering the realistic landing of various SSL methods. To solve this problem, existing methods usually divide unlabeled data to in-distribution (ID) data and out-of-distribution (OOD) data, and directly discard or weaken the OOD data to avoid their adverse impact. In other words, they treat OOD data as completely useless and thus the potential valuable information for classification contained by them is totally ignored. To remedy this defect, this paper proposes a "Transferable OOD data Recycling" (TOOR) method which properly utilizes ID data as well as the "recyclable" OOD data to enrich…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications
