Reinforcement Learning-Guided Semi-Supervised Learning

Marzi Heidari; Hanping Zhang; Yuhong Guo

arXiv:2405.01760·cs.LG·May 6, 2024

Reinforcement Learning-Guided Semi-Supervised Learning

Marzi Heidari, Hanping Zhang, Yuhong Guo

PDF

Open Access

TL;DR

This paper introduces RLGSSL, a reinforcement learning-guided semi-supervised learning method that adaptively leverages labeled and unlabeled data, outperforming existing SSL techniques through a novel reward-based approach.

Contribution

It proposes a reinforcement learning framework for SSL, formulating it as a bandit problem with a new reward function, and employs a teacher-student model for improved stability.

Findings

01

RLGSSL achieves superior performance on benchmark datasets.

02

The RL-guided approach effectively balances labeled and unlabeled data.

03

Experimental results outperform state-of-the-art SSL methods.

Abstract

In recent years, semi-supervised learning (SSL) has gained significant attention due to its ability to leverage both labeled and unlabeled data to improve model performance, especially when labeled data is scarce. However, most current SSL methods rely on heuristics or predefined rules for generating pseudo-labels and leveraging unlabeled data. They are limited to exploiting loss functions and regularization methods within the standard norm. In this paper, we propose a novel Reinforcement Learning (RL) Guided SSL method, RLGSSL, that formulates SSL as a one-armed bandit problem and deploys an innovative RL loss based on weighted reward to adaptively guide the learning process of the prediction model. RLGSSL incorporates a carefully designed reward function that balances the use of labeled and unlabeled data to enhance generalization performance. A semi-supervised teacher-student…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFuzzy Logic and Control Systems