Self Iterative Label Refinement via Robust Unlabeled Learning

Hikaru Asano; Tadashi Kozuno; Yukino Baba

arXiv:2502.12565·cs.CL·December 1, 2025

Self Iterative Label Refinement via Robust Unlabeled Learning

Hikaru Asano, Tadashi Kozuno, Yukino Baba

PDF

Open Access

TL;DR

This paper introduces an iterative label refinement method using unlabeled data to improve large language model classification, reducing bias and enhancing performance across diverse tasks with minimal supervision.

Contribution

It proposes a novel unlabeled-unlabeled learning framework for iterative pseudo-label refinement, addressing biases in self-refinement of LLMs with minimal human supervision.

Findings

01

Outperforms initial LLM classification results.

02

Effective across diverse datasets including low-resource languages.

03

Enhances safety alignment and generative task performance.

Abstract

Recent advances in large language models (LLMs) have yielded impressive performance on various tasks, yet they often depend on high-quality feedback that can be costly. Self-refinement methods attempt to leverage LLMs' internal evaluation mechanisms with minimal human supervision; however, these approaches frequently suffer from inherent biases and overconfidence, especially in domains where the models lack sufficient internal knowledge, resulting in performance degradation. As an initial step toward enhancing self-refinement for broader applications, we introduce an iterative refinement pipeline that employs the Unlabeled-Unlabeled learning framework to improve LLM-generated pseudo-labels for classification tasks. By exploiting two unlabeled datasets with differing positive class ratios, our approach iteratively denoises and refines the initial pseudo-labels, thereby mitigating the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Text and Document Classification Technologies