Zero-to-Strong Generalization: Eliciting Strong Capabilities of Large   Language Models Iteratively without Gold Labels

Chaoqun Liu; Qin Chao; Wenxuan Zhang; Xiaobao Wu; Boyang Li; Anh Tuan; Luu; Lidong Bing

arXiv:2409.12425·cs.CL·September 20, 2024

Zero-to-Strong Generalization: Eliciting Strong Capabilities of Large Language Models Iteratively without Gold Labels

Chaoqun Liu, Qin Chao, Wenxuan Zhang, Xiaobao Wu, Boyang Li, Anh Tuan, Luu, Lidong Bing

PDF

Open Access

TL;DR

This paper introduces a novel zero-to-strong generalization paradigm that iteratively prompts LLMs to annotate unlabeled data, effectively unlocking their capabilities without relying on gold labels, and demonstrating success across diverse tasks and model sizes.

Contribution

The study proposes an iterative prompting framework that enables LLMs to improve their performance on tasks using only unlabeled data, bypassing the need for gold labels.

Findings

01

Effective on classification and reasoning tasks

02

Works for various model sizes and learning paradigms

03

Enhances LLM capabilities without gold labels

Abstract

Large Language Models (LLMs) have demonstrated remarkable performance through supervised fine-tuning or in-context learning using gold labels. However, this paradigm is limited by the availability of gold labels, while in certain scenarios, LLMs may need to perform tasks that are too complex for humans to provide such labels. To tackle this challenge, this study explores whether solely utilizing unlabeled data can elicit strong model capabilities. We propose a new paradigm termed zero-to-strong generalization. We iteratively prompt LLMs to annotate unlabeled data and retain high-quality labels by filtering. Surprisingly, we obverse that this iterative process gradually unlocks LLMs' potential on downstream tasks. Our experiments on extensive classification and reasoning tasks confirm the effectiveness of our proposed framework. Our analysis indicates that this paradigm is effective for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling