LLMs Could Autonomously Learn Without External Supervision

Ke Ji; Junying Chen; Anningzhe Gao; Wenya Xie; Xiang Wan; and Benyou Wang

arXiv:2406.00606·cs.CL·June 10, 2024

LLMs Could Autonomously Learn Without External Supervision

Ke Ji, Junying Chen, Anningzhe Gao, Wenya Xie, Xiang Wan, and Benyou Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Autonomous Learning for LLMs, enabling models to self-educate without human-labeled data, leading to improved performance over traditional training methods.

Contribution

It proposes a novel self-supervised learning paradigm that allows LLMs to independently identify and fill knowledge gaps through interaction with text.

Findings

01

Autonomous Learning surpasses pre-training and supervised fine-tuning performance.

02

Models effectively self-educate by interacting with diverse textual materials.

03

The approach reduces reliance on annotated datasets, improving training efficiency.

Abstract

In the quest for super-human performance, Large Language Models (LLMs) have traditionally been tethered to human-annotated datasets and predefined training objectives-a process that is both labor-intensive and inherently limited. This paper presents a transformative approach: Autonomous Learning for LLMs, a self-sufficient learning paradigm that frees models from the constraints of human supervision. This method endows LLMs with the ability to self-educate through direct interaction with text, akin to a human reading and comprehending literature. Our approach eliminates the reliance on annotated data, fostering an Autonomous Learning environment where the model independently identifies and reinforces its knowledge gaps. Empirical results from our comprehensive experiments, which utilized a diverse array of learning materials and were evaluated against standard public quizzes, reveal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

freedomintelligence/autonomous_learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Scientific Computing and Data Management