Online Experiential Learning for Language Models

Tianzhu Ye; Li Dong; Qingxiu Dong; Xun Wu; Shaohan Huang; Furu Wei

arXiv:2603.16856·cs.CL·March 18, 2026

Online Experiential Learning for Language Models

Tianzhu Ye, Li Dong, Qingxiu Dong, Xun Wu, Shaohan Huang, Furu Wei

PDF

Open Access

TL;DR

This paper introduces Online Experiential Learning (OEL), a framework enabling language models to learn continuously from their deployment experiences, improving performance through iterative knowledge extraction and consolidation without requiring environment access.

Contribution

The paper proposes a novel online learning framework for language models that leverages real-world deployment experiences to enhance performance iteratively.

Findings

01

OEL improves task accuracy and token efficiency over iterations.

02

Extracted experiential knowledge outperforms raw trajectories in effectiveness.

03

On-policy consistency is crucial for effective experiential learning.

Abstract

The prevailing paradigm for improving large language models relies on offline training with human annotations or simulated environments, leaving the rich experience accumulated during real-world deployment entirely unexploited. We propose Online Experiential Learning (OEL), a framework that enables language models to continuously improve from their own deployment experience. OEL operates in two stages: first, transferable experiential knowledge is extracted and accumulated from interaction trajectories collected on the user side; second, this knowledge is consolidated into model parameters via on-policy context distillation, requiring no access to the user-side environment. The two stages are iterated to form an online learning loop, where the improved model collects higher-quality trajectories that yield richer experiential knowledge for subsequent rounds. We evaluate OEL on text-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Reinforcement Learning in Robotics