Natural Language Reinforcement Learning

Xidong Feng; Bo Liu; Yan Song; Haotian Fu; Ziyu Wan; Girish A. Koushik; Zhiyuan Hu; Mengyue Yang; Ying Wen; Jun Wang

arXiv:2411.14251·cs.LG·May 29, 2025

Natural Language Reinforcement Learning

Xidong Feng, Bo Liu, Yan Song, Haotian Fu, Ziyu Wan, Girish A. Koushik, Zhiyuan Hu, Mengyue Yang, Ying Wen, Jun Wang

PDF

Open Access 1 Repo 3 Models

TL;DR

This paper introduces Natural Language Reinforcement Learning (NLRL), a framework that uses interpretable linguistic narratives for value estimation, enabling agents to learn more actively and understand environments deeply through language-based representations.

Contribution

The paper proposes NLRL, extending RL with language-based value functions and components, leveraging LLMs for practical implementation and improved agent understanding.

Findings

01

NLRL demonstrates effectiveness on 4 multi-step tasks.

02

NLRL achieves improved learning efficiency.

03

NLRL fosters deeper environment understanding.

Abstract

Artificial intelligence progresses towards the "Era of Experience," where agents are expected to learn from continuous, grounded interaction. We argue that traditional Reinforcement Learning (RL), which typically represents value as a scalar, can restrict agent's deep understanding of environments and hinders the active, deliberative learning crucial for navigating this new paradigm. To address the issue, we introduce Natural Language Reinforcement Learning (NLRL), a framework that extends RL principles into natural language counterparts. Central to NLRL is the Language Value Function (LVF), which redefines value as an interpretable linguistic narrative articulating the rationale behind an evaluation. NLRL further extends this concept to core RL components, including policy, the Bellman equation, and policy iteration. Leveraging recent advancements in Large Language Models (LLMs), NLRL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

waterhorse1/natural-language-rl
jaxOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Automated Systems