Policy Learning with a Language Bottleneck

Megha Srivastava; Cedric Colas; Dorsa Sadigh; Jacob Andreas

arXiv:2405.04118·cs.LG·February 5, 2026

Policy Learning with a Language Bottleneck

Megha Srivastava, Cedric Colas, Dorsa Sadigh, Jacob Andreas

PDF

Open Access 1 Repo

TL;DR

The paper introduces PLLB, a framework where AI agents generate linguistic rules to improve interpretability and generalization, facilitating better human-AI interaction across diverse tasks.

Contribution

It presents a novel policy learning framework that integrates language models to produce interpretable rules guiding AI behavior, enhancing transparency and human collaboration.

Findings

01

Agents learn more interpretable behaviors

02

Rules enable better generalization across tasks

03

Shared rules improve human-AI coordination

Abstract

Modern AI systems such as self-driving cars and game-playing agents achieve superhuman performance, but often lack human-like generalization, interpretability, and inter-operability with human users. Inspired by the rich interactions between language and decision-making in humans, we introduce Policy Learning with a Language Bottleneck (PLLB), a framework enabling AI agents to generate linguistic rules that capture the high-level strategies underlying rewarding behaviors. PLLB alternates between a *rule generation* step guided by language models, and an *update* step where agents learn new policies guided by rules, even when a rule is insufficient to describe an entire complex policy. Across five diverse tasks, including a two-player signaling game, maze navigation, image reconstruction, and robot grasp planning, we show that PLLB agents are not only able to learn more interpretable and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

meghabyte/bottleneck
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling