Predicting vs. Acting: A Trade-off Between World Modeling & Agent   Modeling

Margaret Li; Weijia Shi; Artidoro Pagnoni; Peter West; Ari Holtzman

arXiv:2407.02446·cs.CL·July 3, 2024

Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling

Margaret Li, Weijia Shi, Artidoro Pagnoni, Peter West, Ari Holtzman

PDF

Open Access

TL;DR

This paper investigates the trade-off in RLHF-aligned language models between world modeling and agent acting capabilities, revealing that optimizing for one often diminishes the other, especially in long-form text generation.

Contribution

It empirically demonstrates the trade-off between world modeling and agent acting in RLHF models and proposes a potential explanation involving implicit blueprints limiting randomness.

Findings

01

RLHF models focus probability on anchor spans, reducing diversity.

02

A trade-off exists between prediction accuracy and coherent long-form generation.

03

Alignment techniques may inherently limit a model's predictive flexibility.

Abstract

RLHF-aligned LMs have shown unprecedented ability on both benchmarks and long-form text generation, yet they struggle with one foundational task: next-token prediction. As RLHF models become agent models aimed at interacting with humans, they seem to lose their world modeling -- the ability to predict what comes next in arbitrary documents, which is the foundational training objective of the Base LMs that RLHF adapts. Besides empirically demonstrating this trade-off, we propose a potential explanation: to perform coherent long-form generation, RLHF models restrict randomness via implicit blueprints. In particular, RLHF models concentrate probability on sets of anchor spans that co-occur across multiple generations for the same prompt, serving as textual scaffolding but also limiting a model's ability to generate documents that do not include these spans. We study this trade-off on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making · Multi-Agent Systems and Negotiation

MethodsBalanced Selection