PIANIST: Learning Partially Observable World Models with LLMs for   Multi-Agent Decision Making

Jonathan Light; Sixue Xing; Yuanzhe Liu; Weiqin Chen; Min Cai; Xiusi; Chen; Guanzhi Wang; Wei Cheng; Yisong Yue; Ziniu Hu

arXiv:2411.15998·cs.AI·November 26, 2024

PIANIST: Learning Partially Observable World Models with LLMs for Multi-Agent Decision Making

Jonathan Light, Sixue Xing, Yuanzhe Liu, Weiqin Chen, Min Cai, Xiusi, Chen, Guanzhi Wang, Wei Cheng, Yisong Yue, Ziniu Hu

PDF

Open Access

TL;DR

PIANIST is a framework that enables LLMs to generate effective world models for multi-agent decision making in complex games without domain-specific training, by decomposing the world into intuitive components for efficient planning.

Contribution

The paper introduces PIANIST, a novel approach that decomposes world models into components for zero-shot LLM generation, facilitating decision making without domain-specific training.

Findings

01

Works well on two different challenging games

02

Enables fast MCTS simulation using LLM-generated world models

03

Operates without domain-specific training data

Abstract

Effective extraction of the world knowledge in LLMs for complex decision-making tasks remains a challenge. We propose a framework PIANIST for decomposing the world model into seven intuitive components conducive to zero-shot LLM generation. Given only the natural language description of the game and how input observations are formatted, our method can generate a working world model for fast and efficient MCTS simulation. We show that our method works well on two different games that challenge the planning and decision making skills of the agent for both language and non-language based action taking, without any training on domain-specific training data or explicitly defined world model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Data Stream Mining Techniques · Bayesian Modeling and Causal Inference