Selective Perception: Optimizing State Descriptions with Reinforcement   Learning for Language Model Actors

Kolby Nottingham; Yasaman Razeghi; Kyungmin Kim; JB Lanier; Pierre; Baldi; Roy Fox; Sameer Singh

arXiv:2307.11922·cs.LG·July 25, 2023

Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors

Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, JB Lanier, Pierre, Baldi, Roy Fox, Sameer Singh

PDF

Open Access

TL;DR

This paper introduces BLINDER, a reinforcement learning method that automatically selects concise environment state descriptions for language model actors, improving efficiency and success rates in decision-making tasks.

Contribution

BLINDER is a novel approach that learns to optimize state descriptions for LLM actors, reducing input size and computational costs while enhancing task performance.

Findings

01

Improves task success rate in NetHack and robotic tasks

02

Reduces input size and inference costs

03

Generalizes across different LLM actors

Abstract

Large language models (LLMs) are being applied as actors for sequential decision making tasks in domains such as robotics and games, utilizing their general world knowledge and planning abilities. However, previous work does little to explore what environment state information is provided to LLM actors via language. Exhaustively describing high-dimensional states can impair performance and raise inference costs for LLM actors. Previous LLM actors avoid the issue by relying on hand-engineered, task-specific protocols to determine which features to communicate about a state and which to leave out. In this work, we propose Brief Language INputs for DEcision-making Responses (BLINDER), a method for automatically selecting concise state descriptions by learning a value function for task-conditioned state descriptions. We evaluate BLINDER on the challenging video game NetHack and a robotic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques