Pre-Trained Language Models for Interactive Decision-Making
Shuang Li, Xavier Puig, Chris Paxton, Yilun Du, Clinton Wang, Linxi, Fan, Tao Chen, De-An Huang, Ekin Aky\"urek, Anima Anandkumar, Jacob Andreas,, Igor Mordatch, Antonio Torralba, Yuke Zhu

TL;DR
This paper explores leveraging pre-trained language models to improve sequential decision-making tasks, demonstrating significant gains in task completion and generalization through policy initialization, fine-tuning, and active data gathering.
Contribution
It introduces a novel framework that uses language models to scaffold learning in decision-making, showing how LM-based representations enhance generalization and task performance.
Findings
Pre-trained LMs improve task completion rates by 43.6%.
Active data gathering boosts generalization, outperforming baselines by 25.1%.
Sequential input representations and LM initialization are key factors for success.
Abstract
Language model (LM) pre-training is useful in many language processing tasks. But can pre-trained LMs be further leveraged for more general machine learning problems? We propose an approach for using LMs to scaffold learning and generalization in general sequential decision-making problems. In this approach, goals and observations are represented as a sequence of embeddings, and a policy network initialized with a pre-trained LM predicts the next action. We demonstrate that this framework enables effective combinatorial generalization across different environments and supervisory modalities. We begin by assuming access to a set of expert demonstrations, and show that initializing policies with LMs and fine-tuning them via behavior cloning improves task completion rates by 43.6% in the VirtualHome environment. Next, we integrate an active data gathering procedure in which agents…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
