Language Models Meet World Models: Embodied Experiences Enhance Language Models
Jiannan Xiang, Tianhua Tao, Yi Gu, Tianmin Shu, Zirui Wang, Zichao, Yang, Zhiting Hu

TL;DR
This paper introduces a method to enhance large language models by finetuning them with embodied experiences from a physical world simulator, significantly improving their reasoning and planning abilities in real-world tasks.
Contribution
The paper proposes combining world models with language models using embodied experiences, employing techniques like EWC and LoRA to retain generality and improve physical reasoning capabilities.
Findings
Improved performance on 18 downstream tasks by 64.28% on average.
Small LMs with embodied finetuning outperform larger models like ChatGPT.
Effective integration of embodied knowledge enhances reasoning and planning in LMs.
Abstract
While large language models (LMs) have shown remarkable capabilities across numerous tasks, they often struggle with simple reasoning and planning in physical environments, such as understanding object permanence or planning household activities. The limitation arises from the fact that LMs are trained only on written text and miss essential embodied knowledge and skills. In this paper, we propose a new paradigm of enhancing LMs by finetuning them with world models, to gain diverse embodied knowledge while retaining their general language capabilities. Our approach deploys an embodied agent in a world model, particularly a simulator of the physical world (VirtualHome), and acquires a diverse set of embodied experiences through both goal-oriented planning and random exploration. These experiences are then used to finetune LMs to teach diverse abilities of reasoning and acting in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning
MethodsBalanced Selection
